educationR: A Comprehensive Collection of Educational Datasets

library(educationR)

library(ggplot2)

library(dplyr)
#> 
#> Attaching package: 'dplyr'
#> The following objects are masked from 'package:stats':
#> 
#>     filter, lag
#> The following objects are masked from 'package:base':
#> 
#>     intersect, setdiff, setequal, union

Introduction

The educationR package provides a comprehensive collection of educational datasets, focusing on various aspects of education such as student performance, learning methods, test scores, absenteeism, and other educational metrics. This package is designed to serve as a resource for educational researchers, data analysts, and statisticians who wish to explore and analyze data in the field of education.

The datasets in the educationR package have been carefully curated and are ready to use for your data analysis needs.

Dataset Suffixes

Each dataset in the educationR package comes with a suffix that indicates the type and format of the dataset. These suffixes help users quickly identify the structure of the data, such as:

  • tbl_df: A tibble (a modern version of a data frame in R)
  • df: A standard data frame
  • table: A table (used for contingency tables or cross-tabulations)

Example Datasets

Below are some examples of datasets included in the educationR package:

  • Achieve_tbl_df: A tibble containing math achievement test scores by gender.

  • German_tbl_df: A tibble documenting before-and-after German copying errors post-course.

  • QuizPulse10_df: A data frame comparing quiz scores with lecture pulse rates.

  • UCBAdmissions_table: A table documenting student admissions at UC Berkeley.

Data Visualization with educationR data sets

Here are a couple of examples of how to use educationR package datasets to create data visualizations related to educational matters:

1. Visualization of Math Achievement by Gender


# Example: Visualizing math achievement by gender
Achieve_tbl_df %>%
  ggplot(aes(x = gender, y = score)) +
  geom_boxplot() +
  labs(title = "Math Achievement by Gender",
       x = "Gender",
       y = "Achievement Score") +
  theme_minimal()

2. Visualization of Quiz Scores vs. Lecture Pulse Rates


# Example: Visualizing the relationship between quiz scores and lecture pulse rates
QuizPulse10_df %>%
  ggplot(aes(x = Quiz, y = Lecture)) +
  geom_point(alpha = 0.6) +
  labs(title = "Quiz Scores vs. Lecture Pulse Rates",
       x = "Quiz Score",
       y = "Pulse Rate (beats per minute)") +
  theme_minimal()

Conclusion

The educationR package provides a wealth of educational data for analysis. By using the dataset suffixes, users can quickly identify the type of data they are working with, ensuring a smooth analysis process.

For more information on each dataset and further examples, please refer to the full package documentation.