Package 'educationR'

Title: A Comprehensive Collection of Educational Datasets
Description: Provides a comprehensive collection of datasets related to education, covering topics such as student performance, learning methods, test scores, absenteeism, and other educational metrics. This package serves as a resource for educational researchers, data analysts, and statisticians to explore and analyze data in the field of education.
Authors: Renzo Caceres Rossi [aut, cre]
Maintainer: Renzo Caceres Rossi <[email protected]>
License: GPL-3
Version: 0.1.0
Built: 2024-12-06 01:36:10 UTC
Source: CRAN

Help Index


Ability and Intelligence Tests

Description

This dataset, ability_list, is a list containing information about six ability and intelligence tests administered to 112 individuals. The dataset provides a covariance matrix, the means (centers) of the variables, and the number of observations.

Usage

data(ability_list)

Format

A list with 3 components:

cov

A covariance matrix (numeric matrix) of dimensions 6x6, representing the relationships between six different tests.

center

A numeric vector of length 6 containing the mean scores for each of the six tests.

n.obs

The total number of observations (integer), which is 112.

Details

The dataset name has been kept as 'ability_list' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'list' indicates that the dataset is a list object. The original content has not been modified in any way.

Source

Generated for educational purposes.


Absenteeism from School in New South Wales

Description

This dataset, absenteeism_tbl_df, is a tibble containing information about absenteeism from school and certain demographic characteristics of children in rural New South Wales, Australia. The dataset includes data from 146 randomly sampled students during a particular school year, providing insights into the relationships between absenteeism and variables such as ethnicity, sex, age group, and learning categories.

Usage

data(absenteeism_tbl_df)

Format

A tibble with 146 observations and 5 variables:

eth

Ethnicity of the student (factor with 2 levels).

sex

Sex of the student (factor with 2 levels).

age

Age group of the student (factor with 4 levels).

lrn

Learning category of the student, e.g., slow learner or not (factor with 2 levels).

days

Number of days the student was absent from school (integer).

Details

The dataset name has been kept as 'absenteeism_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

Data collected from a study in rural New South Wales, Australia.


Math Achievement Test Scores by Gender

Description

This dataset, Achieve_tbl_df, is a tibble containing information about math achievement test scores for 25 high school students, categorized by gender. The dataset provides insights into the distribution of scores between male and female students.

Usage

data(Achieve_tbl_df)

Format

A tibble with 25 observations and 2 variables:

score

Math achievement test score of the student (integer).

gender

Gender of the student (factor with 2 levels: "Male", "Female").

Details

The dataset name has been kept as 'Achieve_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.


Math Test Scores versus Anxiety Scores

Description

This dataset, Anxiety_tbl_df, is a tibble containing information about math test scores and anxiety levels before the test for 20 students. The dataset provides insights into the relationship between anxiety levels and math test performance.

Usage

data(Anxiety_tbl_df)

Format

A tibble with 20 observations and 2 variables:

anxiety

Anxiety score of the student before taking the math test (integer).

math

Math test score of the student (integer).

Details

The dataset name has been kept as 'Anxiety_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.


Grad Rates: Athletes vs. Non-Athletes Big Ten Conf.

Description

This dataset, Bigten_tbl_df, is a tibble containing information about the graduation rates of student athletes and nonathletes in the Big Ten Conference. The dataset includes data from two academic years, showing the graduation rates by school and athlete status (athletes vs nonathletes).

Usage

data(Bigten_tbl_df)

Format

A tibble with 44 observations and 4 variables:

school

Name of the school (character).

year

Year of the data (factor with 2 levels).

rate

Graduation rate percentage (integer).

status

Athlete status (character, either 'Athlete' or 'Nonathlete').

Details

The dataset name has been kept as 'Bigten_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

NCAA Graduation Rates Report, 2000.


Test Scores on First Exam in Biology Class

Description

This dataset, Biology_tbl_df, is a tibble containing the test scores of 30 students on their first exam in a biology class. The dataset provides insight into the distribution of scores among the students.

Usage

data(Biology_tbl_df)

Format

A tibble with 30 observations and 1 variable:

score

Test scores on the first biology exam (integer).

Details

The dataset name has been kept as 'Biology_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.


Education Level of Blacks by Gender

Description

This dataset, Blackedu_tbl_df, is a tibble containing information about the education level of Black individuals, categorized by gender. The dataset includes 3800 observations and provides insights into the distribution of education levels across different gender groups.

Usage

data(Blackedu_tbl_df)

Format

A tibble with 3800 observations and 2 variables:

gender

Gender of the individual (factor with 2 levels).

education

Education level of the individual (factor with 5 levels).

Details

The dataset name has been kept as 'Blackedu_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

Bureau of Census data.


Books Read & Spelling Scores: 17 Third Graders

Description

This dataset, Books_tbl_df, is a tibble containing information about the number of books read and spelling scores for 17 third-grade students. The dataset provides insights into the relationship between the number of books read and spelling scores in this group of students.

Usage

data(Books_tbl_df)

Format

A tibble with 17 observations and 2 variables:

book

Number of books read by the student (integer).

spelling

Spelling score of the student (integer).

Details

The dataset name has been kept as 'Books_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.


Community college housing (simulated data)

Description

This dataset, cchousing_tbl_df, is a tibble containing simulated data on housing prices for students at a community college. The dataset provides the housing prices for 75 students, offering insights into the distribution of housing prices in this educational setting.

Usage

data(cchousing_tbl_df)

Format

A tibble with 75 observations and 1 variable:

price

Housing price for the student (numeric).

Details

The dataset name has been kept as 'cchousing_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the openintro package.


College Credits

Description

This dataset, credits_tbl_df, is a tibble containing simulated data on the number of college credits taken by students each semester. The dataset includes data from 100 students, providing insights into the distribution of credits taken by students in a college setting.

Usage

data(credits_tbl_df)

Format

A tibble with 100 observations and 1 variable:

credits

Number of college credits taken by the student (integer).

Details

The dataset name has been kept as 'credits_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the openintro package (simulated data).


US Crime Rates vs. High School Dropout

Description

This dataset, crime_degree_tbl_df, is a tibble containing data on crime rates and the percentage of the population without a high school degree in 51 U.S. states. The dataset includes information on the crime rate and the percentage of the population without a high school degree for each state.

Usage

data(crime_degree_tbl_df)

Format

A tibble with 51 observations and 3 variables:

state

State name (character).

nodegree

Percentage of the population without a high school degree (numeric).

crime

Crime rate (numeric).

Details

The dataset name has been kept as 'crime_degree_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.


Detroit Auto Workers: Education Levels

Description

This dataset, Detroit_tbl_df, is a tibble containing data on the educational levels of a sample of 40 auto workers in Detroit. The dataset includes information on the highest level of education attained by each worker.

Usage

data(Detroit_tbl_df)

Format

A tibble with 40 observations and 1 variable:

educ

Educational level of the auto worker (integer).

Details

The dataset name has been kept as 'Detroit_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.


Dev Students: 2-Year & 4-Year College Demographics

Description

This dataset, Develop_tbl_df, is a tibble containing demographic data on students enrolled in developmental education programs at 2-year and 4-year colleges. The dataset includes information on the racial background of the students and whether they are enrolled in 2-year or 4-year colleges.

Usage

data(Develop_tbl_df)

Format

A tibble with 5656 observations and 2 variables:

race

Racial background of the student (factor with 5 levels).

college

Type of college the student is enrolled in (factor with 2 levels: 2-year or 4-year).

Details

The dataset name has been kept as 'Develop_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from "Research in Development Education" (1994), V. 11, 2.


Fall '95 Developmental Math: Failed Student Scores

Description

This dataset, Devmath_tbl_df, is a tibble containing test scores for students who failed developmental mathematics in the fall semester of 1995. The dataset includes the scores of these students as part of a simulated study.

Usage

data(Devmath_tbl_df)

Format

A tibble with 40 observations and 1 variable:

score

Test scores of students who failed developmental mathematics (integer).

Details

The dataset name has been kept as 'Devmath_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package. Data provided by Dr. Anita Kitchens.


Drug use of students and parents

Description

This dataset, drug_use_tbl_df, is a tibble containing data on the drug use of students and their parents. The dataset summarizes 445 student-parent pairs, with each pair indicating whether the student and/or their parent has used drugs, specifically marijuana.

Usage

data(drug_use_tbl_df)

Format

A tibble with 445 observations and 2 variables:

student

Whether the student has used drugs (factor with 2 levels).

parents

Whether the parent has used drugs (factor with 2 levels).

Details

The dataset name has been kept as 'drug_use_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the openintro package. Ellis GJ and Stone LH. 1979. Marijuana Use in College: An Evaluation of a Modeling Explanation. Youth and Society 10:323-334.


Dyslexia: Data on College Students Diagnosed with Dyslexia

Description

This dataset, Dyslexia_tbl_df, is a tibble containing data on a group of college students diagnosed with dyslexia. The dataset includes various personal characteristics such as age, gender, handedness, weight, height, and number of children, along with the number of words they were able to read correctly.

Usage

data(Dyslexia_tbl_df)

Format

A tibble with 8 observations and 7 variables:

words

Number of words read correctly (integer).

age

Age of the student (integer).

gender

Gender of the student (character).

handed

Handedness of the student (character).

weight

Weight of the student (integer).

height

Height of the student (integer).

children

Number of children the student has (integer).

Details

The dataset name has been kept as 'Dyslexia_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.


educationR: A Comprehensive Collection of Educational Datasets

Description

This package provides a comprehensive collection of datasets related to education, covering topics such as student performance, learning methods, test scores, absenteeism, and other educational metrics.

Details

educationR: A Comprehensive Collection of Educational Datasets

logo

A Comprehensive Collection of Educational Datasets.

Author(s)

Maintainer: Renzo Cáceres Rossi [email protected]

See Also

Useful links:


Engineering Grads Salary: 10 Years Post-Graduation

Description

This dataset, Engineer_tbl_df, is a tibble containing salary data for engineering graduates 10 years after graduation. The dataset includes information on the salary of the graduates and the type of university they graduated from (categorized into three types of universities).

Usage

data(Engineer_tbl_df)

Format

A tibble with 51 observations and 2 variables:

salary

Salary of the engineering graduate 10 years after graduation (integer).

university

Type of university the graduate attended (factor with 3 levels).

Details

The dataset name has been kept as 'Engineer_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.


College entrance exam scores for 24 high school seniors

Description

This dataset, Entrance_tbl_df, is a tibble containing the college entrance exam scores of 24 high school seniors. The dataset includes information on their exam scores.

Usage

data(Entrance_tbl_df)

Format

A tibble with 24 observations and 1 variable:

score

College entrance exam scores (integer).

Details

The dataset name has been kept as 'Entrance_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.


Exam Scores: Class of 19 Students

Description

This dataset, exams_tbl_df, is a tibble containing the exam scores of a class of 19 students. The dataset includes information on their performance in a specific exam.

Usage

data(exams_tbl_df)

Format

A tibble with 19 observations and 1 variable:

scores

Exam scores of students (integer).

Details

The dataset name has been kept as 'exams_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the openintro package.


Female College Student Heights

Description

This dataset, fheights_tbl_df, is a tibble containing the heights of 24 female college students, measured in inches. The dataset provides insight into the physical characteristics of a specific demographic group.

Usage

data(fheights_tbl_df)

Format

A tibble with 24 observations and 1 variable:

heights

Heights of female college students, measured in inches (integer).

Details

The dataset name has been kept as 'fheights_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the openintro package.


Before and after German copying errors post-course

Description

This dataset, German_tbl_df, is a tibble containing data on the number of errors made by 20 students when copying a German passage, both before and after participating in an experimental German course. The dataset provides insights into language learning and the effectiveness of the course.

Usage

data(German_tbl_df)

Format

A tibble with 20 observations and 3 variables:

student

Identifier for the student (character).

when

Indicates whether the errors were recorded "before" or "after" the experimental course (character).

errors

Number of errors made by the student when copying the German passage (integer).

Details

The dataset name has been kept as 'German_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.


Analytical Skills of Young Gifted Children

Description

This dataset, gifted_tbl_df, is a tibble containing data on the analytical skills of 36 young gifted children, along with several factors that may influence these skills. The dataset includes measures such as parental IQ, early developmental milestones, and television habits.

Usage

data(gifted_tbl_df)

Format

A tibble with 36 observations and 8 variables:

score

Analytical skills score of the child (integer).

fatheriq

IQ of the father (integer).

motheriq

IQ of the mother (integer).

speak

Age in months when the child first said "mummy" or "daddy" (integer).

count

Age in months when the child first counted to 10 successfully (integer).

read

Average number of hours per week the child's parents read to the child (numeric).

edutv

Average number of hours per week the child watched educational TV programs in the past three months (numeric).

cartoons

Average number of hours per week the child watched cartoons on TV in the past three months (numeric).

Details

The dataset name has been kept as 'gifted_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the openintro package. Graybill, F.A. & Iyer, H.K., (1994) *Regression Analysis: Concepts and Applications*, Duxbury, p. 511-6.


High School GPA versus College GPA

Description

This dataset, GPA_college_tbl_df, is a tibble containing data on the relationship between high school GPA and college GPA for 10 students. The dataset is valuable for analyzing academic performance correlations across different educational levels.

Usage

data(GPA_college_tbl_df)

Format

A tibble with 10 observations and 2 variables:

hsgpa

High school GPA (numeric).

collgpa

College GPA (numeric).

Details

The dataset name has been kept as 'GPA_college_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.


Survey of Duke Students on GPA, Studying, and More

Description

This dataset, gpa_tbl_df, is a tibble containing survey data from 55 Duke University students. It includes information on students' GPA, weekly study hours, average hours of sleep per night, time spent going out per week, and gender. The dataset provides valuable insights into the relationship between academic performance and lifestyle habits.

Usage

data(gpa_tbl_df)

Format

A tibble with 55 observations and 5 variables:

gpa

Grade Point Average (numeric).

studyweek

Number of hours spent studying per week (integer).

sleepnight

Average hours of sleep per night (numeric).

out

Average hours spent going out per week (numeric).

gender

Gender of the student (factor with levels "Male" and "Female").

Details

The dataset name has been kept as 'gpa_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the openintro package.


Test Grades in a Beginning Statistics Class

Description

This dataset, Grades_stats_tbl_df, is a tibble containing test grades for a beginning statistics class. It includes the grades of 29 students and provides insights into the performance distribution in an introductory statistics course.

Usage

data(Grades_stats_tbl_df)

Format

A tibble with 29 observations and 1 variable:

grades

Test grades (integer).

Details

The dataset name has been kept as 'Grades_stats_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.


Relationship Between Watching TV and Grades (Simulated data)

Description

This dataset, gradestv_tbl_df, is a tibble containing simulated data for analyzing the relationship between the number of hours per week students watch TV and their grades in a statistics class. It provides a simple dataset for exploring correlations or regression models in educational settings.

Usage

data(gradestv_tbl_df)

Format

A tibble with 25 observations and 2 variables:

tv

Number of hours per week students watch TV (integer).

grades

Grade obtained in a statistics class (integer).

Details

The dataset name has been kept as 'gradestv_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the openintro package.


Graduation Rates for Student Athletes in the Southeastern Conf.

Description

This dataset, Graduate_tbl_df, is a tibble containing data on the graduation rates for student athletes in various schools within the Southeastern Conference. It includes the name of the school, a code for identification, and the graduation rate as a percentage.

Usage

data(Graduate_tbl_df)

Format

A tibble with 12 observations and 3 variables:

school

Name of the school (character).

code

Code for the school (character).

percent

Graduation rate as a percentage (integer).

Details

The dataset name has been kept as 'Graduate_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.


Study Habits of Students in Two Matched School Districts

Description

This dataset, Habits_tbl_df, is a tibble containing data on the study habits of students in two matched school districts. It includes variables related to the number of hours students from each district spent on studying, the difference between the two groups, and the significance of this difference.

Usage

data(Habits_tbl_df)

Format

A tibble with 11 observations and 4 variables:

A

Number of study hours in the first school district (integer).

B

Number of study hours in the second school district (integer).

differ

Difference in study hours between the two school districts (integer).

signrks

Significance of the difference in study hours (numeric).

Details

The dataset name has been kept as 'Habits_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.


Effects of Open Education Reported by Hedges and Olkin (1985)

Description

This dataset, HedgesOlkin85_df, is a data frame containing data from Hedges and Olkin's 1985 study on the effects of open education. The dataset includes variables related to attitudes toward school and reading achievement in students, based on the meta-analysis reported in their work. This data was used to investigate the effects of open education.

Usage

data(HedgesOlkin85_df)

Format

A data frame with 4 observations and 6 variables:

study

Study identifier (numeric).

d_att

Effect size for attitude toward school (numeric).

d_ach

Effect size for reading achievement (numeric).

var_att

Variance of the attitude effect size (numeric).

cov_att_ach

Covariance between attitude and achievement (numeric).

var_ach

Variance of the achievement effect size (numeric).

Details

The dataset name has been kept as 'HedgesOlkin85_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.

Source

The dataset was taken from the metaSEM package.

References

Hedges, L. V., & Olkin, I. (1985). Statistical methods for meta-analysis. Orlando, FL: Academic Press.


Homework Hours: Private vs. Public High Schools

Description

This dataset, Homework_tbl_df, is a tibble containing data on the number of hours per week high school students spend on homework. The dataset compares students from private and public schools, providing insights into the study habits and academic workload of students in these two types of schools.

Usage

data(Homework_tbl_df)

Format

A tibble with 30 observations and 2 variables:

school

Type of school the student attends, either private or public (character).

time

Number of hours per week the student spends on homework (numeric).

Details

The dataset name has been kept as 'Homework_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.


Job satisfaction and stress level for 9 school teachers

Description

This dataset, Jobsat_tbl_df, is a tibble containing data on job satisfaction and stress levels for 9 school teachers. The dataset provides insights into how teachers' stress levels relate to their job satisfaction, which can be valuable for understanding workplace dynamics and improving teacher well-being.

Usage

data(Jobsat_tbl_df)

Format

A tibble with 9 observations and 2 variables:

wspt

Stress level of the teacher, measured on a scale (integer).

satisfaction

Job satisfaction level of the teacher (numeric).

Details

The dataset name has been kept as 'Jobsat_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.


First Grade Reading Scores: With vs. Without Kindergarten

Description

This dataset, Kinder_tbl_df, is a tibble containing data on the reading scores of first grade children who attended kindergarten versus those who did not. The dataset provides insights into the impact of attending kindergarten on early reading abilities, which can help in evaluating the effectiveness of early childhood education programs.

Usage

data(Kinder_tbl_df)

Format

A tibble with 8 observations and 3 variables:

pair

Pair identifier for the group of children (integer).

kinder

Reading score for children who attended kindergarten (integer).

nokinder

Reading score for children who did not attend kindergarten (integer).

Details

The dataset name has been kept as 'Kinder_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.


Reading skills of 24 matched low ability students

Description

This dataset, Lowabil_tbl_df, is a tibble containing data on the reading skills of 24 students with low abilities, who were matched based on certain characteristics. The dataset compares the performance of the students in two different groups: the experimental group and the control group. This can be used to evaluate the effectiveness of an intervention or treatment in improving reading skills.

Usage

data(Lowabil_tbl_df)

Format

A tibble with 12 observations and 3 variables:

pair

Pair identifier for the matched students (integer).

experiment

Reading score for students in the experimental group (integer).

control

Reading score for students in the control group (integer).

Details

The dataset name has been kept as 'Lowabil_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.


Survey of Duke students and the area of their major

Description

This dataset, major_survey_tbl_df, is a tibble containing information about the GPAs of 218 Duke students and their academic major. The dataset provides insights into the relationship between GPA and the field of study, allowing for an exploration of how academic performance varies across different majors.

Usage

data(major_survey_tbl_df)

Format

A tibble with 218 observations and 2 variables:

gpa

Grade point average of the student (numeric).

major

Academic major of the student (factor with 3 levels).

Details

The dataset name has been kept as 'major_survey_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the openintro package.


Standardized Math Test Scores for 30 Students

Description

This dataset, Math_scores_tbl_df, is a tibble containing the standardized math test scores of 30 students. The dataset is useful for analyzing the distribution of math scores and exploring factors that might influence math performance in educational settings.

Usage

data(Math_scores_tbl_df)

Format

A tibble with 30 observations and 1 variable:

score

Standardized math test score (integer).

Details

The dataset name has been kept as 'Math_scores_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.


Incoming Freshmen Math Competency at a Community College

Description

This dataset, Mathcomp_tbl_df, is a tibble containing the standardized math competency scores for a group of 31 entering freshmen at a small community college. The dataset provides insights into the initial math skills of incoming students, which could be useful for evaluating preparedness and designing interventions to improve academic success.

Usage

data(Mathcomp_tbl_df)

Format

A tibble with 31 observations and 1 variable:

score

Standardized math competency score (integer).

Details

The dataset name has been kept as 'Mathcomp_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.


Math Proficiency and SAT Math Scores by State

Description

This dataset, Mathpro_tbl_df, is a tibble containing information on math proficiency percentages, SAT math scores, and group classifications for 51 states in the United States. The dataset provides insights into how math proficiency and SAT scores vary by state, allowing for comparative analysis and exploration of regional trends in mathematics education.

Usage

data(Mathpro_tbl_df)

Format

A tibble with 51 observations and 4 variables:

state

Name of the state (character).

sat_math

Average SAT math score for the state (integer).

profic

Math proficiency percentage for the state (numeric).

group

Group classification for the state (integer).

Details

The dataset name has been kept as 'Mathpro_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.


Graduate School Applicants' Miller Personality Test Scores

Description

This dataset, Miller_scores_tbl_df, is a tibble containing the Miller Personality Test scores for 25 college students applying for graduate school. This dataset allows for an exploration of personality traits and their potential correlation with academic or professional success.

Usage

data(Miller_scores_tbl_df)

Format

A tibble with 25 observations and 1 variable:

miller

Miller Personality Test score for a student (integer).

Details

The dataset name has been kept as 'Miller_scores_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.


Music Recognition Learning in Identical Twins

Description

This dataset, Music_tbl_df, is a tibble containing improvement scores for identical twins taught music recognition using two different techniques. Each pair of twins was taught using both methods, and the differences in their improvement scores were recorded.

Usage

data(Music_tbl_df)

Format

A tibble with 12 observations and 3 variables:

method1

Improvement scores using the first music recognition technique (integer).

method2

Improvement scores using the second music recognition technique (integer).

differ

Difference in improvement scores between the two methods (integer).

Details

The dataset name has been kept as 'Music_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.


State University Freshmen: Parental Education Profiles

Description

This dataset, Parented_tbl_df, is a tibble containing information about the education levels of parents of 200 entering freshmen at a state university. The dataset provides insights into the distribution of parental education backgrounds and their possible influence on students' academic paths.

Usage

data(Parented_tbl_df)

Format

A tibble with 200 observations and 2 variables:

education

Education level of the parent (factor with 6 levels).

parent

Parent type (factor with 2 levels: "mother" or "father").

Details

The dataset name has been kept as 'Parented_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.


Racial Prejudice Scores of High School Students

Description

This dataset, Prejudic_tbl_df, is a tibble containing scores measuring racial prejudice among a sample of 25 high school students. The dataset provides insights into attitudes related to racial prejudice within this demographic.

Usage

data(Prejudic_tbl_df)

Format

A tibble with 25 observations and 1 variable:

prejud

Racial prejudice score (integer).

Details

The dataset name has been kept as 'Prejudic_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.


Effects of Learning Methods on Statistics Students

Description

This dataset, Program_stats_tbl_df, is a tibble containing information about the effects of four different methods of programmed learning on statistics students. It includes the learning method used and the corresponding scores of the students.

Usage

data(Program_stats_tbl_df)

Format

A tibble with 44 observations and 2 variables:

method

Programmed learning method (character).

score

Score obtained by the student (integer).

Details

The dataset name has been kept as 'Program_stats_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.


Relationship Between PSAT and SAT Scores

Description

This dataset, PSAT_SAT_tbl_df, is a tibble containing paired data on PSAT and SAT scores for a sample of students. It allows for the exploration of the relationship between performance on the PSAT and SAT exams.

Usage

data(PSAT_SAT_tbl_df)

Format

A tibble with 7 observations and 2 variables:

psat

PSAT scores (integer).

sat

SAT scores (integer).

Details

The dataset name has been kept as 'PSAT_SAT_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.


Quiz vs Lecture Pulse Rates

Description

This dataset, QuizPulse10_df, is a data frame containing paired data on pulse rates for 10 students during a quiz and a lecture. The dataset allows for the exploration of how pulse rates differ during these two activities.

Usage

data(QuizPulse10_df)

Format

A data frame with 10 observations and 3 variables:

Student

Student ID (integer).

Quiz

Pulse rate during the quiz (integer).

Lecture

Pulse rate during the lecture (integer).

Details

The dataset name has been kept as 'QuizPulse10_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The original content has not been modified in any way.

Source

The dataset was taken from the Lock5Data package.


Reading Scores vs IQ Scores

Description

This dataset, Readiq_tbl_df, is a tibble containing paired data on reading scores and IQ scores for a sample of individuals. It allows for the exploration of the relationship between reading ability and IQ scores.

Usage

data(Readiq_tbl_df)

Format

A tibble with 14 observations and 2 variables:

reading

Reading scores (integer).

iq

IQ scores (integer).

Details

The dataset name has been kept as 'Readiq_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.


Simulated data for SAT score improvement

Description

This dataset, sat_improve_tbl_df, is a tibble containing simulated data on SAT score improvements for a sample of students who took a course from an SAT score improvement company. It allows for the exploration of score improvements from students who enrolled in the course.

Usage

data(sat_improve_tbl_df)

Format

A tibble with 30 observations and 1 variable:

sat_improve

SAT score improvement (integer).

Details

The dataset name has been kept as 'sat_improve_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the openintro package.


Schizophrenic Learning Ability After Tranquilizer Dose

Description

This dataset, Schizoph_tbl_df, is a tibble containing standardized exam scores for 13 patients. The data was collected to investigate the learning ability of schizophrenics after being given a specified dose of a tranquilizer.

Usage

data(Schizoph_tbl_df)

Format

A tibble with 13 observations and 1 variable:

score

Standardized exam scores (integer).

Details

The dataset name has been kept as 'Schizoph_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.


Final exam scores for twenty students

Description

This dataset, stats_scores_tbl_df, is a tibble containing the final exam scores of 20 students. The data provides insights into the academic performance of the students in their final exam for a statistics course.

Usage

data(stats_scores_tbl_df)

Format

A tibble with 20 observations and 1 variable:

scores

Final exam scores (integer).

Details

The dataset name has been kept as 'stats_scores_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the openintro package.


Sleep for 110 college students (simulated)

Description

This dataset, student_sleep_tbl_df, is a tibble containing the number of hours that 110 college students slept in a single night. The data allows for the analysis of sleep patterns among college students, potentially useful for understanding the relationship between sleep and academic performance, well-being, or other factors.

Usage

data(student_sleep_tbl_df)

Format

A tibble with 110 observations and 1 variable:

hours

Number of hours slept (numeric).

Details

The dataset name has been kept as 'student_sleep_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the openintro package.


Hours Studied by Freshmen

Description

This dataset, Study_freshmen_tbl_df, is a tibble containing the number of hours studied per week by a sample of 50 freshmen. It allows for the exploration of study habits and potentially provides insights into the relationship between study time and academic performance among college freshmen.

Usage

data(Study_freshmen_tbl_df)

Format

A tibble with 50 observations and 1 variable:

hours

Number of hours studied per week (integer).

Details

The dataset name has been kept as 'Study_freshmen_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.


Tennessee Self-Concept Scores for Gifted Students

Description

This dataset, Tennessee_gifted_df, is a data frame containing the self-concept scores for 20 gifted high school students in Tennessee. It allows for the exploration of self-concept in the context of gifted students and may provide insights into their self-perception and potential academic achievement.

Usage

data(Tennessee_gifted_df)

Format

A data frame with 20 observations and 1 variable:

score

Self-concept scores for gifted students (numeric).

Details

The dataset name has been kept as 'Tennessee_gifted_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.


Textbook Costs for Different Fields

Description

This dataset, TextbookCosts_df, is a data frame containing information about the number of textbooks and their total costs for different fields of study. It allows for the exploration of textbook expenses in various academic disciplines.

Usage

data(TextbookCosts_df)

Format

A data frame with 40 observations and 3 variables:

Field

Field of study (factor with 4 levels).

Books

Number of textbooks (integer).

Cost

Total cost of textbooks (integer).

Details

The dataset name has been kept as 'TextbookCosts_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The original content has not been modified in any way.

Source

The dataset was taken from the Lock5Data package.


Student Admissions at UC Berkeley

Description

This dataset, UCBAdmissions_table, is a contingency table containing aggregate data on applicants to graduate school at UC Berkeley in 1973. The data is classified by admission status, gender, and department, and it provides insights into admissions patterns for the six largest departments.

Usage

data(UCBAdmissions_table)

Format

A contingency table with 24 cells (2 × 2 × 6), classified by:

Admit

Admission status (Admitted, Rejected).

Gender

Gender of the applicants (Male, Female).

Dept

Department (A, B, C, D, E, F).

The table entries are the number of applicants.

Details

The dataset name has been kept as 'UCBAdmissions_table' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The original content has not been modified in any way.

Source

The dataset was taken from the datasets package.