Package 'SGPdata'

Title: Exemplar Data Sets for Student Growth Percentiles (SGP) Analyses
Description: Data sets utilized by the 'SGP' package as exemplars for users to conduct their own student growth percentiles (SGP) analyses.
Authors: Damian W. Betebenner [aut, cre], Adam R. Van Iwaarden [aut], Ben Domingue [aut]
Maintainer: Damian W. Betebenner <[email protected]>
License: GPL-3
Version: 28.0-0.0
Built: 2024-09-14 06:38:28 UTC
Source: CRAN

Help Index


Exemplar Data Sets for Student Growth Percentile (SGP) Analyses

Description

The 'SGPdata' package contains five exemplar data sets utilized by the 'SGP' package for testing and examples.

Details

Package: SGPdata
Type: Package
Version: 28.0-0.0
Date: 2024-7-14
License: GPL-3
LazyLoad: Yes
LazyData: Yes

Longitudinal Student Assessment Data

Description

A dataset comprising a subset of five years of state assessment reading data suitable for student growth percentile and percentile growth projection/trajectory analyses. The dataset is used in examples provided in the documentation for the studentGrowthPercentiles and
studentGrowthProjections functions, and serves as an exemplar for user construction of their own datasets for growth percentile analyses.

Format

A data frame of student level observations for five years on the following variables.

ID

Student ID

GRADE_2020

Student Grade Tested 2020, possibly missing

GRADE_2021

Student Grade Tested 2021, possibly missing

GRADE_2022

Student Grade Tested 2022, possibly missing

GRADE_2023

Student Grade Tested 2023, possibly missing

GRADE_2024

Student Grade Tested 2024, possibly missing

SS_2020

Student Scale Score 2020, possibly missing

SS_2021

Student Scale Score 2021, possibly missing

SS_2022

Student Scale Score 2022, possibly missing

SS_2023

Student Scale Score 2023, possibly missing

SS_2024

Student Scale Score 2024, possibly missing

Note

All datasets used with studentGrowthPercentiles and studentGrowthProjections must be specifically formatted as wide format files. The first variable/column is the student ID variable. The next set of columns provide the grade of the student across all the years provided in the dataset (possibly missing). The last set of columns provide the scales scores of the student in the respective grades. Multi-year operational analyses benefit from putting data in long format. See sgpData_LONG and associated documentation for a comprehensive account.

Source

Anonymized student level state assessment data

See Also

sgpData_LONG


Lookup table associating INSTRUCTOR_NUMBER with student ID by YEAR and CONTENT_AREA

Description

A lookup table comprising student IDs associated with INSTRUCTOR_NUMBERa subset of five years of state assessment reading and mathematics data suitable for student growth percentile and percentile growth projection/trajectory analyses. The dataset is in LONG format with each record representing a unique teacher by student by year by content area combination.

Format

A data frame of student level observations for five years and two content areas for the following variables.

ID

Unique Student Identification Number

CONTENT_AREA

Content area for student/teacher record (Reading or Mathematics)

YEAR

Year for student/teacher record

INSTRUCTOR_NUMBER

Unique instructor number identifier

INSTRUCTOR_LAST_NAME

Instructor last name

INSTRUCTOR_FIRST_NAME

Instructor first name

INSTRUCTOR_WEIGHT

Proportional weight associated with student/teacher record

INSTRUCTOR_ENROLLMENT_STATUS

Indicator of full enrollment status of student with teacher

Source

Anonymized student/teacher lookup table in LONG format

See Also

sgpData_LONG


Longitudinal Student Assessment Data in LONG format

Description

A dataset comprising a subset of five years of state assessment reading and mathematics data suitable for student growth percentile (SGP) and percentile growth projection/trajectory analyses. The dataset is in LONG format with each record representing a unique student, by year by content area combination. The example data is used in examples and illustration in the enclosed documentation to show how SGPs can be operationalized across years with the results being used to produce a variety of high quality visualizations both at the aggregate (e.g., school) and individual level.

Format

A data frame of student level observations for five years and two content areas for the following variables.

ID

Unique Student Identification Number

LAST_NAME

Student last name

FIRST_NAME

Student first name

CONTENT_AREA

Content area for student observation (Reading or Mathematics)

YEAR

Year for student observation

GRADE

Grade level of student observation

SCALE_SCORE

Student Scale Score

ACHIEVEMENT_LEVEL

Achievement level associated with student scale score

GENDER

Student gender

ETHNICITY

Student ethnicity

FREE_REDUCED_LUNCH_STATUS

Student free/reduced lunch status

ELL_STATUS

Student English Language Learner status

IEP_STATUS

Student Individual Education Plan status

GIFTED_AND_TALENTED_PROGRAM_STATUS

Student Gifted and Talented Program status

SCHOOL_NUMBER

School number associated with student record

SCHOOL_NAME

School name associated with school number and student record

EMH_LEVEL

Elementary, Middle, High school indicator for school attended

DISTRICT_NUMBER

District number associated with student record

DISTRICT_NAME

District name associated with school number and student record

SCHOOL_ENROLLMENT_STATUS

Indicator of full academic year student enrollment in school

DISTRICT_ENROLLMENT_STATUS

Indicator of full academic year student enrollment in school

STATE_ENROLLMENT_STATUS

Indicator of full academic year student enrollment in school

VALID_CASE

Indicator of whether the case is valid or invalid

Note

All datasets used with studentGrowthPercentiles and studentGrowthProjections must be specifically formatted as wide format files. The first variable/column is the student ID variable. The next set of columns provide the grade of the student across all the years provided in the dataset (possibly missing). The last set of columns provide the scales scores of the student in the respective grades. Operational users of this package are advised to use data formatted similar to the sgpData_LONG with the higher level functions prepareSGP, analyzeSGP, combineSGP, summarizeSGP, visualizeSGP, and outputSGP

Source

Anonymized student level state assessment data in LONG format

See Also

sgpData


Longitudinal Student Assessment Data in LONG format with COVID-19 2020 interruption

Description

A dataset comprising a subset of seven years of state assessment ELA and mathematics data across an 8 year span suitable for student growth percentile (SGP) and percentile growth projection/trajectory analyses. The dataset is missing 2020 data to help users model COVID related interuptions to student growth. The dataset is in LONG format with each record representing a unique student, by year by content area combination. The example data is used in examples and illustration in the enclosed documentation to show how SGPs can be operationalized across years with the results being used to produce a variety of high quality visualizations both at the aggregate (e.g., school) and individual level.

Format

A data frame of student level observations for seven years (across an 8 year span) and two content areas for the following variables.

VALID_CASE

Indicator of whether the case is valid or invalid

CONTENT_AREA

Content area for student observation (ELA or Mathematics)

YEAR

Year for student observation

ID

Unique Student Identification Number

IEP_STATUS

Student Individual Education Plan status

GRADE

Grade level of student observation

SCALE_SCORE

Student Scale Score

SCALE_SCORE_without_COVID_IMPACT

Student Scale Score without COVID impact. Original scale score

ACHIEVEMENT_LEVEL

Achievement level associated with student scale score

ETHNICITY

Student ethnicity

FREE_REDUCED_LUNCH_STATUS

Student free/reduced lunch status

ELL_STATUS

Student English Language Learner status

GENDER

Student gender

SCHOOL_NUMBER

School number associated with student record

SCHOOL_NAME

School name associated with school number and student record

DISTRICT_NUMBER

District number associated with student record

DISTRICT_NAME

District name associated with school number and student record

Note

All datasets used with studentGrowthPercentiles and studentGrowthProjections must be specifically formatted as wide format files. The first variable/column is the student ID variable. The next set of columns provide the grade of the student across all the years provided in the dataset (possibly missing). The last set of columns provide the scales scores of the student in the respective grades. Operational users of this package are advised to use data formatted similar to the sgpData_LONG_COVID with the higher level functions prepareSGP, analyzeSGP, combineSGP, summarizeSGP, visualizeSGP, and outputSGP

Source

Anonymized and altered student level state assessment data in LONG format

See Also

sgpData_LONG


Longitudinal Student Assessment Data in LONG format with DATE for SGPt analyses.

Description

A dataset comprising a subset of 3 years (and time periods) of interim assessment early literacy, reading and mathematics data suitable for time dependent student growth percentile (SGPt) and percentile growth projection/trajectory analyses. The dataset is in LONG format with each record representing a unique student, by year/time period by content area combination. The example data is used in examples and illustrations in the enclosed documentation to show how SGPts can be operationalized across years with the results being used to produce a variety of high quality visualizations both at the aggregate (e.g., school) and individual level.

Format

A data frame of student level observations for five years and two content areas for the following variables.

COUNTRY

Variable indicating the COUNTRY associated with test record.

STATE

State associated with the test record.

VALID_CASE

VALID_CASE variable.

CONTENT_AREA

Variable indicating content area associated with test record.

YEAR

Variable indicating the year associated with the test record.

DATE

Variable indicating the year associated with the test record: Must be in YYYY-MM-DD format.

ID

Unique student identifier associated with test record.

GRADE

Variable indicating the grade associated with the test record.

SCALE_SCORE

Variable providing the scale score associated with the test record.

SCALE_SCORE_RASCH

Variable providing the Rasch score associated with the test record.

SEM

Variable indicating the standard error of measurement associated with the scale score.

ACHIEVEMENT_LEVEL

Variable indicating the achievement level associated with the scale score.

Source

Anonymized student level interim assessment data in LONG format

See Also

sgpData_LONG