| Title: | Ranked Set Sampling Genome-Wide Association Studies Toolkit |
|---|---|
| Description: | Provides methods for genome-wide association studies (GWAS) using ranked set sampling (RSS) designs. The package includes tools for ranked set sample selection, standard and RSS-based association analyses, simulation of genotype and phenotype data, statistical comparison of RSS and simple random sampling (SRS) approaches, visualization of GWAS results, and power analysis under alternative sampling schemes. |
| Authors: | Khalid Ul Islam Rather [aut, cre] |
| Maintainer: | Khalid Ul Islam Rather <[email protected]> |
| License: | GPL-3 |
| Version: | 0.1.1 |
| Built: | 2026-06-18 19:34:11 UTC |
| Source: | https://github.com/cran/RankGWASK |
Compare top SNPs identified using RSS and SRS.
compareRSSvsSRS( data, phenotype = "Yield", ranking = "PlantHeight", set.size = 5 )compareRSSvsSRS( data, phenotype = "Yield", ranking = "PlantHeight", set.size = 5 )
data |
Dataset. |
phenotype |
Phenotype variable. |
ranking |
Ranking variable. |
set.size |
RSS set size. |
A list of GWAS results.
data(gwas_data) comp <- compareRSSvsSRS(gwas_data)data(gwas_data) comp <- compareRSSvsSRS(gwas_data)
Simulated genome-wide association study (GWAS) dataset containing 500 individuals, 100 SNP markers, a Yield phenotype, and a PlantHeight ranking variable.
gwas_datagwas_data
A data frame with 500 rows and 103 variables.
The dataset contains:
ID: Individual identifier.
SNP1-SNP100: Genotype markers coded as 0, 1, or 2.
Yield: Quantitative phenotypic trait.
PlantHeight: Ranking variable used in Ranked Set Sampling (RSS).
This dataset was generated for illustrating Genome-Wide Association Studies (GWAS) under Ranked Set Sampling (RSS) and Simple Random Sampling (SRS) designs. SNP markers are coded as 0, 1, or 2, representing genotype classes.
Simulated data generated using simulateGWAS().
data(gwas_data) dim(gwas_data) names(gwas_data)[1:10]data(gwas_data) dim(gwas_data) names(gwas_data)[1:10]
Perform GWAS after Ranked Set Sampling.
rssGWAS(data, phenotype = "Yield", ranking = "PlantHeight", set.size = 5)rssGWAS(data, phenotype = "Yield", ranking = "PlantHeight", set.size = 5)
data |
Dataset. |
phenotype |
Phenotype column. |
ranking |
Ranking variable. |
set.size |
RSS set size. |
GWAS results.
data(gwas_data) res <- rssGWAS( gwas_data, phenotype = "Yield", ranking = "PlantHeight" )data(gwas_data) res <- rssGWAS( gwas_data, phenotype = "Yield", ranking = "PlantHeight" )
Create Manhattan plot from GWAS results.
rssManhattan(results)rssManhattan(results)
results |
Output from standardGWAS() or rssGWAS(). |
Manhattan plot.
data(gwas_data) res <- standardGWAS(gwas_data) rssManhattan(res)data(gwas_data) res <- standardGWAS(gwas_data) rssManhattan(res)
Create QQ plot from GWAS results.
rssQQ(results)rssQQ(results)
results |
GWAS results. |
QQ plot.
data(gwas_data) res <- standardGWAS(gwas_data) rssQQ(res)data(gwas_data) res <- standardGWAS(gwas_data) rssQQ(res)
Performs Ranked Set Sampling (RSS) by dividing data into sets, ranking within sets, and selecting order statistics.
rssSample(data, ranking, set.size = 5, m = 1, r = 1)rssSample(data, ranking, set.size = 5, m = 1, r = 1)
data |
A data frame. |
ranking |
Character string specifying ranking variable. |
set.size |
Integer: number of units per set. |
m |
Integer: number of cycles (default 1). |
r |
Integer: number of repetitions (default 1). |
A data frame containing RSS selected observations.
data(gwas_data) rssSample(gwas_data, "PlantHeight", set.size = 5, m = 2, r = 1)data(gwas_data) rssSample(gwas_data, "PlantHeight", set.size = 5, m = 2, r = 1)
Generate simulated GWAS genotype and phenotype data.
simulateGWAS(n = 500, nSNP = 100, seed = 123)simulateGWAS(n = 500, nSNP = 100, seed = 123)
n |
Number of individuals. |
nSNP |
Number of SNP markers. |
seed |
Random seed. |
A data frame containing simulated SNP markers, phenotype values and ranking variable.
sim <- simulateGWAS() head(sim)sim <- simulateGWAS() head(sim)
Draw a simple random sample.
srsSample(data, n)srsSample(data, n)
data |
Data frame. |
n |
Sample size. |
A sampled data frame.
Perform single-marker genome-wide association analysis using linear regression.
standardGWAS(data, phenotype = "Yield")standardGWAS(data, phenotype = "Yield")
data |
A data frame containing SNP markers and phenotype. |
phenotype |
Character string specifying the phenotype column. |
A data frame containing SNP effects, standard errors, and p-values sorted by significance.
data(gwas_data) res <- standardGWAS( gwas_data, phenotype = "Yield" ) head(res)data(gwas_data) res <- standardGWAS( gwas_data, phenotype = "Yield" ) head(res)