Title: | Multivariate Genomic Selection |
---|---|
Description: | Estimating trait heritability and handling overfitting. This package includes a collection of functions for (1) estimating genetic variance-covariances and calculate trait heritability; and (2) handling overfitting by calculating the variance components and the heritability through cross validation. |
Authors: | Zhenyu Jia |
Maintainer: | Zhenyu Jia <[email protected]> |
License: | GPL (>= 2) |
Version: | 1.3 |
Built: | 2024-11-20 06:25:37 UTC |
Source: | CRAN |
The package GSMX consists of the functions to estimate genetic variance-covariances and calculate trait heritability, and handle overfitting by calculating the variance components and the heritability through cross validation.
Package: | GSMX |
Type: | GSMX |
Version: | 1.3 |
Date: | 2017-10-16 |
License: GPL>=2 | |
Genomic selection (GS) is a form of marker-assisted selection (MAS) where markers across the entire genome are used such that all quantitative trait loci (QTL) are in linkage disequilibrium with at least one marker.
Increased number of markers and their density along with increased sample size improve the resolution of QTL mapping, and therefore improve estimated breeding values and genetic gain.
However, the genetic variance-covariances were estimated from training samples using a large number of markers including many trait-irrelevant markers, and then being used for calculating trait heritability, leading to severe overfitting.
In this package, we developed an algorithm to handle such overfitting by calculating the variance components and the heritability through cross validation.
This method provides an accurate estimation of trait heritability (equivalent to trait predictability), and objectively reflects the level of applicability of the GS models to other breeding materials..
Zhenyu Jia
Control overfitting heritability in genomic selection through cross validation
library(GSMX) data(pseudo.kin) data(pseudo.data) myfit=gsm(pseudo.data, pseudo.kin, nfold=5)
library(GSMX) data(pseudo.kin) data(pseudo.data) myfit=gsm(pseudo.data, pseudo.kin, nfold=5)
The function estimate genetic variance-covariances and calculate heritability and predictability for multivariate genetic selection using cross validation
gsm(mydata, mykin, nfold)
gsm(mydata, mykin, nfold)
mydata |
dataset with two traits |
mykin |
kinship matrix |
nfold |
number of folds for cross validation |
res |
Results |
library(GSMX) data(pseudo.kin) data(pseudo.data) myfit=gsm(pseudo.data, pseudo.kin, nfold=5)
library(GSMX) data(pseudo.kin) data(pseudo.data) myfit=gsm(pseudo.data, pseudo.kin, nfold=5)
Simulated dataset
library(GSMX) data(pseudo.data) length(pseudo.data)
library(GSMX) data(pseudo.data) length(pseudo.data)
Simulated kinship matrix
library(GSMX) data(pseudo.kin) dim(pseudo.kin)
library(GSMX) data(pseudo.kin) dim(pseudo.kin)