Title: | Transmission Ratio Distortion |
---|---|
Description: | Transmission Ratio Distortion (TRD) is a genetic phenomenon where the two alleles from either parent are not transmitted to the offspring at the expected 1:1 ratio under Mendelian inheritance, leading to spurious signals in genetic association studies. Functions in this package are developed to account for this phenomenon using loglinear model and Transmission Disequilibrium Test (TDT). Some population information can also be calculated. |
Authors: | Lam Opal Huang |
Maintainer: | Lam Opal Huang <[email protected]> |
License: | GPL (>= 2) |
Version: | 1.1 |
Built: | 2024-12-21 06:27:37 UTC |
Source: | CRAN |
Transmission Ratio Distortion (TRD) is a genetic phenomenon where two alleles from either parent are not transmitted to the offspring at the expected 1:1 ratio under Mendelian inheritance. Occurrence of TRD in general population can lead to spurious association signals for the study on case populations, including genotype relative risk (GRR) and imprinting effect. Therefore, it is necessary to adjust for TRD in association studies when TRD is suspected to be present.
This package has implemented the loglinear model [Weinberg et al. 1998], the adjusted loglinear model for non-sex-of-parent-specific TRD (NST) [Huang et al. 2014] and sex-of-parent-specific TRD (ST) [Huang et al. 2015]; Transmission Disequilibrium Test(TDT)[Spielman et al. 1993], and adjusted TDT [Labbe et al. 2013].
A simulation function is included in the package, which generates a population of case- and control-trios, with parent-of-origin information for disease allele in heterozygous child (rtrios), based on penetrance, ratio of maternal to paternal-induced disease risk for a heterozygous child, transmission probability of minor allele for mother and father, and minor allele frequency. Population and sample sizes can be specified by user. The data can be used directly to fit loglinear model (ll) or to perform a TDT (tdt) with or without adjustment for TRD. Count data for the 15/16 mother-father-child category can also be computed (strata.cnt).
The loglinear model can take input as trios or count data with or without parent-of-origin information. Types of loglinear model which can be fitted include genotype relative risk, gene-dosage, dominant models, with (grri, gdi, domi) or without (grr, gd, dom) imprinting effect. Real dataset of same structure can also be used to fit all types of loglinear model. Other functions are available to calculate population information, such as genotype counts (gcount), minor allele frequency (find.maf), transmission probability of minor allele (find.t), and theoretical prevalence of disease (prev). Note that trios with missing data are discarded when fitting the loglinear models.
Package: | TRD |
Type: | Package |
Version: | 1.1 |
Date: | 2015-11-10 |
License: | 3.2.2 |
Lam Opal Huang
Maintainer: Lam Opal Huang <[email protected]>
Huang LO, Infante-Rivard C, Labbe A, (2015). Modeling sex-of-parent-specific Transmission Ratio Distortion and imprinting effect in loglinear model using case-trios.(Working manuscript)
Huang LO, Infante-Rivard C, Labbe A, (2015). Adjustment for Transmission Ratio Distortion in the analysis of case-parent trios using a loglinear model.(Working manuscript)
Labbe A, Huang LO, Infante-Rivard C (2013). Transmission Ratio Distortion: A Neglected Phenomenon with Many Consequences in Genetic Analysis and Population Genetics:Naumova AK and Greenwood CMT (eds.), Epigenetics and Complex Traits: 265-85, Springer.
Huang LO, Labbe A, Infante-Rivard C (2012). Transmission ratio distortion: review of concept and implications for genetic association studies. Human Genetics 132: 245-63.
Weinberg CR, Wilcox AJ, Lie RT (1998) A log-linear approach to case-parent-triad data: assessing effects of disease genes that act either directly or through maternal effects and that may be subject to parental imprinting. American Journal of Human Genetics 62: 969-78.
Spielman RS, McGinnis RE, Ewens WJ (1993) Transmission test for linkage disequilibrium: the insulin gene region and insulin-dependent diabetes mellitus (IDDM). American Journal of Human Genetics 52: 506.
Compute MAF in child, parents and triad of case-trios, control-trios, or the whole sample. Minor allele normally refers to the disease allele, which has lower frequency than the alternative allele(s). Trios with partially missing data are included.
find.maf(sample)
find.maf(sample)
sample |
case- and/or control-trios dataset with first column as disease status (1=case, 0=control), the second, third and forth columns as maternal, paternal and child genotypes. The fifth column, if entered as part of the input data, is the parent-of-origin indicator of disease allele in a heterozygous child (a value of 1 indicates the mother transmitted the disease allele and 0 indicates the father transmitted). The parent-of-origin indicator is only meaningful for a heterozygous child. |
cases |
MAF in affected children, parents, or triads of case-trios |
ctrls |
MAF in unaffected children, parents, or triads of control-trios |
sample |
MAF in children, parents, or triads of the whole sample |
Lam Opal Huang
trios=rtrios(100000,500,0.1,0.2,0.3,1,0.5,0.5,0.1) find.maf(trios$case) find.maf(trios$ctrl) find.maf(rbind(trios$case,trios$ctrl))
trios=rtrios(100000,500,0.1,0.2,0.3,1,0.5,0.5,0.1) find.maf(trios$case) find.maf(trios$ctrl) find.maf(rbind(trios$case,trios$ctrl))
Calculate the transmission ratio of the minor allele from parents to offspring for case- and control-trios, in mother, father, or both parents. Minor allele normally refers to the disease allele, which has lower frequency than the alternative allele(s). Note that trios with missing data are discarded.
find.t(sample)
find.t(sample)
sample |
case- and/or control-trios dataset with first column as disease status (1=case, 0=control), the second, third and forth columns as maternal, paternal and child genotypes. The fifth column, if entered as part of the input data, is the parent-of-origin indicator of disease allele in a heterozygous child (a value of 1 indicates the mother transmitted the disease allele and 0 indicates the father transmitted). The parent-of-origin indicator is only meaningful for a heterozygous child. |
cases |
transmission ratio of minor allele from mother, father and both parents of case-trios to the offspring |
ctrls |
transmission ratio of minor allele from mother, father and both parents of control-trios to the offspring |
Lam Opal Huang
trios=rtrios(100000,500,0.1,0.2,0.3,1,0.5,0.5,0.1) find.t(trios$case) find.t(trios$ctrl) find.t(rbind(trios$case,trios$ctrl))
trios=rtrios(100000,500,0.1,0.2,0.3,1,0.5,0.5,0.1) find.t(trios$case) find.t(trios$ctrl) find.t(rbind(trios$case,trios$ctrl))
Use the input trios data to calculate the transmission ratio of the minor allele from parents to offspring for both case- and control-trios population.
findr(sample)
findr(sample)
sample |
A sample of case- and control-trios with first column as the disease status (1=case, 0=control), second column as mother genotype, third father genotype and fourth child genotype. |
r1 |
transmission ratio of minor allele in case-trios |
r2 |
transmission ratio of minor allele in control-trios |
Lam Opal Huang
trios=rtrios(1000,50,0.1,0.2,0.3,0.5,0.5,0.1) findr(rbind(trios[[1]],trios[[2]]))
trios=rtrios(1000,50,0.1,0.2,0.3,0.5,0.5,0.1) findr(rbind(trios[[1]],trios[[2]]))
Calculate genotype counts (additive model) of case-trios, control-trios and the whole sample, separately for children, mothers and fathers. Additive model means the genotype is the number of copies of minor allele that the individual carries assuming a bi-allelic locus. This genotype will be 0 for homozygous wild-type, 1 for heterozygous, and 2 for homozygous mutant. Minor allele normally refers to the disease allele, which has lower frequency than the alternative allele(s). Trios with partially missing data are included.
gcount(sample)
gcount(sample)
sample |
case- and/or control-trios dataset with first column as disease status (1=case, 0=control), the second, third and forth columns as maternal, paternal and child genotypes. The fifth column, if entered as part of the input data, is the parent-of-origin indicator of disease allele in a heterozygous child (a value of 1 indicates the mother transmitted the disease allele and 0 indicates the father transmitted). The parent-of-origin indicator is only meaningful for a heterozygous child. |
cases |
genotype counts of children, mothers and fathers, in case-trios |
ctrls |
genotype counts of children, mothers and fathers, in control-trios |
sample |
genotype counts of children, mothers and fathers, in the whole sample |
Lam Opal Huang
trios=rtrios(100000,500,0.1,0.2,0.3,1,0.5,0.5,0.1) gcount(trios$case) gcount(trios$ctrl) gcount(rbind(trios$case,trios$ctrl))
trios=rtrios(100000,500,0.1,0.2,0.3,1,0.5,0.5,0.1) gcount(trios$case) gcount(trios$ctrl) gcount(rbind(trios$case,trios$ctrl))
Use input data, either sample of case-trios ('sample') or count data for 15/16-MFC-genotype categories ('counts') to fit loglinear model with (Model 2) and without (Model 1) adjustment for TRD, model type could be genotype relative risk ('grr'), dominant ('dom'), gene-dosage ('gd'), genotype relative risk with imprinting variable ('grri'), dominant with imprinting variable ('domi'), or gene-dosage with imprinting variable ('gdi'). Note that the models fit only complete trios with no missing data.
ll(data, data.type, tmother, tfather, model.type)
ll(data, data.type, tmother, tfather, model.type)
data |
input data can be (1) case- and/or control-trios dataset with first column as disease status (1=case, 0=control), the second, third and forth columns as maternal, paternal and child genotypes. The fifth column, if entered as part of the input data, is the parent-of-origin indicator of disease allele in a heterozygous child (a value of 1 indicates the mother transmitted the disease allele and 0 indicates the father transmitted). The parent-of-origin indicator is only meaningful for a heterozygous child. (2) a count dataset for the 15 or 16-MFC-genotype categories obtainable from the function strata.cnt(..), with the latter having 2 categories (111M & 111F) for triply heterozygous trios. |
data.type |
type of data: 'sample' or 'counts' |
tmother |
transmission ratio of minor alleles from mother to child |
tfather |
transmission ratio of minor alleles from father to child |
model.type |
'grr', 'dom', 'gd', 'grri', 'domi', or 'gdi' |
If the following warning messages occur, increase the population and sample size. Warning messages: glm.fit: fitted rates numerically 0 occurred
Summary table |
Summary statistics of fitting the loglinear model |
Lam Opal Huang
Huang LO, Infante-Rivard C, Labbe A, (2015). Modeling sex-of-parent-specific Transmission Ratio Distortion and imprinting effect in loglinear model using case-trios.(Working manuscript)
Huang LO, Infante-Rivard C, Labbe A, (2015). Adjustment for Transmission Ratio Distortion in the analysis of case-parent trios using a loglinear model.(Working manuscript)
Weinberg CR, Wilcox AJ, Lie RT (1998) A log-linear approach to case-parent-triad data: assessing effects of disease genes that act either directly or through maternal effects and that may be subject to parental imprinting. American Journal of Human Genetics 62: 969-78.
trios=rtrios(100000,500,0.1,0.2,0.3,1,0.5,0.5,0.1) out=ll(trios$case,'sample',0.5,0.5,'grr') cnt=strata.cnt(trios$case) out=ll(cnt,'counts',0.5,0.5,'grr')
trios=rtrios(100000,500,0.1,0.2,0.3,1,0.5,0.5,0.1) out=ll(trios$case,'sample',0.5,0.5,'grr') cnt=strata.cnt(trios$case) out=ll(cnt,'counts',0.5,0.5,'grr')
Use 3 penetrance factors for genotype 0, 1 and 2 (additive model), ratio of maternal to paternal-induced disease risk for a heterozygous child, tranmission ratio of minor allele from mother and father to offspring, and minor allele frequency (MAF) to calculate the disease prevalence. Additive model means the genotype is the number of copies of minor allele that the individual carries assuming a bi-allelic locus. This genotype will be 0 for homozygous wild-type, 1 for heterozygous, and 2 for homozygous mutant. Minor allele normally refers to the disease allele, which has lower frequency than the alternative allele(s).
prev(f0, f1, f2, g, tmother, tfather, maf)
prev(f0, f1, f2, g, tmother, tfather, maf)
f0 |
penetrance for individuals with genotype 0 (additive model) |
f1 |
penetrance for individuals with genotype 1 (additive model) |
f2 |
penetrance for individuals with genotype 2 (additive model) |
g |
Ratio of maternal- over paternal-induced disease risk |
tmother |
transmission ratio of minor allele from mother to child |
tfather |
transmission ratio of minor allele from father to child |
maf |
minor allele frequency |
d |
disease prevalence |
Lam Opal Huang
prev(0.1,0.2,0.3,1,0.5,0.5,0.1)
prev(0.1,0.2,0.3,1,0.5,0.5,0.1)
Using penetrance factors, ratio of maternal- to paternal-induced disease risk for a heterozygous child, tranmission ratio of minor allele from mother and father to offspring, and minor allele frequency (MAF) to generate a simulated population of case- and/or control-trios dataset with first column as disease status (1=case, 0=control), the second, third and forth columns as maternal, paternal and child genotypes. The fifth column as the parent-of-origin indicator of disease allele in a heterozygous child (a value of 1 indicates the mother transmitted the disease allele and 0 indicates the father transmitted). The parent-of-origin indicator is only meaningful for a heterozygous child. Minor allele normally refers to the disease allele, which has lower frequency than the alternative allele(s).
rtrios(n, ssize, f0, f1, f2, g, tmother, tfather, maf)
rtrios(n, ssize, f0, f1, f2, g, tmother, tfather, maf)
n |
simulated population size |
ssize |
sample size |
f0 |
penetrance for individuals with genotype 0 (additive model) |
f1 |
penetrance for individuals with genotype 1 (additive model) |
f2 |
penetrance for individuals with genotype 2 (additive model) |
g |
Ratio of maternal- over paternal-induced disease risk |
tmother |
transmission ratio of minor allele from mother to child |
tfather |
transmission ratio of minor allele from father to child |
maf |
minor allele frequency |
case |
case-trios data with parent-of-origin indicator |
ctrl |
control-trios data with parent-of-origin indicator |
Lam Opal Huang
find.maf
find.t
gcount
ll
prev
strata.cnt
tdt
trios=rtrios(100000,500,0.1,0.2,0.3,1,0.5,0.5,0.1)
trios=rtrios(100000,500,0.1,0.2,0.3,1,0.5,0.5,0.1)
Count trios with mother-father-child genotypes MFC (additive model) as mating type 1: 222; mating type 2: 212,122,211,121; mating type 3: 201,021; mating type 4: 112,(111M,111F) or 111,110; mating type 5: 101,100,011,010; mating type 6: 000; where 111M and 111F are for categories with heterozygous children inherited the disease allele from the mother and father, respectively. Additive model means the genotype is the number of copies of minor allele that the individual carries assuming a bi-allelic locus. This genotype will be 0 for homozygous wild-type, 1 for heterozygous, and 2 for homozygous mutant. Note that trios with missing data are discarded.
strata.cnt(sample)
strata.cnt(sample)
sample |
case- and/or control-trios dataset with first column as disease status (1=case, 0=control), the second, third and forth columns as maternal, paternal and child genotypes. The fifth column, if entered as part of the input data, is the parent-of-origin indicator of disease allele in a heterozygous child (a value of 1 indicates the mother transmitted the disease allele and 0 indicates the father transmitted). The parent-of-origin indicator is only meaningful for a heterozygous child. |
cnt.MFC |
count of stratum with genotypes MFC (additive model) |
Lam Opal Huang
trios=rtrios(100000,500,0.1,0.2,0.3,1,0.5,0.5,0.1) strata.cnt(trios$case) strata.cnt(trios$ctrl)
trios=rtrios(100000,500,0.1,0.2,0.3,1,0.5,0.5,0.1) strata.cnt(trios$case) strata.cnt(trios$ctrl)
Compute TDT on case- and control-trios, and adjusted TDT on case-trios using transmission ratio of minor allele calculated from control-trios [Labbe et al. 2013]. When control-trios are not supplied in the input data, only TDT on case-trios is computed. Similarly, when case-trios are not supplied, only TDT on control-trios is computed. The adjusted TDT uses both case- and control-trios, and takes into account of TRD. Note that trios with missing data are discarded.
tdt(sample)
tdt(sample)
sample |
case- and/or control-trios dataset with first column as disease status (1=case, 0=control), the second, third and forth columns as maternal, paternal and child genotypes. The fifth column, if entered as part of the input data, is the parent-of-origin indicator of disease allele in a heterozygous child (a value of 1 indicates the mother transmitted the disease allele and 0 indicates the father transmitted). The parent-of-origin indicator is only meaningful for a heterozygous child. |
case statistics |
TDT statistics for case-trios |
control statistics |
TDT statistics for control-trios |
Adjusted-case statistics |
Adjusted TDT statistics for case-trios using transmission ratio of minor allele in control-trios |
case p-value |
TDT p-value for case-trios |
control p-value |
TDT p-value for control-trios |
Adjusted-case p-value |
Adjusted TDT p-value for case-trios using transmission ratio of minor allele in control-trios |
Lam Opal Huang
Labbe A, Huang LO, Infante-Rivard C (2013). Transmission Ratio Distortion: A Neglected Phenomenon with Many Consequences in Genetic Analysis and Population Genetics:Naumova AK and Greenwood CMT (eds.), Epigenetics and Complex Traits: 265-85, Springer.
Spielman RS, McGinnis RE, Ewens WJ (1993) Transmission test for linkage disequilibrium: the insulin gene region and insulin-dependent diabetes mellitus (IDDM). American Journal of Human Genetics 52: 506.
trios=rtrios(100000,500,0.1,0.2,0.3,1,0.5,0.5,0.1) tdt(trios$case) tdt(trios$ctrl) tdt(rbind(trios$case,trios$ctrl))
trios=rtrios(100000,500,0.1,0.2,0.3,1,0.5,0.5,0.1) tdt(trios$case) tdt(trios$ctrl) tdt(rbind(trios$case,trios$ctrl))
Gives the diagnoal counts of McNemar Table for heterozygous transmission.
tdt.cnt(sample)
tdt.cnt(sample)
sample |
case- or control-trios dataset with first column as disease status (1=case, 0=control), second column as maternal genotype, third paternal genotype, and fourth child genotype. |
b |
transmission count of minor over major allele in heterozygous parents |
c |
transmission count of major over minor allele in heterozygous parents |
Lam Opal Huang
Spielman RS, McGinnis RE, Ewens WJ (1993) Transmission test for linkage disequilibrium: the insulin gene region and insulin-dependent diabetes mellitus (IDDM). American Journal of Human Genetics 52: 506.
trios=rtrios(1000,50,0.1,0.2,0.3,0.5,0.5,0.1) case.cnt=tdt.cnt(trios$case) ctrl.cnt=tdt.cnt(trios$control)
trios=rtrios(1000,50,0.1,0.2,0.3,0.5,0.5,0.1) case.cnt=tdt.cnt(trios$case) ctrl.cnt=tdt.cnt(trios$control)