Title: | Gene Set Analysis Exploiting Pathway Topology |
---|---|
Description: | Using Gaussian graphical models we propose a novel approach to perform pathway analysis using gene expression. Given the structure of a graph (a pathway) we introduce two statistical tests to compare the mean and the concentration matrices between two groups. Specifically, these tests can be performed on the graph and on its connected components (cliques). The package is based on the method described in Massa M.S., Chiogna M., Romualdi C. (2010) <doi:10.1186/1752-0509-4-121>. |
Authors: | Sofia Massa, Gabriele Sales |
Maintainer: | Gabriele Sales <[email protected]> |
License: | GPL-3 |
Version: | 1.5.0 |
Built: | 2024-11-01 11:17:56 UTC |
Source: | CRAN |
Test the equality of the means in two experimental conditions for all the cliques of a pathway, conditioning on the results of the test on the concentration matrices
clique.mean.test(y1,y2,dag,alpha,perm.num=1000,paired=FALSE)
clique.mean.test(y1,y2,dag,alpha,perm.num=1000,paired=FALSE)
y1 |
a matrix with n1 individuals (rows) in the first experimental condition and p genes (columns). |
y2 |
a matrix with n2 individuals (rows) in the second experimental condition and p genes (columns). The genes in the two experimental conditions must be the same. |
dag |
graphNEL object, a directed acyclic graph (DAG) corresponding to the pathway of interest. See package |
alpha |
significance level of the test. |
perm.num |
number of permutation used by the test. |
paired |
logical flag. If |
The function tests the equality of the means of each clique of a
pathway in two experimental conditions, conditioning on the results of
clique.var.test
.
The graph of a pathway is first converted into a DAG, then the moral
graph is obtained and if the latter graph is decomposable then the
test is performed on all its cliques. If the moral graph is not
decomposable, its triangulated version is obtained and then the test
is performed on all its cliques.
The expression data may contain some genes differing from those in the pathway: in such case the function automatically takes the intersection between the two gene sets.
This function requires gRBase
and qpgraph
packages.
p.value |
p-value of the test on the mean for each clique. |
p.value.var |
p-value of the test on the variance for each clique. If paired is |
cliques |
the list of the corresponding cliques. |
var.equal |
logical, a vector with a TRUE for a significant clique and a FALSE otherwise. If paired is |
t.value |
the observed values of the statistic to test the mean for each clique. |
lambda.value |
the observed values of the statistic to test the
variance for each clique. If paired is |
graph |
the triangularized and moralized graph. |
M. Sofia Massa, Gabriele Sales
Massa, M.S., Chiogna, M., Romualdi, C. (2010). Gene set analysis exploiting the topology of a pathway. BMC Systems Biology, 4:121 https://bmcsystbiol.biomedcentral.com/articles/10.1186/1752-0509-4-121
pathway.var.test
,pathway.mean.test
,
clique.var.test
,
data(examples) clique.mean.test(y1, y2, dag_bcell, 0.05)
data(examples) clique.mean.test(y1, y2, dag_bcell, 0.05)
Test the equality of concentration matrices in two experimental conditions for all cliques of a pathway
clique.var.test(y1,y2,dag,alpha)
clique.var.test(y1,y2,dag,alpha)
y1 |
a matrix with n1 individuals (rows) in the first experimental condition and p genes (columns). |
y2 |
a matrix with n2 individuals (rows) in the second experimental condition and p genes (columns). The genes in the two experimental conditions must be the same. |
dag |
graphNEL object, a directed acyclic graph (DAG) corresponding to the pathway of interest. See package |
alpha |
significance level of the test. |
The function tests the equality of the concentration matrices of each clique of a pathway in two experimental conditions. The graph of a pathway is first converted into a DAG, then the moral graph is obtained and if the latter graph is decomposable then the test is performed on all its cliques. If the moral graph is not decomposable, its triangulated version is obtained and then the test is performed on all its cliques.
The expression data may contain some genes differing from those in the pathway: in such case the function automatically takes the intersection between the two gene sets.
This function requires gRBase
and qpgraph
packages.
p.value |
the observed value of the significance level. |
cliques |
the list of cliques where the test is performed. |
var.equal |
logical, a vector with a TRUE for a significant clique and a FALSE otherwise. |
lambda.var |
the observed value of the statistic. |
graph |
the triangulated and moral graphs. |
M. Sofia Massa, Gabriele Sales
Massa, M.S., Chiogna, M., Romualdi, C. (2010). Gene set analysis exploiting the topology of a pathway. BMC Systems Biology, 4:121 https://bmcsystbiol.biomedcentral.com/articles/10.1186/1752-0509-4-121
pathway.var.test
, pathway.mean.test
,
clique.mean.test
,
data(examples) clique.var.test(y1, y2, dag_bcell, 0.05)
data(examples) clique.var.test(y1, y2, dag_bcell, 0.05)
The B cell receptor signaling pathway (BCR) converted to a DAG. In case of nodes with multiple genes, the most differentially expressed between groups of patients defined in y1 (acute lymphocytic leukemia patients with BCR/ABL gene rearrangement) and y2 (acute lymphocytic leukemia patients with BCR/ABL gene rearrangement) matrices.
Chiaretti S., Li X., Gentleman R., Vitale A., Wang K.S., Mandelli F., Fo R., Ritz J. (2005) Gene expression profiles of B-lineage adult acute lymphocytic leukemia reveal genetic patterns that identify lineage derivation and distinct mechanisms of transformation. Clin Cancer Res, 11:7209-7219. https://aacrjournals.org/clincancerres/article/11/20/7209/188537/Gene-Expression-Profiles-of-B-lineage-Adult-Acute
Massa, M.S., Chiogna, M., Romualdi, C. (2010). Gene set analysis exploiting the topology of a pathway. BMC Systems Biology, 4:121 https://bmcsystbiol.biomedcentral.com/articles/10.1186/1752-0509-4-121
Test the equality of means in two experimental conditions for a pathway, conditioning on the result of the test on the concentration matrices
pathway.mean.test(y1,y2,dag,alpha,perm.num=10000,variance=TRUE,paired=FALSE)
pathway.mean.test(y1,y2,dag,alpha,perm.num=10000,variance=TRUE,paired=FALSE)
y1 |
a matrix with n1 individuals (rows) in the first experimental condition and p genes (columns). |
y2 |
a matrix with n2 individuals (rows) in the second experimental condition and p genes (columns). The genes in the two experimental conditions must be the same. |
dag |
graphNEL object, a directed acyclic graph (DAG) corresponding to the pathway of interest. See package |
alpha |
significance level of the test. |
variance |
logical flag. If |
perm.num |
number of permutations of the test. |
paired |
logical flag. If |
The function tests the equality of the means of two
experimental conditions, conditioning on the result of
pathway.var.test
.
The expression data may contain some genes differing from those in the pathway: in such case the function automatically takes the intersection between the two gene sets.
This function requires gRBase
and qpgraph
packages.
p.value |
the p-value of the test on the mean. |
p.value.var |
the p-value of the test on the variance. If paired is |
lambda.value |
the observed value of the test statistic for the
variance. If paired is |
qchisq.value |
the 1 - alpha quantile value of the null
distribution of the test statistic on the variance. If paired is |
cli.moral |
a list containing the cliques of the moral graph. |
var.equal |
logical flag. If |
graph |
a graphNEL object representing the moral graph. |
df.var |
the degrees of freedom of the null distribution for the
variance test. If paired is |
df.mean |
the degrees of freedom of the null distribution for the
mean test. If paired is |
t.value |
the observed value of the test statistic for the mean. |
M. Sofia Massa, Gabriele Sales
Massa, M.S., Chiogna, M., Romualdi, C. (2010). Gene set analysis exploiting the topology of a pathway. BMC Systems Biology, 4:121 https://bmcsystbiol.biomedcentral.com/articles/10.1186/1752-0509-4-121
pathway.var.test
, clique.var.test
,
clique.mean.test
,
data(examples) pathway.mean.test(y1, y2, dag_bcell, 0.05, 100)
data(examples) pathway.mean.test(y1, y2, dag_bcell, 0.05, 100)
Test the equality of concentration matrices in two experimental conditions for a pathway
pathway.var.test(y1,y2,dag,alpha,variance=FALSE,s1=NULL,s2=NULL)
pathway.var.test(y1,y2,dag,alpha,variance=FALSE,s1=NULL,s2=NULL)
y1 |
a matrix with n1 individuals (rows) in the first experimental condition and p genes (columns). |
y2 |
a matrix with n2 individuals (rows) in the second experimental condition and p genes (columns). The genes in the two experimental conditions must be the same. |
dag |
graphNEL object, directed acyclic graph (DAG) corresponding to the pathway of interest. See package |
alpha |
significance level of the test. |
variance |
logical flag. If |
s1 |
y1 covariance matrix estimation. |
s2 |
y2 covariance matrix estimation. |
The graph of a pathway is first converted into a DAG and then into a moral graph. The data is modelled with two Gaussian graphical models with zero mean and graph provided by the moral graph. The function tests the equality of the two concentration matrices (inverse of the covariance matrices).
The expression data may contain some genes differing from those in the pathway: in such case the function automatically takes the intersection between the two gene sets.
A necessary condition for the existence of the covariance estimates is that
the number of statistical units (samples) is greater than the number of
variables. If this is not the case, penalized techniques for estimating
and
have to be employed, that are currently not provided by the package. In theses
cases, one can perform penalized estimation of
and
outside topologyGSA, and then provide such estimates as input arguments
to the function
pathway.var.test
to compute the value of the test for
homogeneity. In this case, computation of the p-value deserves attention,
as standard results on the asymptotic distribution of the test statistic may
no longer be valid. Therefore, computation of the p-value has to be dealt
with by the user.
This function requires gRBase
and qpgraph
packages.
lambda.value |
the observed value of the test statistic. |
qchisq.value |
the 1 - alpha quantile value of the null distribution of the test statistic on the variance. |
p.value |
the observed value of the significance level of the test. |
cli.moral |
a list containing the cliques of the moral graph. |
var.equal |
logical flag. If TRUE variances are heteroschedastic. |
graph |
a graphNEL object representing the moral graph. |
df |
the degrees of freedom of the null distribution. |
s1 |
if variance=TRUE, the estimate of y1 covariance. |
s2 |
if variance=TRUE, the estimate of y2 covariance. |
M. Sofia Massa, Gabriele Sales
Massa, M.S., Chiogna, M., Romualdi, C. (2010). Gene set analysis exploiting the topology of a pathway. BMC Systems Biology, 4:121 https://bmcsystbiol.biomedcentral.com/articles/10.1186/1752-0509-4-121
Lauritzen, S.L. (1996). Graphical models. Clarendon Press, Oxford.
pathway.mean.test
, clique.var.test
,
clique.mean.test
.
data(examples) pathway.var.test(y1, y2, dag_bcell, 0.05)
data(examples) pathway.var.test(y1, y2, dag_bcell, 0.05)
This functions prints a summary of the results of each statistical test.
## S3 method for class 'clique.var.test' print(x, ...) ## S3 method for class 'clique.mean.test' print(x, ...) ## S3 method for class 'pathway.var.test' print(x, ...) ## S3 method for class 'pathway.mean.test' print(x, ...)
## S3 method for class 'clique.var.test' print(x, ...) ## S3 method for class 'clique.mean.test' print(x, ...) ## S3 method for class 'pathway.var.test' print(x, ...) ## S3 method for class 'pathway.mean.test' print(x, ...)
x |
the result of a statistical test |
... |
ignored |
M. Sofia Massa, Gabriele Sales
This function highlights the significant cliques on the moralized and triangulated graph. Different colors correspond to different values of -log(pvalue)
render.significant.cliques(info, alpha)
render.significant.cliques(info, alpha)
info |
the result of |
alpha |
p-value cutoff to define significant cliques |
Massa, M.S., Chiogna, M., Romualdi, C. (2010). Gene set analysis exploiting the topology of a pathway. BMC Systems Biology, 4:121 https://bmcsystbiol.biomedcentral.com/articles/10.1186/1752-0509-4-121
clique.var.test
, clique.mean.test
.
data(examples) info <- clique.var.test(y1, y2, dag_bcell, 0.1) ## Not run: render.significant.cliques(info)
data(examples) info <- clique.var.test(y1, y2, dag_bcell, 0.1) ## Not run: render.significant.cliques(info)
Several distinct genetic mechanisms lead to acute lymphocytic leukemia (ALL) malignant transformations deriving
from distinct lymphoid precursor cells that have been committed to either T-lineage or B-lineage differentiation.
Chromosome translocations and molecular rearrangements are common events in B-lineage ALL and reflect distinct
mechanisms of transformation. The B cell Receptor (BCR/ABL) gene rearrangement occurs in about 25% of cases in
adult ALL, and much less frequently in pediatric ALL.
y1
dataset contains gene expression signatures of BCR signaling pathway (composed by 35 gene products, n.columns = 35)
in acute lymphocytic leukemia (ALL) cells associated with presence of BCR/ABL gene rearrangement (n. rows = 37).
Expression values are normalized according to rma
and quantile
normalization.
Chiaretti S., Li X., Gentleman R., Vitale A., Wang K.S., Mandelli F., Fo R., Ritz J. (2005) Gene expression profiles of B-lineage adult acute lymphocytic leukemia reveal genetic patterns that identify lineage derivation and distinct mechanisms of transformation. Clin Cancer Res, 11:7209-7219. https://aacrjournals.org/clincancerres/article/11/20/7209/188537/Gene-Expression-Profiles-of-B-lineage-Adult-Acute
Massa, M.S., Chiogna, M., Romualdi, C. (2010). Gene set analysis exploiting the topology of a pathway. BMC Systems Biology, 4:121 https://bmcsystbiol.biomedcentral.com/articles/10.1186/1752-0509-4-121
Several distinct genetic mechanisms lead to acute lymphocytic leukemia (ALL) malignant transformations deriving
from distinct lymphoid precursor cells that have been committed to either T-lineage or B-lineage differentiation.
Chromosome translocations and molecular rearrangements are common events in B-lineage ALL and reflect distinct
mechanisms of transformation. The B cell Receptor (BCR/ABL) gene rearrangement occurs in about 25% of cases in
adult ALL, and much less frequently in pediatric ALL.
y2
dataset contains gene expression signatures of BCR signaling pathway (composed by 35 gene products, n.columns = 35)
in acute lymphocytic leukemia (ALL) cells not associated with BCR/ABL gene rearrangement (n. rows = 41).
Expression values are normalized according to rma
and quantile
normalization.
Chiaretti S., Li X., Gentleman R., Vitale A., Wang K.S., Mandelli F., Fo R., Ritz J. (2005) Gene expression profiles of B-lineage adult acute lymphocytic leukemia reveal genetic patterns that identify lineage derivation and distinct mechanisms of transformation. Clin Cancer Res, 11:7209-7219. https://aacrjournals.org/clincancerres/article/11/20/7209/188537/Gene-Expression-Profiles-of-B-lineage-Adult-Acute
Massa, M.S., Chiogna, M., Romualdi, C. (2010). Gene set analysis exploiting the topology of a pathway. BMC Systems Biology, 4:121 https://bmcsystbiol.biomedcentral.com/articles/10.1186/1752-0509-4-121