Title: | Network Assisted Algorithm for Epigenetic Studies Using Mean and Variance Combined Signals |
---|---|
Description: | Package for a Network assisted algorithm for Epigenetic studies using mean and variance Combined signals: NEpiC. NEpiC combines both signals in mean and variance differences in methylation level between case and control groups searching for differentially methylated sub-networks (modules) using the protein-protein interaction network. |
Authors: | Peifeng Ruan |
Maintainer: | Peifeng Ruan <[email protected]> |
License: | GPL-2 |
Version: | 1.0.1 |
Built: | 2024-12-01 07:59:28 UTC |
Source: | CRAN |
c_dms uses the combined signals in both mean and variance differences in methylation measures between case and control groups to search for differentially methylated sub-networks (modules) using a protein-protein interaction (PPI) network. It returns ordered dense modules.
c_dms(network, gene2weight, d = 1, r = 0.1)
c_dms(network, gene2weight, d = 1, r = 0.1)
network |
A dataframe describing the edges of s PPI network. Two columns are needed in this dataset: interactor A and interactor B. |
gene2weight |
A dataframe describing combined gene-level scores. The first column of this dataframe, named 'gene', contains the gene names, which should use the same annotation as those used in the network. The second column 'weight' contains gene-level combined scores. |
d |
An integer determining the order of neighbour genes to be searched in NEpiC algorithm. We suggest to set d=1 for NEpiC. |
r |
A number determining the threshold for including a new gene into the module. |
A list containing several statistics for modules, including genes in modules, module scores before and after normalization.
Peifeng Ruan, Jing Shen, Regina M. Santella, Shuigeng Zhou, Shuang Wang.(2015). NEpiC: a Network-assisted algorithm for Epigenetic studies using mean and variance Combined signals.
Peilin Jia, Siyuan Zheng, Jirong Rong, Wei Zheng, Zhongming Zhao. (2011) Bioformatics. dmGWAS: dense module searching for genome-wide association studies in protein-protein interaction networks.
#res=c_dms(network, gene2weight, d = 1, r = 0.1)
#res=c_dms(network, gene2weight, d = 1, r = 0.1)
A function computing gene-level scores combining both signals in mean and variance differences.
c_score(mean_sig, var_sig, lambda = "default")
c_score(mean_sig, var_sig, lambda = "default")
mean_sig |
A dataframe describing mean signals for every CpG sites. Three columns are needed in this dataset, which are 'cpg', 'gene' and 'mean_p'. 'cpg' column has names of CpG sites. 'gene' column has has gene annotations for CpG sites. 'mean_p' column has p values of mean differences for CpG sites. |
var_sig |
A dataframe describing variance signals for every CpG sites. Three columns are needed in this dataset, which are 'cpg', 'gene' and 'var_p'. 'cpg' column has names of CpG sites. 'gene' column has has gene annotations for CpG sites. 'var_p' column has p values of variance differences for CpG sites. |
lambda |
A parameter to balance mean and variance scores. NEpiC computes it as default if it is not otherwise specified by user. If lambda = 1, NEpiC will search dense modules using only mean signals. If lambda = 0, NEpiC will search dense modules using only variance signals. Users can also specify any float between 0 and 1 for lambda if they have any additional prior biology information. |
c_score returns a dataframe describing the combined gene-level scores, which can be used directly in c_dms.
NEpiC: a Network-assisted algorithm for Epigenetic studies using mean and variance Combined signals. Peifeng Ruan, Jing Shen, Regina M. Santella, Shuigeng Zhou, Shuang Wang.(2015)
#c_score(mean_sig, var_sig, lambda = "default")
#c_score(mean_sig, var_sig, lambda = "default")
Chosing genes in top ranked modules as candidate genes for outcomes. Further prioritized candidate genes can be chosen based on percentages selected by top ranked modules selected if prioritizing = T.
candidate_genes(res, pct = 0.01, prioritizing)
candidate_genes(res, pct = 0.01, prioritizing)
res |
The result for c_dms. |
pct |
A float which determines the number of candidate genes. For example, if pct is 0.01 as default setting, the genes in top 1 percent modules will be chosen as the candidate genes. |
prioritizing |
To further prioritize the candidate genes or not. |
A list for candidate genes or prioritized candidate genes.
NEpiC: a Network-assisted algorithm for Epigenetic studies using mean and variance Combined signals Peifeng Ruan, Jing Shen, Regina M. Santella, Shuigeng Zhou, Shuang Wang.(2015)
# candidate_genes(res, pct = 0.01, prioritizing = T)
# candidate_genes(res, pct = 0.01, prioritizing = T)
A function shuffling the tumor/adjacent normal labels within a tumor/adjacent normal pair and repeating the permutation procedures to calculate the permutated module scores for the identified top 1 percentage modules.
permutation(normal_data, tumor_data, annotation, res, n = 100)
permutation(normal_data, tumor_data, annotation, res, n = 100)
normal_data |
The Illumina 450k methylation array results for normal samples. The rows represent the methylation sites and the columns represent different samples. |
tumor_data |
The Illumina 450k methylation array results for adjacent tumor paire samples. The rows represent the methylation sites and the columns represent different samples. |
annotation |
The annotation file for Illumina 450k methylation array data. The first column is the methylation sites' names, whose column name is "cpg" and the second column is the methylation sites' gene annotations, whose column name is "gene" |
res |
The result of the c_dms, which contains the information of identified modules. |
n |
The numbers of permutations. |
A dataset contains identified modules' names, identified modules' score, identified modules' permutated scores and the last column contains the permutation p values for identified modules.
Peifeng Ruan, Jing Shen, Regina M. Santella, Shuigeng Zhou, Shuang Wang.(2015). NEpiC: a Network-assisted algorithm for Epigenetic studies using mean and variance Combined signals.
#perm_result = function (normal_data, tumor_data, annotation, res, n = 100)
#perm_result = function (normal_data, tumor_data, annotation, res, n = 100)