Title: | Investigating the Functional Characteristics of Selected SNPs and Their Vicinity Genomic Region |
---|---|
Description: | To investigate the functional characteristics of selected SNPs and their vicinity genomic region. Linked SNPs in moderate to high linkage disequilibrium (e.g. r2>0.50) with the corresponding index SNPs will be selected for further analysis. |
Authors: | Alireza Ani [aut, cre], Zoha Kamali [aut], Ahmad Vaez [aut] |
Maintainer: | Alireza Ani <[email protected]> |
License: | GPL-3 |
Version: | 0.2.6.0 |
Built: | 2024-12-06 06:50:31 UTC |
Source: | CRAN |
This function receives a list of variants and checks their information on Ensembl website via the Ensembl REST API server.
annotate( rslist, server, db, outputPath, window_size = 500, r2 = 0.5, LDlist = TRUE, cadd = FALSE, geneNames.file = NULL, regulatoryType.file = NULL, cores = 0 )
annotate( rslist, server, db, outputPath, window_size = 500, r2 = 0.5, LDlist = TRUE, cadd = FALSE, geneNames.file = NULL, regulatoryType.file = NULL, cores = 0 )
rslist |
A vector of rs numbers. |
server |
Name of the server. "https://rest.ensembl.org" can be used for GRCh38 and "https://grch37.rest.ensembl.org" for GRCh37. |
db |
The population database for calculating LD scores. This can be found using |
outputPath |
The report file will be saved in this path as an Excel file (.xlsx) |
window_size |
Number of base pairs around the variant for checking LD scores (max = 500kb) |
r2 |
The LD threshold for selecting variants around the target SNP. |
LDlist |
If set to TRUE, variants in high LD will be found and added to the output. |
cadd |
If set to TRUE, the CADD scores will be added to variant information. |
geneNames.file |
path the gene information file (*.rds). Default value is NULL and ENSEMBL website will be checked if no file is provided. |
regulatoryType.file |
path the variants regulatory type information file (*.rds). Default value is NULL and this step will be skipped if no file is provided. |
cores |
set to a value above 0 for parallel processing. |
a data table with all variant information is returned.
## Not run: # select the required server server <- "https://grch37.rest.ensembl.org" # select the database for population data # this can be selected from listDatabases() function db <- "1000GENOMES:phase_3:EUR" # create a vector of required SNPs rslist=c('rs236349') output <- annotate(rslist,server,db, outputPath = paste(tempdir(),'sampleOutput.xlsx',sep="/"), window_size = 500, r2 = .9, cadd = FALSE) ## End(Not run)
## Not run: # select the required server server <- "https://grch37.rest.ensembl.org" # select the database for population data # this can be selected from listDatabases() function db <- "1000GENOMES:phase_3:EUR" # create a vector of required SNPs rslist=c('rs236349') output <- annotate(rslist,server,db, outputPath = paste(tempdir(),'sampleOutput.xlsx',sep="/"), window_size = 500, r2 = .9, cadd = FALSE) ## End(Not run)
This function returns a list of variables that are in high LD with the lead variant.
LDlist(rslist, server, db, window_size, r2)
LDlist(rslist, server, db, window_size, r2)
rslist |
A vector of rs numbers. |
server |
Name of the server. "https://rest.ensembl.org" can be used for GRCh38 and "https://grch37.rest.ensembl.org" for GRCh37. |
db |
The population database for calculating LD scores. This can be found using |
window_size |
Number of base pairs around the variant for checking LD scores (max = 500kb) |
r2 |
The LD threshold for selecting variants around the target SNP. |
a data table with variant information.
This function list the name, description and size of the available populations in 1000 Genomes project database. This database will be used for returning variables in high LD with the target SNP.
listDatabases(server)
listDatabases(server)
server |
name of the server. "https://rest.ensembl.org" can be used for GRCh38 and "https://grch37.rest.ensembl.org" for GRCh37. |
A data table is returned which includes the name, description and size of the available populations in 1000 Genomes project database.
This function test whether the Ensembl server is accessible or not
pingServer(server)
pingServer(server)
server |
name of the server. "https://rest.ensembl.org" can be used for GRCh38 and "https://grch37.rest.ensembl.org" for GRCh37. |
a message is displayed to the user
Shows the data releases available on this REST server. May return more than one release (unfrequent non-standard Ensembl configuration).
releaseVersion(server)
releaseVersion(server)
server |
name of the server. |
a message is displayed to the user