Package: variantspark 0.1.1

Samuel Macêdo

variantspark: A 'Sparklyr' Extension for 'VariantSpark'

This is a 'sparklyr' extension integrating 'VariantSpark' and R. 'VariantSpark' is a framework based on 'scala' and 'spark' to analyze genome datasets, see <https://bioinformatics.csiro.au/>. It was tested on datasets with 3000 samples each one containing 80 million features in either unsupervised clustering approaches and supervised applications, like classification and regression. The genome datasets are usually writing in VCF, a specific text file format used in bioinformatics for storing gene sequence variations. So, 'VariantSpark' is a great tool for genome research, because it is able to read VCF files, run analyses and return the output in a 'spark' data frame.

Authors:Samuel Macêdo [aut, cre], Javier Luraschi [aut]

variantspark_0.1.1.tar.gz
variantspark_0.1.1.tar.gz(r-4.5-noble)variantspark_0.1.1.tar.gz(r-4.4-noble)
variantspark_0.1.1.tgz(r-4.4-emscripten)variantspark_0.1.1.tgz(r-4.3-emscripten)
variantspark.pdf |variantspark.html✨
variantspark/json (API)

# Install 'variantspark' in R:

install.packages('variantspark', repos = c('https://cran.r-universe.dev', 'https://cloud.r-project.org'))

On CRAN:

This package does not link to any Github/Gitlab/R-forge repository. No issue tracker or development information is available.

1.70 score 114 downloads 7 exports 39 dependencies

Last updated 6 years agofrom:e0ef195154. Checks:3 OK. Indexed: yes.

Target	Result	Latest binary
Doc / Vignettes	OK	Mar 14 2025
R-4.5-linux	OK	Mar 14 2025
R-4.4-linux	OK	Mar 14 2025

Exports:importance_tbl sample_names vs_connect vs_importance_analysis vs_read_csv vs_read_labels vs_read_vcf

Dependencies:askpass blob cli codetools config cpp11 curl DBI dbplyr dplyr fansi generics globals glue httr jsonlite lifecycle magrittr mime openssl pillar pkgconfig purrr R6 rlang rstudioapi sparklyr stringi stringr sys tibble tidyr tidyselect utf8 uuid vctrs withr xml2 yaml

Help page	Topics
Extract the importance data frame	importance_tbl
Display sample names	sample_names
Creating a variantspark connection	vs_connect
Importance Analysis	vs_importance_analysis
Reading a CSV file	vs_read_csv
Reading labels	vs_read_labels
Reading a VCF file	vs_read_vcf

Package: variantspark 0.1.1

variantspark: A 'Sparklyr' Extension for 'VariantSpark'

Citation

Readme and manuals

Help Manual

Usage by other packages (reverse dependencies)