Package: variantspark 0.1.1

Samuel Macêdo

variantspark: A 'Sparklyr' Extension for 'VariantSpark'

This is a 'sparklyr' extension integrating 'VariantSpark' and R. 'VariantSpark' is a framework based on 'scala' and 'spark' to analyze genome datasets, see <https://bioinformatics.csiro.au/>. It was tested on datasets with 3000 samples each one containing 80 million features in either unsupervised clustering approaches and supervised applications, like classification and regression. The genome datasets are usually writing in VCF, a specific text file format used in bioinformatics for storing gene sequence variations. So, 'VariantSpark' is a great tool for genome research, because it is able to read VCF files, run analyses and return the output in a 'spark' data frame.

Authors:Samuel Macêdo [aut, cre], Javier Luraschi [aut]

variantspark_0.1.1.tar.gz
variantspark_0.1.1.tar.gz(r-4.5-noble)variantspark_0.1.1.tar.gz(r-4.4-noble)
variantspark_0.1.1.tgz(r-4.4-emscripten)variantspark_0.1.1.tgz(r-4.3-emscripten)
variantspark.pdf |variantspark.html
variantspark/json (API)

# Install 'variantspark' in R:
install.packages('variantspark', repos = c('https://cran.r-universe.dev', 'https://cloud.r-project.org'))

Peer review:

This package does not link to any Github/Gitlab/R-forge repository. No issue tracker or development information is available.

1.90 score 16 scripts 130 downloads 7 exports 39 dependencies

Last updated 5 years agofrom:e0ef195154. Checks:OK: 2. Indexed: yes.

TargetResultDate
Doc / VignettesOKNov 14 2024
R-4.5-linuxOKNov 14 2024

Exports:importance_tblsample_namesvs_connectvs_importance_analysisvs_read_csvvs_read_labelsvs_read_vcf

Dependencies:askpassblobclicodetoolsconfigcpp11curlDBIdbplyrdplyrfansigenericsglobalsgluehttrjsonlitelifecyclemagrittrmimeopensslpillarpkgconfigpurrrR6rlangrstudioapisparklyrstringistringrsystibbletidyrtidyselectutf8uuidvctrswithrxml2yaml