Package: DataSimilarity 0.4.0

Marieke Stolte
DataSimilarity: Quantifying Similarity of Datasets and Multivariate Two- And k-Sample Testing
A collection of methods for quantifying the similarity of two or more datasets, many of which can be used for two- or k-sample testing. It provides newly implemented methods as well as wrapper functions for existing methods that enable calling many different methods in a unified framework. The methods were selected from the review and comparison of Stolte et al. (2024) <doi:10.1214/24-SS149>. An empirical comparison of the methods was performed in Stolte et al. (2026) <doi:10.48550/arXiv.2604.11458> for categorical data and in Stolte et al. (2026) <doi:10.48550/arXiv.2604.12327> for numeric data.
Authors:
DataSimilarity_0.4.0.tar.gz
DataSimilarity_0.4.0.tar.gz(r-4.7-any)DataSimilarity_0.4.0.tar.gz(r-4.6-any)
DataSimilarity_0.4.0.tgz(r-4.6-emscripten)
manual.pdf |manual.html✨
card.svg |card.png
DataSimilarity/json (API)
| # Install 'DataSimilarity' in R: |
| install.packages('DataSimilarity', repos = c('https://cran.r-universe.dev', 'https://cloud.r-project.org')) |
- method.table - List of Methods Included in the Package
This package does not link to any Github/Gitlab/R-forge repository. No issue tracker or development information is available.
Last updated from:b0bceebe6a. Checks:4 OK. Indexed: yes.
| Target | Result | Time | Files | Syslog |
|---|---|---|---|---|
| linux-devel-x86_64 | OK | 280 | ||
| source / vignettes | OK | 687 | ||
| linux-release-x86_64 | OK | 299 | ||
| wasm-release | OK | 172 |
Exports:AUCBahrBallDivergenceBFBGBG2BMGBQSC2STCCSCCS_catCFCF_catCMDistanceCramerDataSimilarityDiProPermDISCOBDISCOFDSdwdProjEnergyengineerMetricf.af.aCatf.sf.sCatfindSigmafindSimilarityMethodFRFR_catFStestGGRLGGRLCatGPKgTestsgTests_catgTestsMultiHamiltonPathhammingDistHMNJeffreyskerTestsKMDknnknn.bfknn.fastLHZLHZStatisticMDMMCMMMDMSTMST5MWNKTOTDDPetrierectPartitionRISERItestRosenbaumSCSHsvmProjtStatWassersteinYMRZLZCZC_cat
Dependencies:boot