Package: ProjectionBasedClustering 1.2.2

Michael Thrun

ProjectionBasedClustering: Projection Based Clustering

A clustering approach applicable to every projection method is proposed here. The two-dimensional scatter plot of any projection method can construct a topographic map which displays unapparent data structures by using distance and density information of the data. The generalized U*-matrix renders this visualization in the form of a topographic map, which can be used to automatically define the clusters of high-dimensional data. The whole system is based on Thrun and Ultsch, "Using Projection based Clustering to Find Distance and Density based Clusters in High-Dimensional Data" <doi:10.1007/s00357-020-09373-2>. Selecting the correct projection method will result in a visualization in which mountains surround each cluster. The number of clusters can be determined by counting valleys on the topographic map. Most projection methods are wrappers for already available methods in R. By contrast, the neighbor retrieval visualizer (NeRV) is based on C++ source code of the 'dredviz' software package, and the Curvilinear Component Analysis (CCA) is translated from 'MATLAB' ('SOM Toolbox' 2.0) to R.

Authors:Michael Thrun [aut, cre, cph], Quirin Stier [ctb, rev], Brinkmann Luca [ctb], Florian Lerch [aut], Felix Pape [aut], Tim Schreier [aut], Luis Winckelmann [aut], Kristian Nybo [cph], Jarkko Venna [cph], van der Maaten Laurens [cph]

ProjectionBasedClustering_1.2.2.tar.gz
ProjectionBasedClustering_1.2.2.tar.gz(r-4.7-arm64)ProjectionBasedClustering_1.2.2.tar.gz(r-4.7-x86_64)ProjectionBasedClustering_1.2.2.tar.gz(r-4.6-arm64)ProjectionBasedClustering_1.2.2.tar.gz(r-4.6-x86_64)
ProjectionBasedClustering_1.2.2.tgz(r-4.6-emscripten)
manual.pdf |manual.html
card.svg |card.png
ProjectionBasedClustering/json (API)

# Install 'ProjectionBasedClustering' in R:
install.packages('ProjectionBasedClustering', repos = c('https://cran.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://github.com/mthrun/projectionbasedclustering/issues

Uses libs:
  • c++– GNU Standard C++ Library v3
  • openmp– GCC OpenMP (GOMP) support library
Datasets:
  • DefaultColorSequence - Default color sequence for plots
  • Hepta - Hepta is part of the Fundamental Clustering Problem Suit (FCPS) [Thrun/Ultsch, 2020].

On CRAN:

Conda:

cppopenmp

3.32 score 4 packages 35 scripts 894 downloads 21 exports 89 dependencies

Last updated from:437f93e099. Checks:6 OK. Indexed: no.

TargetResultTimeFilesSyslog
linux-devel-arm64OK213
linux-devel-x86_64OK211
source / vignettesOK253
linux-release-arm64OK204
linux-release-x86_64OK213
wasm-releaseOK211

Exports:CCAContTrustMeasureICAinteractiveClusteringinteractiveGeneralizedUmatrixIslandinteractiveProjectionBasedClusteringIPBCIsomapKLMeasureKruskalStressMDSNeRVPCAPlotProjectedPointsPolarSwarmProjection2BestmatchesProjectionBasedClusteringProjectionPursuitSammonsMappingtSNEUniformManifoldApproximationProjection

Dependencies:abindaskpassbase64encbslibcachemcliclustercommonmarkcpp11crosstalkcurldata.tabledeldirdigestdplyrevaluatefarverfastmapfontawesomefsGeneralizedUmatrixgenericsgeometryggplot2gluegtablehighrhtmltoolshtmlwidgetshttpuvhttrisobandjquerylibjsonliteknitrlabelinglaterlatticelazyevallifecyclelinproglpSolvemagicmagrittrMASSMatrixmemoisemgcvmimenlmeopensslotelpermutepillarpkgconfigplotlypromisespurrrR6rappdirsRColorBrewerRcppRcppArmadilloRcppParallelRcppProgressrlangrmarkdownS7sassscalesshinyshinyjsshinythemessourcetoolsstringistringrsystibbletidyrtidyselecttinytexutf8vctrsveganviridisLitewithrxfunxtableyaml