# -------------------------------------------- # CITATION file created with {cffr} R package # See also: https://docs.ropensci.org/cffr/ # -------------------------------------------- cff-version: 1.2.0 message: 'To cite package "genieclust" in publications use:' type: software license: AGPL-3.0-only title: 'genieclust: Fast and Robust Hierarchical Clustering with Noise Points Detection' version: 1.1.6 doi: 10.1016/j.softx.2021.100722 identifiers: - type: doi value: 10.32614/CRAN.package.genieclust - type: url value: https://clustering-benchmarks.gagolewski.com/ abstract: A retake on the Genie algorithm (Gagolewski, 2021 ) - a robust hierarchical clustering method (Gagolewski, Bartoszuk, Cena, 2016 ). Now faster and more memory efficient; determining the whole hierarchy for datasets of 10M points in low dimensional Euclidean spaces or 100K points in high-dimensional ones takes only 1-2 minutes. Allows clustering with respect to mutual reachability distances so that it can act as a noise point detector or a robustified version of 'HDBSCAN*' (that is able to detect a predefined number of clusters and hence it does not dependent on the somewhat fragile 'eps' parameter). The package also features an implementation of inequality indices (the Gini, Bonferroni index), external cluster validity measures (e.g., the normalised clustering accuracy and partition similarity scores such as the adjusted Rand, Fowlkes-Mallows, adjusted mutual information, and the pair sets index), and internal cluster validity indices (e.g., the Calinski-Harabasz, Davies-Bouldin, Ball-Hall, Silhouette, and generalised Dunn indices). See also the 'Python' version of 'genieclust' available on 'PyPI', which supports sparse data, more metrics, and even larger datasets. authors: - family-names: Gagolewski given-names: Marek email: marek@gagolewski.com orcid: https://orcid.org/0000-0003-0637-6028 preferred-citation: type: article title: 'genieclust: Fast and robust hierarchical clustering' authors: - family-names: Gagolewski given-names: Marek email: marek@gagolewski.com orcid: https://orcid.org/0000-0003-0637-6028 journal: SoftwareX year: '2021' volume: '15' doi: 10.1016/j.softx.2021.100722 start: '100722' repository: https://CRAN.R-project.org/package=genieclust repository-code: https://github.com/gagolews/genieclust url: https://genieclust.gagolewski.com/ date-released: '2024-08-22' contact: - family-names: Gagolewski given-names: Marek email: marek@gagolewski.com orcid: https://orcid.org/0000-0003-0637-6028 references: - type: article title: 'Genie: A new, fast, and outlier-resistant hierarchical clustering algorithm' authors: - family-names: Gagolewski given-names: Marek - family-names: Bartoszuk given-names: Maciej - family-names: Cena given-names: Anna journal: Information Sciences year: '2016' volume: '363' doi: 10.1016/j.ins.2016.05.003 start: '8' end: '23'