Package: SuperCell 1.0.1

Leonard Herault

SuperCell: Simplification of scRNA-Seq Data by Merging Together Similar Cells

Aggregates large single-cell data into metacell dataset by merging together gene expression of very similar cells. 'SuperCell' uses 'velocyto.R' <doi:10.1038/s41586-018-0414-6> <https://github.com/velocyto-team/velocyto.R> for RNA velocity. We also recommend installing 'scater' Bioconductor package <doi:10.18129/B9.bioc.scater> <https://bioconductor.org/packages/release/bioc/html/scater.html>.

Authors:Mariia Bilous [aut], Leonard Herault [cre]

SuperCell_1.0.1.tar.gz
SuperCell_1.0.1.tar.gz(r-4.5-noble)SuperCell_1.0.1.tar.gz(r-4.4-noble)
SuperCell_1.0.1.tgz(r-4.4-emscripten)SuperCell_1.0.1.tgz(r-4.3-emscripten)
SuperCell.pdf |SuperCell.html
SuperCell/json (API)

# Install 'SuperCell' in R:
install.packages('SuperCell', repos = 'https://cloud.r-project.org')

Bug tracker:https://github.com/gfellerlab/supercell/issues2 issues

Datasets:

On CRAN:

Conda:

software

3.30 score 475 downloads 30 exports 144 dependencies

Last updated 5 months agofrom:141ae9293e. Checks:2 OK, 1 NOTE. Indexed: no.

TargetResultLatest binary
Doc / VignettesOKMar 25 2025
R-4.5-linuxOKMar 25 2025
R-4.4-linuxNOTEMar 25 2025

Exports:anndata_2_supercellbuild_knn_graphmetacell2_anndata_2_supercellsc_mixing_scoreSCimplifySCimplify_for_velocitySCimplify_from_embeddingsupercell_2_scesupercell_2_Seuratsupercell_assignsupercell_clustersupercell_DimPlotsupercell_estimate_velocitysupercell_FindAllMarkerssupercell_FindMarkerssupercell_GEsupercell_GeneGenePlotsupercell_mergesupercell_mergeGEsupercell_plotsupercell_plot_GEsupercell_plot_tSNEsupercell_plot_UMAPsupercell_prcompsupercell_puritysupercell_rescalesupercell_silhouettesupercell_tSNEsupercell_UMAPsupercell_VlnPlot

Dependencies:askpassbackportsbase64encbitbit64bootbroombslibcachemcheckmateclicliprclustercodetoolscolorspacecorpcorcpp11crayondata.tabledbscandigestdoFuturedplyrentropyevaluatefansifarverfastclusterfastmapfontawesomeforcatsforeachforeignFormulafsfuturefuture.applygdatagenericsggplot2glmnetglobalsgluegridExtragtablegtoolshavenherehighrHmischmshtmlTablehtmltoolshtmlwidgetsigraphirlbaisobanditeratorsjomojquerylibjsonliteknitrlabelinglatticelifecyclelistenvlme4magrittrMASSMatrixmatrixStatsmemoisemgcvmicemimeminqamitmlmunsellnlmenloptrnnetnumDerivopensslordinalpanparallellypatchworkpermutepillarpkgconfigplotfunctionsplyrpngprettyunitsprogressprogressrproxypurrrR6RANNrappdirsrbibutilsRColorBrewerRcppRcppEigenRcppTOMLRdpackreadrreformulasreticulaterlangrmarkdownrpartrprojrootRSpectrarstudioapiRtsnesassscalesshapestringistringrsurvivalsystibbletidyrtidyselecttinytexTraMineRtzdbucminfumaputf8vctrsveganvegclustviridisviridisLitevroomWeightedClusterweightswithrxfunyaml

Combined or independent SuperCell runs for different samples

Rendered fromb_Combined_vs_independent_sample_processing.Rmdusingknitr::rmarkdownon Mar 25 2025.

Last update: 2024-09-06
Started: 2024-09-06

Example of the SuperCell pipeline

Rendered froma_SuperCell.Rmdusingknitr::rmarkdownon Mar 25 2025.

Last update: 2024-09-06
Started: 2024-09-06

Citation

To cite package ‘SuperCell’ in publications use:

Bilous M (2024). SuperCell: Simplification of scRNA-Seq Data by Merging Together Similar Cells. R package version 1.0.1, https://CRAN.R-project.org/package=SuperCell.

Corresponding BibTeX entry:

  @Manual{,
    title = {SuperCell: Simplification of scRNA-Seq Data by Merging
      Together Similar Cells},
    author = {Mariia Bilous},
    year = {2024},
    note = {R package version 1.0.1},
    url = {https://CRAN.R-project.org/package=SuperCell},
  }

Readme and manuals

R-CMD-check 1EDOI

Coarse-graining of large single-cell RNA-seq data into metacells

SuperCell is an R package for coarse-graining large single-cell RNA-seq data into metacells and performing downstream analysis at the metacell level.

The exponential scaling of scRNA-seq data represents an important hurdle for downstream analyses. One of the solutions to facilitate the analysis of large-scale and noisy scRNA-seq data is to merge transcriptionally highly similar cells into metacells. This concept was first introduced by Baran et al., 2019 (MetaCell) and by Iacono et al., 2018 (bigSCale). More recent methods to build metacells have been described in Ben-Kiki et al. 2022 (MetaCell2), Bilous et al., 2022 (SuperCell) and Persad et al., 2022 (SEACells). Despite some differences in the implementation, all the methods are network-based and can be summarized as follows:

1. A single-cell network is computed based on cell-to-cell similarity (in transcriptomic space)

2. Highly similar cells are identified as those forming dense regions in the single-cell network and merged together into metacells (coarse-graining)

3. Transcriptomic information within each metacell is combined (average or sum).

4. Metacell data are used for the downstream analyses instead of large-scale single-cell data

Unlike clustering, the aim of metacells is not to identify large groups of cells that comprehensively capture biological concepts, like cell types, but to merge cells that share highly similar profiles, and may carry repetitive information. Therefore metacells represent a compromise structure that optimally remove redundant information in scRNA-seq data while preserving the biologically relevant heterogeneity.

An important concept when building metacells is the graining level (γ), which we define as the ratio between the number of single cells in the initial data and the number of metacells. We suggest applying γ between 10 and 50, which significantly reduces the computational resources needed to perform the downstream analyses while preserving most of the result of the initial (i.e., single-cell) analyses.

Installation

SuperCell requires igraph, RANN, WeightedCluster, corpcor, weights, Hmisc, Matrix, matrixStats, plyr, irlba, grDevices, patchwork, ggplot2. SuperCell uses velocyto.R for RNA velocity.

install.packages("igraph")
install.packages("RANN")
install.packages("WeightedCluster")
install.packages("corpcor")
install.packages("weights")
install.packages("Hmisc")
install.packages("Matrix")
install.packages("patchwork")
install.packages("plyr")
install.packages("irlba")

Installing SuperCell package from gitHub

if (!requireNamespace("remotes")) install.packages("remotes")
remotes::install_github("GfellerLab/SuperCell")

library(SuperCell)

Examples

  1. Building and analyzing metacells with SuperCell
  2. Building metacells with SuperCell and alayzing them with a standard Seurat pipeline
  3. Data integration of metacells built with SuperCell

[License]

SuperCell is developed by the group of David Gfeller at University of Lausanne.

SuperCell is available under GPL-3 License.

For scientific questions, please contact Mariia Bilous (mariia.bilous@unil.ch) or David Gfeller (David.Gfeller@unil.ch).

How to cite

If you use SuperCell in a publication, please cite: Bilous et al. Metacells untangle large and complex single-cell transcriptome networks, BMC Bioinformatics (2022).

Help Manual

Help pageTopics
Convert Anndata metacell object (Metacell-2 or SEACells) to Super-cell like objectanndata_2_supercell
Build kNN graphbuild_knn_graph
Build kNN graph using RANN::nn2 (used in '"build_knn_graph"')build_knn_graph_nn2
Cancer cell lines datasetcell_lines
Build kNN graph from distance (used in '"build_knn_graph"')knn_graph_from_dist
Convert Metacells (Metacell-2) to Super-cell like objectmetacell2_anndata_2_supercell
Compute mixing of single-cells within supercellsc_mixing_score
Detection of metacells with the SuperCell approachSCimplify
Construct super-cells from spliced and un-spliced matricesSCimplify_for_velocity
Detection of metacells with the SuperCell approach from low dim representationSCimplify_from_embedding
Super-cells to SingleCellExperiment objectsupercell_2_sce
Super-cells to Seurat objectsupercell_2_Seurat
Assign super-cells to the most aboundant clustersupercell_assign
Cluster super-cell datasupercell_cluster
Plot metacell 2D plot (PCA, UMAP, tSNE etc)supercell_DimPlot
Run RNAvelocity for super-cells (slightly modified from <https://github.com/velocyto-team/velocyto.R>) Not yet adjusted for super-cell size (not sample-weighted)supercell_estimate_velocity
Differential expression analysis of supep-cell data. Most of the parameters are the same as in Seurat FindAllMarkers (for simplicity)supercell_FindAllMarkers
Differential expression analysis of supep-cell data. Most of the parameters are the same as in Seurat FindMarkers (for simplicity)supercell_FindMarkers
Simplification of scRNA-seq datasetsupercell_GE
Simplification of scRNA-seq dataset (old version, not used since 12.02.2021)supercell_GE_idx
Gene-gene correlation plotsupercell_GeneGenePlot
Plot Gene-gene correlation plot for 1 featuresupercell_GeneGenePlot_single
Merging independent SuperCell objectssupercell_merge
Merging metacell gene expression matrices from several independent SuperCell objectssupercell_mergeGE
Plot metacell NWsupercell_plot
Plot super-cell NW colored by an expression of a gene (gradient color)supercell_plot_GE
Plot super-cell tSNE (Use supercell_DimPlot instead) Plots super-cell tSNE (result of supercell_tSNE)supercell_plot_tSNE
Plot super-cell UMAP (Use supercell_DimPlot instead) Plots super-cell UMAP (result of supercell_UMAP)supercell_plot_UMAP
compute PCA for super-cell data (sample-weighted data)supercell_prcomp
Compute purity of super-cellssupercell_purity
Rescale supercell objectsupercell_rescale
Compute Silhouette index accounting for samlpe size (super cells size) ###supercell_silhouette
Compute tSNE of super-cellssupercell_tSNE
Compute UMAP of super-cellssupercell_UMAP
Violin plotssupercell_VlnPlot
Plot Violin plot for 1 featuresupercell_VlnPlot_single