GPU-accelerated single-cell analysis at scale with rapids-singlecell

dc.contributor.authorDicks, Severin
dc.contributor.authorHeumos, Lukas
dc.contributor.authorMay, Lilly
dc.contributor.authorJimenez, Sara
dc.contributor.authorAngerer, Philipp
dc.contributor.authorGold, Ilan
dc.contributor.authorVirshup, Isaac
dc.contributor.authorFischer, Felix
dc.contributor.authorGill, Michelle
dc.contributor.authorBoerries, Melanie
dc.contributor.authorNolet, Coey
dc.contributor.authorChen, Tiffany J.
dc.contributor.authorTheis, Fabian J.
dc.date.accessioned2026-03-26T14:26:48Z
dc.date.issued2026-03-02
dc.description.abstractSingle-cell sequencing technologies reveal cellular heterogeneity at high resolution, advancing our understanding of biological complexity. As datasets start to scale to tens of millions of cells, computational workflows face substantial bottlenecks, with CPU-based analytical pipelines requiring hours or days for routine processing steps like filtering, normalization, and clustering. These scalability limitations fundamentally restrict common interactive data exploration and iterative hypothesis testing. Here we introduce rapids-singlecell, a GPU-accelerated framework that integrates natively with the scverse ecosystem and operates directly on the AnnData data structure, which delivers orders-of-magnitude speedups for single-cell workflows. Built on CuPy arrays and the NVIDIA CUDA-X Data Science (RAPIDS) ecosystem, rapids-singlecell provides near drop-in GPU replacements for core scanpy-based analysis steps. Across standard single-cell workflows such as preprocessing, dimensionality reduction, neighborhood graph construction, clustering, and batch correction, rapids-singlecell achieves speedups of up to several hundred-fold compared to optimized CPU baselines. This reduces analysis time from hours to minutes on standard hardware, while maintaining consistent biological interpretations. These performance improvements make it possible to analyze large data sets in close to real time, without the need for data splitting. Together with real-time parameter tuning and iterative workflows, rapids-singlecell makes interactive large-scale single-cell analysis possible.
dc.description.sponsorshipThe authors acknowledge technical support from NVIDIA and scverse. We would also like to thank Seulah Park for creating the rapids-singlecell logo.
dc.description.urihttp://arxiv.org/abs/2603.02402
dc.format.extent14 pages
dc.genrejournal articles
dc.genrepreprints
dc.identifierdoi:10.13016/m2082m-nqbk
dc.identifier.urihttps://doi.org/10.48550/arXiv.2603.02402
dc.identifier.urihttp://hdl.handle.net/11603/42274
dc.language.isoen
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department
dc.relation.ispartofUMBC Student Collection
dc.rightsAttribution 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectQuantitative Biology - Genomics
dc.titleGPU-accelerated single-cell analysis at scale with rapids-singlecell
dc.typeText
dcterms.creatorhttps://orcid.org/0000-0002-2117-7636

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2603.02402v1.pdf
Size:
1.16 MB
Format:
Adobe Portable Document Format