TY - JOUR
T1 - A novel coarsened graph learning method for scalable single-cell data analysis
AU - Kataria, Mohit
AU - Srivastava, Ekta
AU - Arjun, Kumar
AU - Kumar, Sandeep
AU - Gupta, Ishaan
AU - Jayadeva,
N1 - Publisher Copyright:
© 2025 Elsevier Ltd
PY - 2025/4
Y1 - 2025/4
N2 - The emergence of single-cell technologies, including flow and mass cytometry, as well as single-cell RNA sequencing, has revolutionized the study of cellular heterogeneity, generating vast datasets rich in biological insights. Despite the effectiveness of graph-based analyses in deciphering the complexities of these datasets, managing large-scale graph representations of single-cell data remains computationally challenging. Coarsening has been employed to tackle this difficulty. However, current coarsening techniques such as Cytocoarsening, Heavy Edge Matching (HEM), and Locally Variable Edges (LVE) often suffer from slow processing speeds and limited adaptability. To address these challenges, we propose a novel approach utilizing Feature-Aware Graph Coarsening via Hashing (FACH), which integrates locality-sensitive hashing for scalable and efficient single-cell data analysis. This method directly extracts informative, low-dimensional cell representations from raw single-cell RNA sequencing and mass cytometry data, significantly improving processing speed while preserving essential data features. We demonstrate its effectiveness in downstream tasks, such as scalable graph neural network training on coarsened single-cell data, highlighting its ability to retain crucial biological features and enable efficient, accurate analyses. Our method directly extracts informative, low-dimensional cell representations from raw single-cell RNA sequencing and mass cytometry data, significantly improving processing speed and preserving critical biological features, such as transcriptional signatures and network topology. It reduces computational time by at least 50% compared to existing methods and achieves superior classification accuracy, such as 88.1% on the Baron Human dataset, underscoring its efficiency and precision in large-scale single-cell analysis.
AB - The emergence of single-cell technologies, including flow and mass cytometry, as well as single-cell RNA sequencing, has revolutionized the study of cellular heterogeneity, generating vast datasets rich in biological insights. Despite the effectiveness of graph-based analyses in deciphering the complexities of these datasets, managing large-scale graph representations of single-cell data remains computationally challenging. Coarsening has been employed to tackle this difficulty. However, current coarsening techniques such as Cytocoarsening, Heavy Edge Matching (HEM), and Locally Variable Edges (LVE) often suffer from slow processing speeds and limited adaptability. To address these challenges, we propose a novel approach utilizing Feature-Aware Graph Coarsening via Hashing (FACH), which integrates locality-sensitive hashing for scalable and efficient single-cell data analysis. This method directly extracts informative, low-dimensional cell representations from raw single-cell RNA sequencing and mass cytometry data, significantly improving processing speed while preserving essential data features. We demonstrate its effectiveness in downstream tasks, such as scalable graph neural network training on coarsened single-cell data, highlighting its ability to retain crucial biological features and enable efficient, accurate analyses. Our method directly extracts informative, low-dimensional cell representations from raw single-cell RNA sequencing and mass cytometry data, significantly improving processing speed and preserving critical biological features, such as transcriptional signatures and network topology. It reduces computational time by at least 50% compared to existing methods and achieves superior classification accuracy, such as 88.1% on the Baron Human dataset, underscoring its efficiency and precision in large-scale single-cell analysis.
KW - Coarsened graph learning
KW - Computational biology
KW - Downstream analysis
KW - Graph-based analysis
KW - Single-cell
UR - https://www.scopus.com/pages/publications/85218428611
U2 - 10.1016/j.compbiomed.2025.109873
DO - 10.1016/j.compbiomed.2025.109873
M3 - Article
AN - SCOPUS:85218428611
SN - 0010-4825
VL - 188
JO - Computers in Biology and Medicine
JF - Computers in Biology and Medicine
M1 - 109873
ER -