Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Apr 14;18(4):e0284527.
doi: 10.1371/journal.pone.0284527. eCollection 2023.

GRACE: Graph autoencoder based single-cell clustering through ensemble similarity learning

Affiliations

GRACE: Graph autoencoder based single-cell clustering through ensemble similarity learning

Jun Seo Ha et al. PLoS One. .

Abstract

Recent advances in single-cell sequencing techniques have enabled gene expression profiling of individual cells in tissue samples so that it can accelerate biomedical research to develop novel therapeutic methods and effective drugs for complex disease. The typical first step in the downstream analysis pipeline is classifying cell types through accurate single-cell clustering algorithms. Here, we describe a novel single-cell clustering algorithm, called GRACE (GRaph Autoencoder based single-cell Clustering through Ensemble similarity larning), that can yield highly consistent groups of cells. We construct the cell-to-cell similarity network through the ensemble similarity learning framework, and employ a low-dimensional vector representation for each cell through a graph autoencoder. Through performance assessments using real-world single-cell sequencing datasets, we show that the proposed method can yield accurate single-cell clustering results by achieving higher assessment metric scores.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Graphical overview of the proposed single-cell clustering algorithm.
GRACE includes three main steps to derive the accurate single cell clustering. First, the proposed method constructs the ensemble cell-to-cell similarity network, where it can effective represent the similarities between cells based on the multiple similarity measurements through different feature genes. Second, a graph autoencoder derives a low-dimensional vector representation for each node (i.e., cell). Finally, GRACE predicts the optimal number of clusters and yields the accurate single-cell clustering based on the low-dimensional vector representation for each node.
Fig 2
Fig 2. Purity scores for each clustering algorithm.
Note that we performed 10 trials and visualize scattering points for each trial and the bar plot represents the averaged purity scores for all trials.
Fig 3
Fig 3. Jaccard index scores for each single-cell clustering algorithm.
Note that scattering points represent the Jaccard index scores for 10 trials, and the bar plot represents the averaged Jaccard index score for all trials.
Fig 4
Fig 4. Adjusted rand index for each algorithm.
Note that scattering points represent the ARI scores for 10 trials, and the bar plot represents the averaged ARI score for all trials.
Fig 5
Fig 5. Normalized mutual information for each clustering algorithm.
Note that scattering points represent the NMI scores for 10 trials, and the bar plot represents the averaged NMI score for all trials.
Fig 6
Fig 6. Comparison of the true and predicted number of clusters for each algorithm.
Fig 7
Fig 7. Low dimensional visualization of clustering results.
Low-dimensional coordinates are derived through t-SNE and predicted clusters are annotated through different colors.
Fig 8
Fig 8. Computation time of each algorithm.
All experiments were performed on Intel i5 processor with 12 cores, 48GB system memory, and NVIDIA GTX 1060 GPU. Note That the base clock frequency of the CPU is 4.10 GHz.

Similar articles

Cited by

References

    1. Hashimshony T, Wagner F, Sher N, Yanai I. CEL-Seq: single-cell RNA-Seq by multiplexed linear amplification. Cell reports. 2012;2(3):666–673. doi: 10.1016/j.celrep.2012.08.003 - DOI - PubMed
    1. Islam S, Zeisel A, Joost S, La Manno G, Zajac P, Kasper M, et al.. Quantitative single-cell RNA-seq with unique molecular identifiers. Nature methods. 2014;11(2):163–166. doi: 10.1038/nmeth.2772 - DOI - PubMed
    1. Macosko EZ, Basu A, Satija R, Nemesh J, Shekhar K, Goldman M, et al.. Highly parallel genome-wide expression profiling of individual cells using nanoliter droplets. Cell. 2015;161(5):1202–1214. doi: 10.1016/j.cell.2015.05.002 - DOI - PMC - PubMed
    1. Klein AM, Mazutis L, Akartuna I, Tallapragada N, Veres A, Li V, et al.. Droplet barcoding for single-cell transcriptomics applied to embryonic stem cells. Cell. 2015;161(5):1187–1201. doi: 10.1016/j.cell.2015.04.044 - DOI - PMC - PubMed
    1. Hwang B, Lee JH, Bang D. Single-cell RNA sequencing technologies and bioinformatics pipelines. Experimental & molecular medicine. 2018;50(8):1–14. doi: 10.1038/s12276-018-0071-8 - DOI - PMC - PubMed

Publication types