Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2024 Jan 17:rs.3.rs-3844718.
doi: 10.21203/rs.3.rs-3844718/v1.

cellMarkerPipe: Cell Marker Identification and Evaluation Pipeline in Single Cell Transcriptomes

Affiliations

cellMarkerPipe: Cell Marker Identification and Evaluation Pipeline in Single Cell Transcriptomes

Qiuming Yao et al. Res Sq. .

Update in

Abstract

Assessing marker genes from all cell clusters can be time-consuming and lack systematic strategy. Streamlining this process through a unified computational platform that automates identification and benchmarking will greatly enhance efficiency and ensure a fair evaluation. We therefore developed a novel computational platform, cellMarkerPipe (https://github.com/yao-laboratory/cellMarkerPipe), for automated cell-type specific marker gene identification from scRNA-seq data, coupled with comprehensive evaluation schema. CellMarkerPipe adaptively wraps around a collection of commonly used and state-of-the-art tools, including Seurat, COSG, SC3, SCMarker, COMET, and scGeneFit. From rigorously testing across diverse samples, we ascertain SCMarker's overall reliable performance in single marker gene selection, with COSG showing commendable speed and comparable efficacy. Furthermore, we demonstrate the pivotal role of our approach in real-world medical datasets. This general and opensource pipeline stands as a significant advancement in streamlining cell marker gene identification and evaluation, fitting broad applications in the field of cellular biology and medical research.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Schematic Overview of the Pipeline. The cellMarkerPipe comprises three key steps: preparation, selection, and evaluation for marker genes.
Figure 2
Figure 2
Testing Scenarios for Various Methods. (a) Evaluation of methods on Zeisel data with varying numbers of selected marker genes. (b) Assessment of methods on Jurkat data with fluctuations in cell type proportions. (c) Examination of methods on PBMC data with varying numbers of cells in the input dataset, and (d) varying numbers of genes post dimension reduction. (e) COSG results showcased for the plant (Arabidopsis root) data, highlighting specific expressions with the top 10 genes in each cluster.
Figure 3
Figure 3
Performance of COSG on Single-Cell Data. (a) Evaluation of COSG performance on human gut single-cell data. (b) Evaluation of COSG performance on mice gut single-cell data. (c) Top 10 gene expressions per cell type identified by COSG for human gut samples. (d) Top 10 gene expressions per cell type identified by COSG for mice gut samples.
Figure 4
Figure 4
Running Time Measurement in Seconds for Varied Input Parameters. (a) Variation in running time with changes in the number of genes. (b) Variation in running time with changes in the number of cells in the input data. These computational tasks were executed on a computing node with 10G of allocated memory, running a single process on standard CPUs at the Holland Computing Center, University of Nebraska Lincoln.
Figure 5
Figure 5
Comparative Performance in Blood Samples. (a) Performance of methods in unedited and modified blood samples from the same patient. (b) Top 10 marker gene expressions per cell type identified by COSG for unedited and modified blood samples. (c) Top 10 gene expressions for B cells. (d) Influence of clustering method on recall and precision.

Similar articles

References

    1. Birnbaum K. D., Otegui M. S., Bailey-Serres J. & Rhee S. Y. The Plant Cell Atlas: focusing new technologies on the kingdom that nourishes the planet. Plant Physiology vol. 188 Preprint at 10.1093/plphys/kiab584 (2022). - DOI - PMC - PubMed
    1. Nieto P. et al. A single-cell tumor immune atlas for precision oncology. Genome Res 31, (2021). - PMC - PubMed
    1. Fawkner-Corbett D. et al. Spatiotemporal analysis of human intestinal development at single-cell resolution. Cell 184, (2021). - PMC - PubMed
    1. Zilbauer M. et al. A Roadmap for the Human Gut Cell Atlas. Nat Rev Gastroenterol Hepatol 20, (2023). - PMC - PubMed
    1. Rozenblatt-Rosen O. et al. Building a high-quality Human Cell Atlas. Nature Biotechnology vol. 39 Preprint at 10.1038/s41587-020-00812-4 (2021). - DOI - PubMed

Publication types

LinkOut - more resources