Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2021 May 31:12:655536.
doi: 10.3389/fgene.2021.655536. eCollection 2021.

Machine Intelligence in Single-Cell Data Analysis: Advances and New Challenges

Affiliations
Review

Machine Intelligence in Single-Cell Data Analysis: Advances and New Challenges

Jiajia Liu et al. Front Genet. .

Abstract

The rapid development of single-cell technologies allows for dissecting cellular heterogeneity at different omics layers with an unprecedented resolution. In-dep analysis of cellular heterogeneity will boost our understanding of complex biological systems or processes, including cancer, immune system and chronic diseases, thereby providing valuable insights for clinical and translational research. In this review, we will focus on the application of machine learning methods in single-cell multi-omics data analysis. We will start with the pre-processing of single-cell RNA sequencing (scRNA-seq) data, including data imputation, cross-platform batch effect removal, and cell cycle and cell-type identification. Next, we will introduce advanced data analysis tools and methods used for copy number variance estimate, single-cell pseudo-time trajectory analysis, phylogenetic tree inference, cell-cell interaction, regulatory network inference, and integrated analysis of scRNA-seq and spatial transcriptome data. Finally, we will present the latest analyzing challenges, such as multi-omics integration and integrated analysis of scRNA-seq data.

Keywords: CNV estimation; batch effects removal; cell cycle identification; cell type identification; cell–cell interaction; data imputation; regulatory network inference; trajectory inference.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

FIGURE 1
FIGURE 1
Brief summary of computational methods for single cell data analysis.
FIGURE 2
FIGURE 2
Conceptual view of reads imputation. (A) Expected distribution of transcripts. (B) Reads notches in bulk RNA-seq. (C) Reads notches in scRNA-seq. For concise, the junction reads are not indicated.

References

    1. Abdelaal T., Michielsen L., Cats D., Hoogduin D., Mei H., Reinders M. J. T., et al. (2019). A comparison of automatic cell identification methods for single-cell RNA sequencing data. Genome Biol. 20:194. 10.1186/s13059-019-1795-z - DOI - PMC - PubMed
    1. Aibar S., Gonzalez-Blas C. B., Moerman T., Huynh-Thu V. A., Imrichova H., Hulselmans G., et al. (2017). SCENIC: single-cell regulatory network inference and clustering. Nat. Methods 14 1083–1086. 10.1038/nmeth.4463 - DOI - PMC - PubMed
    1. Aitchison J. (1982). The statistical analysis of compositional data. J. R. Statist. Soc. Ser. B 44 139–160. 10.1111/j.2517-6161.1982.tb01195.x - DOI
    1. Amodio M., van Dijk D., Srinivasan K., Chen W. S., Mohsen H., Moon K. R., et al. (2019). Exploring single-cell data with deep multitasking neural networks. Nat. Methods 16 1139–1145. 10.1038/s41592-019-0576-7 - DOI - PMC - PubMed
    1. Anafi R. C., Francey L. J., Hogenesch J. B., Kim J. (2017). CYCLOPS reveals human transcriptional rhythms in health and disease. Proc. Natl. Acad. Sci. U.S.A. 114 5312–5317. 10.1073/pnas.1619320114 - DOI - PMC - PubMed

LinkOut - more resources