Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jul 15:16:841145.
doi: 10.3389/fnins.2022.841145. eCollection 2022.

Identification of cortical interneuron cell markers in mouse embryos based on machine learning analysis of single-cell transcriptomics

Affiliations

Identification of cortical interneuron cell markers in mouse embryos based on machine learning analysis of single-cell transcriptomics

Zhandong Li et al. Front Neurosci. .

Abstract

Mammalian cortical interneurons (CINs) could be classified into more than two dozen cell types that possess diverse electrophysiological and molecular characteristics, and participate in various essential biological processes in the human neural system. However, the mechanism to generate diversity in CINs remains controversial. This study aims to predict CIN diversity in mouse embryo by using single-cell transcriptomics and the machine learning methods. Data of 2,669 single-cell transcriptome sequencing results are employed. The 2,669 cells are classified into three categories, caudal ganglionic eminence (CGE) cells, dorsal medial ganglionic eminence (dMGE) cells, and ventral medial ganglionic eminence (vMGE) cells, corresponding to the three regions in the mouse subpallium where the cells are collected. Such transcriptomic profiles were first analyzed by the minimum redundancy and maximum relevance method. A feature list was obtained, which was further fed into the incremental feature selection, incorporating two classification algorithms (random forest and repeated incremental pruning to produce error reduction), to extract key genes and construct powerful classifiers and classification rules. The optimal classifier could achieve an MCC of 0.725, and category-specified prediction accuracies of 0.958, 0.760, and 0.737 for the CGE, dMGE, and vMGE cells, respectively. The related genes and rules may provide helpful information for deepening the understanding of CIN diversity.

Keywords: cortical interneuron diversity; embryo; ganglionic eminences; machine learning; rule learning.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

FIGURE 1
FIGURE 1
Overall workflow of the study.
FIGURE 2
FIGURE 2
The IFS curves of RIPPER and RF. RIPPER can reach the highest MCC of 0.577, whereas RF can reach the highest MCC of 0.725.
FIGURE 3
FIGURE 3
The Individual accuracies of some key classifier. Two RF classifiers provide almost equal performance and are superior to the RIPPER classifier.
FIGURE 4
FIGURE 4
The IFS curve of RF between 10 and 500. The RF classifier with top 120 features yielded a little lower MCC than the classifier with top 240 features, which is the optimal RF classifier.
FIGURE 5
FIGURE 5
Box plot to show the performance of RF classifier with top 120 features under 10-fold cross-validation for 10 times. ACC and MCC vary in a small range, indicating the stability of the classifier.

References

    1. Agoston Z., Li N., Haslinger A., Wizenmann A., Schulte D. (2012). Genetic and physical interaction of Meis2, Pax3 and Pax7 during dorsal midbrain development. BMC Dev. Biol. 12:10. 10.1186/1471-213X-12-10 - DOI - PMC - PubMed
    1. Bae C. J., Park B. Y., Lee Y. H., Tobias J. W., Hong C. S., Saintjeannet J. P. (2014). Identification of Pax3 and Zic1 targets in the developing neural crest. Dev. Biol. 386 473–483. 10.1016/j.ydbio.2013.12.011 - DOI - PMC - PubMed
    1. Bouilloux F., Thireau J., Ventéo S., Farah C., Karam S., Dauvilliers Y., et al. (2015). Loss of the transcription factor Meis1 prevents sympathetic neurons target-field innervation and increases susceptibility to sudden cardiac death. eLife 5:e11627. 10.7554/eLife.11627 - DOI - PMC - PubMed
    1. Breiman L. (2001). Random forests. Mach. Learn. 45 5–32.
    1. Chen L., Wang S., Zhang Y.-H., Li J., Xing Z.-H., Yang J., et al. (2017). Identify key sequence features to improve CRISPR sgRNA efficacy. IEEE Access 5 26582–26590.

LinkOut - more resources