Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 May 1;12(5):683.
doi: 10.3390/genes12050683.

Hippocampal Subregion and Gene Detection in Alzheimer's Disease Based on Genetic Clustering Random Forest

Affiliations

Hippocampal Subregion and Gene Detection in Alzheimer's Disease Based on Genetic Clustering Random Forest

Jin Li et al. Genes (Basel). .

Abstract

The distinguishable subregions that compose the hippocampus are differently involved in functions associated with Alzheimer's disease (AD). Thus, the identification of hippocampal subregions and genes that classify AD and healthy control (HC) groups with high accuracy is meaningful. In this study, by jointly analyzing the multimodal data, we propose a novel method to construct fusion features and a classification method based on the random forest for identifying the important features. Specifically, we construct the fusion features using the gene sequence and subregions correlation to reduce the diversity in same group. Moreover, samples and features are selected randomly to construct a random forest, and genetic algorithm and clustering evolutionary are used to amplify the difference in initial decision trees and evolve the trees. The features in resulting decision trees that reach the peak classification are the important "subregion gene pairs". The findings verify that our method outperforms well in classification performance and generalization. Particularly, we identified some significant subregions and genes, such as hippocampus amygdala transition area (HATA), fimbria, parasubiculum and genes included RYR3 and PRKCE. These discoveries provide some new candidate genes for AD and demonstrate the contribution of hippocampal subregions and genes to AD.

Keywords: clustering evolution; genetic algorithm; hippocampus; random forest; subregion.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Figures

Figure 1
Figure 1
The Manhattan plots of CA1 of HC and AD. CA1 = cornu ammonis 1 region; HC = healthy control; AD = Alzheimer’s disease.
Figure 2
Figure 2
The anatomical representation of the 12 hippocampal subregions. The color represented different subregions. HATA = the hippocampus amygdala transition area; GL_ML_DG = the granule cell layer and molecular of the dentate gyrus; CA1 = cornu ammonis 1 region; CA3 = cornu ammonis 3 region; CA4 = cornu ammonis 4 region.
Figure 3
Figure 3
The schematic diagram of genetic clustering random forest. Genetic algorithm and clustering evolution were applied to increase the difference among basic classifiers and further improve their diversity and accuracy.
Figure 4
Figure 4
The schematic diagram of genetic evolution. The clustering evolutionary was used to select the parent.
Figure 5
Figure 5
The relationship among the clustering evolution times, the genetic evolution times and the size of initial random forest in genetic clustering random forest. The dotted line indicates the accuracy of classification. The solid lines and bars indicate the number of genetic evolution times and clustering evolution times according to the initial random forest size.
Figure 6
Figure 6
The relationship between the accuracies and the size of initial random forest.
Figure 7
Figure 7
The relationship between the genetic evolution times and the size of initial random forest.
Figure 8
Figure 8
The relationship between the clustering evolution times and the size of initial random forest. The dotted line indicates the accuracy of classification. The solid lines indicate the number of clustering evolution times according to the initial random forest size.
Figure 9
Figure 9
The relationship curves of accuracy and the four methods in 10 experiments.
Figure 10
Figure 10
The ability of the traditional random forest to classify the subsets.
Figure 11
Figure 11
The top 475 “subregion-gene pairs” and the first 15 important “subregion-gene pairs”. Nodes denote the subregions and genes. Edges denote the association between subregions and genes, and the widths of edges denote the frequency of each “subregion-gene pair”.
Figure 12
Figure 12
The classification accuracy curve of the proposed model based on three datasets.

Similar articles

Cited by

References

    1. Newton-Cheh C., Hirschhorn J.N. Genetic association studies of complex traits: Design and analysis issues. Mutat. Res. Fundam. Mol. Mech. Mutagenesis. 2005;573:54–69. doi: 10.1016/j.mrfmmm.2005.01.006. - DOI - PubMed
    1. Iglesias J.E., Augustinack J.C., Nguyen K., Player C.M., Player A., Wright M., Roy N., Frosch M.P., Mckee A.C., Wald L.L. A computational atlas of the hippocampal formation using ex vivo, ultra-high resolution mri: Application to adaptive segmentation of in vivo mri. Neuroimage. 2015;115:117–137. doi: 10.1016/j.neuroimage.2015.04.042. - DOI - PMC - PubMed
    1. Zeidman P., Maguire E.A. Anterior hippocampus: The anatomy of perception, imagination and episodic memory. Nat. Rev. Neurosci. 2016;17:173–182. doi: 10.1038/nrn.2015.24. - DOI - PMC - PubMed
    1. Cong S., Risacher S.L., West J.D., Wu Y.C., Apostolova L.G., Tallman E., Rizkalla M., Salama P., Saykin A.J., Shen L. Volumetric Comparison of Hippocampal Subfields Extracted from 4-Minute Accelerated versus 8-Minute High-resolution T2-weighted 3T MRI Scans. Brain Imaging Behav. 2018;12:1583–1595. doi: 10.1007/s11682-017-9819-3. - DOI - PMC - PubMed
    1. Cong S., Yao X., Huang Z., Risacher S.L., Nho K., Saykin A.J., Shen L. Volumetric gwas of medial temporal lobe structures identifies an erc1 locus using adni high-resolution t2-weighted mri data. Neurobiol. Aging. 2020;95:81–93. doi: 10.1016/j.neurobiolaging.2020.07.005. - DOI - PMC - PubMed

Publication types

Substances