Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Aug 11;16(8):e0255085.
doi: 10.1371/journal.pone.0255085. eCollection 2021.

Identification of conserved transcriptome features between humans and Drosophila in the aging brain utilizing machine learning on combined data from the NIH Sequence Read Archive

Affiliations

Identification of conserved transcriptome features between humans and Drosophila in the aging brain utilizing machine learning on combined data from the NIH Sequence Read Archive

Joe L Webb et al. PLoS One. .

Abstract

Aging is universal, yet characterizing the molecular changes that occur in aging which lead to an increased risk for neurological disease remains a challenging problem. Aging affects the prefrontal cortex (PFC), which governs executive function, learning, and memory. Previous sequencing studies have demonstrated that aging alters gene expression in the PFC, however the extent to which these changes are conserved across species and are meaningful in neurodegeneration is unknown. Identifying conserved, age-related genetic and morphological changes in the brain allows application of the wealth of tools available to study underlying mechanisms in model organisms such as Drosophila melanogaster. RNA sequencing data from human PFC and fly heads were analyzed to determine conserved transcriptome signatures of age. Our analysis revealed that expression of 50 conserved genes can accurately determine age in Drosophila (R2 = 0.85) and humans (R2 = 0.46). These transcriptome signatures were also able to classify Drosophila into three age groups with a mean accuracy of 88% and classify human samples with a mean accuracy of 69%. Overall, this work identifies 50 highly conserved aging-associated genetic changes in the brain that can be further studied in model organisms and demonstrates a novel approach to uncovering genetic changes conserved across species from multi-study public databases.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Workflow of biological age prediction using XGBoost across species.
Depicting selection of genes for both aging involvement and conservation between Human and Drosophila.
Fig 2
Fig 2. Biological age prediction using XGBoost across species.
A) Sample regression analysis using all human genes to predict human age. B) Histogram of R2 values for predicting human age with all available human genes with a mean R2 of 0.61. C) A sample regression analysis with all Drosophila genes to predict Drosophila age. D) Histogram of R2 values for predicting Drosophila age using all available Drosophila genes with a mean R2 of 0.93. All results calculated using XGBoost. Histograms represent averages across 1000 bootstrapped random samplings where the regressor or classifier was trained on 75% of the samples and tested on 25% with values reported as the mean across 1000 iterations. Median AE stands for Median Average Error.
Fig 3
Fig 3. Feature selection using aging correlated genes across species.
A) Sample regression analysis using the top 1000 human aging correlated genes to predict human age. B) Histogram of R2 values using the top 1000 human aging correlated genes to predict human age with a mean R2 of 0.62. C) Sample regression analysis predicting Drosophila age using the top 1000 aging correlated Drosophila genes. D) Histogram of R2 values predicting Drosophila age using the top 1000 Drosophila aging correlated genes with mean R2 of 0.95. All results calculated using XGBoost. All histograms represent averages across 1000 bootstrapped random samplings where the regressor or classifier was trained on 75% of the samples and tested on 25%. All histogram values are reported as the mean across 1000 iterations. Median AE stands for median average error.
Fig 4
Fig 4. Intersection of homologous aging-correlated genes.
A) Sample regression analysis of the overlapping 50 aging correlated genes across humans. B) Histogram of R2 values of human age prediction using the conserved 50 aging correlated genes with a mean R2 of 0.46. C) Sample regression analysis of the mean R2 values predicting Drosophila age using the 50 aging correlated genes. D) Histogram of R2 values predicting Drosophila age using the conserved 50 aging correlated genes with a mean R2 of 0.85. All results calculated using XGBoost. All histogram results represent averages across 1000 bootstrapped random samplings where the regressor or classifier was trained on 75% of the samples and tested on 25%. All histogram values are reported as the mean across 1000 iterations. Median AE stands for Median Average Error.
Fig 5
Fig 5
A) A heat map of gene expression in young, middle aged, and old human prefrontal cortex of the 50 conserved genes. Shift of color from blue to red indicates an increase in expression relative to all gene expression in the dataset as indicated B) Interactome representing STRING Network interactions of 50 conserved genes. Networks connected through ‘edge’ lines represented by STRING confidence score. Line thickness indicates interaction confidence scores with greater thickness indicating higher confidence interaction. The table on the right indicates KEGG pathways represented by the STRING network. Colors in nodes represent involvement in pathway, see table. Orange border surrounding a node indicates occurrence within 1000 genes with a reported association of either Parkinson’s or Alzheimer’s disease.

References

    1. Grady CL, McIntosh AR, Craik FIM. Task-related activity in prefrontal cortex and its relation to recognition memory performance in young and old adults. Neuropsychologia. 2005;43(10):1466–81. doi: 10.1016/j.neuropsychologia.2004.12.016 - DOI - PubMed
    1. Morrison JH, Baxter MG. The ageing cortical synapse: Hallmarks and implications for cognitive decline. Vol. 13, Nature Reviews Neuroscience. 2012. p. 240–50. doi: 10.1038/nrn3200 - DOI - PMC - PubMed
    1. Zhuang J, Zhang L, Dai S, Cui L, Guo C, Sloofman L, et al.. Comparison of multi-tissue aging between human and mouse. Sci Rep. 2019Dec1;9(1). doi: 10.1038/s41598-019-42485-3 - DOI - PMC - PubMed
    1. West AE, Greenberg ME. Neuronal activity-regulated gene transcription in synapse development and cognitive function. Vol. 3, Cold Spring Harbor Perspectives in Biology. 2011. p. 1–21. doi: 10.1101/cshperspect.a005744 - DOI - PMC - PubMed
    1. Loerch PM, Lu T, Dakin KA, Vann JM, Isaacs A, Geula C, et al.. Evolution of the aging brain transcriptome and synaptic regulation. PLoS One. 2008Oct2;3(10). doi: 10.1371/journal.pone.0003329 - DOI - PMC - PubMed

Publication types

LinkOut - more resources