DNA methylation-based age prediction using massively parallel sequencing data and multiple machine learning models
- PMID: 30243148
- DOI: 10.1016/j.fsigen.2018.09.003
DNA methylation-based age prediction using massively parallel sequencing data and multiple machine learning models
Abstract
The field of DNA intelligence focuses on retrieving information from DNA evidence that can help narrow down large groups of suspects or define target groups of interest. With recent breakthroughs on the estimation of geographical ancestry and physical appearance, the estimation of chronological age comes to complete this circle of information. Recent studies have identified methylation sites in the human genome that correlate strongly with age and can be used for the development of age-estimation algorithms. In this study, 110 whole blood samples from individuals aged 11-93 years were analysed using a DNA methylation quantification assay based on bisulphite conversion and massively parallel sequencing (Illumina MiSeq) of 12 CpG sites. Using this data, 17 different statistical modelling approaches were compared based on root mean square error (RMSE) and a Support Vector Machine with polynomial function (SVMp) model was selected for further testing. For the selected model (RMSE = 4.9 years) the mean average error (MAE) of the blind test (n = 33) was calculated at 4.1 years, with 52% of the samples predicting with less than 4 years of error and 86% with less than 7 years. Furthermore, the sensitivity of the method was assessed both in terms of methylation quantification accuracy and prediction accuracy in the first validation of this kind. The described method retained its accuracy down to 10 ng of initial DNA input or ∼2 ng bisulphite PCR input. Finally, 34 saliva samples were analysed and following basic normalisation, the chronological age of the donors was predicted with less than 4 years of error for 50% of the samples and with less than 7 years of error for 70%.
Keywords: Age prediction; Artificial neural networks; DNA methylation; Machine learning; Saliva; Sperm; Whole blood.
Copyright © 2018 Elsevier B.V. All rights reserved.
Similar articles
-
DNA methylation-based forensic age prediction using artificial neural networks and next generation sequencing.Forensic Sci Int Genet. 2017 May;28:225-236. doi: 10.1016/j.fsigen.2017.02.009. Epub 2017 Feb 28. Forensic Sci Int Genet. 2017. PMID: 28254385 Free PMC article.
-
Chronological age prediction based on DNA methylation: Massive parallel sequencing and random forest regression.Forensic Sci Int Genet. 2017 Nov;31:19-28. doi: 10.1016/j.fsigen.2017.07.015. Epub 2017 Aug 1. Forensic Sci Int Genet. 2017. PMID: 28841467
-
Platform-independent models for age prediction using DNA methylation data.Forensic Sci Int Genet. 2019 Jan;38:39-47. doi: 10.1016/j.fsigen.2018.10.005. Epub 2018 Oct 9. Forensic Sci Int Genet. 2019. PMID: 30336352
-
Forensic individual age estimation with DNA: From initial approaches to methylation tests.Forensic Sci Rev. 2017 Jul;29(2):121-144. Forensic Sci Rev. 2017. PMID: 28691915 Review.
-
Methodological aspects of whole-genome bisulfite sequencing analysis.Brief Bioinform. 2015 May;16(3):369-79. doi: 10.1093/bib/bbu016. Epub 2014 May 27. Brief Bioinform. 2015. PMID: 24867940 Review.
Cited by
-
Applications of massively parallel sequencing in forensic genetics.Genet Mol Biol. 2022 Sep 19;45(3 Suppl 1):e20220077. doi: 10.1590/1678-4685-GMB-2022-0077. eCollection 2022. Genet Mol Biol. 2022. PMID: 36121926 Free PMC article.
-
Predicting Chronological Age from DNA Methylation Data: A Machine Learning Approach for Small Datasets and Limited Predictors.Methods Mol Biol. 2022;2432:187-200. doi: 10.1007/978-1-0716-1994-0_14. Methods Mol Biol. 2022. PMID: 35505216
-
Accurate age estimation from blood samples of Han Chinese individuals using eight high-performance age-related CpG sites.Int J Legal Med. 2022 Nov;136(6):1655-1665. doi: 10.1007/s00414-022-02865-3. Epub 2022 Jul 11. Int J Legal Med. 2022. PMID: 35819508
-
Longitudinal changes and variation in human DNA methylation analysed with the Illumina MethylationEPIC BeadChip assay and their implications on forensic age prediction.Sci Rep. 2023 Dec 8;13(1):21658. doi: 10.1038/s41598-023-49064-7. Sci Rep. 2023. PMID: 38066081 Free PMC article.
-
Postmortem age estimation via DNA methylation analysis in buccal swabs from corpses in different stages of decomposition-a "proof of principle" study.Int J Legal Med. 2021 Jan;135(1):167-173. doi: 10.1007/s00414-020-02360-7. Epub 2020 Jul 7. Int J Legal Med. 2021. PMID: 32632799 Free PMC article.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical