A Machine Learning Approach to Parkinson's Disease Blood Transcriptomics
- PMID: 35627112
- PMCID: PMC9141063
- DOI: 10.3390/genes13050727
A Machine Learning Approach to Parkinson's Disease Blood Transcriptomics
Abstract
The increased incidence and the significant health burden associated with Parkinson's disease (PD) have stimulated substantial research efforts towards the identification of effective treatments and diagnostic procedures. Despite technological advancements, a cure is still not available and PD is often diagnosed a long time after onset when irreversible damage has already occurred. Blood transcriptomics represents a potentially disruptive technology for the early diagnosis of PD. We used transcriptome data from the PPMI study, a large cohort study with early PD subjects and age matched controls (HC), to perform the classification of PD vs. HC in around 550 samples. Using a nested feature selection procedure based on Random Forests and XGBoost we reached an AUC of 72% and found 493 candidate genes. We further discussed the importance of the selected genes through a functional analysis based on GOs and KEGG pathways.
Keywords: Parkinson’s disease; blood transcriptomics; feature selection; inflammation; machine learning; mitochondrial dysfunction; oxidative stress; xgboost.
Conflict of interest statement
The authors declare no conflict of interest.
Figures







References
-
- GBD Disease Incidence, Prevalence Collaborators Global, regional, and national incidence, prevalence, and years lived with disability for 354 diseases and injuries for 195 countries and territories, 1990–2017: A systematic analysis for the Global Burden of Disease Study 2017. Lancet. 2018;392:1789–1858. doi: 10.1016/S0140-6736(18)32279-7. - DOI - PMC - PubMed
MeSH terms
LinkOut - more resources
Full Text Sources
Medical