Machine Learning-Based Transcriptomic Diagnosis of Periodontitis
- PMID: 41265166
- PMCID: PMC12666856
- DOI: 10.1016/j.identj.2025.104028
Machine Learning-Based Transcriptomic Diagnosis of Periodontitis
Abstract
Background: Periodontitis, a prevalent chronic inflammatory disease, remains a global health challenge with conventional diagnostic methods hindered by subjectivity and low sensitivity. This study aimed to develop a machine learning (ML)-based diagnostic framework using transcriptomic data to enhance diagnostic accuracy and efficiency.
Methods: Transcriptomic datasets from 616 samples (452 periodontitis, 164 healthy controls) were retrieved from the Gene Expression Omnibus (GEO). Differentially expressed genes (DEGs) were identified, and functional enrichment, weighted gene co-expression network analysis (WGCNA), and immune infiltration profiling were performed. Key biomarkers were refined using Boruta and Least Absolute Shrinkage and Selection Operator (LASSO) algorithms. Independent six ML models were constructed and validated. A nomogram for risk prediction, transcription factor networks, and drug-target interactions were analysed.
Results: Five diagnostic biomarkers (CSF2RB, COL15A1, MME, NEFM, CYP24A1) were identified, with robust performance across datasets. The Random Forest (RF) and eXtreme Gradient Boosting (XGBoost) achieved perfect classification in training and high accuracy in external validation. Immune infiltration analysis revealed significant correlations between biomarkers and immune cell populations (eg, dendritic cells, T cells). Transcription factor networks highlighted NFYA and SP1 as central regulators. Drug prediction identified re-purposable candidates with validated molecular docking affinity.
Conclusion: This study establishes a ML-driven diagnostic framework for periodontitis, integrating transcriptomic, immune, and regulatory network insights. These gene biomarkers may provide novel insight into periodontitis pathogenesis, while our diagnostic models show potential for clinical utility in personalised diagnosis, targeted intervention, and therapeutic development.
Plain language summary: Periodontitis is a common, serious condition often diagnosed too late using traditional methods that can be subjective. To improve detection, we developed an machine learning (ML) tool that analyses genetic activity in gum tissue. Using data from 616 patient samples, we identified five key genes (CSF2RB, COL15A1, MME, NEFM, CYP24A1) that act as biological 'flags' for gum disease. These genes are linked to immune responses that drive gum inflammation. Our ML models - especially two types called Random Forest and XGBoost - perfectly spotted gum disease in initial tests and remained highly accurate in new patient groups. We also created a simple scoring chart (nomogram) to predict individual risk. The genes we found interact with immune cells and vitamin D pathways, revealing new disease mechanisms. This work provides a faster, more objective way to diagnose gum disease and opens doors for personalised treatments.
Keywords: Artificial intelligence; Diagnostic models; Machine learning; Periodontitis; Precision medicine; Statistical.
Copyright © 2025 The Authors. Published by Elsevier Inc. All rights reserved.
Conflict of interest statement
Conflict of interest None disclosed.
Figures
References
-
- Bartold PM. Lifestyle and periodontitis: the emergence of personalized periodontics. Periodontol 2000. 2018;78(1):7–11. - PubMed
-
- Stavropoulos A., Bertl K., Spineli L.M., Sculean A., Cortellini P., Tonetti M. Medium- and long-term clinical benefits of periodontal regenerative/reconstructive procedures in intrabony defects: systematic review and network meta-analysis of randomized controlled clinical studies. J Clin Periodontol. 2021;48(3):410–430. - PMC - PubMed
LinkOut - more resources
Full Text Sources
Miscellaneous
