Integrative machine learning and RT-qPCR analysis identify key stress-responsive genes in Thermus thermophilus HB8
- PMID: 40833705
- DOI: 10.1007/s10709-025-00243-6
Integrative machine learning and RT-qPCR analysis identify key stress-responsive genes in Thermus thermophilus HB8
Abstract
Bacteria are constantly exposed to diverse environmental stresses, necessitating complex adaptive mechanisms for survival. Thermus thermophilus, a thermophilic extremophile, serves as an excellent model for investigating these responses due to its remarkable resilience to harsh conditions. Recent advances in artificial intelligence, particularly in machine learning, have transformed the identification of novel stress-responsive biomarkers. In this study, we analyzed transcriptomic data from 65 T. thermophilus HB8 samples subjected to various abiotic stresses to identify key genes involved in stress adaptation. We applied a suite of supervised machine learning algorithms to classify samples and prioritize informative features. Among the tested models, Extreme Gradient Boosting (XGBoost) and Random Forest (RF) achieved the highest classification performance, with XGBoost attaining perfect discrimination between stressed and control samples (AUC = 1.00) and RF closely following (AUC = 0.99). Feature importance analysis consistently identified three candidate genes: TTHA0029, TTHA1720, and TTHA1359. Functional validation using RT-qPCR confirmed the significant upregulation of TTHA0029 and TTHA1720 under salt and hydrogen peroxide stress, suggesting roles in redox regulation and ionic homeostasis. Phylogenetic analysis further revealed the specificity of these genes to the Thermus genus. Overall, our findings highlight central molecular players in stress tolerance in T. thermophilus and demonstrate the utility of machine learning in biomarker discovery. The identified genes, TTHA0029 and TTHA1720, may serve as promising targets for genetic engineering to improve stress resilience in both crops and industrially relevant microorganisms.
Keywords: Bacteria; Environmental stress; Gene expression; Machine learning; RT-qPCR.
© 2025. The Author(s), under exclusive licence to Springer Nature Switzerland AG.
Conflict of interest statement
Declarations. Competing interests: The authors declare no competing interests.
Similar articles
-
Supervised Machine Learning Models for Predicting Sepsis-Associated Liver Injury in Patients With Sepsis: Development and Validation Study Based on a Multicenter Cohort Study.J Med Internet Res. 2025 May 26;27:e66733. doi: 10.2196/66733. J Med Internet Res. 2025. PMID: 40418571 Free PMC article.
-
Automated feature learning and survival prognostication in grade 4 glioma using supervised machine learning models.J Neurooncol. 2025 Oct;175(1):389-403. doi: 10.1007/s11060-025-05099-6. Epub 2025 Jun 16. J Neurooncol. 2025. PMID: 40522559
-
Genome-wide and transcriptome analysis of PdWRKY transcription factors in date palm (Phoenix dactylifera) revealing insights into heat and drought stress tolerance.BMC Genomics. 2025 Jul 1;26(1):589. doi: 10.1186/s12864-025-11715-6. BMC Genomics. 2025. PMID: 40597593 Free PMC article.
-
Generalizable machine learning for stress monitoring from wearable devices: A systematic literature review.Int J Med Inform. 2023 May;173:105026. doi: 10.1016/j.ijmedinf.2023.105026. Epub 2023 Feb 28. Int J Med Inform. 2023. PMID: 36893657
-
Structure, evolution, and roles of MYB transcription factors proteins in secondary metabolite biosynthetic pathways and abiotic stresses responses in plants: a comprehensive review.Front Plant Sci. 2025 Jul 31;16:1626844. doi: 10.3389/fpls.2025.1626844. eCollection 2025. Front Plant Sci. 2025. PMID: 40822724 Free PMC article. Review.
References
-
- Agari Y, Kuramitsu S, Shinkai A (2010) Identification of novel genes regulated by the oxidative stress-responsive transcriptional activator SdrP in thermus thermophilus HB8. FEMS Microbiol Lett 313:127–134 - PubMed
-
- Ahsen ME (2025) Harnessing unsupervised ensemble learning for biomedical applications: A review of methods and advances. Mathematics 13:420
-
- Apel K, Hirt H (2004) Reactive oxygen species: metabolism, oxidative stress, and signal transduction. Annu Rev Plant Biol 55:373–399 - PubMed
-
- Baldi P, Long AD (2001) A bayesian framework for the analysis of microarray expression data: regularized t-test and statistical inferences of gene changes. Bioinformatics 17:509–519 - PubMed
-
- Bonilla CY (2020) Generally stressed out bacteria: environmental stress response mechanisms in gram-positive bacteria. Integr Comp Biol 60:126–133 - PubMed
MeSH terms
Substances
LinkOut - more resources
Full Text Sources