Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Aug 20;153(1):28.
doi: 10.1007/s10709-025-00243-6.

Integrative machine learning and RT-qPCR analysis identify key stress-responsive genes in Thermus thermophilus HB8

Affiliations

Integrative machine learning and RT-qPCR analysis identify key stress-responsive genes in Thermus thermophilus HB8

Abbas Karimi-Fard et al. Genetica. .

Abstract

Bacteria are constantly exposed to diverse environmental stresses, necessitating complex adaptive mechanisms for survival. Thermus thermophilus, a thermophilic extremophile, serves as an excellent model for investigating these responses due to its remarkable resilience to harsh conditions. Recent advances in artificial intelligence, particularly in machine learning, have transformed the identification of novel stress-responsive biomarkers. In this study, we analyzed transcriptomic data from 65 T. thermophilus HB8 samples subjected to various abiotic stresses to identify key genes involved in stress adaptation. We applied a suite of supervised machine learning algorithms to classify samples and prioritize informative features. Among the tested models, Extreme Gradient Boosting (XGBoost) and Random Forest (RF) achieved the highest classification performance, with XGBoost attaining perfect discrimination between stressed and control samples (AUC = 1.00) and RF closely following (AUC = 0.99). Feature importance analysis consistently identified three candidate genes: TTHA0029, TTHA1720, and TTHA1359. Functional validation using RT-qPCR confirmed the significant upregulation of TTHA0029 and TTHA1720 under salt and hydrogen peroxide stress, suggesting roles in redox regulation and ionic homeostasis. Phylogenetic analysis further revealed the specificity of these genes to the Thermus genus. Overall, our findings highlight central molecular players in stress tolerance in T. thermophilus and demonstrate the utility of machine learning in biomarker discovery. The identified genes, TTHA0029 and TTHA1720, may serve as promising targets for genetic engineering to improve stress resilience in both crops and industrially relevant microorganisms.

Keywords: Bacteria; Environmental stress; Gene expression; Machine learning; RT-qPCR.

PubMed Disclaimer

Conflict of interest statement

Declarations. Competing interests: The authors declare no competing interests.

Similar articles

References

    1. Agari Y, Kuramitsu S, Shinkai A (2010) Identification of novel genes regulated by the oxidative stress-responsive transcriptional activator SdrP in thermus thermophilus HB8. FEMS Microbiol Lett 313:127–134 - PubMed
    1. Ahsen ME (2025) Harnessing unsupervised ensemble learning for biomedical applications: A review of methods and advances. Mathematics 13:420
    1. Apel K, Hirt H (2004) Reactive oxygen species: metabolism, oxidative stress, and signal transduction. Annu Rev Plant Biol 55:373–399 - PubMed
    1. Baldi P, Long AD (2001) A bayesian framework for the analysis of microarray expression data: regularized t-test and statistical inferences of gene changes. Bioinformatics 17:509–519 - PubMed
    1. Bonilla CY (2020) Generally stressed out bacteria: environmental stress response mechanisms in gram-positive bacteria. Integr Comp Biol 60:126–133 - PubMed

MeSH terms

Substances

LinkOut - more resources