Predicting Future Respiratory Hospitalizations in Extremely Premature Neonates Using Transcriptomic Data and Machine Learning
- PMID: 40868449
- PMCID: PMC12385036
- DOI: 10.3390/children12080996
Predicting Future Respiratory Hospitalizations in Extremely Premature Neonates Using Transcriptomic Data and Machine Learning
Abstract
Background: Extremely premature neonates are at increased risk for respiratory complications, often resulting in recurrent hospitalizations during early childhood. Early identification of preterm infants at highest risk of respiratory hospitalizations could enable targeted preventive interventions. While clinical and demographic factors offer some prognostic value, integrating transcriptomic data may improve predictive accuracy.
Objective: To determine whether early-life gene expression profiles can predict respiratory-related hospitalizations within the first four years of life in extremely preterm neonates.
Methods: We conducted a retrospective cohort study of 58 neonates born at <32 weeks' gestational age, using publicly available transcriptomic data from peripheral blood samples collected on days 5, 14, and 28 of life. Random forest models were trained to predict unplanned respiratory readmissions. Model performance was evaluated using sensitivity, specificity, positive predictive value, negative predictive value, and area under the receiver operating characteristic curve (AUC).
Results: All three models, built using transcriptomic data from days 5, 14, and 28, demonstrated strong predictive performance (AUC = 0.90), though confidence intervals were wide due to small sample size. We identified 31 genes and eight biological pathways that were differentially expressed between preterm neonates with and without subsequent respiratory readmissions.
Conclusions: Transcriptomic data from the neonatal period, combined with machine learning, accurately predicted respiratory-related rehospitalizations in extremely preterm neonates. The identified gene signatures offer insight into early biological disruptions that may predispose preterm neonates to chronic respiratory morbidity. Validation in larger, diverse cohorts is needed to support clinical translation.
Keywords: bioinformatics; bronchopulmonary dysplasia; machine learning; preterm infants; respiratory morbidity; transcriptomics.
Conflict of interest statement
The authors declare no conflicts of interest.
Figures



References
-
- Preterm Birth. [(accessed on 7 April 2025)]. Available online: https://www.who.int/news-room/fact-sheets/detail/preterm-birth.
-
- Srinivasjois R., Slimings C., Einarsdóttir K., Burgner D., Leonard H. Association of Gestational Age at Birth with Reasons for Subsequent Hospitalisation: 18 Years of Follow-Up in a Western Australian Population Study. PLoS ONE. 2015;10:e0130535. doi: 10.1371/journal.pone.0130535. - DOI - PMC - PubMed
Grants and funding
LinkOut - more resources
Full Text Sources