Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Sep:195:110562.
doi: 10.1016/j.compbiomed.2025.110562. Epub 2025 Jun 23.

A model developed by machine learning and principal component analysis for predicting bronchiolitis obliterans syndrome after allogeneic HSCT in a Chinese population with hematologic malignancies

Affiliations

A model developed by machine learning and principal component analysis for predicting bronchiolitis obliterans syndrome after allogeneic HSCT in a Chinese population with hematologic malignancies

Jia Chen et al. Comput Biol Med. 2025 Sep.

Abstract

Bronchiolitis obliterans syndrome (BOS) is a severe pulmonary complication following allogeneic hematopoietic stem cell transplantation (allo-HSCT), with early prediction being crucial. While pulmonary function tests (PFTs) are fundamental for BOS assessment, the predictive value of pretransplant PFT results remains uncertain. In this study involving allo-HSCT recipients with hematologic malignancies who survived over 100 days post-HSCT, we aimed to determine the predictive significance of pretransplant PFTs for BOS. Data from 742 eligible patients were randomly divided into training and validation cohorts, and machine learning algorithms were employed for feature engineering, feature selection, and modeling. Principal component analysis (PCA) was utilized to reduce the dimensionality of PFT parameters. Over a median follow-up of 573.5 days, 57 patients developed BOS, with a median interval of 269 days from transplantation to BOS onset. Our multivariate logistic regression (MLR) model, incorporating the first principal components of PCA-treated PFT results along with age, sex, and previous nonpulmonary chronic graft-versus-host disease (GvHD), demonstrated discriminant AUC values of 0.671 (95 % CI, 0.522-0.820) and 0.669 (95 % CI, 0.588-0.751) in the validation and training sets, respectively. Pretransplant PFT results emerged as pivotal in predicting BOS risk post-allo-HSCT. The MLR model, developed through data-driven machine learning, effectively identifies high-risk BOS populations at an early stage.

Keywords: Bronchiolitis obliterans syndrome; Hematopoietic stem cell transplantation; Machine learning; Predictive model; Pulmonary function test.

PubMed Disclaimer

Conflict of interest statement

Declaration of competing interest The author declares that there is no conflict of interest to this work.

Similar articles

MeSH terms

Supplementary concepts

LinkOut - more resources