Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 May 19:10:1174631.
doi: 10.3389/fmed.2023.1174631. eCollection 2023.

Predicting total lung capacity from spirometry: a machine learning approach

Affiliations

Predicting total lung capacity from spirometry: a machine learning approach

Luka Beverin et al. Front Med (Lausanne). .

Abstract

Background and objective: Spirometry patterns can suggest that a patient has a restrictive ventilatory impairment; however, lung volume measurements such as total lung capacity (TLC) are required to confirm the diagnosis. The aim of the study was to train a supervised machine learning model that can accurately estimate TLC values from spirometry and subsequently identify which patients would most benefit from undergoing a complete pulmonary function test.

Methods: We trained three tree-based machine learning models on 51,761 spirometry data points with corresponding TLC measurements. We then compared model performance using an independent test set consisting of 1,402 patients. The best-performing model was used to retrospectively identify restrictive ventilatory impairment in the same test set. The algorithm was compared against different spirometry patterns commonly used to predict restriction.

Results: The prevalence of restrictive ventilatory impairment in the test set is 16.7% (234/1402). CatBoost was the best-performing machine learning model. It predicted TLC with a mean squared error (MSE) of 560.1 mL. The sensitivity, specificity, and F1-score of the optimal algorithm for predicting restrictive ventilatory impairment was 83, 92, and 75%, respectively.

Conclusion: A machine learning model trained on spirometry data can estimate TLC to a high degree of accuracy. This approach could be used to develop future smart home-based spirometry solutions, which could aid decision making and self-monitoring in patients with restrictive lung diseases.

Keywords: interstitial lung disease; machine learning; restriction; spirometry; total lung capacity.

PubMed Disclaimer

Conflict of interest statement

MT, AH, and PD were employed by the ArtiQ NV. MV has received consultancy fees from ArtiQ NV. WJ was a shareholder at ArtiQ NV. The remaining author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

Figure 1
Figure 1
Illustration of the machine learning-based algorithm for predicting total lung capacity. MSE, mean squared error.
Figure 2
Figure 2
The total lung capacity (TLC) predictions of the CatBoost model (TLCCatBoost) against the reference TLC measurements in the independent test set, grouped by true restriction defined as TLC < lower limit of normal (LLN). The black dashed line represents the line of ideal agreement.
Figure 3
Figure 3
The prediction error for each diagnosis is calculated as the difference between the average total lung capacity (TLC) value and the average TLCCatBoost prediction for that group. Bars above and below the horizontal dotted line indicate model underestimation and overestimation, respectively. COPD, chronic obstructive pulmonary disease; ILD, interstitial lung disease; OBD, other obstructive disease; NMD, neuromuscular disease; PVD, pulmonary vascular disease; TD, thoracic deformity.

References

    1. Martinez-Pitre PJ, Sabbula BR, Cascella M. In StatPearls [Internet]. Treasure Island (FL): StatPearls Publishing (2023).
    1. Guerra S, Sherrill DL, Venker C, Ceccato CM, Halonen M, Martinez FD. Morbidity and mortality associated with the restrictive spirometric pattern: a longitudinal study. Thorax. (2010) 65:499–504. doi: 10.1136/thx.2009.126052, PMID: - DOI - PMC - PubMed
    1. Raj R, Raparia K, Lynch DA, Brown KK. Surgical lung biopsy for interstitial lung diseases. Chest. (2017) 151:1131–40. doi: 10.1016/j.chest.2016.06.019 - DOI - PubMed
    1. Wanger J, Clausen JL, Coates A, Pedersen OF, Brusasco V, Burgos F, et al. . Standardisation of the measurement of lung volumes. Eur Respir J. (2005) 26:511–22. doi: 10.1183/09031936.05.00035005 - DOI - PubMed
    1. Pellegrino R, Viegi G, Brusasco V, Crapo RO, Burgos F, Casaburi R, et al. . Interpretative strategies for lung function tests. Eur Respir J. (2005) 26:948–68. doi: 10.1183/09031936.05.00035205 - DOI - PubMed

LinkOut - more resources