Predicting total lung capacity from spirometry: a machine learning approach

Luka Beverin¹, Marko Topalovic², Armin Halilovic², Paul Desbordes², Wim Janssens³, Maarten De Vos^{4

2}

Affiliations

¹ Statistics Research Centre, KU Leuven, Leuven, Belgium.
² ArtiQ NV, Leuven, Belgium.
³ Laboratory of Respiratory Diseases and Thoracic Surgery, Department of Chronic Diseases Metabolism and Ageing, Ku Leuven, Leuven, Belgium.
⁴ Stadius, Department of Electrical Engineering, KU Leuven, Leuven, Belgium.

PMID: 37275373
PMCID: PMC10238228
DOI: 10.3389/fmed.2023.1174631

Predicting total lung capacity from spirometry: a machine learning approach

Luka Beverin et al. Front Med (Lausanne). 2023.

. 2023 May 19:10:1174631.

doi: 10.3389/fmed.2023.1174631. eCollection 2023.

Authors

Luka Beverin¹, Marko Topalovic², Armin Halilovic², Paul Desbordes², Wim Janssens³, Maarten De Vos^{4

2}

Affiliations

¹ Statistics Research Centre, KU Leuven, Leuven, Belgium.
² ArtiQ NV, Leuven, Belgium.
³ Laboratory of Respiratory Diseases and Thoracic Surgery, Department of Chronic Diseases Metabolism and Ageing, Ku Leuven, Leuven, Belgium.
⁴ Stadius, Department of Electrical Engineering, KU Leuven, Leuven, Belgium.

PMID: 37275373
PMCID: PMC10238228
DOI: 10.3389/fmed.2023.1174631

Abstract

Background and objective: Spirometry patterns can suggest that a patient has a restrictive ventilatory impairment; however, lung volume measurements such as total lung capacity (TLC) are required to confirm the diagnosis. The aim of the study was to train a supervised machine learning model that can accurately estimate TLC values from spirometry and subsequently identify which patients would most benefit from undergoing a complete pulmonary function test.

Methods: We trained three tree-based machine learning models on 51,761 spirometry data points with corresponding TLC measurements. We then compared model performance using an independent test set consisting of 1,402 patients. The best-performing model was used to retrospectively identify restrictive ventilatory impairment in the same test set. The algorithm was compared against different spirometry patterns commonly used to predict restriction.

Results: The prevalence of restrictive ventilatory impairment in the test set is 16.7% (234/1402). CatBoost was the best-performing machine learning model. It predicted TLC with a mean squared error (MSE) of 560.1 mL. The sensitivity, specificity, and F1-score of the optimal algorithm for predicting restrictive ventilatory impairment was 83, 92, and 75%, respectively.

Conclusion: A machine learning model trained on spirometry data can estimate TLC to a high degree of accuracy. This approach could be used to develop future smart home-based spirometry solutions, which could aid decision making and self-monitoring in patients with restrictive lung diseases.

Keywords: interstitial lung disease; machine learning; restriction; spirometry; total lung capacity.

PubMed Disclaimer

Conflict of interest statement

MT, AH, and PD were employed by the ArtiQ NV. MV has received consultancy fees from ArtiQ NV. WJ was a shareholder at ArtiQ NV. The remaining author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

**Figure 1**
Illustration of the machine learning-based algorithm for predicting total lung capacity. MSE, mean squared error.

**Figure 2**
The total lung capacity (TLC) predictions of the CatBoost model (TLC_CatBoost) against the reference TLC measurements in the independent test set, grouped by true restriction defined as TLC < lower limit of normal (LLN). The black dashed line represents the line of ideal agreement.

**Figure 3**
The prediction error for each diagnosis is calculated as the difference between the average total lung capacity (TLC) value and the average TLC_CatBoost prediction for that group. Bars above and below the horizontal dotted line indicate model underestimation and overestimation, respectively. COPD, chronic obstructive pulmonary disease; ILD, interstitial lung disease; OBD, other obstructive disease; NMD, neuromuscular disease; PVD, pulmonary vascular disease; TD, thoracic deformity.

See this image and copyright information in PMC

References

1. Martinez-Pitre PJ, Sabbula BR, Cascella M. In StatPearls [Internet]. Treasure Island (FL): StatPearls Publishing (2023).
1. Guerra S, Sherrill DL, Venker C, Ceccato CM, Halonen M, Martinez FD. Morbidity and mortality associated with the restrictive spirometric pattern: a longitudinal study. Thorax. (2010) 65:499–504. doi: 10.1136/thx.2009.126052, PMID: - DOI - PMC - PubMed
1. Raj R, Raparia K, Lynch DA, Brown KK. Surgical lung biopsy for interstitial lung diseases. Chest. (2017) 151:1131–40. doi: 10.1016/j.chest.2016.06.019 - DOI - PubMed
1. Wanger J, Clausen JL, Coates A, Pedersen OF, Brusasco V, Burgos F, et al. . Standardisation of the measurement of lung volumes. Eur Respir J. (2005) 26:511–22. doi: 10.1183/09031936.05.00035005 - DOI - PubMed
1. Pellegrino R, Viegi G, Brusasco V, Crapo RO, Burgos F, Casaburi R, et al. . Interpretative strategies for lung function tests. Eur Respir J. (2005) 26:948–68. doi: 10.1183/09031936.05.00035205 - DOI - PubMed

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Predicting total lung capacity from spirometry: a machine learning approach

Affiliations

Predicting total lung capacity from spirometry: a machine learning approach

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

LinkOut - more resources

Full Text Sources