A Machine Learning Application to Predict Early Lung Involvement in Scleroderma: A Feasibility Evaluation
- PMID: 34679580
- PMCID: PMC8534403
- DOI: 10.3390/diagnostics11101880
A Machine Learning Application to Predict Early Lung Involvement in Scleroderma: A Feasibility Evaluation
Abstract
Introduction: Systemic sclerosis (SSc) is a systemic immune-mediated disease, featuring fibrosis of the skin and organs, and has the greatest mortality among rheumatic diseases. The nervous system involvement has recently been demonstrated, although actual lung involvement is considered the leading cause of death in SSc and, therefore, should be diagnosed early. Pulmonary function tests are not sensitive enough to be used for screening purposes, thus they should be flanked by other clinical examinations; however, this would lead to a risk of overtesting, with considerable costs for the health system and an unnecessary burden for the patients. To this extent, Machine Learning (ML) algorithms could represent a useful add-on to the current clinical practice for diagnostic purposes and could help retrieve the most useful exams to be carried out for diagnostic purposes.
Method: Here, we retrospectively collected high resolution computed tomography, pulmonary function tests, esophageal pH impedance tests, esophageal manometry and reflux disease questionnaires of 38 patients with SSc, applying, with R, different supervised ML algorithms, including lasso, ridge, elastic net, classification and regression trees (CART) and random forest to estimate the most important predictors for pulmonary involvement from such data.
Results: In terms of performance, the random forest algorithm outperformed the other classifiers, with an estimated root-mean-square error (RMSE) of 0.810. However, this algorithm was seen to be computationally intensive, leaving room for the usefulness of other classifiers when a shorter response time is needed.
Conclusions: Despite the notably small sample size, that could have prevented obtaining fully reliable data, the powerful tools available for ML can be useful for predicting early lung involvement in SSc patients. The use of predictors coming from spirometry and pH impedentiometry together might perform optimally for predicting early lung involvement in SSc.
Keywords: HRCT chest; artificial intelligence; esophageal dilatation; machine learning; systemic sclerosis.
Conflict of interest statement
The authors declare no conflict of interest.
Figures
Similar articles
-
Brief Report: Pulmonary Function Tests: High Rate of False-Negative Results in the Early Detection and Screening of Scleroderma-Related Interstitial Lung Disease.Arthritis Rheumatol. 2015 Dec;67(12):3256-61. doi: 10.1002/art.39405. Arthritis Rheumatol. 2015. PMID: 26316389
-
Clinical algorithms for the diagnosis and prognosis of interstitial lung disease in systemic sclerosis.Semin Arthritis Rheum. 2017 Oct;47(2):228-234. doi: 10.1016/j.semarthrit.2017.03.019. Epub 2017 Apr 1. Semin Arthritis Rheum. 2017. PMID: 28454677
-
Worsening of esophageal dilatation is associated with increase in a high-resolution computed tomography (HRCT) score in early systemic sclerosis-associated interstitial lung disease (SSc-ILD).Clin Rheumatol. 2021 Mar;40(3):955-963. doi: 10.1007/s10067-020-05346-3. Epub 2020 Aug 15. Clin Rheumatol. 2021. PMID: 32803568
-
Serum concentration of surfactant protein D in patients with systemic sclerosis: The potential marker of the interstitial lung disease severity.Best Pract Res Clin Rheumatol. 2018 Aug;32(4):541-549. doi: 10.1016/j.berh.2019.01.005. Epub 2019 Feb 14. Best Pract Res Clin Rheumatol. 2018. PMID: 31174823 Review.
-
Systemic sclerosis. A clinical overview.Adv Exp Med Biol. 1999;455:73-83. Adv Exp Med Biol. 1999. PMID: 10599326 Review.
Cited by
-
COVID-19 Detection in Chest X-ray Images Using a New Channel Boosted CNN.Diagnostics (Basel). 2022 Jan 21;12(2):267. doi: 10.3390/diagnostics12020267. Diagnostics (Basel). 2022. PMID: 35204358 Free PMC article.
-
Biomarkers in Systemic Sclerosis: An Overview.Curr Issues Mol Biol. 2023 Sep 25;45(10):7775-7802. doi: 10.3390/cimb45100490. Curr Issues Mol Biol. 2023. PMID: 37886934 Free PMC article. Review.
-
Machine Learning Based Multi-Parameter Modeling for Prediction of Post-Inflammatory Lung Changes.Diagnostics (Basel). 2025 Mar 20;15(6):783. doi: 10.3390/diagnostics15060783. Diagnostics (Basel). 2025. PMID: 40150125 Free PMC article.
-
The Use and Utility of Machine Learning in Achieving Precision Medicine in Systemic Sclerosis: A Narrative Review.J Pers Med. 2022 Jul 23;12(8):1198. doi: 10.3390/jpm12081198. J Pers Med. 2022. PMID: 35893293 Free PMC article. Review.
-
Machine Learning Analysis of Electronic Health Records Identifies Interstitial Lung Disease and Predicts Mortality in Patients with Systemic Sclerosis.medRxiv [Preprint]. 2025 Jun 4:2025.06.02.25328786. doi: 10.1101/2025.06.02.25328786. medRxiv. 2025. PMID: 40502596 Free PMC article. Preprint.
References
LinkOut - more resources
Full Text Sources