Pulmonary emphysema subtypes defined by unsupervised machine learning on CT scans
- PMID: 37268414
- PMCID: PMC10592007
- DOI: 10.1136/thorax-2022-219158
Pulmonary emphysema subtypes defined by unsupervised machine learning on CT scans
Abstract
Background: Treatment and preventative advances for chronic obstructive pulmonary disease (COPD) have been slow due, in part, to limited subphenotypes. We tested if unsupervised machine learning on CT images would discover CT emphysema subtypes with distinct characteristics, prognoses and genetic associations.
Methods: New CT emphysema subtypes were identified by unsupervised machine learning on only the texture and location of emphysematous regions on CT scans from 2853 participants in the Subpopulations and Intermediate Outcome Measures in COPD Study (SPIROMICS), a COPD case-control study, followed by data reduction. Subtypes were compared with symptoms and physiology among 2949 participants in the population-based Multi-Ethnic Study of Atherosclerosis (MESA) Lung Study and with prognosis among 6658 MESA participants. Associations with genome-wide single-nucleotide-polymorphisms were examined.
Results: The algorithm discovered six reproducible (interlearner intraclass correlation coefficient, 0.91-1.00) CT emphysema subtypes. The most common subtype in SPIROMICS, the combined bronchitis-apical subtype, was associated with chronic bronchitis, accelerated lung function decline, hospitalisations, deaths, incident airflow limitation and a gene variant near DRD1, which is implicated in mucin hypersecretion (p=1.1 ×10-8). The second, the diffuse subtype was associated with lower weight, respiratory hospitalisations and deaths, and incident airflow limitation. The third was associated with age only. The fourth and fifth visually resembled combined pulmonary fibrosis emphysema and had distinct symptoms, physiology, prognosis and genetic associations. The sixth visually resembled vanishing lung syndrome.
Conclusion: Large-scale unsupervised machine learning on CT scans defined six reproducible, familiar CT emphysema subtypes that suggest paths to specific diagnosis and personalised therapies in COPD and pre-COPD.
Keywords: COPD epidemiology; Emphysema; Imaging/CT MRI etc.
© Author(s) (or their employer(s)) 2023. No commercial re-use. See rights and permissions. Published by BMJ.
Conflict of interest statement
Competing interests: EDA, PPB, AM, YS, WS, JHMA, MHC, DC, EH, DRJ, SK, JDK, TL, JL, ECO, WP, MRP, SSR, EKS, KEW and AFL reports receiving grants from the National Institutes of Health (NIH). JY performed the work at Columbia University but is now an employee of Google. EAH reports receiving grants from the NIH; being a founder and shareholder of VIDA Diagnostics; and holding patents for an apparatus for analysing CT images to determine the presence of pulmonary tissue pathology, an apparatus for image display and analysis, and a method for multiscale meshing of branching biological structures. EBA reports receiving grants from the American Heart Association and the NIH. CBC reports receiving personal fees from GlaxoSmithKline. MTD reports receiving a grant from the NHLBI and personal fees from AstraZeneca, GlaxoSmithKline, Pulmonx, PneumRx/BTG and Quark. MKH reports consulting for GlaxoSmithKline, AstraZeneca and Boehringer Ingelheim receiving research support from Novartis and Sunovion. NNH reports receiving grants from the NIH, Boehringer Ingelheim, and the COPD Foundation. JDK reports receiving grants from US Environmental Protection Agency and the NIH. FJM reports serving on COPD advisory boards for AstraZeneca, Boehringer Ingelheim, Chiesi, GlaxoSmithKline, Sunovion and Teva; serving as a consultant for ProterixBio and Verona; serving on the steering committees of studies sponsored by the NHLBI, AstraZeneca, and GlaxoSmithKline; having served on data safety and monitoring boards of COPD studies supported by Genentech and GlaxoSmithKline. BMS reports receiving grants from the NIH, Canadian Institutes of Health Research (CIHR), Fonds de la recherche en santé du Québec (FRQS), the Research Institute of the McGill University Health Centre, the Quebec Lung Association and AstraZeneca. PGW reports receiving personal fees for consultancy from Theravance, AstraZeneca, Regeneron, Sanofi, Genentech, Roche and Janssen. RGB reports receiving grants from the COPD Foundation, the US Environmental Protection Agency (EPA), the American Lung Association and the NIH.
Figures





Comment in
-
Reimagining emphysema for a computational age.Thorax. 2023 Nov;78(11):1063-1064. doi: 10.1136/thorax-2023-220458. Epub 2023 Aug 17. Thorax. 2023. PMID: 37591700 No abstract available.
References
-
- World Health Organization. The top 10 causes of death, 2019. Geneva, Switzerland: WHO; 2020 [Available from: https://www.who.int/news-room/fact-sheets/detail/the-top-10-causes-of-death accessed 7-7-21.
-
- Global Stategy for the Diagnosis, Management, and Prevention of Chronic Obstructive Pulmonary Disease - 2023 Report: Global Initiative for Chronic Obstructive Lung Disease, 2022.
-
- Baldwin ED, Cournand A, Richards DW Jr. Pulmonary insufficiency; a study of 122 cases of chronic pulmonary emphysema. Medicine 1949;28:201–37. - PubMed
-
- Burrows B, Fletcher CM, Heard BE, et al. The emphysematous and bronchial types of chronic airways obstruction. A clinicopathological study of patients in London and Chicago. Lancet 1966;287:830–35. - PubMed
Publication types
MeSH terms
Grants and funding
- HHSN268201500003C/HL/NHLBI NIH HHS/United States
- HHSN268200900016C/HL/NHLBI NIH HHS/United States
- HHSN268200900015C/HL/NHLBI NIH HHS/United States
- N01 HC095159/HL/NHLBI NIH HHS/United States
- P30 DK063491/DK/NIDDK NIH HHS/United States
- R01 HL142028/HL/NHLBI NIH HHS/United States
- R01 HL155816/HL/NHLBI NIH HHS/United States
- UL1 TR001420/TR/NCATS NIH HHS/United States
- HHSN268201500003I/HL/NHLBI NIH HHS/United States
- R01 HL130506/HL/NHLBI NIH HHS/United States
- R01 HL113264/HL/NHLBI NIH HHS/United States
- HHSN268200900019C/HL/NHLBI NIH HHS/United States
- U24 HL141762/HL/NHLBI NIH HHS/United States
- T32 HL144442/HL/NHLBI NIH HHS/United States
- UL1 TR001079/TR/NCATS NIH HHS/United States
- N02 HL064278/HL/NHLBI NIH HHS/United States
- P30 ES005605/ES/NIEHS NIH HHS/United States
- R01 HL077612/HL/NHLBI NIH HHS/United States
- K24 HL140108/HL/NHLBI NIH HHS/United States
- RC1 HL100543/HL/NHLBI NIH HHS/United States
- HHSN268200900014C/HL/NHLBI NIH HHS/United States
- HHSN268200900020C/HL/NHLBI NIH HHS/United States
- HHSN268200900017C/HL/NHLBI NIH HHS/United States
- R01 HL093081/HL/NHLBI NIH HHS/United States
- U01 HL089856/HL/NHLBI NIH HHS/United States
- HHSN268200900018C/HL/NHLBI NIH HHS/United States
- P30 ES007033/ES/NIEHS NIH HHS/United States
- P01 HL105339/HL/NHLBI NIH HHS/United States
- R01 HL121270/HL/NHLBI NIH HHS/United States
- K12 HL120004/HL/NHLBI NIH HHS/United States
- P30 DK054759/DK/NIDDK NIH HHS/United States
- P30 DK026687/DK/NIDDK NIH HHS/United States
- HHSN268200900013C/HL/NHLBI NIH HHS/United States
- UL1 TR000040/TR/NCATS NIH HHS/United States
- R01 HL103676/HL/NHLBI NIH HHS/United States
- UL1 TR001873/TR/NCATS NIH HHS/United States
- UL1 TR001881/TR/NCATS NIH HHS/United States
- R01 HL131565/HL/NHLBI NIH HHS/United States
LinkOut - more resources
Full Text Sources
Medical