Do comprehensive deep learning algorithms suffer from hidden stratification? A retrospective study on pneumothorax detection in chest radiography
- PMID: 34876430
- PMCID: PMC8655590
- DOI: 10.1136/bmjopen-2021-053024
Do comprehensive deep learning algorithms suffer from hidden stratification? A retrospective study on pneumothorax detection in chest radiography
Abstract
Objectives: To evaluate the ability of a commercially available comprehensive chest radiography deep convolutional neural network (DCNN) to detect simple and tension pneumothorax, as stratified by the following subgroups: the presence of an intercostal drain; rib, clavicular, scapular or humeral fractures or rib resections; subcutaneous emphysema and erect versus non-erect positioning. The hypothesis was that performance would not differ significantly in each of these subgroups when compared with the overall test dataset.
Design: A retrospective case-control study was undertaken.
Setting: Community radiology clinics and hospitals in Australia and the USA.
Participants: A test dataset of 2557 chest radiography studies was ground-truthed by three subspecialty thoracic radiologists for the presence of simple or tension pneumothorax as well as each subgroup other than positioning. Radiograph positioning was derived from radiographer annotations on the images.
Outcome measures: DCNN performance for detecting simple and tension pneumothorax was evaluated over the entire test set, as well as within each subgroup, using the area under the receiver operating characteristic curve (AUC). A difference in AUC of more than 0.05 was considered clinically significant.
Results: When compared with the overall test set, performance of the DCNN for detecting simple and tension pneumothorax was statistically non-inferior in all subgroups. The DCNN had an AUC of 0.981 (0.976-0.986) for detecting simple pneumothorax and 0.997 (0.995-0.999) for detecting tension pneumothorax.
Conclusions: Hidden stratification has significant implications for potential failures of deep learning when applied in clinical practice. This study demonstrated that a comprehensively trained DCNN can be resilient to hidden stratification in several clinically meaningful subgroups in detecting pneumothorax.
Keywords: accident & emergency medicine; chest imaging; health informatics.
© Author(s) (or their employer(s)) 2021. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ.
Conflict of interest statement
Competing interests: All authors have reviewed and approved this manuscript. Authors JS, CT, QDB, MRM, XH, HA, JL, PB and CMJ are employees of, or are seconded to, Annalise.ai. NE and LO-R have no interests to declare.
Figures


Similar articles
-
Can AI outperform a junior resident? Comparison of deep neural network to first-year radiology residents for identification of pneumothorax.Emerg Radiol. 2020 Aug;27(4):367-375. doi: 10.1007/s10140-020-01767-4. Epub 2020 Jul 8. Emerg Radiol. 2020. PMID: 32643070
-
Evaluation of an Artificial Intelligence Model for Detection of Pneumothorax and Tension Pneumothorax in Chest Radiographs.JAMA Netw Open. 2022 Dec 1;5(12):e2247172. doi: 10.1001/jamanetworkopen.2022.47172. JAMA Netw Open. 2022. PMID: 36520432 Free PMC article.
-
Automated detection of moderate and large pneumothorax on frontal chest X-rays using deep convolutional neural networks: A retrospective study.PLoS Med. 2018 Nov 20;15(11):e1002697. doi: 10.1371/journal.pmed.1002697. eCollection 2018 Nov. PLoS Med. 2018. PMID: 30457991 Free PMC article.
-
Deep Learning for Pneumothorax Detection on Chest Radiograph: A Diagnostic Test Accuracy Systematic Review and Meta Analysis.Can Assoc Radiol J. 2024 Aug;75(3):525-533. doi: 10.1177/08465371231220885. Epub 2024 Jan 8. Can Assoc Radiol J. 2024. PMID: 38189265
-
Should we perform an inspiratory or an expiratory chest radiograph for the initial diagnosis of pneumothorax?Radiologia (Engl Ed). 2018 Sep-Oct;60(5):437-440. doi: 10.1016/j.rx.2017.10.004. Epub 2017 Dec 6. Radiologia (Engl Ed). 2018. PMID: 29208316 Review. English, Spanish.
Cited by
-
Deep learning for pneumothorax diagnosis: a systematic review and meta-analysis.Eur Respir Rev. 2023 Jun 7;32(168):220259. doi: 10.1183/16000617.0259-2022. Print 2023 Jun 30. Eur Respir Rev. 2023. PMID: 37286217 Free PMC article.
-
Radiomics-based decision support tool assists radiologists in small lung nodule classification and improves lung cancer early diagnosis.Br J Cancer. 2023 Dec;129(12):1949-1955. doi: 10.1038/s41416-023-02480-y. Epub 2023 Nov 6. Br J Cancer. 2023. PMID: 37932513 Free PMC article.
-
Deep learning for tubes and lines detection in critical illness: Generalizability and comparison with residents.Eur J Radiol Open. 2024 Jul 29;13:100593. doi: 10.1016/j.ejro.2024.100593. eCollection 2024 Dec. Eur J Radiol Open. 2024. PMID: 39175597 Free PMC article.
-
Better performance of deep learning pulmonary nodule detection using chest radiography with pixel level labels in reference to computed tomography: data quality matters.Sci Rep. 2024 Jul 10;14(1):15967. doi: 10.1038/s41598-024-66530-y. Sci Rep. 2024. PMID: 38987309 Free PMC article.
-
Analysis of Line and Tube Detection Performance of a Chest X-ray Deep Learning Model to Evaluate Hidden Stratification.Diagnostics (Basel). 2023 Jul 9;13(14):2317. doi: 10.3390/diagnostics13142317. Diagnostics (Basel). 2023. PMID: 37510062 Free PMC article.
References
-
- Khan A, Sohail A, Zahoora U, et al. . A survey of the recent architectures of deep convolutional neural networks. Artif Intell Rev 2020;53:5455–516. 10.1007/s10462-020-09825-6 - DOI
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical