. 2022 Jul 22;101(29):e29587.

doi: 10.1097/MD.0000000000029587.

Multi-population generalizability of a deep learning-based chest radiograph severity score for COVID-19

Matthew D Li¹, Nishanth T Arun¹, Mehak Aggarwal¹, Sharut Gupta¹, Praveer Singh¹, Brent P Little², Dexter P Mendoza², Gustavo C A Corradi³, Marcelo S Takahashi³, Suely F Ferraciolli³, Marc D Succi⁴, Min Lang², Bernardo C Bizzo^{1

5}, Ittai Dayan⁵, Felipe C Kitamura^{3

6}, Jayashree Kalpathy-Cramer^{1

5}

Affiliations

¹ Athinoula A. Martinos Center for Biomedical Imaging, Department of Radiology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.
² Division of Thoracic Imaging and Intervention, Department of Radiology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.
³ Diagnósticos da América SA (DASA), São Paulo, Brazil.
⁴ Division of Emergency Radiology, Department of Radiology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.
⁵ MGH and BWH Center for Clinical Data Science, Mass General Brigham, Boston, MA, USA.
⁶ Department of Diagnostic Imaging, Universidade Federal de São Paulo, São Paulo, Brazil.

PMID: 35866818
PMCID: PMC9302282
DOI: 10.1097/MD.0000000000029587

Multi-population generalizability of a deep learning-based chest radiograph severity score for COVID-19

Matthew D Li et al. Medicine (Baltimore). 2022.

. 2022 Jul 22;101(29):e29587.

doi: 10.1097/MD.0000000000029587.

Authors

Affiliations

¹ Athinoula A. Martinos Center for Biomedical Imaging, Department of Radiology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.
² Division of Thoracic Imaging and Intervention, Department of Radiology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.
³ Diagnósticos da América SA (DASA), São Paulo, Brazil.
⁴ Division of Emergency Radiology, Department of Radiology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.
⁵ MGH and BWH Center for Clinical Data Science, Mass General Brigham, Boston, MA, USA.
⁶ Department of Diagnostic Imaging, Universidade Federal de São Paulo, São Paulo, Brazil.

PMID: 35866818
PMCID: PMC9302282
DOI: 10.1097/MD.0000000000029587

Abstract

To tune and test the generalizability of a deep learning-based model for assessment of COVID-19 lung disease severity on chest radiographs (CXRs) from different patient populations. A published convolutional Siamese neural network-based model previously trained on hospitalized patients with COVID-19 was tuned using 250 outpatient CXRs. This model produces a quantitative measure of COVID-19 lung disease severity (pulmonary x-ray severity (PXS) score). The model was evaluated on CXRs from 4 test sets, including 3 from the United States (patients hospitalized at an academic medical center (N = 154), patients hospitalized at a community hospital (N = 113), and outpatients (N = 108)) and 1 from Brazil (patients at an academic medical center emergency department (N = 303)). Radiologists from both countries independently assigned reference standard CXR severity scores, which were correlated with the PXS scores as a measure of model performance (Pearson R). The Uniform Manifold Approximation and Projection (UMAP) technique was used to visualize the neural network results. Tuning the deep learning model with outpatient data showed high model performance in 2 United States hospitalized patient datasets (R = 0.88 and R = 0.90, compared to baseline R = 0.86). Model performance was similar, though slightly lower, when tested on the United States outpatient and Brazil emergency department datasets (R = 0.86 and R = 0.85, respectively). UMAP showed that the model learned disease severity information that generalized across test sets. A deep learning model that extracts a COVID-19 severity score on CXRs showed generalizable performance across multiple populations from 2 continents, including outpatients and hospitalized patients.

PubMed Disclaimer

Conflict of interest statement

Conflicts of interest and sources of funding: This study was supported by sundry funds to J.K. This research was carried out in whole or in part at the Athinoula A. Martinos Center for Biomedical Imaging at the Massachusetts General Hospital, using resources provided by the Center for Functional Neuroimaging Technologies, P41EB015896, a P41 Biotechnology Resource Grant supported by the National Institute of Biomedical Imaging and Bioengineering (NIBIB), National Institutes of Health. GPU computing resources were provided by the MGH and BWH Center for Clinical Data Science. M.D.L., B.C.B., I.D., and J.K. report collaborating with Bayer Radiology on addressing regulatory requirements for potential clinical application of this technology (no funding provided for the work in this manuscript). M.D.L. reports funding from an RSNA R&E Fund Research Resident/Fellow Grant, outside of the submitted work. BPL is a textbook associate editor and author for Elsevier, Inc. and receives royalties. F.C.K. reports consulting for MD.ai. J.K. reports grants from GE Healthcare, nonfinancial support from AWS, and grants from Genentech Foundation, outside the submitted work. For the remaining authors none were declared.

Figures

**Figure 1.**
Schematic of study design. Previously published Siamese neural network-based model for extracting lung disease severity from CXRs^[7] was tuned using new CXR data and evaluated in 4 test sets.

**Figure 2.**
Boxplots show variable distributions in patient age (A) and lung disease severity by mRALE score (B) in the different CXR test sets. Boxplots show the median and interquartile range (IQR), where the whiskers extend up to 1.5 x IQR.

**Figure 3.**
Scatterplots show the correlation between radiologist-determined mRALE score and the deep learning-based PXS score in the Hospital 1 Inpatient Test Set (R = 0.88) (A), Hospital 1 Outpatient Test Set (R = 0.86) (B), Hospital 2 Emergency Test Set (R = 0.85) (C), and Hospital 3 Inpatient Test Set (R = 0.90) (A). Linear regression 95% confidence intervals are shown in each scatterplot.

**Figure 4.**
Dimensionality reduction using UMAP shows the relationships between CXR data passed through the deep learning-based PXS score model from all 4 test sets (total N = 678), color coded for PXS score (A), mRALE score (B), and test set (C). For the legend in (C), H indicates Hospital. Across the different test sets, a representation of lung disease severity is learned by the PXS score model.

See this image and copyright information in PMC

Update of

Improvement and Multi-Population Generalizability of a Deep Learning-Based Chest Radiograph Severity Score for COVID-19.
Li MD, Arun NT, Aggarwal M, Gupta S, Singh P, Little BP, Mendoza DP, Corradi GCA, Takahashi MS, Ferraciolli SF, Succi MD, Lang M, Bizzo BC, Dayan I, Kitamura FC, Kalpathy-Cramer J. Li MD, et al. medRxiv [Preprint]. 2020 Sep 18:2020.09.15.20195453. doi: 10.1101/2020.09.15.20195453. medRxiv. 2020. Update in: Medicine (Baltimore). 2022 Jul 22;101(29):e29587. doi: 10.1097/MD.0000000000029587. PMID: 32995811 Free PMC article. Updated. Preprint.

References

1. Wong HYF, Lam HYS, Fong AH-T, et al. Frequency and Distribution of Chest Radiographic Findings in Patients Positive for COVID-19. Radiology. 2019;296:E72–8. - PMC - PubMed
1. Smith DL, Grenier J-P, Batte C, et al. A characteristic chest radiographic pattern in the setting of COVID-19 pandemic. Radiol Cardiothorac Imaging. 2020;2:e200280. - PMC - PubMed
1. Cozzi D, Albanesi M, Cavigli E, et al. Chest X-ray in new coronavirus disease 2019 (COVID-19) infection: findings and correlation with clinical outcome. Radiol Medica. 2020;125:730–7. - PMC - PubMed
1. Toussie D, Voutsinas N, Finkelstein M, et al. Clinical and chest radiography features determine patient outcomes in young and middle-aged adults with COVID-19. Radiology. 2020;297:E197–206. - PMC - PubMed
1. Joseph NP, Reid NJ, Som A, et al. Racial and ethnic disparities in disease severity on admission chest radiographs among patients admitted with confirmed coronavirus disease 2019: a retrospective cohort study. Radiology. 2020;297:E303–12. - PMC - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

P41 EB015896/EB/NIBIB NIH HHS/United States

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Multi-population generalizability of a deep learning-based chest radiograph severity score for COVID-19

Affiliations

Multi-population generalizability of a deep learning-based chest radiograph severity score for COVID-19

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Update of

References

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical