A Computable Phenotype for Acute Respiratory Distress Syndrome Using Natural Language Processing and Machine Learning

Majid Afshar^{1

2}, Cara Joyce², Anthony Oakey³, Perry Formanek⁴, Philip Yang⁴, Matthew M Churpek⁵, Richard S Cooper², Susan Zelisko⁶, Ron Price⁶, Dmitriy Dligach^{2

3}

Affiliations

¹ Division of Pulmonary and Critical Care Medicine, Loyola University Medical Center, Maywood, IL.
² Department of Public Health Sciences, Stritch School of Medicine, Loyola University Chicago, Maywood, IL.
³ Department of Computer Science, Loyola University Chicago, Chicago, IL.
⁴ Department of Medicine, Loyola University Medical Center, Maywood, IL.
⁵ Division of Pulmonary and Critical Care Medicine, University of Chicago, Chicago, IL.
⁶ Informatics and Systems Development, Health Sciences Division, Loyola University Chicago, Maywood, IL.

PMID: 30815053
PMCID: PMC6371271

A Computable Phenotype for Acute Respiratory Distress Syndrome Using Natural Language Processing and Machine Learning

Majid Afshar et al. AMIA Annu Symp Proc. 2018.

. 2018 Dec 5:2018:157-165.

eCollection 2018.

Authors

Majid Afshar^{1

2}, Cara Joyce², Anthony Oakey³, Perry Formanek⁴, Philip Yang⁴, Matthew M Churpek⁵, Richard S Cooper², Susan Zelisko⁶, Ron Price⁶, Dmitriy Dligach^{2

3}

Affiliations

¹ Division of Pulmonary and Critical Care Medicine, Loyola University Medical Center, Maywood, IL.
² Department of Public Health Sciences, Stritch School of Medicine, Loyola University Chicago, Maywood, IL.
³ Department of Computer Science, Loyola University Chicago, Chicago, IL.
⁴ Department of Medicine, Loyola University Medical Center, Maywood, IL.
⁵ Division of Pulmonary and Critical Care Medicine, University of Chicago, Chicago, IL.
⁶ Informatics and Systems Development, Health Sciences Division, Loyola University Chicago, Maywood, IL.

PMID: 30815053
PMCID: PMC6371271

Abstract

Acute Respiratory Distress Syndrome (ARDS) is a syndrome of respiratory failure that may be identified using text from radiology reports. The objective of this study was to determine whether natural language processing (NLP) with machine learning performs better than a traditional keyword model for ARDS identification. Linguistic pre-processing of reports was performed and text features were inputs to machine learning classifiers tuned using 10-fold cross-validation on 80% of the sample size and tested in the remaining 20%. A cohort of 533 patients was evaluated, with a data corpus of 9,255 radiology reports. The traditional model had an accuracy of 67.3% (95% CI: 58.3-76.3) with a positive predictive value (PPV) of 41.7% (95% CI: 27.7-55.6). The best NLP model had an accuracy of 83.0% (95% CI: 75.9-90.2) with a PPV of 71.4% (95% CI: 52.1-90.8). A computable phenotype for ARDS with NLP may identify more cases than the traditional model.

PubMed Disclaimer

Figures

**Figure 1.**
Traditional model for ARDS identification

**Figure 2.**
Discrimination with Area Under the Receiver Operative Characteristic Curve and Calibration plots of NLP model with all radiology reports and Concept Unique Identifier features

See this image and copyright information in PMC

References

1. Bellani G, Laffey JG, Pham T, Fan E, Brochard L, Esteban A, Gattinoni L, van Haren F, Larsson A, McAuley DF, Ranieri M, Rubenfeld G, Thompson BT, Wrigge H, Slutsky AS, Pesenti A. Investigators LS and Group ET. Epidemiology, Patterns of Care, and Mortality for Patients With Acute Respiratory Distress Syndrome in intensive care units in 50 Countries. JAMA. 2016;315:788–800. - PubMed
1. Ranieri VM, Rubenfeld GD, Thompson BT, Ferguson ND, Caldwell E, Fan E, Camporota L, Slutsky AS. Acute respiratory distress syndrome: the Berlin Definition. JAMA. 2012;307:2526–33. - PubMed
1. Herasevich V, Yilmaz M, Khan H, Hubmayr RD, Gajic O. Validation of an electronic surveillance system for acute lung injury. Intensive Care Medicine. 2009;35:1018–1023. - PMC - PubMed
1. Koenig HC, Finkel BB, Khalsa SS, Lanken PN, Prasad M, Urbani R, Fuchs BD. Performance of an automated electronic acute lung injury screening system in intensive care unit patients. Crit Care Med. 2011;39:98–104. - PubMed
1. Azzam HC, Khalsa SS, Urbani R, Shah CV, Christie JD, Lanken PN, Fuchs BD. Validation Study of an Automated Electronic Acute Lung Injury Screening Tool. J Am Med Inform Assoc. 2009;16:503–508. - PMC - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

K23 AA024503/AA/NIAAA NIH HHS/United States

LinkOut - more resources

Full Text Sources
- Europe PubMed Central
- PubMed Central
Other Literature Sources
- The Lens - Patent Citations Database

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A Computable Phenotype for Acute Respiratory Distress Syndrome Using Natural Language Processing and Machine Learning

Affiliations

A Computable Phenotype for Acute Respiratory Distress Syndrome Using Natural Language Processing and Machine Learning

Authors

Affiliations

Abstract

Figures

References

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources