A Deep Language Model for Symptom Extraction From Clinical Text and its Application to Extract COVID-19 Symptoms From Social Media

Xiao Luo, Priyanka Gandhi, Susan Storey, Kun Huang

PMID: 34705659
PMCID: PMC9074854
DOI: 10.1109/JBHI.2021.3123192

A Deep Language Model for Symptom Extraction From Clinical Text and its Application to Extract COVID-19 Symptoms From Social Media

Xiao Luo et al. IEEE J Biomed Health Inform. 2022 Apr.

. 2022 Apr;26(4):1737-1748.

doi: 10.1109/JBHI.2021.3123192. Epub 2022 Apr 14.

Authors

Xiao Luo, Priyanka Gandhi, Susan Storey, Kun Huang

PMID: 34705659
PMCID: PMC9074854
DOI: 10.1109/JBHI.2021.3123192

Abstract

Patients experience various symptoms when they haveeither acute or chronic diseases or undergo some treatments for diseases. Symptoms are often indicators of the severity of the disease and the need for hospitalization. Symptoms are often described in free text written as clinical notes in the Electronic Health Records (EHR) and are not integrated with other clinical factors for disease prediction and healthcare outcome management. In this research, we propose a novel deep language model to extract patient-reported symptoms from clinical text. The deep language model integrates syntactic and semantic analysis for symptom extraction and identifies the actual symptoms reported by patients and conditional or negation symptoms. The deep language model can extract both complex and straightforward symptom expressions. We used a real-world clinical notes dataset to evaluate our model and demonstrated that our model achieves superior performance compared to three other state-of-the-art symptom extraction models. We extensively analyzed our model to illustrate its effectiveness by examining each component's contribution to the model. Finally, we applied our model on a COVID-19 tweets data set to extract COVID-19 symptoms. The results show that our model can identify all the symptoms suggested by the Center for Disease Control (CDC) ahead of their timeline and many rare symptoms.

PubMed Disclaimer

Figures

**Fig. 3:**
Symptom Distribution by N-grams

**Fig. 5:**
Sample Tweets with Symptom Mentions

**Fig. 6:**
Trends of the Symptoms listed by CDC

**Fig. 7:**
Trends of the Other Frequent Symptoms

See this image and copyright information in PMC

References

1. Molarius A and Janson S, “Self-rated health, chronic diseases, and symptoms among middle-aged and elderly men and women,” Journal of clinical epidemiology, vol. 55, no. 4, pp. 364–370, 2002. - PubMed
1. Carfì A, Bernabei R, Landi F et al. , “Persistent symptoms in patients after acute covid-19,” Jama, vol. 324, no. 6, pp. 603–605, 2020. - PMC - PubMed
1. Felix HC, Seaberg B, Bursac Z, Thostenson J, and Stewart MK, “Why do patients keep coming back? results of a readmitted patient survey,” Social work in health care, vol. 54, no. 1, pp. 1–15, 2015. - PMC - PubMed
1. Miaskowski C, “Symptom clusters: establishing the link between clinical practice and symptom management research,” 2006. - PubMed
1. Numico G, Cristofano A, Mozzicafreddo A, Cursio OE, Franco P, Courthod G, Trogu A, Malossi A, Cucchi M, Sirotovà Z et al. , “Hospital admission of cancer patients: avoidable practice or necessary care?” PloS one, vol. 10, no. 3, p. e0120827, 2015. - PMC - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

R15 GM139094/GM/NIGMS NIH HHS/United States

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A Deep Language Model for Symptom Extraction From Clinical Text and its Application to Extract COVID-19 Symptoms From Social Media

A Deep Language Model for Symptom Extraction From Clinical Text and its Application to Extract COVID-19 Symptoms From Social Media

Authors

Abstract

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical