Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2020 Nov 10:2020.11.06.20227165.
doi: 10.1101/2020.11.06.20227165.

ConceptWAS: a high-throughput method for early identification of COVID-19 presenting symptoms

Affiliations

ConceptWAS: a high-throughput method for early identification of COVID-19 presenting symptoms

Juan Zhao et al. medRxiv. .

Update in

Abstract

Objective: Identifying symptoms highly specific to COVID-19 would improve the clinical and public health response to infectious outbreaks. Here, we describe a high-throughput approach - Concept-Wide Association Study (ConceptWAS) that systematically scans a disease's clinical manifestations from clinical notes. We used this method to identify symptoms specific to COVID-19 early in the course of the pandemic.

Methods: Using the Vanderbilt University Medical Center (VUMC) EHR, we parsed clinical notes through a natural language processing pipeline to extract clinical concepts. We examined the difference in concepts derived from the notes of COVID-19-positive and COVID-19-negative patients on the PCR testing date. We performed ConceptWAS using the cumulative data every two weeks for early identifying specific COVID-19 symptoms.

Results: We processed 87,753 notes 19,692 patients (1,483 COVID-19-positive) subjected to COVID-19 PCR testing between March 8, 2020, and May 27, 2020. We found 68 clinical concepts significantly associated with COVID-19. We identified symptoms associated with increasing risk of COVID-19, including "absent sense of smell" (odds ratio [OR] = 4.97, 95% confidence interval [CI] = 3.21-7.50), "fever" (OR = 1.43, 95% CI = 1.28-1.59), "with cough fever" (OR = 2.29, 95% CI = 1.75-2.96), and "ageusia" (OR = 5.18, 95% CI = 3.02-8.58). Using ConceptWAS, we were able to detect loss sense of smell or taste three weeks prior to their inclusion as symptoms of the disease by the Centers for Disease Control and Prevention (CDC).

Conclusion: ConceptWAS is a high-throughput approach for exploring specific symptoms of a disease like COVID-19, with a promise for enabling EHR-powered early disease manifestations identification.

Keywords: COVID-19; EHR; Natural language processing.

PubMed Disclaimer

Conflict of interest statement

Competing interests The authors have no competing interests to declare.

Figures

Figure 1.
Figure 1.
Volcano plot of a ConceptWAS scan for 19, 692 patients that included COVID-19-positive group (cases) and negative group (controls). The points are colored by the semantic type of the concepts. Selected associations related to signs, symptoms, or diseases/syndromes are labeled. The volcano plot indicates -log 10 (p-value) for association (y-axis) plotted against their respective log 2 (fold change) (x-axis). The dashed line represents significance level using a Bonferroni correction.
Figure 2.
Figure 2.
Forest plot comparing individual concepts between COVID-19-positive (case) and COVID-19-negative (control) patients. Selected associations include the significant signals related to semantic types of symptoms that met Bonferroni-corrected significance (p-value < 2.55E-06). The odds ratio has been adjusted for age, gender, and race. The concepts are ordered by p-value.
Figure 3.
Figure 3.
Temporal ConceptWAS using bi-weekly cumulative data. For significant signals (related to signs, symptoms) using all data (labeled in Figure 2), the plot indicates their −log 10 (p-value) for association (y-axis) against using the cumulative data started between March 8, 2020 to n weeks (x-axis). The dashed line indicates a significant association using a Bonferroni correction.

References

    1. WHO Coronavirus Disease (COVID-19) Dashboard, (n.d.). https://covid19.who.int/ (accessed May 26, 2020).
    1. Guan W., Ni Z., Hu Y., Liang W., Ou C., He J., Liu L., Shan H., Lei C., Hui D.S.C., Du B., Li L., Zeng G., Yuen K.-Y., Chen R., Tang C., Wang T., Chen P., Xiang J., Li S., Wang J., Liang Z., Peng Y., Wei L., Liu Y., Hu Y., Peng P., Wang J., Liu J., Chen Z., Li G., Zheng Z., Qiu S., Luo J., Ye C., Zhu S., Zhong N., Clinical Characteristics of Coronavirus Disease 2019 in China, New England Journal of Medicine. (2020). 10.1056/NEJMoa2002032. - DOI - PMC - PubMed
    1. Makaronidis J., Mok J., Balogun N., Magee C.G., Omar R.Z., Carnemolla A., Batterham R.L., Seroprevalence of SARS-CoV-2 antibodies in people with an acute loss in their sense of smell and/or taste in a community-based population in London, UK: An observational cohort study, PLOS Medicine. 17 (2020) e1003358. 10.1371/journal.pmed.1003358. - DOI - PMC - PubMed
    1. Fritz A., Brice-Saddler M., Judkis M., CDC confirms six coronavirus symptoms showing up in patients over and over, Washington Post. (n.d.). https://www.washingtonpost.com/health/2020/04/27/six-new-coronavirus-sym... (accessed September 25, 2020).
    1. Statement from the UK Chief Medical Officers on an update to coronavirus symptoms: 18 May 2020, GOV.UK. (n.d.). https://www.gov.uk/government/news/statement-from-the-uk-chief-medical-o... (accessed June 5, 2020).

Publication types