Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Feb 3;7(6):eabe5984.
doi: 10.1126/sciadv.abe5984. Print 2021 Feb.

A diagnostic host response biosignature for COVID-19 from RNA profiling of nasal swabs and blood

Affiliations

A diagnostic host response biosignature for COVID-19 from RNA profiling of nasal swabs and blood

Dianna L Ng et al. Sci Adv. .

Abstract

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), which causes coronavirus disease-19 (COVID-19), has emerged as the cause of a global pandemic. We used RNA sequencing to analyze 286 nasopharyngeal (NP) swab and 53 whole-blood (WB) samples from 333 patients with COVID-19 and controls. Overall, a muted immune response was observed in COVID-19 relative to other infections (influenza, other seasonal coronaviruses, and bacterial sepsis), with paradoxical down-regulation of several key differentially expressed genes. Hospitalized patients and outpatients exhibited up-regulation of interferon-associated pathways, although heightened and more robust inflammatory responses were observed in hospitalized patients with more clinically severe illness. Two-layer machine learning-based host classifiers consisting of complete (>1000 genes), medium (<100), and small (<20) gene biomarker panels identified COVID-19 disease with 85.1-86.5% accuracy when benchmarked using an independent test set. SARS-CoV-2 infection has a distinct biosignature that differs between NP swabs and WB and can be leveraged for COVID-19 diagnosis.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1. Overview of sample collection and metatranscriptomic analysis.
(A) Flowchart of NP swab and WB sample collection. (B) Box and whisker plots of RT-PCR cycle threshold (Ct) values of SARS-CoV-2–positive individuals who are outpatients (n = 55) were compared with those who are hospitalized, non-ICU (n = 17), or in the ICU (n = 7). There was no difference in viral load, inversely related to the Ct value, regardless of disease severity [P = 0.89 by analysis of variance (ANOVA)]. (C to E) Box and whisker plots of abundance (C), Chao richness (D), and Shannon diversity (E) of the viral metatranscriptome in patients with SARS-CoV-2 (COVID-19) (n = 137), respiratory viruses (“Other virus”) (n = 41), and without respiratory viruses (“No virus”) (n = 108), stratified by the inclusion (“Including respiratory viral reads”) or exclusion (“Exclusion respiratory viral reads”) of respiratory viral reads. (F to H) Box and whisker plots of abundance (F), Chao richness (G), and Shannon diversity (H) of the bacterial metatranscriptome. (I) Distribution of viral families in each group, expressed as log10-normalized RPM. (J) Distribution of the top 10 bacterial families in each group. For box and whisker plots, the median is represented by a dotted line, boxes represent the first to third quartiles, whiskers represent the minimum and maximum values, and jitters represent the distribution of the population. For (C) to (H), statistical analysis was conducted by Kruskal-Wallis test, followed by the Nemenyi test for post hoc analysis.
Fig. 2
Fig. 2. Comparison of canonical pathways predicted to be involved in COVID-19.
(A) Comparison of NP swab pathways in COVID-19, influenza, other respiratory viruses (human metapneumovirus, human rhinovirus, and human parainfluenza 2), and seasonal coronaviruses. (B) Comparison of NP swab pathways in COVID-19 outpatients and hospitalized patients (non-ICU and ICU). (C) Comparison of NP swab and WB pathways in COVID-19–positive patients. (D) Comparison of WB pathways in COVID-19, influenza, and bacterial sepsis. Pathway prediction is determined by z-score; a positive value denotes up-regulation, and a negative value denotes down-regulation. All pathways and z-scores were calculated relative to NP swab and WB donor controls.
Fig. 3
Fig. 3. Heatmap of pathways predicted to be involved in COVID-19 and other respiratory viruses from NP swabs.
Comparison of pathways for (A) all patients with COVID-19 compared with donor controls, (B) outpatients compared with donor controls, and (C) hospitalized (non-ICU and ICU) patients compared with donor controls. (D) Influenza compared with donor samples. (E) Seasonal coronaviruses (HCoV HKU-1, HCoV NL-63, HCoV OC43, and HCoV 229E) compared with donor samples. (F) Other respiratory viruses (human metapneumovirus and human rhinovirus) compared with donor samples. Dev, development; Fxn, function; GI, gastrointestinal; morph, morphology.
Fig. 4
Fig. 4. Heatmaps of NP swab and WB pathways.
(A) Hospitalized patients compared with outpatients with COVID-19 using NP swabs. Comparison of WB pathways in hospitalized patients with COVID-19 compared with (B) donor controls, (C) bacterial sepsis, and (D) influenza.
Fig. 5
Fig. 5. Hierarchically clustered heatmaps of DEGs.
(A) NP swab DEGs from patients with infection by SARS-CoV-2 compared with influenza, seasonal coronaviruses, and other respiratory viruses, or virus-negative patients. Samples are stratified by diagnosis (“Infection”) and highest level of care (“Level of care” including outpatient, hospitalized, non-ICU, and ICU). (B) WB DEGs from patients with infection by SARS-CoV-2 compared with influenza, bacterial, and bacterial sepsis. Samples are stratified by diagnosis (“Infection”). (C) Comparison of DEGs among outpatients and hospitalized patients admitted (“ICU”) or not admitted (“non-ICU”) to the ICU. For (A), DEGs with Bonferroni-corrected P value <0.001 were included. For (B) and (C), DEGs with Bonferroni-corrected P value <0.01 were included. All noncoding genes were removed. For (A), DEGs are calculated relative to individuals with nonviral ARIs, and the top 100 DEGs are included; for (B), DEGs are calculated relative to donor controls, and the top 100 DEGs are included; for (C), DEGs are calculated on the basis of pairwise comparisons between outpatients and hospitalized patients, and all DEGs are included. Up-regulated genes are colored in red, and down-regulated genes are colored in blue. The distribution of predicted pathways (“Host response pathways”) is shown by pie graphs above each group in the heatmap; cellular functions are indicated in shades of green, immune functions are indicated in shades of red, and other pathways are indicated in black. DEG groups (A to L) were identified on the basis of hierarchical clustering with complete linkage, after exclusion of noncoding genes. The lettering is presented in alphabetical order, starting with A in the top left corner. Dotted outlines highlight key DEG groups, which are distinctly up-regulated in SARS-CoV-2 infection compared with other disease states.
Fig. 6
Fig. 6. Venn diagrams of DEGs.
(A) Comparison of NP swab DEGs in patients with SARS-CoV-2 and influenza. (B) Comparison of NP swab and WB DEGs in patients with SARS-CoV-2. (C) Comparison of NP swab DEGs in COVID-19 outpatients and hospitalized patients. (D) Comparison of NP swab and WB DEGs in patients with influenza. The DEGs are calculated relative to donor controls.
Fig. 7
Fig. 7. Diagnostic classifier for COVID-19.
(A) Overview of classifier design and distribution of samples for training and test sets in layers 1 and 2. (B) Pie graphs showing the distribution of predicted pathways (“Host response pathways”) represented by the genes within the full gene panel (left), medium gene panel (middle), or the small gene panel (right).
Fig. 8
Fig. 8. Performance characteristics of the two-layered (combined layers 1 and layer 2) classifier for full, medium, and small gene panels.
(A to C) Training set ROC curve (left) and test set violin plot (middle) and confusion matrix (right) for the two-layer classifier, using either the full gene panel (A), medium gene panel (B), or the small gene panel (C).

References

    1. Dong E., Du H., Gardner L., An interactive web-based dashboard to track COVID-19 in real time. Lancet Infect. Dis. 20, 533–534 (2020). - PMC - PubMed
    1. Abdollahi E., Champredon D., Langley J. M., Galvani A. P., Moghadas S. M., Temporal estimates of case-fatality rate for COVID-19 outbreaks in Canada and the United States. CMAJ 192, E666–E670 (2020). - PMC - PubMed
    1. Guan W.-J., Ni Z.-Y., Hu Y., Liang W.-H., Ou C.-Q., He J.-X., Liu L., Shan H., Lei C.-L., Hui D. S. C., Du B., Li L.-J., Zeng G., Yuen K.-Y., Chen R.-C., Tang C.-L., Wang T., Chen P.-Y., Xiang J., Li S.-Y., Wang J.-L., Liang Z.-J., Peng Y.-X., Wei L., Liu Y., Hu Y.-H., Peng P., Wang J.-M., Liu J.-Y., Chen Z., Li G., Zheng Z.-J., Qiu S.-Q., Luo J., Ye C.-J., Zhu S.-Y., Zhong N.-S.; China Medical Treatment Expert Group for Covid-19 , Clinical characteristics of coronavirus disease 2019 in China. N. Engl. J. Med. 382, 1708–1720 (2020). - PMC - PubMed
    1. Russell T. W., Hellewell J., Jarvis C. I., van Zandvoort K., Abbott S., Ratnayake R.; Cmmid Covid-Working Group, Flasche S., Eggo R. M., Edmunds W. J., Kucharski A. J., Estimating the infection and case fatality ratio for coronavirus disease (COVID-19) using age-adjusted data from the outbreak on the Diamond Princess cruise ship, February 2020. Euro Surveill. 25, 2000256 (2020). - PMC - PubMed
    1. Verity R., Okell L. C., Dorigatti I., Winskill P., Whittaker C., Imai N., Cuomo-Dannenburg G., Thompson H., Walker P. G. T., Fu H., Dighe A., Griffin J. T., Baguelin M., Bhatia S., Boonyasiri A., Cori A., Cucunuba Z., FitzJohn R., Gaythorpe K., Green W., Hamlet A., Hinsley W., Laydon D., Nedjati-Gilani G., Riley S., van Elsland S., Volz E., Wang H., Wang Y., Xi X., Donnelly C. A., Ghani A. C., Ferguson N. M., Estimates of the severity of coronavirus disease 2019: A model-based analysis. Lancet Infect. Dis. 20, 669–677 (2020). - PMC - PubMed

Publication types