. 2021 Feb 3;7(6):eabe5984.

doi: 10.1126/sciadv.abe5984. Print 2021 Feb.

A diagnostic host response biosignature for COVID-19 from RNA profiling of nasal swabs and blood

Dianna L Ng^{1

2}, Andrea C Granados^{2

3}, Yale A Santos^{2

3}, Venice Servellita^{2

3}, Gregory M Goldgof², Cem Meydan^{4

5

6}, Alicia Sotomayor-Gonzalez^{2

3}, Andrew G Levine², Joanna Balcerek², Lucy M Han¹, Naomi Akagi¹, Kent Truong¹, Neil M Neumann¹, David N Nguyen⁷, Sagar P Bapat^{2

7

8}, Jing Cheng^{9

10}, Claudia Sanchez-San Martin^{2

3}, Scot Federman^{2

3}, Jonathan Foox^{4

5

6}, Allan Gopez^{2

3}, Tony Li¹¹, Ray Chan², Cynthia S Chu², Chiara A Wabl^{2

3}, Amelia S Gliwa^{2

3}, Kevin Reyes^{2

3}, Chao-Yang Pan¹², Hugo Guevara¹², Debra Wadford¹², Steve Miller^{2

3}, Christopher E Mason^{4

5

13

14}, Charles Y Chiu^{15

3

8}

Affiliations

¹ Department of Pathology, University of California, San Francisco, San Francisco, CA, USA.
² Department of Laboratory Medicine, University of California, San Francisco, San Francisco, CA, USA.
³ UCSF-Abbott Viral Diagnostics and Discovery Center, San Francisco, CA, USA.
⁴ Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA.
⁵ The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY, USA.
⁶ WorldQuant Initiative for Quantitative Prediction, Weill Cornell Medicine, New York, NY, USA.
⁷ Diabetes Center, University of California, San Francisco, San Francisco, CA, USA.
⁸ Department of Medicine, Division of Infectious Diseases, University of California, San Francisco, San Francisco, CA, USA.
⁹ Department of Preventive and Restorative Dental Sciences, University of California, San Francisco, San Francisco, CA, USA.
¹⁰ Department of Epidemiology and Biostatistics, University of California, San Francisco, San Francisco, CA, USA.
¹¹ Department of Medicine, Division of Hematology and Oncology, University of California, San Francisco, San Francisco, CA, USA.
¹² Viral and Rickettsial Disease Laboratory, California Department of Health, Richmond, CA, USA.
¹³ New York Genome Center, New York, NY, USA.
¹⁴ Feil Family Brain and Mind Research Institute, Weill Cornell Medicine, New York, NY, USA.
¹⁵ Department of Laboratory Medicine, University of California, San Francisco, San Francisco, CA, USA. charles.chiu@ucsf.edu.

PMID: 33536218
PMCID: PMC7857687
DOI: 10.1126/sciadv.abe5984

A diagnostic host response biosignature for COVID-19 from RNA profiling of nasal swabs and blood

Dianna L Ng et al. Sci Adv. 2021.

. 2021 Feb 3;7(6):eabe5984.

doi: 10.1126/sciadv.abe5984. Print 2021 Feb.

Authors

Affiliations

¹ Department of Pathology, University of California, San Francisco, San Francisco, CA, USA.
² Department of Laboratory Medicine, University of California, San Francisco, San Francisco, CA, USA.
³ UCSF-Abbott Viral Diagnostics and Discovery Center, San Francisco, CA, USA.
⁴ Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA.
⁵ The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY, USA.
⁶ WorldQuant Initiative for Quantitative Prediction, Weill Cornell Medicine, New York, NY, USA.
⁷ Diabetes Center, University of California, San Francisco, San Francisco, CA, USA.
⁸ Department of Medicine, Division of Infectious Diseases, University of California, San Francisco, San Francisco, CA, USA.
⁹ Department of Preventive and Restorative Dental Sciences, University of California, San Francisco, San Francisco, CA, USA.
¹⁰ Department of Epidemiology and Biostatistics, University of California, San Francisco, San Francisco, CA, USA.
¹¹ Department of Medicine, Division of Hematology and Oncology, University of California, San Francisco, San Francisco, CA, USA.
¹² Viral and Rickettsial Disease Laboratory, California Department of Health, Richmond, CA, USA.
¹³ New York Genome Center, New York, NY, USA.
¹⁴ Feil Family Brain and Mind Research Institute, Weill Cornell Medicine, New York, NY, USA.
¹⁵ Department of Laboratory Medicine, University of California, San Francisco, San Francisco, CA, USA. charles.chiu@ucsf.edu.

PMID: 33536218
PMCID: PMC7857687
DOI: 10.1126/sciadv.abe5984

Abstract

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), which causes coronavirus disease-19 (COVID-19), has emerged as the cause of a global pandemic. We used RNA sequencing to analyze 286 nasopharyngeal (NP) swab and 53 whole-blood (WB) samples from 333 patients with COVID-19 and controls. Overall, a muted immune response was observed in COVID-19 relative to other infections (influenza, other seasonal coronaviruses, and bacterial sepsis), with paradoxical down-regulation of several key differentially expressed genes. Hospitalized patients and outpatients exhibited up-regulation of interferon-associated pathways, although heightened and more robust inflammatory responses were observed in hospitalized patients with more clinically severe illness. Two-layer machine learning-based host classifiers consisting of complete (>1000 genes), medium (<100), and small (<20) gene biomarker panels identified COVID-19 disease with 85.1-86.5% accuracy when benchmarked using an independent test set. SARS-CoV-2 infection has a distinct biosignature that differs between NP swabs and WB and can be leveraged for COVID-19 diagnosis.

Copyright © 2021 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works. Distributed under a Creative Commons Attribution NonCommercial License 4.0 (CC BY-NC).

PubMed Disclaimer

Figures

**Fig. 1. Overview of sample collection and metatranscriptomic analysis.**
(A) Flowchart of NP swab and WB sample collection. (B) Box and whisker plots of RT-PCR cycle threshold (Ct) values of SARS-CoV-2–positive individuals who are outpatients (n = 55) were compared with those who are hospitalized, non-ICU (n = 17), or in the ICU (n = 7). There was no difference in viral load, inversely related to the Ct value, regardless of disease severity [P = 0.89 by analysis of variance (ANOVA)]. (C to E) Box and whisker plots of abundance (C), Chao richness (D), and Shannon diversity (E) of the viral metatranscriptome in patients with SARS-CoV-2 (COVID-19) (n = 137), respiratory viruses (“Other virus”) (n = 41), and without respiratory viruses (“No virus”) (n = 108), stratified by the inclusion (“Including respiratory viral reads”) or exclusion (“Exclusion respiratory viral reads”) of respiratory viral reads. (F to H) Box and whisker plots of abundance (F), Chao richness (G), and Shannon diversity (H) of the bacterial metatranscriptome. (I) Distribution of viral families in each group, expressed as log₁₀-normalized RPM. (J) Distribution of the top 10 bacterial families in each group. For box and whisker plots, the median is represented by a dotted line, boxes represent the first to third quartiles, whiskers represent the minimum and maximum values, and jitters represent the distribution of the population. For (C) to (H), statistical analysis was conducted by Kruskal-Wallis test, followed by the Nemenyi test for post hoc analysis.

**Fig. 2. Comparison of canonical pathways predicted to be involved in COVID-19.**
(A) Comparison of NP swab pathways in COVID-19, influenza, other respiratory viruses (human metapneumovirus, human rhinovirus, and human parainfluenza 2), and seasonal coronaviruses. (B) Comparison of NP swab pathways in COVID-19 outpatients and hospitalized patients (non-ICU and ICU). (C) Comparison of NP swab and WB pathways in COVID-19–positive patients. (D) Comparison of WB pathways in COVID-19, influenza, and bacterial sepsis. Pathway prediction is determined by z-score; a positive value denotes up-regulation, and a negative value denotes down-regulation. All pathways and z-scores were calculated relative to NP swab and WB donor controls.

**Fig. 3. Heatmap of pathways predicted to be involved in COVID-19 and other respiratory viruses from NP swabs.**
Comparison of pathways for (A) all patients with COVID-19 compared with donor controls, (B) outpatients compared with donor controls, and (C) hospitalized (non-ICU and ICU) patients compared with donor controls. (D) Influenza compared with donor samples. (E) Seasonal coronaviruses (HCoV HKU-1, HCoV NL-63, HCoV OC43, and HCoV 229E) compared with donor samples. (F) Other respiratory viruses (human metapneumovirus and human rhinovirus) compared with donor samples. Dev, development; Fxn, function; GI, gastrointestinal; morph, morphology.

**Fig. 4. Heatmaps of NP swab and WB pathways.**
(A) Hospitalized patients compared with outpatients with COVID-19 using NP swabs. Comparison of WB pathways in hospitalized patients with COVID-19 compared with (B) donor controls, (C) bacterial sepsis, and (D) influenza.

**Fig. 5. Hierarchically clustered heatmaps of DEGs.**
(A) NP swab DEGs from patients with infection by SARS-CoV-2 compared with influenza, seasonal coronaviruses, and other respiratory viruses, or virus-negative patients. Samples are stratified by diagnosis (“Infection”) and highest level of care (“Level of care” including outpatient, hospitalized, non-ICU, and ICU). (B) WB DEGs from patients with infection by SARS-CoV-2 compared with influenza, bacterial, and bacterial sepsis. Samples are stratified by diagnosis (“Infection”). (C) Comparison of DEGs among outpatients and hospitalized patients admitted (“ICU”) or not admitted (“non-ICU”) to the ICU. For (A), DEGs with Bonferroni-corrected P value <0.001 were included. For (B) and (C), DEGs with Bonferroni-corrected P value <0.01 were included. All noncoding genes were removed. For (A), DEGs are calculated relative to individuals with nonviral ARIs, and the top 100 DEGs are included; for (B), DEGs are calculated relative to donor controls, and the top 100 DEGs are included; for (C), DEGs are calculated on the basis of pairwise comparisons between outpatients and hospitalized patients, and all DEGs are included. Up-regulated genes are colored in red, and down-regulated genes are colored in blue. The distribution of predicted pathways (“Host response pathways”) is shown by pie graphs above each group in the heatmap; cellular functions are indicated in shades of green, immune functions are indicated in shades of red, and other pathways are indicated in black. DEG groups (A to L) were identified on the basis of hierarchical clustering with complete linkage, after exclusion of noncoding genes. The lettering is presented in alphabetical order, starting with A in the top left corner. Dotted outlines highlight key DEG groups, which are distinctly up-regulated in SARS-CoV-2 infection compared with other disease states.

**Fig. 6. Venn diagrams of DEGs.**
(A) Comparison of NP swab DEGs in patients with SARS-CoV-2 and influenza. (B) Comparison of NP swab and WB DEGs in patients with SARS-CoV-2. (C) Comparison of NP swab DEGs in COVID-19 outpatients and hospitalized patients. (D) Comparison of NP swab and WB DEGs in patients with influenza. The DEGs are calculated relative to donor controls.

**Fig. 7. Diagnostic classifier for COVID-19.**
(A) Overview of classifier design and distribution of samples for training and test sets in layers 1 and 2. (B) Pie graphs showing the distribution of predicted pathways (“Host response pathways”) represented by the genes within the full gene panel (left), medium gene panel (middle), or the small gene panel (right).

**Fig. 8. Performance characteristics of the two-layered (combined layers 1 and layer 2) classifier for full, medium, and small gene panels.**
(A to C) Training set ROC curve (left) and test set violin plot (middle) and confusion matrix (right) for the two-layer classifier, using either the full gene panel (A), medium gene panel (B), or the small gene panel (C).

See this image and copyright information in PMC

References

1. Dong E., Du H., Gardner L., An interactive web-based dashboard to track COVID-19 in real time. Lancet Infect. Dis. 20, 533–534 (2020). - PMC - PubMed
1. Abdollahi E., Champredon D., Langley J. M., Galvani A. P., Moghadas S. M., Temporal estimates of case-fatality rate for COVID-19 outbreaks in Canada and the United States. CMAJ 192, E666–E670 (2020). - PMC - PubMed
1. Guan W.-J., Ni Z.-Y., Hu Y., Liang W.-H., Ou C.-Q., He J.-X., Liu L., Shan H., Lei C.-L., Hui D. S. C., Du B., Li L.-J., Zeng G., Yuen K.-Y., Chen R.-C., Tang C.-L., Wang T., Chen P.-Y., Xiang J., Li S.-Y., Wang J.-L., Liang Z.-J., Peng Y.-X., Wei L., Liu Y., Hu Y.-H., Peng P., Wang J.-M., Liu J.-Y., Chen Z., Li G., Zheng Z.-J., Qiu S.-Q., Luo J., Ye C.-J., Zhu S.-Y., Zhong N.-S.; China Medical Treatment Expert Group for Covid-19 , Clinical characteristics of coronavirus disease 2019 in China. N. Engl. J. Med. 382, 1708–1720 (2020). - PMC - PubMed
1. Russell T. W., Hellewell J., Jarvis C. I., van Zandvoort K., Abbott S., Ratnayake R.; Cmmid Covid-Working Group, Flasche S., Eggo R. M., Edmunds W. J., Kucharski A. J., Estimating the infection and case fatality ratio for coronavirus disease (COVID-19) using age-adjusted data from the outbreak on the Diamond Princess cruise ship, February 2020. Euro Surveill. 25, 2000256 (2020). - PMC - PubMed
1. Verity R., Okell L. C., Dorigatti I., Winskill P., Whittaker C., Imai N., Cuomo-Dannenburg G., Thompson H., Walker P. G. T., Fu H., Dighe A., Griffin J. T., Baguelin M., Bhatia S., Boonyasiri A., Cori A., Cucunuba Z., FitzJohn R., Gaythorpe K., Green W., Hamlet A., Hinsley W., Laydon D., Nedjati-Gilani G., Riley S., van Elsland S., Volz E., Wang H., Wang Y., Xi X., Donnelly C. A., Ghani A. C., Ferguson N. M., Estimates of the severity of coronavirus disease 2019: A model-based analysis. Lancet Infect. Dis. 20, 669–677 (2020). - PMC - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
- scite Smart Citations
Medical
- MedlinePlus Health Information
Molecular Biology Databases
- NIAID Data Ecosystem - Find datasets on Infectious and Immune-mediated Diseases
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A diagnostic host response biosignature for COVID-19 from RNA profiling of nasal swabs and blood

Affiliations

A diagnostic host response biosignature for COVID-19 from RNA profiling of nasal swabs and blood

Authors

Affiliations

Abstract

Figures

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical

Molecular Biology Databases

Miscellaneous