Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Sep;93(9):5544-5554.
doi: 10.1002/jmv.27093. Epub 2021 May 28.

Genome-wide screening of SARS-CoV-2 infection-related genes based on the blood leukocytes sequencing data set of patients with COVID-19

Affiliations

Genome-wide screening of SARS-CoV-2 infection-related genes based on the blood leukocytes sequencing data set of patients with COVID-19

Xin Gao et al. J Med Virol. 2021 Sep.

Abstract

Coronavirus disease 2019 (COVID-19) is a global epidemic disease caused by a novel virus, severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), causing serious adverse effects on human health. In this study, we obtained a blood leukocytes sequencing data set of COVID-19 patients from the GEO database and obtained differentially expressed genes (DEGs). We further analyzed these DEGs by protein-protein interaction analysis and Gene Ontology enrichment analysis and identified the DEGs closely related to SARS-CoV-2 infection. Then, we constructed a six-gene model (comprising IFIT3, OASL, USP18, XAF1, IFI27, and EPSTI1) by logistic regression analysis and calculated the area under the ROC curve (AUC) for the diagnosis of COVID-19. The AUC values of the training group, testing group, and entire group were 0.930, 0.914, and 0.921, respectively. The six genes were highly expressed in patients with COVID-19 and positively correlated with the expression of SARS-CoV-2 invasion-related genes (ACE2, TMPRSS2, CTSB, and CTSL). The risk score calculated by this model was also positively correlated with the expression of TMPRSS2, CTSB, and CTSL, indicating that the six genes were closely related to SARS-CoV-2 infection. In conclusion, we comprehensively analyzed the functions of DEGs in the blood leukocytes of patients with COVID-19 and constructed a six-gene model that may contribute to the development of new diagnostic and therapeutic ideas for COVID-19. Moreover, these six genes may be therapeutic targets for COVID-19.

Keywords: COVID-19; SARS-CoV-2; bioinformatics; diagnosis; leukocyte.

PubMed Disclaimer

Conflict of interest statement

The authors declare that there are no conflict of interests.

Figures

Figure 1
Figure 1
Differentially expressed genes (DEGs) screening. (A) Volcano map of DEGs. Green indicates downregulated DEGs, red indicates upregulated DEGs, and gray indicates genes without differential expression. (B) Heat map of DEGs
Figure 2
Figure 2
Module analysis and GO function analysis of differentially expressed genes. (A) PPI network of module 1 and module 2. (B) Top 10 results of GO functional annotation of module 1 and module 2 genes. (C) Relationship between module 2 genes and GO functional annotation. GO, Gene Ontology; PPI, protein–protein interaction
Figure 3
Figure 3
Lasso regression analysis and expression information visualization of module 2 genes. (A) The adjustment parameter (λ) selected in the lasso model is cross‐verified 10 times by the minimum standard. The Y‐axis represents the binomial deviation and the X‐axis represents log (λ). (B) Visualization of selected gene expression information. The number in the outermost circle and the size of the yellow circle indicate log2FC, and the blue and red data in the third circle indicate the average expression values of COVID‐19 and non‐COVID‐19 samples
Figure 4
Figure 4
Predictive ability evaluation of the six‐gene model in the training group, testing group, and entire group. (A–C) training group; (D–F) testing group; (G–I) entire group
Figure 5
Figure 5
Analysis of the prediction ability of each independent index for SARS‐CoV‐2 infection. (A) Analysis of the predictive ability of the six genes individually for SARS‐CoV‐2 infection. (B) Analysis of the predictive ability of six clinical indexes for SARS‐CoV‐2 infection. (C) Predictive ability analysis after fitting the six‐gene model with ferritin and fibrinogen
Figure 6
Figure 6
Expression analysis of six genes (A–D), Expression analysis of the six genes in the GSE157103 data set; (E, F) Expression analysis of the six genes in the GSE156063 data set; (G) Analysis of the coexpression of the six genes and SARS‐CoV‐2 infection‐related genes in the GSE157103 data set; (H) Analysis of the coexpression of the six genes and SARS‐CoV‐2 infection‐related genes in the GSE156063 data set; (I) Analysis of the expression of IFIT3, USP18, XAF1, IFI27, and EPSTI1 in the lung tissue of SARS‐CoV‐2‐infected mice in the GSE154104 data set
Figure 7
Figure 7
Analysis of the relationship between risk score and expression of genes related to SARS‐CoV‐2 infection in the GSE157103 data set. (A) Analysis of the correlation between risk score and ACE2 expression. (B) Analysis of the correlation between risk score and TMPRSS2 expression. (C) Analysis of the correlation between risk score and CTSB expression. (D) Analysis of the correlation between risk score and CTSL expression

Similar articles

Cited by

References

    1. Li Q, Guan X, Wu P, et al. Early transmission dynamics in Wuhan, China, of novel coronavirus‐infected pneumonia. N Engl J Med. 2020. 2020;382(13):1199‐1207. 10.1056/NEJMoa2001316 - DOI - PMC - PubMed
    1. Ballaalla M, Merugu GP, Patel M, et al. COVID‐19, modern pandemic: a systematic review from front‐line health care providers' perspective. J Clin Med Res. 2020;12(4):215‐229. 10.14740/jocmr4142 - DOI - PMC - PubMed
    1. World Health Organization. WHO Coronavirus Disease (COVID‐19) Dashboard. 2021. [Data last updated: 2021/03/22]. https://covid19.who.int.
    1. Mann R, Perisetti A, Gajendran M, Gandhi Z, Umapathy C, Goyal H. Clinical characteristics, diagnosis, and treatment of major coronavirus outbreaks. Front Med. 2020;7:581521. 10.3389/fmed.2020.581521 - DOI - PMC - PubMed
    1. Ye ZW, Yuan S, Yuen KS, Fung SY, Chan CP, Jin DY. Zoonotic origins of human coronaviruses. Int J Biol Sci. 2020;16(10):1686‐1697. 10.7150/ijbs.45472 - DOI - PMC - PubMed

Publication types

MeSH terms