Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Sep 2;22(5):bbab120.
doi: 10.1093/bib/bbab120.

Bioinformatics and machine learning approach identifies potential drug targets and pathways in COVID-19

Affiliations

Bioinformatics and machine learning approach identifies potential drug targets and pathways in COVID-19

Md Rabiul Auwul et al. Brief Bioinform. .

Abstract

Current coronavirus disease-2019 (COVID-19) pandemic has caused massive loss of lives. Clinical trials of vaccines and drugs are currently being conducted around the world; however, till now no effective drug is available for COVID-19. Identification of key genes and perturbed pathways in COVID-19 may uncover potential drug targets and biomarkers. We aimed to identify key gene modules and hub targets involved in COVID-19. We have analyzed SARS-CoV-2 infected peripheral blood mononuclear cell (PBMC) transcriptomic data through gene coexpression analysis. We identified 1520 and 1733 differentially expressed genes (DEGs) from the GSE152418 and CRA002390 PBMC datasets, respectively (FDR < 0.05). We found four key gene modules and hub gene signature based on module membership (MMhub) statistics and protein-protein interaction (PPI) networks (PPIhub). Functional annotation by enrichment analysis of the genes of these modules demonstrated immune and inflammatory response biological processes enriched by the DEGs. The pathway analysis revealed the hub genes were enriched with the IL-17 signaling pathway, cytokine-cytokine receptor interaction pathways. Then, we demonstrated the classification performance of hub genes (PLK1, AURKB, AURKA, CDK1, CDC20, KIF11, CCNB1, KIF2C, DTL and CDC6) with accuracy >0.90 suggesting the biomarker potential of the hub genes. The regulatory network analysis showed transcription factors and microRNAs that target these hub genes. Finally, drug-gene interactions analysis suggests amsacrine, BRD-K68548958, naproxol, palbociclib and teniposide as the top-scored repurposed drugs. The identified biomarkers and pathways might be therapeutic targets to the COVID-19.

Keywords: COVID-19; differentially expressed genes; gene coexpression network; machine learning; protein–protein interaction; systems biology.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Flowchart of this study.
Figure 2
Figure 2
Construction of WGCNA coexpression modules and hub modules selection. (A) The cluster dendrogram of COVID-19 infected samples. (B) Analysis of the scale-free fit index (left) and the mean connectivity (right) for various soft-thresholding powers. (C) Heatmap plot of all genes. (D) Dendrogram of all differentially expressed genes clustered based on a dissimilarity measure (1-TOM). (E) Module eigengene dendrogram and eigengene network heatmap summarize the modules yielded in the clustering analysis. (F) The median rank of the modules; the rank value close to zero indicates a high degree of module preservation. (G) The Z summary statistics plot over each module; the blue and green dashed lines indicate the thresholds Z = 2 and Z = 10, indicate moderate and strong preservation thresholds, respectively.
Figure 3
Figure 3
GO and KEGG enrichment analysis for four key modules. (A) biological process, (B) molecular function, (C) cellular components and (D) KEGG enrichment analysis.
Figure 4
Figure 4
Receiver operating curve (ROC) plot of the five classifier performance based on (A) accuracies, (B) AUC.
Figure 5
Figure 5
Hub gene expression profiles. (A) Venn diagram of common hub genes identified among the hub genes of GSE152418 identified via MM scores and PPI and the DEGs of CRA002390 data. (B) Heatmap of hub genes of GSE152418 dataset. (C) Bar chart of the log expression values of 52 common hub-genes in the GSE152418 dataset.
Figure 6
Figure 6
Network construction. (A) PPI network of the 52 common hub genes of COVID-19 data, (B) TFs-Gene interaction network of the 52 common hub genes, (C) gene-miRNAs interaction for the common hub genes of COVID-19.

Similar articles

Cited by

References

    1. Cucinotta D, Vanelli M. WHO declares COVID-19 a pandemic. Acta Biomed 2020;90:157–60. - PMC - PubMed
    1. Chen N, Zhou M, Dong X, et al. Epidemiological and clinical characteristics of 99 cases of 2019 coronavirus pneumonia in Wuhan, China: a descriptive study. Lancet 2020;395:507–13. - PMC - PubMed
    1. Ahamad MM, Aktar S, Rashed-Al-Mahfuz M, et al. A machine learning model to identify early stage symptoms of SARS-Cov-2 infected patients. Expert systems with applications 2020;160:113661. - PMC - PubMed
    1. Blanco-Melo D, Nilsson-Payant B, Liu W-C, et al. SARS-CoV-2 launches a unique transcriptional signature from in vitro, ex vivo, and in vivo systems. bioRxiv 2020. March 24, 2020. doi: 10.1101/2020.03.24.004655 Arxiv biorxiv;2020.03.24.004655v1, preprint: not peer reviewed. - DOI
    1. Islam T, Rahman MR, Aydin B, et al. Integrative transcriptomics analysis of lung epithelial cells and identification of repurposable drug candidates for COVID-19. Eur J Pharmacol 2020;887:173594. - PMC - PubMed