Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Meta-Analysis
. 2020 Dec 18;10(1):22303.
doi: 10.1038/s41598-020-79033-3.

Dynamic data-driven meta-analysis for prioritisation of host genes implicated in COVID-19

Affiliations
Meta-Analysis

Dynamic data-driven meta-analysis for prioritisation of host genes implicated in COVID-19

Nicholas Parkinson et al. Sci Rep. .

Abstract

The increasing body of literature describing the role of host factors in COVID-19 pathogenesis demonstrates the need to combine diverse, multi-omic data to evaluate and substantiate the most robust evidence and inform development of therapies. Here we present a dynamic ranking of host genes implicated in human betacoronavirus infection (SARS-CoV-2, SARS-CoV, MERS-CoV, seasonal coronaviruses). We conducted an extensive systematic review of experiments identifying potential host factors. Gene lists from diverse sources were integrated using Meta-Analysis by Information Content (MAIC). This previously described algorithm uses data-driven gene list weightings to produce a comprehensive ranked list of implicated host genes. From 32 datasets, the top ranked gene was PPIA, encoding cyclophilin A, a druggable target using cyclosporine. Other highly-ranked genes included proposed prognostic factors (CXCL10, CD4, CD3E) and investigational therapeutic targets (IL1A) for COVID-19. Gene rankings also inform the interpretation of COVID-19 GWAS results, implicating FYCO1 over other nearby genes in a disease-associated locus on chromosome 3. Researchers can search and review the gene rankings and the contribution of different experimental methods to gene rank at https://baillielab.net/maic/covid19 . As new data are published we will regularly update the list of genes as a resource to inform and prioritise future studies.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
Overview of MAIC approach. (A) Schematic showing the operation of MAIC. Each entity in a list is given a score, based on overlap with other lists and rank where relevant, and each list is given a weight determined by the scores of its constituent entities. Entity scores are iteratively updated using list weights, and list weights are updated using entity scores, until convergence occurs. (B) Circular plot showing overlap between different data sources included in MAIC. Size of data source blocks is proportional to the summed information content (MAIC scores) of the input list. Lines are coloured according to the dominant data source. Data source categories share the same colour; the largest categories and data sources are labelled (see Supplementary Information for full source data). (C) Relative information contributions (determined by sum of MAIC score contributions) of each experimental category to the evidence base for the top 100 genes in the MAIC output. (D) Distribution of MAIC scores by gene rank. The shaded region indicates the range of possible scores for a gene supported by a single gene list only. Beyond ranks around 700 in this study, gene scores approach baseline, indicating they have little corroborative evidence.
Figure 2
Figure 2
Highest ranked genes in the MAIC output and overlap with other conditions. (A) Heatmap of the top 50 genes implicated in SARS-CoV-2 infection, as ranked by the MAIC algorithm. The heatmap shows the information sources contributing to each of the top genes, by experimental category. Full details of all scored genes, including specific studies contributing to each, are given in Supplementary Table S1. (B) Venn diagram of overlap between the top 500 hits from this study and the top 500 hits from our previous MAIC analysis of Influenza A virus. (C) Venn diagram of overlap between the top 500 hits from this study and manually curated lists from available literature on HLH and ARDS.
Figure 3
Figure 3
Gene Set Enrichment Analysis of MAIC rankings. (A) Violin plots of MAIC score distributions of top enriched pathways significant with both FGSEA and Enrichr algorithms, from the KEGG 2019 (Human) and WikiPathways 2019 (Human) databases. Highly similar pathways and irrelevant specific disease terms are not shown. n: number of gene set members included in the overall MAIC output; NES: normalised enrichment score from FGSEA. (B): Information contribution by methodology for selected enriched KEGG terms. Relative contributions of different information sources vary between functional annotations, but no single methodology predominates to drive enrichment.
Figure 4
Figure 4
Cellular functions of the 100 highest ranked genes in the MAIC output. Protein products of these genes have diverse cellular locations and are associated with numerous processes relevant to the viral life cycle and host immune system. Stages of the betacoronavirus life cycle: (1) S protein-mediated attachment to the cell surface. (2) Endocytosis. (3) Membrane fusion and viral genome release into the cytoplasm. (4) Assembly of the replication-transcription complex, translation of mRNA. (5) Viral replication and virion assembly. (6) Virion maturation, budding and translocation of vesicles.

References

    1. Baillie JK. Translational genomics. Targeting the host immune response to fight infection. Science. 2014;344:807–808. doi: 10.1126/science.1255074. - DOI - PubMed
    1. Beigel JH, Tomashek KM, Dodd LE, Mehta AK, Zingman BS, Kalil AC, Hohmann E, Chu HY, Luetkemeyer A, Kline S, de Castilla DL, Finberg RW, Dierberg K, Tapson V, Hsieh L, Patterson TF, Paredes R, Sweeney DA, Short WR, Touloumi G, Lye DC, Ohmagari N, Oh M-D, Ruiz-Palacios GM, Benfield T, Fätkenheuer G, Kortepeter MG, Atmar RL, Creech CB, Lundgren J, Babiker AG, Pett S, Neaton JD, Burgess TH, Bonnett T, Green M, Makowski M, Osinusi A, Nayak S, Lane HC. Remdesivir for the treatment of covid-19: preliminary report. N. Engl. J. Med. 2020 doi: 10.1056/NEJMoa2007764. - DOI - PMC - PubMed
    1. Li B, Clohisey SM, Chia BS, Wang B, Cui A, Eisenhaure T, Schweitzer LD, Hoover P, Parkinson NJ, Nachshon A, Smith N, Regan T, Farr D, Gutmann MU, Bukhari SI, Law A, Sangesland M, Gat-Viks I, Digard P, Vasudevan S, Lingwood D, Dockrell DH, Doench JG, Baillie JK, Hacohen N. Genome-wide CRISPR screen identifies host dependency factors for influenza A virus infection. Nat. Commun. 2020;11:164. doi: 10.1038/s41467-019-13965-x. - DOI - PMC - PubMed
    1. Horby P, Lim WS, Emberson JR, Mafham M, Bell JL, Linsell L, Staplin N, Brightling C, Ustianowski A, Elmahi E, Prudon B, Green C, Felton T, Chadwick D, Rege K, Fegan C, Chappell LC, Faust SN, Jaki T, Jeffery K, Montgomery A, Rowan K, Juszczak E, Baillie JK, Haynes R, Landray MJ. Dexamethasone in hospitalized patients with covid-19 - preliminary report. N. Engl. J. Med. 2020 doi: 10.1056/NEJMoa2021436. - DOI - PMC - PubMed
    1. Dorward DA, et al. Tissue-specific immunopathology in fatal Covid-19. Am. J. Respir. Crit. Care Med. 2020 doi: 10.1164/rccm.202008-3265OC. - DOI - PMC - PubMed

Publication types