Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Meta-Analysis
. 2023 May;617(7962):764-768.
doi: 10.1038/s41586-023-06034-3. Epub 2023 May 17.

GWAS and meta-analysis identifies 49 genetic variants underlying critical COVID-19

Erola Pairo-Castineira #  1   2   3 Konrad Rawlik #  1 Andrew D Bretherick  1   2   4 Ting Qi  5   6 Yang Wu  7 Isar Nassiri  8 Glenn A McConkey  9 Marie Zechner  1   3 Lucija Klaric  2 Fiona Griffiths  1   3 Wilna Oosthuyzen  1   3 Athanasios Kousathanas  10 Anne Richmond  2 Jonathan Millar  1   3   11 Clark D Russell  1 Tomas Malinauskas  8 Ryan Thwaites  12 Kirstie Morrice  13 Sean Keating  11 David Maslove  14 Alistair Nichol  15 Malcolm G Semple  16   17 Julian Knight  8 Manu Shankar-Hari  11   18 Charlotte Summers  19 Charles Hinds  20 Peter Horby  21 Lowell Ling  22 Danny McAuley  23   24 Hugh Montgomery  25 Peter J M Openshaw  12   26 Colin Begg  27 Timothy Walsh  11 Albert Tenesa  2   3   28 Carlos Flores  29   30   31   32 José A Riancho  33   34   35 Augusto Rojas-Martinez  36 Pablo Lapunzina  37   38   39 GenOMICC InvestigatorsSCOURGE ConsortiumISARICC Investigators23andMe COVID-19 TeamJian Yang  5   6 Chris P Ponting  2 James F Wilson  2   28 Veronique Vitart  2 Malak Abedalthagafi  40   41 Andre D Luchessi  42   43 Esteban J Parra  43 Raquel Cruz  37   44 Angel Carracedo  37   44   45   46 Angie Fawkes  13 Lee Murphy  13 Kathy Rowan  47 Alexandre C Pereira  48 Andy Law  3 Benjamin Fairfax  8 Sara Clohisey Hendry  1   3 J Kenneth Baillie  49   50   51   52
Collaborators, Affiliations
Meta-Analysis

GWAS and meta-analysis identifies 49 genetic variants underlying critical COVID-19

Erola Pairo-Castineira et al. Nature. 2023 May.

Erratum in

  • Author Correction: GWAS and meta-analysis identifies 49 genetic variants underlying critical COVID-19.
    Pairo-Castineira E, Rawlik K, Bretherick AD, Qi T, Wu Y, Nassiri I, McConkey GA, Zechner M, Klaric L, Griffiths F, Oosthuyzen W, Kousathanas A, Richmond A, Millar J, Russell CD, Malinauskas T, Thwaites R, Morrice K, Keating S, Maslove D, Nichol A, Semple MG, Knight J, Shankar-Hari M, Summers C, Hinds C, Horby P, Ling L, McAuley D, Montgomery H, Openshaw PJM, Begg C, Walsh T, Tenesa A, Flores C, Riancho JA, Rojas-Martinez A, Lapunzina P; GenOMICC Investigators; SCOURGE Consortium; ISARICC Investigators; 23andMe COVID-19 Team; Yang J, Ponting CP, Wilson JF, Vitart V, Abedalthagafi M, Luchessi AD, Parra EJ, Cruz R, Carracedo A, Fawkes A, Murphy L, Rowan K, Pereira AC, Law A, Fairfax B, Hendry SC, Baillie JK. Pairo-Castineira E, et al. Nature. 2023 Jul;619(7971):E61. doi: 10.1038/s41586-023-06383-z. Nature. 2023. PMID: 37433877 Free PMC article. No abstract available.

Abstract

Critical illness in COVID-19 is an extreme and clinically homogeneous disease phenotype that we have previously shown1 to be highly efficient for discovery of genetic associations2. Despite the advanced stage of illness at presentation, we have shown that host genetics in patients who are critically ill with COVID-19 can identify immunomodulatory therapies with strong beneficial effects in this group3. Here we analyse 24,202 cases of COVID-19 with critical illness comprising a combination of microarray genotype and whole-genome sequencing data from cases of critical illness in the international GenOMICC (11,440 cases) study, combined with other studies recruiting hospitalized patients with a strong focus on severe and critical disease: ISARIC4C (676 cases) and the SCOURGE consortium (5,934 cases). To put these results in the context of existing work, we conduct a meta-analysis of the new GenOMICC genome-wide association study (GWAS) results with previously published data. We find 49 genome-wide significant associations, of which 16 have not been reported previously. To investigate the therapeutic implications of these findings, we infer the structural consequences of protein-coding variants, and combine our GWAS results with gene expression data using a monocyte transcriptome-wide association study (TWAS) model, as well as gene and protein expression using Mendelian randomization. We identify potentially druggable targets in multiple systems, including inflammatory signalling (JAK1), monocyte-macrophage activation and endothelial permeability (PDE4A), immunometabolism (SLC2A5 and AK5), and host factors required for viral entry and replication (TMPRSS2 and RAB2A).

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1. Functional genomics analyses for SLC22A31 and SFTPD.
a, Effect-size plot for the effect of multiple variants on SLC22A31 expression (eQTLgen, x axis) against increasing susceptibility to critical COVID-19 (βxy = 0.11; Pxy = 1.3 × 10−9). The colour shows linkage disequilibrium (LD) with the missense variant rs117169628. b, Three cartoon views of an AlphaFold model of putative solute carrier family 22 member 31 (SLC22A31; UniProtKB: A6NKX4). The side chains of Pro474 and interacting amino acids are shown as connected spheres. A putative channel for small-molecule transport across the cell membrane is indicated by a dashed circle. Pro474 is predicted to be located in the transmembrane helix and point towards a putative transport pathway of a small molecule. The risk variant, P474L (Ala at rs117169628) would be expected to introduce more flexibility to the transmembrane helix and might therefore affect the transport properties of SLC22A31. Pro474 is predicted to be in a tightly packed environment, and may therefore affect the folding of SLC22A31. c, Effect-size plot for effect of multiple variants on SFTPD expression (eQTLgen, x axis) against increasing susceptibility to critical COVID-19 (βxy = 0.16; Pxy = 9.7 × 10−6). Colour shows linkage disequilibrium with the missense variant rs721917. d, Three cartoon views of an AlphaFold model of pulmonary surfactant-associated protein D (SFTPD; UniProtKB: P35247). The side chain of the variant Met31 is shown as connected spheres. Met31 is predicted to be located in the secondary-structure-lacking region of SFTPD. In the diagram on the right, oxygen and nitrogen atoms are coloured red and blue respectively, and the sulfur atom is coloured yellow.
Fig. 2
Fig. 2. GSMR effect sizes.
a,b, The predicted effect of change in protein concentration (a) and gene expression (b) on the risk of critical COVID-19 is shown for proteins and genes significantly linked to critical COVID-19 by GSMR (false-discovery rate (FDR) < 0.01). The bars show 95% confidence intervals.
Extended Data Fig. 1
Extended Data Fig. 1. Pipeline of meta-analysis and post-GWAS analyses.
Red border indicates that the data is only available for the hospitalized phenotype, while a black border indicates that the analysis was performed for the critical illness phenotype.
Extended Data Fig. 2
Extended Data Fig. 2. Miami plots.
Meta-analysis results are shown for a) critical and b) hospitalized phenotypes. In each plot results obtained using all cohorts are shown at the top and using GenOMICC data only at the bottom. Independent lead variants in the analyses of all cohorts are annotated with associated genes. Genome-wide significant associations that have not been previously reported are indicated in bold.
Extended Data Fig. 3
Extended Data Fig. 3. Comparison of effect size estimates.
GenOMICC is compared with the critical and hospitalized phenotype definitions in the SCOURGE, 23andMe, and HGI analyses. The black line indicates the best linear fit, given by the equation in each plot, obtained using Orthogonal Distance Regression to account for estimation errors in both sets of effects in the comparison.
Extended Data Fig. 4
Extended Data Fig. 4. Study weightings for (a) critical and (b) hospitalized COVID-19.
Mean +/− standard deviation of weights assigned to each data source in meta-analyses for all significant SNPs.
Extended Data Fig. 5
Extended Data Fig. 5. Cartoon showing postulated roles for genes and mediators implicated in the pathogenesis of critical COVID-19 by GenOMICC GWAS, TWAS and Mendelian randomization.
Postulated roles for genetic variants are shown in a highly simplified format to illustrate potential roles in pathogenesis, with the shaded background indicating the hypothetical impact of the host immune response over time. Host immune processes are divided into those that are thought to play a role in controlling viral replication early in disease (orange section, showing “adaptive” response), and those implicated in driving hypoxaemic respiratory failure later in disease (green section, showing “maladaptive” response). Bold type gene names indicate a higher level of confidence in both the gene identification and the biological role.
Extended Data Fig. 6
Extended Data Fig. 6. Functional genomics analyses for TYK2.
(a) Effect size plot for effect of multiple variants on TYK2 expression (eQTLgen, x-axis) against increasing susceptibility to critical COVID-19 (βxy = 0.53; Pxy=1.2×1023). Colour shows linkage disequilibrium (LD) with the missense variant rs34536443. (b) Crystal structure of TYK2 kinase domain (Protein Data Bank ID 4GVJ) in two views that differ by a 45° rotation around a horizontal axis. The side chain of P1104 is shown as connected spheres with a nitrogen atom coloured in blue. Carbon, oxygen, nitrogen and phosphorus atoms of ATP are shown as magenta, red, blue and orange connected spheres, respectively. The N-terminal region of the kinase domain is not shown in the second view for clarity. The right-most panel shows a close view of P1104 and neighbouring residues with their side chains shown as sticks. Numbering of residues corresponds to UniProtKB entry P29597. P1104 is in the catalytic kinase domain and proximal to the ATP-binding site; TYK2 P1104A is catalytically impaired.
Extended Data Fig. 7
Extended Data Fig. 7. Steroid treatment and vaccination status.
Data are shown for a subset of GenOMICC cases who were also recruited to the ISARIC4C study in the UK.

References

    1. Pairo-Castineira E, et al. Genetic mechanisms of critical illness in COVID-19. Nature. 2021;591:92–98. doi: 10.1038/s41586-020-03065-y. - DOI - PubMed
    1. Kousathanas A, et al. Whole-genome sequencing reveals host factors underlying critical COVID-19. Nature. 2022;607:97–103. doi: 10.1038/s41586-022-04576-6. - DOI - PMC - PubMed
    1. Abani O, et al. Baricitinib in patients admitted to hospital with COVID-19 (RECOVERY): a randomised, controlled, open-label, platform trial and updated meta-analysis. Lancet. 2022;400:359–368. doi: 10.1016/S0140-6736(22)01109-6. - DOI - PMC - PubMed
    1. Pathak GA, et al. A first update on mapping the human genetic architecture of COVID-19. Nature. 2022;608:E1–E10. doi: 10.1038/s41586-022-04826-7. - DOI - PMC - PubMed
    1. Zhu Z, et al. Causal associations between risk factors and common diseases inferred from GWAS summary data. Nat. Commun. 2018;9:224. doi: 10.1038/s41467-017-02317-2. - DOI - PMC - PubMed

Publication types