Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Feb;29(2):309-324.
doi: 10.1038/s41431-020-00730-8. Epub 2020 Oct 27.

An expanded analysis framework for multivariate GWAS connects inflammatory biomarkers to functional variants and disease

Collaborators, Affiliations

An expanded analysis framework for multivariate GWAS connects inflammatory biomarkers to functional variants and disease

Sanni E Ruotsalainen et al. Eur J Hum Genet. 2021 Feb.

Abstract

Multivariate methods are known to increase the statistical power to detect associations in the case of shared genetic basis between phenotypes. They have, however, lacked essential analytic tools to follow-up and understand the biology underlying these associations. We developed a novel computational workflow for multivariate GWAS follow-up analyses, including fine-mapping and identification of the subset of traits driving associations (driver traits). Many follow-up tools require univariate regression coefficients which are lacking from multivariate results. Our method overcomes this problem by using Canonical Correlation Analysis to turn each multivariate association into its optimal univariate Linear Combination Phenotype (LCP). This enables an LCP-GWAS, which in turn generates the statistics required for follow-up analyses. We implemented our method on 12 highly correlated inflammatory biomarkers in a Finnish population-based study. Altogether, we identified 11 associations, four of which (F5, ABO, C1orf140 and PDGFRB) were not detected by biomarker-specific analyses. Fine-mapping identified 19 signals within the 11 loci and driver trait analysis determined the traits contributing to the associations. A phenome-wide association study on the 19 representative variants from the signals in 176,899 individuals from the FinnGen study revealed 53 disease associations (p < 1 × 10-4). Several reported pQTLs in the 11 loci provided orthogonal evidence for the biologically relevant functions of the representative variants. Our novel multivariate analysis workflow provides a powerful addition to standard univariate GWAS analyses by enabling multivariate GWAS follow-up and thus promoting the advancement of powerful multivariate methods in genomics.

PubMed Disclaimer

Conflict of interest statement

VS has received honoraria from Novo Nordisk and Sanofi for consultations and has ongoing research collaboration with Bayer AG (all unrelated to this study). All other authors have no conflict of interest.

Figures

Fig. 1
Fig. 1. Study workflow.
The novel LCP-GWAS method that enables follow-up analyses such as fine-mapping for multivariate GWAS is illustrated in the violet panel on the right.
Fig. 2
Fig. 2. Power comparison between multivariate and univariate methods.
Red and blue dots represent genetic variants reaching genome-wide significance only by the multivariate (metaCCA) or univariate method, respectively. Black dots reach the genome-wide significance threshold by both methods and gray dots do not by either method. Respective numbers are reported in the accompanying table.
Fig. 3
Fig. 3. Manhattan plot of the multivariate GWAS results on 12 inflammatory biomarkers.
Gene names colored in orange represent associations only detected by the multivariate method while black are detected by both multivariate and univariate methods. 13 genome-wide significant nonsynonymous and splice-region variants are denoted with diamonds.

References

    1. Ahola-Olli AV, Würtz P, Havulinna AS, Aalto K, Pitkänen N, Lehtimäki T, et al. Genome-wide association study identifies 27 loci influencing concentrations of circulating cytokines and growth factors. Am J Human Genet. 2017;100:40–50. - PMC - PubMed
    1. Astle WJ, Elding H, Jiang T, Allen D, Ruklisa D, Mann AL, et al. The allelic landscape of human blood cell trait variation and links to common complex disease. Cell. 2016;167:1415–29. e19. - PMC - PubMed
    1. Liu DJ, Peloso GM, Yu H, Butterworth AS, Wang X, Mahajan A, et al. Exome-wide association study of plasma lipids in> 300,000 individuals. Nat Genet. 2017;49:1758. - PMC - PubMed
    1. Sinnott-Armstrong N, Tanigawa Y, Amar D, Mars NJ, Aguirre M, Venkataraman GR, et al. Genetics of 38 blood and urine biomarkers in the UK Biobank. 2019. Preprint at https://www.biorxiv.org/content/10.1101/660506v1. - DOI - PMC - PubMed
    1. Suhre K, Arnold M, Bhagwat AM, Cotton RJ, Engelke R, Raffler J, et al. Connecting genetic risk to disease end points through the human blood plasma proteome. Nat Commun. 2017;8:14357. - PMC - PubMed

Publication types