Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Oct 16;10(10):e1003876.
doi: 10.1371/journal.pcbi.1003876. eCollection 2014 Oct.

Canonical correlation analysis for gene-based pleiotropy discovery

Affiliations

Canonical correlation analysis for gene-based pleiotropy discovery

Jose A Seoane et al. PLoS Comput Biol. .

Abstract

Genome-wide association studies have identified a wealth of genetic variants involved in complex traits and multifactorial diseases. There is now considerable interest in testing variants for association with multiple phenotypes (pleiotropy) and for testing multiple variants for association with a single phenotype (gene-based association tests). Such approaches can increase statistical power by combining evidence for association over multiple phenotypes or genetic variants respectively. Canonical Correlation Analysis (CCA) measures the correlation between two sets of multidimensional variables, and thus offers the potential to combine these two approaches. To apply CCA, we must restrict the number of attributes relative to the number of samples. Hence we consider modules of genetic variation that can comprise a gene, a pathway or another biologically relevant grouping, and/or a set of phenotypes. In order to do this, we use an attribute selection strategy based on a binary genetic algorithm. Applied to a UK-based prospective cohort study of 4286 women (the British Women's Heart and Health Study), we find improved statistical power in the detection of previously reported genetic associations, and identify a number of novel pleiotropic associations between genetic variants and phenotypes. New discoveries include gene-based association of NSF with triglyceride levels and several genes (ACSM3, ERI2, IL18RAP, IL23RAP and NRG1) with left ventricular hypertrophy phenotypes. In multiple-phenotype analyses we find association of NRG1 with left ventricular hypertrophy phenotypes, fibrinogen and urea and pleiotropic relationships of F7 and F10 with Factor VII, Factor IX and cholesterol levels.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Hive plot for single gene/single phenotype.
The vertical axis represents the association value (higher, more association). The left axis represents phenotypes and the right axis represents genes. An interactive hive plot is published on the project webpage (http://pleioexp.epi.bris.ac.uk/cca/1gene1phenhive.html).
Figure 2
Figure 2. Heatmap single gene/multiple phenotype.
This figure shows the 22 most important association rules as columns in a heatmap. Note that only one member of the UGT1A family is listed and rules for the same gene are summarized in the same column.
Figure 3
Figure 3. Hive plot single gene/multiple phenotype.
This figure shows a hive plot for gene/phenotype association rules. The vertical axis represents the association rules (higher, more association). The left axis represents phenotypes and the right axis represents genes. An interactive hive plot is published on the project webpage (http://pleioexp.epi.bris.ac.uk/cca/geneNphenHive.html).

References

    1. Stearns FW (2010) One hundred years of pleiotropy: a retrospective. Genetics 186: 767–773. - PMC - PubMed
    1. He X, Zhang J (2006) Toward a molecular understanding of pleiotropy. Genetics 173: 1885–1891. - PMC - PubMed
    1. Tyler AL, Asselbergs FW, Williams SM, Moore JH (2009) Shadows of complexity: what biological networks reveal about epistasis and pleiotropy. Bioessays 31: 220–227. - PMC - PubMed
    1. Hadorn E, MITTWOCH U (1961) Developmental genetics and lethal factors. The American Journal of the Medical Sciences 242: 522.
    1. Li SY, Williams BL, Cui YH (2011) A combined p-value approach to infer pathway regulations in eQTL mapping. Statistics and Its Interface 4: 389–401.

Publication types

Substances

LinkOut - more resources