Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Oct 24:16:124.
doi: 10.1186/s12863-015-0283-z.

Leveraging local ancestry to detect gene-gene interactions in genome-wide data

Affiliations

Leveraging local ancestry to detect gene-gene interactions in genome-wide data

Hugues Aschard et al. BMC Genet. .

Abstract

Background: Although genome-wide association studies have successfully identified thousands of variants associated to complex traits, these variants only explain a small amount of the entire heritability of the trait. Gene-gene interactions have been proposed as a source to explain a significant percentage of the missing heritability. However, detecting gene-gene interactions has proven to be very difficult due to computational and statistical challenges. The vast number of possible interactions that can be tested induces very stringent multiple hypotheses corrections that limit the power of detection. These issues have been mostly highlighted for the identification of pairwise effects and are even more challenging when addressing higher order interaction effects. In this work we explore the use of local ancestry in recently admixed individuals to find signals of gene-gene interaction on human traits and diseases.

Results: We introduce statistical methods that leverage the correlation between local ancestry and the hidden unknown causal variants to find distant gene-gene interactions. We show that the power of this test increases with the number of causal variants per locus and the degree of differentiation of these variants between the ancestral populations. Overall, our simulations confirm that local ancestry can be used to detect gene-gene interactions, solving the computational bottleneck. When compared to a single nucleotide polymorphism (SNP)-based interaction screening of the same sample size, the power of our test was lower on all settings we considered. However, accounting for the dramatic increase in sample size that can be achieve when genotyping only a set of ancestry informative markers instead of the whole genome, we observe substantial gain in power in several scenarios.

Conclusion: Local ancestry-based interaction tests offer a new path to the detection of gene-gene interaction effects. It would be particularly useful in scenarios where multiple differentiated variants at the interacting loci act in a synergistic manner.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Simulation schemes. Main and interaction effects are simulated assuming either a single genetic variant per locus (a) or multiple genetic variants per locus (b). In the latter case, the main and interaction effects on the outcome Y are moderated through two latent variables Z1 and Z2 that directly depend on the causal variants. Example of local ancestry derived for the two haplotypes of three individuals (c)
Fig. 2
Fig. 2
Power comparison for a single causal SNP per locus. Upper panels show the sample size required for 80 % power for the interaction test based on 1 M genotyped GWAS SNPs (S G) (a), and the interaction test based on local ancestry segment (S L) assuming a total of 1 K local ancestry segments (b) against the interaction test based on full sequencing data (S S) assuming a total of 20 M genetic variants (blue curve). Sample size is plotted for increasing ρ GC and ρ GL (defined by the red gradient), the correlation between the true interaction term and the best tag from 1 M genotyped SNPs, and the best tag from local ancestries, respectively. The variance explained by the interaction effect is unrealistically large for illustration purposes and varied between 1 and 10 %. Lower panels show the observed distribution of ρ GC (c) and ρ GL (d) for a randomly selected region from the 1 M Illumina chip and local ancestry, respectively
Fig. 3
Fig. 3
Tagging interaction effects in a multiple causal model. A latent variable Z is generated as a function of an increasing number of SNPs at a single locus, explaining altogether 50 % of its variance. The average value of Z across 20,000 replicates of 10,000 admixed samples is plotted for each three local ancestry classes. The effect of the SNPs is drawn from a normal (a) and left-truncated normal (b) distribution with a mean of 0 (upper panel). When the SNP effects are null on average, the average values of Z do not differ by local ancestry and ρ ZL, the correlation between Z and local ancestry, is also null on average. Conversely, when the average effect of the SNPs is not null, ρ ZL increases with an increasing number of causal variants (lower panel)
Fig. 4
Fig. 4
Power comparison for multiple causal SNPs per locus. Power across 25,000 replicates using a Bonferroni correction resulting in p-value thresholds of 1 × 10−7 and 1 × 10−15 for the local ancestry-based interaction test (S L) and the SNP-based interaction test (S G), respectively. One to five common causals SNPs were selected per interacting locus while assuming either low (a), moderate (b) or high (c) differentiation of those SNPs between the two admixed populations. We considered three case scenarios for the additional increase in sample size that would be achieve when using local ancestry derived from AIMs, no increase (pink), a lower bound of six fold increase (light red) and an upper bound of 10 fold increase. We varied the baseline sample size (for S G) across scenarios to emphasize the differences between the tests

Similar articles

Cited by

References

    1. Aschard H, Lutz S, Maus B, Duell EJ, Fingerlin TE, Chatterjee N, Kraft P, Van Steen K. Challenges and opportunities in genome-wide environmental interaction (GWEI) studies. Hum Genet. 2012;131(10):1591–1613. doi: 10.1007/s00439-012-1192-0. - DOI - PMC - PubMed
    1. Cordell HJ. Detecting gene-gene interactions that underlie human diseases. Nat Rev Genet. 2009;10(6):392–404. doi: 10.1038/nrg2579. - DOI - PMC - PubMed
    1. Mackay TF. Epistasis and quantitative traits: using model organisms to study gene-gene interactions. Nat Rev Genet. 2014;15(1):22–33. doi: 10.1038/nrg3627. - DOI - PMC - PubMed
    1. Aldrich MC, Kumar R, Colangelo LA, Williams LK, Sen S, Kritchevsky SB, Meibohm B, Galanter J, Hu D, Gignoux CR, et al. Genetic ancestry-smoking interactions and lung function in African Americans: a cohort study. PLoS One. 2012;7(6):e39541. doi: 10.1371/journal.pone.0039541. - DOI - PMC - PubMed
    1. Powell R, Davidson D, Divers J, Manichaikul A, Carr JJ, Detrano R, Hoffman EA, Jiang R, Kronmal RA, Liu K, et al. Genetic ancestry and the relationship of cigarette smoking to lung function and per cent emphysema in four race/ethnic groups: a cross-sectional study. Thorax. 2013;68(7):634–642. doi: 10.1136/thoraxjnl-2012-202116. - DOI - PMC - PubMed

Publication types