Rare variants create synthetic genome-wide associations
- PMID: 20126254
- PMCID: PMC2811148
- DOI: 10.1371/journal.pbio.1000294
Rare variants create synthetic genome-wide associations
Abstract
Genome-wide association studies (GWAS) have now identified at least 2,000 common variants that appear associated with common diseases or related traits (http://www.genome.gov/gwastudies), hundreds of which have been convincingly replicated. It is generally thought that the associated markers reflect the effect of a nearby common (minor allele frequency >0.05) causal site, which is associated with the marker, leading to extensive resequencing efforts to find causal sites. We propose as an alternative explanation that variants much less common than the associated one may create "synthetic associations" by occurring, stochastically, more often in association with one of the alleles at the common site versus the other allele. Although synthetic associations are an obvious theoretical possibility, they have never been systematically explored as a possible explanation for GWAS findings. Here, we use simple computer simulations to show the conditions under which such synthetic associations will arise and how they may be recognized. We show that they are not only possible, but inevitable, and that under simple but reasonable genetic models, they are likely to account for or contribute to many of the recently identified signals reported in genome-wide association studies. We also illustrate the behavior of synthetic associations in real datasets by showing that rare causal mutations responsible for both hearing loss and sickle cell anemia create genome-wide significant synthetic associations, in the latter case extending over a 2.5-Mb interval encompassing scores of "blocks" of associated variants. In conclusion, uncommon or rare genetic variants can easily create synthetic associations that are credited to common variants, and this possibility requires careful consideration in the interpretation and follow up of GWAS signals.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures








Comment in
-
Synthetic associations created by rare variants do not explain most GWAS results.PLoS Biol. 2011 Jan 18;9(1):e1000579. doi: 10.1371/journal.pbio.1000579. PLoS Biol. 2011. PMID: 21267061 Free PMC article. No abstract available.
-
Synthetic associations are unlikely to account for many common disease genome-wide association signals.PLoS Biol. 2011 Jan 18;9(1):e1000580. doi: 10.1371/journal.pbio.1000580. PLoS Biol. 2011. PMID: 21267062 Free PMC article.
-
Rare treasures.Nat Rev Genet. 2010 Mar;11(3):170. doi: 10.1038/nrg2754. Nat Rev Genet. 2010. PMID: 21485427 No abstract available.
Similar articles
-
Predicting signatures of "synthetic associations" and "natural associations" from empirical patterns of human genetic variation.PLoS Comput Biol. 2012;8(7):e1002600. doi: 10.1371/journal.pcbi.1002600. Epub 2012 Jul 5. PLoS Comput Biol. 2012. PMID: 22792059 Free PMC article.
-
Synthetic associations in the context of genome-wide association scan signals.Hum Mol Genet. 2010 Oct 15;19(R2):R137-44. doi: 10.1093/hmg/ddq368. Epub 2010 Aug 30. Hum Mol Genet. 2010. PMID: 20805105 Free PMC article. Review.
-
Genome-wide significant associations for variants with minor allele frequency of 5% or less--an overview: A HuGE review.Am J Epidemiol. 2010 Oct 15;172(8):869-89. doi: 10.1093/aje/kwq234. Epub 2010 Sep 28. Am J Epidemiol. 2010. PMID: 20876667 Free PMC article. Review.
-
Common genetic variants regulating ADD3 gene expression alter biliary atresia risk.J Hepatol. 2013 Dec;59(6):1285-91. doi: 10.1016/j.jhep.2013.07.021. Epub 2013 Jul 19. J Hepatol. 2013. PMID: 23872602
-
Interpretation of association signals and identification of causal variants from genome-wide association studies.Am J Hum Genet. 2010 May 14;86(5):730-42. doi: 10.1016/j.ajhg.2010.04.003. Epub 2010 Apr 29. Am J Hum Genet. 2010. PMID: 20434130 Free PMC article.
Cited by
-
The genetics of attention deficit/hyperactivity disorder in adults, a review.Mol Psychiatry. 2012 Oct;17(10):960-87. doi: 10.1038/mp.2011.138. Epub 2011 Nov 22. Mol Psychiatry. 2012. PMID: 22105624 Free PMC article. Review.
-
Computer simulations: tools for population and evolutionary genetics.Nat Rev Genet. 2012 Jan 10;13(2):110-22. doi: 10.1038/nrg3130. Nat Rev Genet. 2012. PMID: 22230817 Review.
-
Properties and modeling of GWAS when complex disease risk is due to non-complementing, deleterious mutations in genes of large effect.PLoS Genet. 2013;9(2):e1003258. doi: 10.1371/journal.pgen.1003258. Epub 2013 Feb 21. PLoS Genet. 2013. PMID: 23437004 Free PMC article.
-
An exploration of linkage fine-mapping on sequences from case-control studies.Genet Epidemiol. 2023 Feb;47(1):78-94. doi: 10.1002/gepi.22502. Epub 2022 Sep 1. Genet Epidemiol. 2023. PMID: 36047334 Free PMC article.
-
A review of study designs and statistical methods for genomic epidemiology studies using next generation sequencing.Front Genet. 2015 Apr 20;6:149. doi: 10.3389/fgene.2015.00149. eCollection 2015. Front Genet. 2015. PMID: 25941534 Free PMC article. Review.
References
-
- Lowe C. E, Cooper J. D, Brusko T, Walker N. M, Smyth D. J, et al. Large-scale genetic fine mapping and genotype-phenotype associations implicate polymorphism in the IL2RA region in type 1 diabetes. Nat Genet. 2007;39:1074–1082. - PubMed
-
- Burfoot R. K, Jensen C. J, Field J, Stankovich J, Varney M. D, et al. SNP mapping and candidate gene sequencing in the class I region of the HLA complex: searching for multiple sclerosis susceptibility genes in Tasmanians. Tissue Antigens. 2008;71:42–50. - PubMed
-
- Deloukas P on behalf of the Wellcome Trust Case Control Consortium. High throughput approaches to fine mapping in regions of confirmed association. 2008. Presentation at the 58th Annual Meeting of the American Society of Human Genetics, November 13, 2008; Philadelphia, Pennsylvania.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous