Disease-associated alleles in genome-wide association studies are enriched for derived low frequency alleles relative to HapMap and neutral expectations
- PMID: 21143973
- PMCID: PMC3017004
- DOI: 10.1186/1755-8794-3-57
Disease-associated alleles in genome-wide association studies are enriched for derived low frequency alleles relative to HapMap and neutral expectations
Abstract
Background: Genome-wide association studies give insight into the genetic basis of common diseases. An open question is whether the allele frequency distributions and ancestral vs. derived states of disease-associated alleles differ from the rest of the genome. Characteristics of disease-associated alleles can be used to increase the yield of future studies.
Methods: The set of all common disease-associated alleles found in genome-wide association studies prior to January 2010 was analyzed and compared with HapMap and theoretical null expectations. In addition, allele frequency distributions of different disease classes were assessed. Ages of HapMap and disease-associated alleles were also estimated.
Results: The allele frequency distribution of HapMap alleles was qualitatively similar to neutral expectations. However, disease-associated alleles were more likely to be low frequency derived alleles relative to null expectations. 43.7% of disease-associated alleles were ancestral alleles. The mean frequency of disease-associated alleles was less than randomly chosen CEU HapMap alleles (0.394 vs. 0.610, after accounting for probability of detection). Similar patterns were observed for the subset of disease-associated alleles that have been verified in multiple studies. SNPs implicated in genome-wide association studies were enriched for young SNPs compared to randomly selected HapMap loci. Odds ratios of disease-associated alleles tended to be less than 1.5 and varied by frequency, confirming previous studies.
Conclusions: Alleles associated with genetic disease differ from randomly selected HapMap alleles and neutral expectations. The evolutionary history of alleles (frequency and ancestral vs. derived state) influences whether they are implicated in genome-wide association studies.
Figures





Similar articles
-
Functional and Structural Consequence of Rare Exonic Single Nucleotide Polymorphisms: One Story, Two Tales.Genome Biol Evol. 2015 Oct 9;7(10):2929-40. doi: 10.1093/gbe/evv191. Genome Biol Evol. 2015. PMID: 26454016 Free PMC article.
-
Allelic Spectra of Risk SNPs Are Different for Environment/Lifestyle Dependent versus Independent Diseases.PLoS Genet. 2015 Jul 22;11(7):e1005371. doi: 10.1371/journal.pgen.1005371. eCollection 2015 Jul. PLoS Genet. 2015. PMID: 26201053 Free PMC article.
-
Genomic variations and distinct evolutionary rate of rare alleles in Arabidopsis thaliana.BMC Evol Biol. 2016 Jan 27;16:25. doi: 10.1186/s12862-016-0590-7. BMC Evol Biol. 2016. PMID: 26817829 Free PMC article.
-
Genome-wide significant associations for variants with minor allele frequency of 5% or less--an overview: A HuGE review.Am J Epidemiol. 2010 Oct 15;172(8):869-89. doi: 10.1093/aje/kwq234. Epub 2010 Sep 28. Am J Epidemiol. 2010. PMID: 20876667 Free PMC article. Review.
-
The pursuit of genome-wide association studies: where are we now?J Hum Genet. 2010 Apr;55(4):195-206. doi: 10.1038/jhg.2010.19. Epub 2010 Mar 19. J Hum Genet. 2010. PMID: 20300123 Review.
Cited by
-
The influence of evolutionary history on human health and disease.Nat Rev Genet. 2021 May;22(5):269-283. doi: 10.1038/s41576-020-00305-9. Epub 2021 Jan 6. Nat Rev Genet. 2021. PMID: 33408383 Free PMC article. Review.
-
Genetic Hitchhiking and Population Bottlenecks Contribute to Prostate Cancer Disparities in Men of African Descent.Cancer Res. 2018 May 1;78(9):2432-2443. doi: 10.1158/0008-5472.CAN-17-1550. Epub 2018 Feb 8. Cancer Res. 2018. PMID: 29438991 Free PMC article.
-
Derived SNP alleles are used more frequently than ancestral alleles as risk-associated variants in common human diseases.J Bioinform Comput Biol. 2012 Apr;10(2):1241008. doi: 10.1142/S0219720012410089. J Bioinform Comput Biol. 2012. PMID: 22809343 Free PMC article.
-
Population Levels Assessment of the Distribution of Disease-Associated Variants With Emphasis on Armenians - A Machine Learning Approach.Front Genet. 2019 Apr 26;10:394. doi: 10.3389/fgene.2019.00394. eCollection 2019. Front Genet. 2019. PMID: 31105750 Free PMC article.
-
Tackle characteristics associated with concussion in elite men's rugby union: unpicking the differences between tacklers and ball-carriers.BMJ Open Sport Exerc Med. 2025 Aug 4;11(3):e002612. doi: 10.1136/bmjsem-2025-002612. eCollection 2025. BMJ Open Sport Exerc Med. 2025. PMID: 40766043 Free PMC article.
References
-
- Tomlinson I, Webb E, Carvajal-Carmona L, Broderick P, Kemp Z, Spain S, Penegar S, Chandler I, Gorman M, Wood W. et al.A genome-wide association scan of tag SNPs identifies a susceptibility variant for colorectal cancer at 8q24.21. Nature Genetics. 2007;39(8):984–988. doi: 10.1038/ng2085. - DOI - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources