Correction of population stratification in large multi-ethnic association studies
- PMID: 18196181
- PMCID: PMC2198793
- DOI: 10.1371/journal.pone.0001382
Correction of population stratification in large multi-ethnic association studies
Abstract
Background: The vast majority of genetic risk factors for complex diseases have, taken individually, a small effect on the end phenotype. Population-based association studies therefore need very large sample sizes to detect significant differences between affected and non-affected individuals. Including thousands of affected individuals in a study requires recruitment in numerous centers, possibly from different geographic regions. Unfortunately such a recruitment strategy is likely to complicate the study design and to generate concerns regarding population stratification.
Methodology/principal findings: We analyzed 9,751 individuals representing three main ethnic groups - Europeans, Arabs and South Asians - that had been enrolled from 154 centers involving 52 countries for a global case/control study of acute myocardial infarction. All individuals were genotyped at 103 candidate genes using 1,536 SNPs selected with a tagging strategy that captures most of the genetic diversity in different populations. We show that relying solely on self-reported ethnicity is not sufficient to exclude population stratification and we present additional methods to identify and correct for stratification.
Conclusions/significance: Our results highlight the importance of carefully addressing population stratification and of carefully "cleaning" the sample prior to analyses to obtain stronger signals of association and to avoid spurious results.
Conflict of interest statement
Figures



Similar articles
-
Using ancestry-informative markers to define populations and detect population stratification.J Psychopharmacol. 2006 Jul;20(4 Suppl):19-26. doi: 10.1177/1359786806066041. J Psychopharmacol. 2006. PMID: 16785266 Review.
-
Joint modeling of genetic association and population stratification using latent class models.Genet Epidemiol. 2001;21 Suppl 1:S409-14. doi: 10.1002/gepi.2001.21.s1.s409. Genet Epidemiol. 2001. PMID: 11793709
-
Genomic profiling of interpopulation diversity guides prioritization of candidate-genes for autoimmunity.Genes Immun. 2004 Sep;5(6):493-504. doi: 10.1038/sj.gene.6364117. Genes Immun. 2004. PMID: 15269719
-
Strong association of interleukin-6 -174G/C promoter single nucleotide polymorphism with a decreased risk of colorectal cancer in ethnic Kashmiri population: A case control study.Tumour Biol. 2017 Mar;39(3):1010428317695940. doi: 10.1177/1010428317695940. Tumour Biol. 2017. PMID: 28349833
-
[Use of case-parents trio for epidemiological studies of association between genetic polymorphisms and complex diseases].Rev Med Chil. 2002 Nov;130(11):1307-15. Rev Med Chil. 2002. PMID: 12587516 Review. Spanish.
Cited by
-
E-cadherin gene methylation in lung cancer.Tumour Biol. 2014 Sep;35(9):9027-33. doi: 10.1007/s13277-014-2076-9. Epub 2014 Jun 7. Tumour Biol. 2014. PMID: 24906605
-
Genetic variation in ALDH4A1 is associated with muscle health over the lifespan and across species.Elife. 2022 Apr 26;11:e74308. doi: 10.7554/eLife.74308. Elife. 2022. PMID: 35470798 Free PMC article.
-
Association between KIF6 rs20455 polymorphism and the risk of coronary heart disease (CHD): a pooled analysis of 50 individual studies including 40,059 cases and 64,032 controls.Lipids Health Dis. 2018 Jan 5;17(1):4. doi: 10.1186/s12944-017-0651-y. Lipids Health Dis. 2018. PMID: 29304815 Free PMC article.
-
Genetic variants specific to aging-related verbal memory: Insights from GWASs in a population-based cohort.PLoS One. 2017 Aug 11;12(8):e0182448. doi: 10.1371/journal.pone.0182448. eCollection 2017. PLoS One. 2017. PMID: 28800603 Free PMC article.
-
Association of VEGF genetic polymorphisms with recurrent spontaneous abortion risk: a systematic review and meta-analysis.PLoS One. 2015 Apr 20;10(4):e0123696. doi: 10.1371/journal.pone.0123696. eCollection 2015. PLoS One. 2015. PMID: 25894555 Free PMC article.
References
-
- Nelson MR, Klotsman M, McNeill AM, Maruyama Y, Bowman CE, et al. Development of a densely genotyped population reference sample: a resource for population, disease, and pharmacological genetics research; 2006; Brisbane, Australia.
-
- Manolio TA, Bailey-Wilson JE, Collins FS. Genes, environment and the value of prospective cohort studies. Nat Rev Genet. 2006;7:812–820. - PubMed
-
- Davey Smith G, Ebrahim S, Lewis S, Hansell AL, Palmer LJ, et al. Genetic epidemiology and public health: hope, hype, and future prospects. Lancet. 2005;366:1484–1498. - PubMed
-
- Yusuf S, Hawken S, Ounpuu S, Dans T, Avezum A, et al. Effect of potentially modifiable risk factors associated with myocardial infarction in 52 countries (the INTERHEART study): case-control study. Lancet. 2004;364:937–952. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources