Accounting for population stratification in DNA methylation studies
- PMID: 24478250
- PMCID: PMC4090102
- DOI: 10.1002/gepi.21789
Accounting for population stratification in DNA methylation studies
Abstract
DNA methylation is an important epigenetic mechanism that has been linked to complex diseases and is of great interest to researchers as a potential link between genome, environment, and disease. As the scale of DNA methylation association studies approaches that of genome-wide association studies, issues such as population stratification will need to be addressed. It is well-documented that failure to adjust for population stratification can lead to false positives in genetic association studies, but population stratification is often unaccounted for in DNA methylation studies. Here, we propose several approaches to correct for population stratification using principal components (PCs) from different subsets of genome-wide methylation data. We first illustrate the potential for confounding due to population stratification by demonstrating widespread associations between DNA methylation and race in 388 individuals (365 African American and 23 Caucasian). We subsequently evaluate the performance of our PC-based approaches and other methods in adjusting for confounding due to population stratification. Our simulations show that (1) all of the methods considered are effective at removing inflation due to population stratification, and (2) maximum power can be obtained with single-nucleotide polymorphism (SNP)-based PCs, followed by methylation-based PCs, which outperform both surrogate variable analysis and genomic control. Among our different approaches to computing methylation-based PCs, we find that PCs based on CpG sites chosen for their potential to proxy nearby SNPs can provide a powerful and computationally efficient approach to adjust for population stratification in DNA methylation studies when genome-wide SNP data are unavailable.
Keywords: DNA methylation; association studies; population stratification; principal components.
© 2014 WILEY PERIODICALS, INC.
Figures
References
Publication types
MeSH terms
Grants and funding
- R56 MH071537/MH/NIMH NIH HHS/United States
- R35 CA197449/CA/NCI NIH HHS/United States
- K01 MH085806/MH/NIMH NIH HHS/United States
- R01 MH071537/MH/NIMH NIH HHS/United States
- T32 ES007142/ES/NIEHS NIH HHS/United States
- R01 MH094757/MH/NIMH NIH HHS/United States
- R01 MH096764/MH/NIMH NIH HHS/United States
- T32 GM074897/GM/NIGMS NIH HHS/United States
- HG007508/HG/NHGRI NIH HHS/United States
- MH085806/MH/NIMH NIH HHS/United States
- MH096764/MH/NIMH NIH HHS/United States
- MH071537/MH/NIMH NIH HHS/United States
- R01 HG007508/HG/NHGRI NIH HHS/United States
- UL1 TR000454/TR/NCATS NIH HHS/United States
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
