Ascertainment biases in SNP chips affect measures of population divergence
- PMID: 20558595
- PMCID: PMC3107607
- DOI: 10.1093/molbev/msq148
Ascertainment biases in SNP chips affect measures of population divergence
Abstract
Chip-based high-throughput genotyping has facilitated genome-wide studies of genetic diversity. Many studies have utilized these large data sets to make inferences about the demographic history of human populations using measures of genetic differentiation such as F(ST) or principal component analyses. However, the single nucleotide polymorphism (SNP) chip data suffer from ascertainment biases caused by the SNP discovery process in which a small number of individuals from selected populations are used as discovery panels. In this study, we investigate the effect of the ascertainment bias on inferences regarding genetic differentiation among populations in one of the common genome-wide genotyping platforms. We generate SNP genotyping data for individuals that previously have been subject to partial genome-wide Sanger sequencing and compare inferences based on genotyping data to inferences based on direct sequencing. In addition, we also analyze publicly available genome-wide data. We demonstrate that the ascertainment biases will distort measures of human diversity and possibly change conclusions drawn from these measures in some times unexpected ways. We also show that details of the genotyping calling algorithms can have a surprisingly large effect on population genetic inferences. We not only present a correction of the spectrum for the widely used Affymetrix SNP chips but also show that such corrections are difficult to generalize among studies.
Figures
References
-
- Affymetrix. 2006. BRLMM: an improved genotype calling method for the GeneChip Human Mapping 500K Array Set. Technical Report. Santa Clara (CA): Affymetrix, Inc. Available from: http://www.affymetrix.com/support/technical/whitepapers/brlmm_whitepaper....
-
- Albrechtsen A, Sand Korneliussen T, Moltke I, van Overseem Hansen T, Nielsen FC, Nielsen R. Relatedness mapping and tracts of relatedness for genome-wide data in the presence of linkage disequilibrium. Genet Epidemiol. 2009;33:266–274. - PubMed
-
- Bustamante CD, Fledel-Alon A, Williamson S, et al. (14 co-authors) Natural selection on protein-coding genes in the human genome. Nature. 2005;437:1153–1157. - PubMed
-
- Cann HM, de Toma C, Cazes L, et al. (41 co-authors) A human genome diversity cell line panel. Science. 2002;296:261–262. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous
