Population structure and eigenanalysis
- PMID: 17194218
- PMCID: PMC1713260
- DOI: 10.1371/journal.pgen.0020190
Population structure and eigenanalysis
Abstract
Current methods for inferring population structure from genetic data do not provide formal significance tests for population differentiation. We discuss an approach to studying population structure (principal components analysis) that was first applied to genetic data by Cavalli-Sforza and colleagues. We place the method on a solid statistical footing, using results from modern statistics to develop formal significance tests. We also uncover a general "phase change" phenomenon about the ability to detect structure in genetic data, which emerges from the statistical theory we use, and has an important implication for the ability to discover structure in genetic data: for a fixed but large dataset size, divergence between two populations (as measured, for example, by a statistic like FST) below a threshold is essentially undetectable, but a little above threshold, detection will be easy. This means that we can predict the dataset size needed to detect structure.
Conflict of interest statement
Competing interests. The authors have declared that no competing interests exist.
Figures










Similar articles
-
Principal component analysis under population genetic models of range expansion and admixture.Mol Biol Evol. 2010 Jun;27(6):1257-68. doi: 10.1093/molbev/msq010. Epub 2010 Jan 21. Mol Biol Evol. 2010. PMID: 20097660
-
A spectral theory for Wright's inbreeding coefficients and related quantities.PLoS Genet. 2021 Jul 19;17(7):e1009665. doi: 10.1371/journal.pgen.1009665. eCollection 2021 Jul. PLoS Genet. 2021. PMID: 34280184 Free PMC article.
-
Population genetics, diversity and forensic characteristics of Tai-Kadai-speaking Bouyei revealed by insertion/deletions markers.Mol Genet Genomics. 2019 Oct;294(5):1343-1357. doi: 10.1007/s00438-019-01584-6. Epub 2019 Jun 13. Mol Genet Genomics. 2019. PMID: 31197471
-
Genetic relatedness analysis: modern data and new challenges.Nat Rev Genet. 2006 Oct;7(10):771-80. doi: 10.1038/nrg1960. Nat Rev Genet. 2006. PMID: 16983373 Review.
-
Genetic markers in the playground of multivariate analysis.Heredity (Edinb). 2009 Apr;102(4):330-41. doi: 10.1038/hdy.2008.130. Epub 2009 Jan 21. Heredity (Edinb). 2009. PMID: 19156164 Review.
Cited by
-
Genome-wide association mapping for resistance to bacterial blight and bacterial leaf streak in rice.Planta. 2021 Apr 8;253(5):94. doi: 10.1007/s00425-021-03612-5. Planta. 2021. PMID: 33830376
-
The genetic prehistory of southern Africa.Nat Commun. 2012;3:1143. doi: 10.1038/ncomms2140. Nat Commun. 2012. PMID: 23072811 Free PMC article.
-
New insights into the fine-scale history of western-eastern admixture of the northwestern Chinese population in the Hexi Corridor via genome-wide genetic legacy.Mol Genet Genomics. 2021 May;296(3):631-651. doi: 10.1007/s00438-021-01767-0. Epub 2021 Mar 1. Mol Genet Genomics. 2021. PMID: 33650010
-
Inferring admixture histories of human populations using linkage disequilibrium.Genetics. 2013 Apr;193(4):1233-54. doi: 10.1534/genetics.112.147330. Epub 2013 Feb 14. Genetics. 2013. PMID: 23410830 Free PMC article.
-
Single-Cell RNA Sequencing in Parkinson's Disease.Biomedicines. 2021 Apr 1;9(4):368. doi: 10.3390/biomedicines9040368. Biomedicines. 2021. PMID: 33916045 Free PMC article. Review.
References
-
- Devlin B, Roeder K. Genomic control for association studies. Biometrics. 1999;55:997–1004. - PubMed
-
- Menozzi P, Piazza A, Cavalli-Sforza L. Synthetic maps of human gene frequencies in Europeans. Science. 1978;201:786–792. - PubMed
-
- Cavalli-Sforza LL, Feldman MW. The application of molecular genetic approaches to the study of human evolution. Nat Genet. 2003;33(Supplement):266–275. Historical article. - PubMed
-
- Chakraborty R, Jin L. A unified approach to study hypervariable polymorphisms: Statistical considerations of determining relatedness and population distances. In: Pena S, Jeffreys A, Epplen J, Chakraborty R, editors. DNA fingerprinting, current state of the science. Basel: Birkhauser; 1993. pp. 153–175. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous