Separation of the largest eigenvalues in eigenanalysis of genotype data from discrete subpopulations
- PMID: 23973732
- PMCID: PMC3825268
- DOI: 10.1016/j.tpb.2013.08.004
Separation of the largest eigenvalues in eigenanalysis of genotype data from discrete subpopulations
Abstract
We present a mathematical model, and the corresponding mathematical analysis, that justifies and quantifies the use of principal component analysis of biallelic genetic marker data for a set of individuals to detect the number of subpopulations represented in the data. We indicate that the power of the technique relies more on the number of individuals genotyped than on the number of markers.
Keywords: Eigenanalysis; Eigenvalues; Number of subpopulations; Population structure; Principal components analysis.
Copyright © 2013 Elsevier Inc. All rights reserved.
Figures
References
-
- Bai Z, Silverstein J. Spectral analysis of large dimensional random matrices. 2 Springer; 2009.
-
- Balding DJ, Nichols RA. A method for quantifying differentiation between populations at multi-allelic loci and its implications for investigating identity and paternity. Genetica. 1995;96:3–12. - PubMed
-
- Benaych-Georges F, Nadakuditi RR. The eigenvalues and eigenvectors of finite, low rank perturbations of large random matrices. Advances in Mathematics. 2011;227:494–521.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
