CLUMPP: a cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure
- PMID: 17485429
- DOI: 10.1093/bioinformatics/btm233
CLUMPP: a cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure
Abstract
Motivation: Clustering of individuals into populations on the basis of multilocus genotypes is informative in a variety of settings. In population-genetic clustering algorithms, such as BAPS, STRUCTURE and TESS, individual multilocus genotypes are partitioned over a set of clusters, often using unsupervised approaches that involve stochastic simulation. As a result, replicate cluster analyses of the same data may produce several distinct solutions for estimated cluster membership coefficients, even though the same initial conditions were used. Major differences among clustering solutions have two main sources: (1) 'label switching' of clusters across replicates, caused by the arbitrary way in which clusters in an unsupervised analysis are labeled, and (2) 'genuine multimodality,' truly distinct solutions across replicates.
Results: To facilitate the interpretation of population-genetic clustering results, we describe three algorithms for aligning multiple replicate analyses of the same data set. We have implemented these algorithms in the computer program CLUMPP (CLUster Matching and Permutation Program). We illustrate the use of CLUMPP by aligning the cluster membership coefficients from 100 replicate cluster analyses of 600 chickens from 20 different breeds.
Availability: CLUMPP is freely available at http://rosenberglab.bioinformatics.med.umich.edu/clumpp.html.
Similar articles
-
Crimp: An efficient tool for summarizing multiple clusterings in population structure analysis and beyond.Mol Ecol Resour. 2023 Apr;23(3):705-711. doi: 10.1111/1755-0998.13734. Epub 2022 Nov 22. Mol Ecol Resour. 2023. PMID: 36349867
-
Clumppling: cluster matching and permutation program with integer linear programming.Bioinformatics. 2024 Jan 2;40(1):btad751. doi: 10.1093/bioinformatics/btad751. Bioinformatics. 2024. PMID: 38096585 Free PMC article.
-
Clumpak: a program for identifying clustering modes and packaging population structure inferences across K.Mol Ecol Resour. 2015 Sep;15(5):1179-91. doi: 10.1111/1755-0998.12387. Epub 2015 Feb 27. Mol Ecol Resour. 2015. PMID: 25684545 Free PMC article.
-
Emergent unsupervised clustering paradigms with potential application to bioinformatics.Front Biosci. 2008 Jan 1;13:677-90. doi: 10.2741/2711. Front Biosci. 2008. PMID: 17981579 Review.
-
Comparison of algorithms to infer genetic population structure from unlinked molecular markers.Stat Appl Genet Mol Biol. 2014 Aug;13(4):391-402. doi: 10.1515/sagmb-2013-0006. Stat Appl Genet Mol Biol. 2014. PMID: 24964261 Review.
Cited by
-
Testing the consistency of connectivity patterns for a widely dispersing marine species.Heredity (Edinb). 2013 Oct;111(4):345-54. doi: 10.1038/hdy.2013.58. Epub 2013 Jul 3. Heredity (Edinb). 2013. PMID: 23820580 Free PMC article.
-
Applying genomic approaches to delineate conservation strategies using the freshwater mussel Margaritifera margaritifera in the Iberian Peninsula as a model.Sci Rep. 2022 Oct 7;12(1):16894. doi: 10.1038/s41598-022-20947-5. Sci Rep. 2022. PMID: 36207367 Free PMC article.
-
The genetic legacy of the pre-colonial period in contemporary Bolivians.PLoS One. 2013;8(3):e58980. doi: 10.1371/journal.pone.0058980. Epub 2013 Mar 20. PLoS One. 2013. PMID: 23527064 Free PMC article.
-
Genetic Characterization of Legionella pneumophila Isolated from a Common Watershed in Comunidad Valenciana, Spain.PLoS One. 2013 Apr 25;8(4):e61564. doi: 10.1371/journal.pone.0061564. Print 2013. PLoS One. 2013. PMID: 23634210 Free PMC article.
-
Population abundance in arctic grayling using genetics and close-kin mark-recapture.Ecol Evol. 2021 Apr 2;11(9):4763-4773. doi: 10.1002/ece3.7378. eCollection 2021 May. Ecol Evol. 2021. PMID: 33976846 Free PMC article.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Research Materials