Structure_threader: An improved method for automation and parallelization of programs structure, fastStructure and MavericK on multicore CPU systems
- PMID: 28776963
- DOI: 10.1111/1755-0998.12702
Structure_threader: An improved method for automation and parallelization of programs structure, fastStructure and MavericK on multicore CPU systems
Abstract
Structure_threader is a program to parallelize multiple runs of genetic clustering software that does not make use of multithreading technology (structure, fastStructure and MavericK) on multicore computers. Our approach was benchmarked across multiple systems and displayed great speed improvements relative to the single-threaded implementation, scaling very close to linearly with the number of physical cores used. Structure_threader was compared to previous software written for the same task-ParallelStructure and StrAuto and was proven to be the faster (up to 25% faster) wrapper under all tested scenarios. Furthermore, Structure_threader can perform several automatic and convenient operations, assisting the user in assessing the most biologically likely value of 'K' via implementations such as the "Evanno," or "Thermodynamic Integration" tests and automatically draw the "meanQ" plots (static or interactive) for each value of K (or even combined plots). Structure_threader is written in python 3 and licensed under the GPLv3. It can be downloaded free of charge at https://github.com/StuntsPT/Structure_threader.
Keywords: bioinformatics/phyloinformatics; clustering; empirical; genomics/proteomics; molecular evolution; parallel computing; population genetics.
© 2017 John Wiley & Sons Ltd.
Similar articles
-
EasyParallel: A GUI platform for parallelization of STRUCTURE and NEWHYBRIDS analyses.PLoS One. 2020 Apr 24;15(4):e0232110. doi: 10.1371/journal.pone.0232110. eCollection 2020. PLoS One. 2020. PMID: 32330179 Free PMC article.
-
StrAuto: automation and parallelization of STRUCTURE analysis.BMC Bioinformatics. 2017 Mar 24;18(1):192. doi: 10.1186/s12859-017-1593-0. BMC Bioinformatics. 2017. PMID: 28340552 Free PMC article.
-
SS-Wrapper: a package of wrapper applications for similarity searches on Linux clusters.BMC Bioinformatics. 2004 Oct 28;5:171. doi: 10.1186/1471-2105-5-171. BMC Bioinformatics. 2004. PMID: 15511296 Free PMC article.
-
StructureSelector: A web-based software to select and visualize the optimal number of clusters using multiple methods.Mol Ecol Resour. 2018 Jan;18(1):176-177. doi: 10.1111/1755-0998.12719. Epub 2017 Oct 9. Mol Ecol Resour. 2018. PMID: 28921901
-
Clumpak: a program for identifying clustering modes and packaging population structure inferences across K.Mol Ecol Resour. 2015 Sep;15(5):1179-91. doi: 10.1111/1755-0998.12387. Epub 2015 Feb 27. Mol Ecol Resour. 2015. PMID: 25684545 Free PMC article.
Cited by
-
The phylogeographic journey of a plant species from lowland to highlands during the Pleistocene.Sci Rep. 2024 Feb 15;14(1):3825. doi: 10.1038/s41598-024-53414-4. Sci Rep. 2024. PMID: 38360894 Free PMC article.
-
Traditional Norwegian Kveik Are a Genetically Distinct Group of Domesticated Saccharomyces cerevisiae Brewing Yeasts.Front Microbiol. 2018 Sep 12;9:2137. doi: 10.3389/fmicb.2018.02137. eCollection 2018. Front Microbiol. 2018. PMID: 30258422 Free PMC article.
-
Bridgehead effect and multiple introductions shape the global invasion history of a termite.Commun Biol. 2021 Feb 12;4(1):196. doi: 10.1038/s42003-021-01725-x. Commun Biol. 2021. PMID: 33580197 Free PMC article.
-
EasyParallel: A GUI platform for parallelization of STRUCTURE and NEWHYBRIDS analyses.PLoS One. 2020 Apr 24;15(4):e0232110. doi: 10.1371/journal.pone.0232110. eCollection 2020. PLoS One. 2020. PMID: 32330179 Free PMC article.
-
Evolution and Diversity of the Wild Rice Oryza officinalis Complex, across Continents, Genome Types, and Ploidy Levels.Genome Biol Evol. 2020 Apr 1;12(4):413-428. doi: 10.1093/gbe/evaa037. Genome Biol Evol. 2020. PMID: 32125373 Free PMC article.
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources