ParallelStructure: a R package to distribute parallel runs of the population genetics program STRUCTURE on multi-core computers
- PMID: 23923012
- PMCID: PMC3726640
- DOI: 10.1371/journal.pone.0070651
ParallelStructure: a R package to distribute parallel runs of the population genetics program STRUCTURE on multi-core computers
Abstract
This software package provides an R-based framework to make use of multi-core computers when running analyses in the population genetics program STRUCTURE. It is especially addressed to those users of STRUCTURE dealing with numerous and repeated data analyses, and who could take advantage of an efficient script to automatically distribute STRUCTURE jobs among multiple processors. It also consists of additional functions to divide analyses among combinations of populations within a single data set without the need to manually produce multiple projects, as it is currently the case in STRUCTURE. The package consists of two main functions: MPI_structure() and parallel_structure() as well as an example data file. We compared the performance in computing time for this example data on two computer architectures and showed that the use of the present functions can result in several-fold improvements in terms of computation time. ParallelStructure is freely available at https://r-forge.r-project.org/projects/parallstructure/.
Conflict of interest statement
Figures

References
-
- Rosenberg NA, Pritchard JK, Weber JL, Cann HM, Kidd KK et al. (2002) Genetic structure of human populations. Science 298: 2381-2385. doi:10.1126/science.1078311. PubMed: 12493913. - DOI - PubMed
-
- Harter AV, Gardner KA, Lentz Falush D, Bye RA et al. (2004) Origin of extant domesticated sunflowers in eastern North America. Nature 430: 201-205. doi:10.1038/nature02710. PubMed: 15241413. - DOI - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases