GENOMEPOP: a program to simulate genomes in populations
- PMID: 18447924
- PMCID: PMC2386491
- DOI: 10.1186/1471-2105-9-223
GENOMEPOP: a program to simulate genomes in populations
Abstract
Background: There are several situations in population biology research where simulating DNA sequences is useful. Simulation of biological populations under different evolutionary genetic models can be undertaken using backward or forward strategies. Backward simulations, also called coalescent-based simulations, are computationally efficient. The reason is that they are based on the history of lineages with surviving offspring in the current population. On the contrary, forward simulations are less efficient because the entire population is simulated from past to present. However, the coalescent framework imposes some limitations that forward simulation does not. Hence, there is an increasing interest in forward population genetic simulation and efficient new tools have been developed recently. Software tools that allow efficient simulation of large DNA fragments under complex evolutionary models will be very helpful when trying to better understand the trace left on the DNA by the different interacting evolutionary forces. Here I will introduce GenomePop, a forward simulation program that fulfills the above requirements. The use of the program is demonstrated by studying the impact of intracodon recombination on global and site-specific dN/dS estimation.
Results: I have developed algorithms and written software to efficiently simulate, forward in time, different Markovian nucleotide or codon models of DNA mutation. Such models can be combined with recombination, at inter and intra codon levels, fitness-based selection and complex demographic scenarios.
Conclusion: GenomePop has many interesting characteristics for simulating SNPs or DNA sequences under complex evolutionary and demographic models. These features make it unique with respect to other simulation tools. Namely, the possibility of forward simulation under General Time Reversible (GTR) mutation or GTRxMG94 codon models with intra-codon recombination, arbitrary, user-defined, migration patterns, diploid or haploid models, constant or variable population sizes, etc. It also allows simulation of fitness-based selection under different distributions of mutational effects. Under the 2-allele model it allows the simulation of recombination hot-spots, the definition of different frequencies in different populations, etc. GenomePop can also manage large DNA fragments. In addition, it has a scaling option to save computation time when simulating large sequences and population sizes under complex demographic and evolutionary situations. These and many other features are detailed in its web page [1].
Figures
Similar articles
-
Recodon: coalescent simulation of coding DNA sequences with recombination, migration and demography.BMC Bioinformatics. 2007 Nov 20;8:458. doi: 10.1186/1471-2105-8-458. BMC Bioinformatics. 2007. PMID: 18028540 Free PMC article.
-
Forward-time simulations of human populations with complex diseases.PLoS Genet. 2007 Mar 23;3(3):e47. doi: 10.1371/journal.pgen.0030047. Epub 2007 Feb 15. PLoS Genet. 2007. PMID: 17381243 Free PMC article.
-
Critical assessment of coalescent simulators in modeling recombination hotspots in genomic sequences.BMC Bioinformatics. 2014 Jan 3;15:3. doi: 10.1186/1471-2105-15-3. BMC Bioinformatics. 2014. PMID: 24387001 Free PMC article.
-
An overview of population genetic data simulation.J Comput Biol. 2012 Jan;19(1):42-54. doi: 10.1089/cmb.2010.0188. Epub 2011 Dec 9. J Comput Biol. 2012. PMID: 22149682 Free PMC article. Review.
-
Advancements and prospects in reconstructing the genetic genealogies of ancient and modern human populations using ancestral recombination graphs.Yi Chuan. 2024 Oct;46(10):849-859. doi: 10.16288/j.yczz.24-150. Yi Chuan. 2024. PMID: 39443313 Review.
Cited by
-
Efficient simulation of epistatic interactions in case-parent trios.Hum Hered. 2013;75(1):12-22. doi: 10.1159/000348789. Epub 2013 Mar 27. Hum Hered. 2013. PMID: 23548797 Free PMC article.
-
Forward-time simulation of realistic samples for genome-wide association studies.BMC Bioinformatics. 2010 Sep 1;11:442. doi: 10.1186/1471-2105-11-442. BMC Bioinformatics. 2010. PMID: 20809983 Free PMC article.
-
A survey of genetic simulation software for population and epidemiological studies.Hum Genomics. 2008 Sep;3(1):79-86. doi: 10.1186/1479-7364-3-1-79. Hum Genomics. 2008. PMID: 19129092 Free PMC article. Review.
-
Boosting forward-time population genetic simulators through genotype compression.BMC Bioinformatics. 2013 Jun 14;14:192. doi: 10.1186/1471-2105-14-192. BMC Bioinformatics. 2013. PMID: 23763838 Free PMC article.
-
Simulating within host human immunodeficiency virus 1 genome evolution in the persistent reservoir.Virus Evol. 2020 Nov 23;6(2):veaa089. doi: 10.1093/ve/veaa089. eCollection 2020 Jul. Virus Evol. 2020. PMID: 34040795 Free PMC article.
References
-
- Carvajal-Rodríguez A. GenomePop: software to simulate the evolution of genomes and populations http://webs.uvigo.es/acraaj/GenomePop.htm - PMC - PubMed
-
- Caballero A, Cusi E, Garcia C, Garcia-Dorado A. Accumulation of deleterious mutations: Additional Drosophila melanogaster estimates and a simulation of the effects of selection. Evolution. 2002;56:1150–1159. - PubMed
-
- Carvajal-Rodriguez A, Rolan-Alvarez E, Caballero A. Quantitative variation as a tool for detecting human-induced impacts on genetic diversity. Biological Conservation. 2005;124:1–13. doi: 10.1016/j.biocon.2004.12.008. - DOI
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Miscellaneous