Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007 Nov;177(3):1725-31.
doi: 10.1534/genetics.106.069088. Epub 2007 Oct 18.

Sequence-level population simulations over large genomic regions

Affiliations

Sequence-level population simulations over large genomic regions

Clive J Hoggart et al. Genetics. 2007 Nov.

Abstract

Simulation is an invaluable tool for investigating the effects of various population genetics modeling assumptions on resulting patterns of genetic diversity, and for assessing the performance of statistical techniques, for example those designed to detect and measure the genomic effects of selection. It is also used to investigate the effectiveness of various design options for genetic association studies. Backward-in-time simulation methods are computationally efficient and have become widely used since their introduction in the 1980s. The forward-in-time approach has substantial advantages in terms of accuracy and modeling flexibility, but at greater computational cost. We have developed flexible and efficient simulation software and a rescaling technique to aid computational efficiency that together allow the simulation of sequence-level data over large genomic regions in entire diploid populations under various scenarios for demography, mutation, selection, and recombination, the latter including hotspots and gene conversion. Our forward evolution of genomic regions (FREGENE) software is freely available from www.ebi.ac.uk/projects/BARGEN together with an ancillary program to generate phenotype labels, either binary or quantitative. In this article we discuss limitations of coalescent-based simulation, introduce the rescaling technique that makes large-scale forward-in-time simulation feasible, and demonstrate the utility of various features of FREGENE, many not previously available.

PubMed Disclaimer

Figures

F<sc>igure</sc> 1.—
Figure 1.—
Trajectories of selected variants that reached fixation in simulation studies (Table 1). Red curves correspond to negatively selected alleles (s < 0).
F<sc>igure</sc> 2.—
Figure 2.—
Per-site homozygosity over generations for simulations described in Table 1. The dashed horizontal line shows the theoretical equilibrium value for the neutral simulations. Vertical dotted lines indicate the generations at which, in the scaled directional simulation, selected sites with s > 0.05 went to fixation.
F<sc>igure</sc> 3.—
Figure 3.—
Tajima's D in 50-kb windows after 300,000 generations (top) and 150,000 generations (bottom). The vertical dotted line (top) indicates the location of the unique site under positive selection that went to fixation within the final 3000 generations. The vertical dashed lines (bottom) indicate selected sites that were polymorphic throughout the preceding 10,000 generations.

References

    1. Carlson, C., D. Thomas, M. Eberle, J. Swanson, R. Livingston et al., 2005. Genomic regions exhibiting positive selection identified from dense genotype data. Genome Res. 15: 1553–1565. - PMC - PubMed
    1. Carvajal-Rodriguez, A., K. A. Crandall and D. Posada, 2006. Recombination estimation under complex evolutionary models with the coalescent composite-likelihood method. Mol. Biol. Evol. 23: 817–827. - PMC - PubMed
    1. Charlesworth, D., 2006. Balancing selection and its effects on sequences in nearby genome regions. PLoS Genet. 2: e64. - PMC - PubMed
    1. Crawford, D. C., T. Bhangale, N. Li, G. Hellenthal, M. J. Rieder et al., 2004. Evidence for substantial fine-scale variation in recombination rates across the human genome. Nat. Genet. 36: 700–706. - PubMed
    1. Ewens, W., 2004. Mathematical Population Genetics: I. Theoretical Introduction. Springer-Verlag, New York.

Publication types