Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013;5(3):606-20.
doi: 10.1093/gbe/evs116.

Distinct mutational behaviors differentiate short tandem repeats from microsatellites in the human genome

Affiliations

Distinct mutational behaviors differentiate short tandem repeats from microsatellites in the human genome

Guruprasad Ananda et al. Genome Biol Evol. 2013.

Abstract

A tandem repeat's (TR) propensity to mutate increases with repeat number, and can become very pronounced beyond a critical boundary, transforming it into a microsatellite (MS). However, a clear understanding of the mutational behavior of different TR classes and motifs and related mechanisms is lacking, as is a consensus on the existence of a boundary separating short TRs (STRs) from MSs. This hinders our understanding of MSs' mutational properties and their effective use as genetic markers. Using indel calls for 179 individuals from 1000 Genomes Pilot-1 Project, we determined polymorphism incidence for four major TR classes, and formalized its varying relationship with repeat number using segmented regression. We observed a biphasic regime with a transition from a faster to a slower exponential growth at 9, 5, 4, and 4 repeats for mono-, di-, tri-, and tetranucleotide TRs, respectively. We used an in vitro mutagenesis assay to evaluate the contribution of strand slippage errors to mutability. STRs and MSs differ in their absolute polymorphism levels, but more importantly in their rates of mutability growth. Although strand slippage is a major factor driving mononucleotide polymorphism incidence, dinucleotide polymorphism incidence is greater than that expected due to strand slippage alone, indicating that additional cellular factors might be driving dinucleotide mutability in the human genome. Leveraging on hundreds of human genomes, we present the first comprehensive, genome-wide analysis of TR mutational behavior, encompassing several motif sizes and compositions.

PubMed Disclaimer

Figures

F<sc>ig</sc>. 1.—
Fig. 1.—
East Asian (JPTCHB), European (CEU), and African (YRI) populations: polymorphism incidence (PI) curves (proportion of polymorphic TRs by increasing repeat number), separately for mono-, di-, tri-, and tetranucleotide TRs—on the log-scale.
F<sc>ig</sc>. 2.—
Fig. 2.—
European (CEU) population: log of polymorphism incidence (see also fig. 1, black symbols) against repeat number, with fits from segmented regression (blue), for (A) mono-, (B) di-, (C) tri-, and (D) tetranucleotide TRs. Dotted vertical red lines show the location of the change points. Horizontal red lines represent 90% confidence intervals for change points. For mononucleotides, values at repeat number 2 and 3 were not included in the segmented regression fit due to convergence issue (see Materials and Methods for details).
F<sc>ig</sc>. 3.—
Fig. 3.—
European (CEU) population: Polymorphism incidence curves (see also fig. 1, black symbols) and experimentally based Pol EF values (as fractions, red), both on log scale, for (A) mono- and (B) dinucleotide TRs.
F<sc>ig</sc>. 4.—
Fig. 4.—
European (CEU) population: Polymorphism incidence curves (log scale) with bootstrap bands for (A) mononucleotide TRs and (B) dinucleotide TRs, separately for different motif composition. (C) Pol β unit-based indel error frequencies (log scale) with bootstrap bands, separately for different motif composition; data for GT and AC motifs are taken from (Kelkar et al. 2010). (D) Polymorphism incidence curves (log scale) with bootstrap bands for trinucleotide TRs, separately for different secondary structures.

References

    1. 1000 Genomes Project Consortium. A map of human genome variation from population-scale sequencing. Nature. 2010;467:1061–1073. - PMC - PubMed
    1. Albers CA, et al. Dindel: accurate indel calls from short-read data. Genome Res. 2010;21:961–973. - PMC - PubMed
    1. Amos W, Hoffman JI. Evidence that two main bottleneck events shaped modern human genetic diversity. Proc R Soc B Biol Sci. 2009;277:131–137. - PMC - PubMed
    1. Ananda G, Chiaromonte F, Makova KD. A genome-wide view of mutation rate co-variation using multivariate analyses. Genome Biol. 2011;12:R27. - PMC - PubMed
    1. Bell GI, Jurka J. The length distribution of perfect dimer repetitive DNA is consistent with its evolution by an unbiased single-step mutation process. J Mol Evol. 1997;44:414–421. - PubMed

Publication types