Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Aug 17;21(1):563.
doi: 10.1186/s12864-020-06949-5.

Relatively semi-conservative replication and a folded slippage model for short tandem repeats

Affiliations

Relatively semi-conservative replication and a folded slippage model for short tandem repeats

Hongxi Zhang et al. BMC Genomics. .

Abstract

Background: The ubiquitous presence of short tandem repeats (STRs) in virtually all genomes implicates their functional relevance, while a widely-accepted definition of STR is yet to be established. Previous studies majorly focus on relatively longer STRs, while shorter repeats were generally excluded. Herein, we have adopted a more generous criteria to define shorter repeats, which has led to the definition of a much larger number of STRs that lack prior analysis. Using this definition, we analyzed the short repeats in 55 randomly selected segments in 55 randomly selected genomic sequences from a fairly wide range of species covering animals, plants, fungi, protozoa, bacteria, archaea and viruses.

Results: Our analysis reveals a high percentage of short repeats in all 55 randomly selected segments, indicating that the universal presence of high-content short repeats could be a common characteristic of genomes across all biological kingdoms. Therefore, it is reasonable to assume a mechanism for continuous production of repeats that can make the replicating process relatively semi-conservative. We have proposed a folded replication slippage model that considers the geometric space of nucleotides and hydrogen bond stability to explain the mechanism more explicitly, with improving the existing straight-line slippage model. The folded slippage model can explain the expansion and contraction of mono- to hexa- nucleotide repeats with proper folding angles. Analysis of external forces in the folding template strands also suggests that expansion exists more commonly than contraction in the short tandem repeats.

Conclusion: The folded replication slippage model provides a reasonable explanation for the continuous occurrences of simple sequence repeats in genomes. This model also contributes to the explanation of STR-to-genome evolution and is an alternative model that complements semi-conservative replication.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
A high percentage of STRs in genomes and genomes probably tend to produce repeats. a STR percentages of 55 randomly-selected reported segments and the control group, which were the sequences generated with the same nucleotide numbers and components as those of the 55 selected reported segments but the random nucleotide orders by a program written in C language. b Contradiction analysis of disappearance and high percentage of STRs in the genomes
Fig. 2
Fig. 2
Straight strand models of semi-conservative replication and slippage. a The space of a nucleotide was drawn. * indicates that those number is the theoretical values (top); The stable straight model of semi-conservative replication (middle); The comparison of hydrogen bond and 3′-5′ phosphodiester bonds (bottom) [–57]. # indicates the strength ratio was calculated by the strength of hydrogen bond dividing that of phosphodiester bond. b The impossible straight slippage models of mononucleotide, dinucleotide and trinucleotide repeats according to the strict geometric calculation of the space of a nucleotide and the stability of hydrogen and phosphodiester bonds
Fig. 3
Fig. 3
The DNA chain is highly curved or folded in the nucleus and the impossible curved slippage model. a Schematic diagram of the size of the nuclear space (top) [61]; The normal replicating enzymes complex straighten the DNA chain, while the disturbed replicating enzymes complex may cause the DNA molecule return to curved state (bottom). b Impossible curved template slippage model according to the strict geometric calculation of the space of a nucleotide and the stability of hydrogen and phosphodiester bonds (top); Mono- and dinucleotide repeats may be impossibly produced in curved replicating strands (middle and bottom)
Fig. 4
Fig. 4
Stable folded slippage models of mononucleotide to hexanucleotide repeats amplification according to the strict geometric calculation of the space of a nucleotide and the stability of hydrogen and phosphodiester bonds. Repeat units tend to be expanded in the replicating strands when the template strands are on the inner side of the folded slippage models respectively. The bottom 3 sub-figures were the folded slippage models in three-dimensional helix form
Fig. 5
Fig. 5
Stable folded slippage models of mononucleotide to hexanucleotide repeats contraction according to the strict geometric calculation of the space of a nucleotide and the stability of hydrogen and phosphodiester bonds. Repeat units tend to be subtracted in the replicating strands when the template strands are on the outside of the folded slippage models respectively. The bottom 3 sub-figures were the folded slippage models in three-dimensional helix form
Fig. 6
Fig. 6
Repeat production incline to expansion. Fo, Fi refer to the force required for the two template strands to bend, respectively. Fo > Fi means that the force of the template strand bending downward is greater than the bending upward, and Pe > Pc means that the possibility of the template strand bending upward is greater than the downward bending

References

    1. Mandal R, Samstein RM, Lee KW, Havel JJ, Wang H, Krishna C, Sabio EY, Makarov V, Kuo FS, Blecua P, et al. CANCER genetic diversity of tumors with mismatch repair deficiency influences anti-PD-1 immunotherapy response. Science. 2019;364(6439):485–491. - PMC - PubMed
    1. Zhao X, Tian Y, Yang R, Feng H, Ouyang Q, Tian Y, Tan Z, Li M, Niu Y, Jiang J. Coevolution between simple sequence repeats (SSRs) and virus genome size. BMC Genomics. 2012;13(1):435. - PMC - PubMed
    1. Chen M, Tan Z, Zeng G, Peng J. Comprehensive analysis of simple sequence repeats in pre-miRNAs. Mol Biol Evol. 2010;27(10):2227–2232. - PubMed
    1. Vinces MD, Legendre M, Caldara M, Hagihara M, Verstrepen KJ. Unstable tandem repeats in promoters confer transcriptional evolvability. Science. 2009;324(5931):1213–1236. - PMC - PubMed
    1. Ellegren H. Microsatellites: simple sequences with complex evolution. Nat Rev Genet. 2004;5(6):435–445. - PubMed

LinkOut - more resources