Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012:8:301-19.
doi: 10.4137/EBO.S9758. Epub 2012 Jun 19.

Transposon-derived and satellite-derived repetitive sequences play distinct functional roles in Mammalian intron size expansion

Affiliations

Transposon-derived and satellite-derived repetitive sequences play distinct functional roles in Mammalian intron size expansion

Dapeng Wang et al. Evol Bioinform Online. 2012.

Abstract

Background: Repetitive sequences (RSs) are redundant, complex at times, and often lineage-specific, representing significant "building" materials for genes and genomes. According to their origins, sequence characteristics, and ways of propagation, repetitive sequences are divided into transposable elements (TEs) and satellite sequences (SSs) as well as related subfamilies and subgroups hierarchically. The combined changes attributable to the repetitive sequences alter gene and genome architectures, such as the expansion of exonic, intronic, and intergenic sequences, and most of them propagate in a seemingly random fashion and contribute very significantly to the entire mutation spectrum of mammalian genomes.

Principal findings: Our analysis is focused on evolutional features of TEs and SSs in the intronic sequence of twelve selected mammalian genomes. We divided them into four groups-primates, large mammals, rodents, and primary mammals-and used four non-mammalian vertebrate species as the out-group. After classifying intron size variation in an intron-centric way based on RS-dominance (TE-dominant or SS-dominant intron expansions), we observed several distinct profiles in intron length and positioning in different vertebrate lineages, such as retrotransposon-dominance in mammals and DNA transposon-dominance in the lower vertebrates, amphibians and fishes. The RS patterns of mouse and rat genes are most striking, which are not only distinct from those of other mammals but also different from that of the third rodent species analyzed in this study-guinea pig. Looking into the biological functions of relevant genes, we observed a two-dimensional divergence; in particular, genes that possess SS-dominant and/or RS-free introns are enriched in tissue-specific development and transcription regulation in all mammalian lineages. In addition, we found that the tendency of transposons in increasing intron size is much stronger than that of satellites, and the combined effect of both RSs is greater than either one of them alone in a simple arithmetic sum among the mammals and the opposite is found among the four non-mammalian vertebrates.

Conclusions: TE- and SS-derived RSs represent major mutational forces shaping the size and composition of vertebrate genes and genomes, and through natural selection they either fine-tune or facilitate changes in size expansion, position variation, and duplication, and thus in functions and evolutionary paths for better survival and fitness. When analyzed globally, not only are such changes significantly diversified but also comprehensible in lineages and biological implications.

Keywords: intron size; mammalian genomes; satellite sequences; transposable elements.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Percentage of introns with retrotransposons, DNA transposons, and satellites. Notes: The fractions of introns with repeats are displayed over intron length intervals. The ten intervals of intron lengths are defined as: 1, (50–150); 2, (151–300); 3, (301–600); 4, (601–1000); 5, (1001–1400); 6, (1401–2000); 7, (2001–3000); 8, (3001–5000); 9, (5001–10000); and 10, (10001+).
Figure 2
Figure 2
Percentage of introns with selected repeat families. Note: The intron length intervals are defined in the same way as what in Figure 1.
Figure 3
Figure 3
Percentage of the numbers of the four intron classes. Note: TS, T, S, and N stand for introns with TE and SS, TE only, SS only, and without any of the two basic types, respectively.
Figure 4
Figure 4
Length comparison of the four intron classes. Note: The asterisks indicate significant differences between neighbouring data groups based on Wilcoxon rank sum test and cut-off <0.05.
Figure 5
Figure 5
Position index comparisons for the four intron classes. Note: The asterisks indicate significant differences between neighbouring data groups based on Wilcoxon rank sum test and cut-off <0.05.
Figure 6
Figure 6
Length comparisons of the four TE-containing intron classes. Note: The asterisks indicate significant differences between neighbouring data groups based on Wilcoxon rank sum test and cut-off <0.05.
Figure 7
Figure 7
Percentage of genes in four classes. Note: TS, T, S, and N denote genes with TE and SS, TE only, SS only, and with none of the two repeat types, respectively.
Figure 8
Figure 8
Topological trees constructed based on TE (A) and SS (B). Note: A detailed procedure is described in Methods.

Similar articles

Cited by

References

    1. Toth G, Gaspari Z, Jurka J. Microsatellites in different eukaryotic genomes: survey and analysis. Genome Res. 2000 Jul;10(7):967–81. - PMC - PubMed
    1. Kidwell MG. Transposable elements and the evolution of genome size in eukaryotes. Genetica. 2002 May;115(1):49–63. - PubMed
    1. Hua-Van A, Le Rouzic A, Boutin TS, Filee J, Capy P. The struggle for life of the genome’s selfish architects. Biol Direct. 2011;6:19. - PMC - PubMed
    1. Slotkin RK, Martienssen R. Transposable elements and the epigenetic regulation of the genome. Nat Rev Genet. 2007 Apr;8(4):272–85. - PubMed
    1. Simons C, Pheasant M, Makunin IV, Mattick JS. Transposon-free regions in mammalian genomes. Genome Res. 2006 Feb;16(2):164–72. - PMC - PubMed

LinkOut - more resources