Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2013 Jan 31:14:71.
doi: 10.1186/1471-2164-14-71.

Comparison of class 2 transposable elements at superfamily resolution reveals conserved and distinct features in cereal grass genomes

Affiliations
Comparative Study

Comparison of class 2 transposable elements at superfamily resolution reveals conserved and distinct features in cereal grass genomes

Yujun Han et al. BMC Genomics. .

Abstract

Background: Class 2 transposable elements (TEs) are the predominant elements in and around plant genes where they generate significant allelic diversity. Using the complete sequences of four grasses, we have performed a novel comparative analysis of class 2 TEs. To ensure consistent comparative analyses, we re-annotated class 2 TEs in Brachypodium distachyon, Oryza sativa (rice), Sorghum bicolor and Zea mays and assigned them to one of the five cut-and-paste superfamilies found in plant genomes (Tc1/mariner, PIF/Harbinger, hAT, Mutator, CACTA). We have focused on noncoding elements because of their abundance, and compared superfamily copy number, size and genomic distribution as well as correlation with the level of nearby gene expression.

Results: Our comparison revealed both unique and conserved features. First, the average length or size distribution of elements in each superfamily is largely conserved, with the shortest always being Tc1/mariner elements, followed by PIF/Harbinger, hAT, Mutator and CACTA. This order also holds for the ratio of the copy numbers of noncoding to coding elements. Second, with the exception of CACTAs, noncoding TEs are enriched within and flanking genes, where they display conserved distribution patterns, having the highest peak in the promoter region. Finally, our analysis of microarray data revealed that genes associated with Tc1/mariner and PIF/Harbinger noncoding elements have significantly higher expression levels than genes without class 2 TEs. In contrast, genes with CACTA elements have significantly lower expression than genes without class 2 TEs.

Conclusions: We have achieved the most comprehensive annotation of class 2 TEs to date in these four grass genomes. Comparative analysis of this robust dataset led to the identification of several previously unknown features of each superfamily related to copy number, element size, genomic distribution and correlation with the expression levels of nearby genes. These results highlight the importance of distinguishing TE superfamilies when assessing their impact on gene and genome evolution.

PubMed Disclaimer

Figures

Figure 1
Figure 1
TE annotation results in four grass genomes. Each superfamily is represented by a specific color (CACTA: blue, hAT: dark orange, Mutator: green, PIF/Harbinger: purple and Tc1/mariner: red). A) The number of transposase genes identified by TARGeT. B) Copy number of nTEs. A copy that has only one end was counted as 0.5. C) Total length of nTEs. D) Percentages of nTEs in the genomes.
Figure 2
Figure 2
The double-ended copy number and length of the consensus nTEs of Tc1/mariner and PIF/Harbinger in Brachypodium, rice, sorghum and maize. The y-axis is in log10 scale.
Figure 3
Figure 3
The nTE distribution in four grass genomes. A) The length percentage of nTEs in the whole genome, 1kb 5 gene flanking regions (5F), and 1kb 3 gene flanking regions (3F) of Brachypodium, rice, sorghum and maize. B) High resolution distribution of nTE frequency around the genes of Brachypodium, rice, sorghum and maize. The highest peaks of nTE frequency are marked by vertical gray lines and the distance from the peak to the TSS (transcription start site) or TTS (transcription stop site) is shown above the lines. Colors of superfamilies are the same as in Figure  1.
Figure 4
Figure 4
Comparison between the expression levels of genes with class 2 TEs and without class 2 TEs, in rice root (A) and stigma (B), maize ear (C) and leaf (D). The y-axis is log10 gene expression values. P-values of pairwise comparison are shown in shadow areas. Colors of superfamilies are the same as in Figure  1.

References

    1. Schnable PS, Ware D, Fulton RS, Stein JC, Wei F, Pasternak S, Liang C, Zhang J, Fulton L, Graves TA. et al.The B73 maize genome: complexity, diversity, and dynamics. Science. 2009;326(5956):1112–1115. - PubMed
    1. Nene V, Wortman JR, Lawson D, Haas B, Kodira C, Tu ZJ, Loftus B, Xi Z, Megy K, Grabherr M. et al.Genome sequence of Aedes aegypti, a major arbovirus vector. Science. 2007;316(5832):1718–1723. - PMC - PubMed
    1. Bennetzen JL. Transposable element contributions to plant gene and genome evolution. Plant Mol Biol. 2000;42(1):251–269. - PubMed
    1. Yuan YW, Wessler SR. The catalytic domain of all eukaryotic cut-and-paste transposase superfamilies. Proc Natl Acad Sci USA. 2011;108(19):7884–7889. - PMC - PubMed
    1. Feschotte C, Pritham EJ. DNA transposons and the evolution of eukaryotic genomes. Annu Rev Genet. 2007;41:331–368. - PMC - PubMed

Publication types

Substances