Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Nov 24:10:1194.
doi: 10.12688/f1000research.74524.1. eCollection 2021.

An Atlas of Plant Transposable Elements

Affiliations

An Atlas of Plant Transposable Elements

Daniel Longhi Fernandes Pedro et al. F1000Res. .

Abstract

Advances in genomic sequencing have recently offered vast opportunities for biological exploration, unraveling the evolution and improving our understanding of Earth biodiversity. Due to distinct plant species characteristics in terms of genome size, ploidy and heterozygosity, transposable elements (TEs) are common characteristics of many genomes. TEs are ubiquitous and dispersed repetitive DNA sequences that frequently impact the evolution and composition of the genome, mainly due to their redundancy and rearrangements. For this study, we provided an atlas of TE data by employing an easy-to-use portal ( APTE website ). To our knowledge, this is the most extensive and standardized analysis of TEs in plant genomes. We evaluated 67 plant genomes assembled at chromosome scale, recovering a total of 49,802,023 TE records, representing a total of 47,992,091,043 (~47,62%) base pairs (bp) of the total genomic space. We observed that new types of TEs were identified and annotated compared to other data repositories. By establishing a standardized catalog of TE annotation on 67 genomes, new hypotheses, exploration of TE data and their influences on the genomes may allow a better understanding of their function and processes. All original code and an example of how we developed the TE annotation strategy is available on GitHub ( Extended data).

Keywords: atlas; genome-wide; large-scale; mobile elements; plants; standardized.

PubMed Disclaimer

Conflict of interest statement

No competing interests were disclosed.

Figures

Figure 1.
Figure 1.. Steps in transposable elements identification.
Dataset: Genome assemblies were downloaded from Ensembl Plants. Identification: 1A) RepeatScout was used to search for putative repetitive sequences and further classification by PASTEClassifier, resulting in a library. 1B) RepeatModeler was also used to find a consensus of TEs sequences. 2) RepeatMasker was run with Repbase library and libraries from RepeatModeler and RepeatScout. 3) For Class II - Subclass 2 TEs, we also used HelitronScanner and MITE-Hunter. 4) In order to find LTR and Non-LTR retrotransposons, we used LTR_retriever and MGEScan-non-LTR, respectively. Filter: A cut-off filter was applied to remove low complexities, simple repeats and other nomenclatures that were not classified into TEs. Annotation: In result of the pipeline, we have a Transposable Element annotation for each genome analyzed.
Figure 2.
Figure 2.. The TE Score: the average amount of sequence identification made by programs in all genomes.
Figure 3.
Figure 3.. Class, order and superfamilies identified among the 67 plant genomes used in this study.
Figure 4.
Figure 4.. Overview of Class I and Class II composition of TEs in each genome organized in a phylogenetic tree.
Figure 5.
Figure 5.. Correlation between genome size and TE content.
On the left, the bar chart in blue, the genome size (in Gb), and, in green, the transposable elements distribution in analyzed genomes (in percentage). On the right, we normalized, in base pair, genome size and TE using log(10) and then we correlated (Pearson) the genome size by transposable elements. r and p-value are shown in the top-left of each chart. A) Using all the 67 annotated genomes; B) For all genomes with recent WGD (Whole Genome Duplication) events, blue circles; C) Excluding genomes that experienced recent WGD, red circles.

References

    1. Michael TP, VanBuren R: Building near-complete plant genomes. Curr. Opin. Plant Biol. 2020;54:26–33. 10.1016/j.pbi.2019.12.009 - DOI - PubMed
    1. Silva SR, Moraes AP, Penha HA, et al. : The terrestrial carnivorous plant utricularia reniformis sheds light on environmental and life-form genome plasticity. Int. J. Mol. Sci. 2020;21(1):3. 10.3390/ijms21010003 - DOI - PMC - PubMed
    1. Lan T, Renner T, Ibarra-Laclette E, et al. : Long-read sequencing uncovers the adaptive topography of a carnivorous plant genome. Proc. Natl. Acad. Sci. 2017;114(22):E4435–E4441. 10.1073/pnas.1702072114 - DOI - PMC - PubMed
    1. Wicker T, Sabot F, Hua-Van A, et al. : A unified classification system for eukaryotic transposable elements. Nat. Rev. Genet. 2007;8:973–982. 10.1038/nrg2165 - DOI - PubMed
    1. Charles M, Belcram H, Just J, et al. : Dynamics and differential proliferation of transposable elements during the evolution of the B and A genomes of wheat. Genetics. 2008;180:1071–1086. 10.1534/genetics.108.092304 - DOI - PMC - PubMed

Publication types

Substances

LinkOut - more resources