Considering transposable element diversification in de novo annotation approaches
- PMID: 21304975
- PMCID: PMC3031573
- DOI: 10.1371/journal.pone.0016526
Considering transposable element diversification in de novo annotation approaches
Abstract
Transposable elements (TEs) are mobile, repetitive DNA sequences that are almost ubiquitous in prokaryotic and eukaryotic genomes. They have a large impact on genome structure, function and evolution. With the recent development of high-throughput sequencing methods, many genome sequences have become available, making possible comparative studies of TE dynamics at an unprecedented scale. Several methods have been proposed for the de novo identification of TEs in sequenced genomes. Most begin with the detection of genomic repeats, but the subsequent steps for defining TE families differ. High-quality TE annotations are available for the Drosophila melanogaster and Arabidopsis thaliana genome sequences, providing a solid basis for the benchmarking of such methods. We compared the performance of specific algorithms for the clustering of interspersed repeats and found that only a particular combination of algorithms detected TE families with good recovery of the reference sequences. We then applied a new procedure for reconciling the different clustering results and classifying TE sequences. The whole approach was implemented in a pipeline using the REPET package. Finally, we show that our combined approach highlights the dynamics of well defined TE families by making it possible to identify structural variations among their copies. This approach makes it possible to annotate TE families and to study their diversification in a single analysis, improving our understanding of TE dynamics at the whole-genome scale and for diverse species.
Conflict of interest statement
Figures




Similar articles
-
A transposable element annotation pipeline and expression analysis reveal potentially active elements in the microalga Tisochrysis lutea.BMC Genomics. 2018 May 22;19(1):378. doi: 10.1186/s12864-018-4763-1. BMC Genomics. 2018. PMID: 29783941 Free PMC article.
-
Benchmarking transposable element annotation methods for creation of a streamlined, comprehensive pipeline.Genome Biol. 2019 Dec 16;20(1):275. doi: 10.1186/s13059-019-1905-y. Genome Biol. 2019. PMID: 31843001 Free PMC article.
-
Illumina TruSeq synthetic long-reads empower de novo assembly and resolve complex, highly-repetitive transposable elements.PLoS One. 2014 Sep 4;9(9):e106689. doi: 10.1371/journal.pone.0106689. eCollection 2014. PLoS One. 2014. PMID: 25188499 Free PMC article.
-
Discovering and detecting transposable elements in genome sequences.Brief Bioinform. 2007 Nov;8(6):382-92. doi: 10.1093/bib/bbm048. Epub 2007 Oct 10. Brief Bioinform. 2007. PMID: 17932080 Review.
-
Identification and Genotyping of Transposable Element Insertions From Genome Sequencing Data.Curr Protoc Hum Genet. 2020 Sep;107(1):e102. doi: 10.1002/cphg.102. Curr Protoc Hum Genet. 2020. PMID: 32662945 Free PMC article. Review.
Cited by
-
Identification of transposable element families from pangenome polymorphisms.Mob DNA. 2024 Jun 26;15(1):13. doi: 10.1186/s13100-024-00323-y. Mob DNA. 2024. PMID: 38926873 Free PMC article.
-
Deep landscape update of dispersed and tandem repeats in the genome model of the red jungle fowl, Gallus gallus, using a series of de novo investigating tools.BMC Genomics. 2016 Aug 19;17(1):659. doi: 10.1186/s12864-016-3015-5. BMC Genomics. 2016. PMID: 27542599 Free PMC article.
-
The Capsella rubella genome and the genomic consequences of rapid mating system evolution.Nat Genet. 2013 Jul;45(7):831-5. doi: 10.1038/ng.2669. Epub 2013 Jun 9. Nat Genet. 2013. PMID: 23749190
-
ncRNAclassifier: a tool for detection and classification of transposable element sequences in RNA hairpins.BMC Bioinformatics. 2012 Sep 25;13:246. doi: 10.1186/1471-2105-13-246. BMC Bioinformatics. 2012. PMID: 23009561 Free PMC article.
-
Curation Guidelines for de novo Generated Transposable Element Families.Curr Protoc. 2021 Jun;1(6):e154. doi: 10.1002/cpz1.154. Curr Protoc. 2021. PMID: 34138525 Free PMC article.
References
-
- Orgel L, Crick F. Selfish DNA: the ultimate parasite. Nature. 1980;284:604–607. - PubMed
-
- Brookfield JFY. The ecology of the genome - mobile DNA elements and their hosts. Nature Reviews Genetics. 2005;6:128–136. - PubMed
-
- Lynch M, Conery J. The origins of genome complexity. Science. 2003;302:1401–1404. - PubMed
-
- Finnegan D. Eukaryotic transposable elements and genome evolution. Trends in Genetics. 1989;5:103–107. - PubMed
-
- Wicker T, Sabot F, Hua-Van A, Bennetzen J, Capy P, et al. A unified classification system for eukaryotic transposable elements. Nature Reviews Genetics. 2007;8:973–982. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Molecular Biology Databases