RelocaTE2: a high resolution transposable element insertion site mapping tool for population resequencing
- PMID: 28149701
- PMCID: PMC5274521
- DOI: 10.7717/peerj.2942
RelocaTE2: a high resolution transposable element insertion site mapping tool for population resequencing
Abstract
Background: Transposable element (TE) polymorphisms are important components of population genetic variation. The functional impacts of TEs in gene regulation and generating genetic diversity have been observed in multiple species, but the frequency and magnitude of TE variation is under appreciated. Inexpensive and deep sequencing technology has made it affordable to apply population genetic methods to whole genomes with methods that identify single nucleotide and insertion/deletion polymorphisms. However, identifying TE polymorphisms, particularly transposition events or non-reference insertion sites can be challenging due to the repetitive nature of these sequences, which hamper both the sensitivity and specificity of analysis tools.
Methods: We have developed the tool RelocaTE2 for identification of TE insertion sites at high sensitivity and specificity. RelocaTE2 searches for known TE sequences in whole genome sequencing reads from second generation sequencing platforms such as Illumina. These sequence reads are used as seeds to pinpoint chromosome locations where TEs have transposed. RelocaTE2 detects target site duplication (TSD) of TE insertions allowing it to report TE polymorphism loci with single base pair precision.
Results and discussion: The performance of RelocaTE2 is evaluated using both simulated and real sequence data. RelocaTE2 demonstrate high level of sensitivity and specificity, particularly when the sequence coverage is not shallow. In comparison to other tools tested, RelocaTE2 achieves the best balance between sensitivity and specificity. In particular, RelocaTE2 performs best in prediction of TSDs for TE insertions. Even in highly repetitive regions, such as those tested on rice chromosome 4, RelocaTE2 is able to report up to 95% of simulated TE insertions with less than 0.1% false positive rate using 10-fold genome coverage resequencing data. RelocaTE2 provides a robust solution to identify TE insertion sites and can be incorporated into analysis workflows in support of describing the complete genotype from light coverage genome sequencing.
Keywords: Annotation; Bioinformatics; Diversity; Parallel processing; Population genomics; Resequencing; Rice; Short read; Transposons.
Conflict of interest statement
The authors declare there are no competing interests.
Figures



Similar articles
-
Transposable element finder (TEF): finding active transposable elements from next generation sequencing data.BMC Bioinformatics. 2022 Nov 22;23(1):500. doi: 10.1186/s12859-022-05011-3. BMC Bioinformatics. 2022. PMID: 36418944 Free PMC article.
-
Reproducible evaluation of transposable element detectors with McClintock 2 guides accurate inference of Ty insertion patterns in yeast.Mob DNA. 2023 Jul 14;14(1):8. doi: 10.1186/s13100-023-00296-4. Mob DNA. 2023. PMID: 37452430 Free PMC article.
-
Reproducible evaluation of transposable element detectors with McClintock 2 guides accurate inference of Ty insertion patterns in yeast.bioRxiv [Preprint]. 2023 Mar 21:2023.02.13.528343. doi: 10.1101/2023.02.13.528343. bioRxiv. 2023. Update in: Mob DNA. 2023 Jul 14;14(1):8. doi: 10.1186/s13100-023-00296-4. PMID: 36824955 Free PMC article. Updated. Preprint.
-
Identification and Genotyping of Transposable Element Insertions From Genome Sequencing Data.Curr Protoc Hum Genet. 2020 Sep;107(1):e102. doi: 10.1002/cphg.102. Curr Protoc Hum Genet. 2020. PMID: 32662945 Free PMC article. Review.
-
Use of retrotransposon-derived genetic markers to analyse genomic variability in plants.Funct Plant Biol. 2018 Jan;46(1):15-29. doi: 10.1071/FP18098. Funct Plant Biol. 2018. PMID: 30939255 Review.
Cited by
-
Asexual Experimental Evolution of Yeast Does Not Curtail Transposable Elements.Mol Biol Evol. 2021 Jun 25;38(7):2831-2842. doi: 10.1093/molbev/msab073. Mol Biol Evol. 2021. PMID: 33720342 Free PMC article.
-
Exploring transposable element-based markers to identify allelic variations underlying agronomic traits in rice.Plant Commun. 2022 May 9;3(3):100270. doi: 10.1016/j.xplc.2021.100270. Epub 2021 Dec 20. Plant Commun. 2022. PMID: 35576152 Free PMC article.
-
Finding and Characterizing Repeats in Plant Genomes.Methods Mol Biol. 2022;2443:327-385. doi: 10.1007/978-1-0716-2067-0_18. Methods Mol Biol. 2022. PMID: 35037215
-
InMut-finder: a software tool for insertion identification in mutagenesis using Nanopore long reads.BMC Genomics. 2021 Dec 19;22(1):908. doi: 10.1186/s12864-021-08206-9. BMC Genomics. 2021. PMID: 34923956 Free PMC article.
-
Targeted identification of TE insertions in a Drosophila genome through hemi-specific PCR.Mob DNA. 2017 Jul 28;8:10. doi: 10.1186/s13100-017-0092-1. eCollection 2017. Mob DNA. 2017. PMID: 28775768 Free PMC article.
References
-
- Campbell PJ, Stephens PJ, Pleasance ED, O’Meara S, Li H, Santarius T, Stebbings LA, Leroy C, Edkins S, Hardy C, Teague JW, Menzies A, Goodhead I, Turner DJ, Clee CM, Quail MA, Cox A, Brown C, Durbin R, Hurles ME, Edwards PA, Bignell GR, Stratton MR, Futreal PA. Identification of somatically acquired rearrangements in cancer using genome-wide massively parallel paired-end sequencing. Nature Genetics. 2008;40:722–729. doi: 10.1038/ng.128. - DOI - PMC - PubMed
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources