Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 May 2;9(5):e91929.
doi: 10.1371/journal.pone.0091929. eCollection 2014.

PASTEC: an automatic transposable element classification tool

Affiliations

PASTEC: an automatic transposable element classification tool

Claire Hoede et al. PLoS One. .

Abstract

Summary: The classification of transposable elements (TEs) is key step towards deciphering their potential impact on the genome. However, this process is often based on manual sequence inspection by TE experts. With the wealth of genomic sequences now available, this task requires automation, making it accessible to most scientists. We propose a new tool, PASTEC, which classifies TEs by searching for structural features and similarities. This tool outperforms currently available software for TE classification. The main innovation of PASTEC is the search for HMM profiles, which is useful for inferring the classification of unknown TE on the basis of conserved functional domains of the proteins. In addition, PASTEC is the only tool providing an exhaustive spectrum of possible classifications to the order level of the Wicker hierarchical TE classification system. It can also automatically classify other repeated elements, such as SSR (Simple Sequence Repeats), rDNA or potential repeated host genes. Finally, the output of this new tool is designed to facilitate manual curation by providing to biologists with all the evidence accumulated for each TE consensus.

Availability: PASTEC is available as a REPET module or standalone software (http://urgi.versailles.inra.fr/download/repet/REPET_linux-x64-2.2.tar.gz). It requires a Unix-like system. There are two standalone versions: one of which is parallelized (requiring Sun grid Engine or Torque), and the other of which is not.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Agents implemented in the system.
Orange agents are retriever agents, blue agents are classifier and filter agents. The super-agent is shown in green. The arrows indicate the principal communications between the different agents, with only requests shown.

References

    1. Wicker T, Sabot F, Hua-Van A, Bennetzen JL, Capy P, et al. (2007) A unified classification system for eukaryotic transposable elements. Nature Rev. Genet 8: 973–982. - PubMed
    1. Permal E, Flutre T, Quesneville H (2012) Roadmap for annotating transposable elements in eukaryote genomes. Methods Mol. Biol 859: 53–68. - PubMed
    1. Bergman CM, Quesneville H (2007) Discovering and detecting transposable elements in genome sequences. Brief. Bioinform 8: 382–392. - PubMed
    1. Abrusán G, Grundmann N, DeMeester L, Makalowski W (2009) TEclass: a tool for automated classification of unknown eukaryotic transposable elements. Bioinformatics 25: 1329–1330. - PubMed
    1. Feschotte C, Keswani U, Ranganathan N, Guibotsy ML, Levine D (2009) Exploring Repetitive DNA Landscapes Using REPCLASS, a Tool That Automates the Classification of Transposable Elements in Eukaryotic Genomes. Genome Biol. Evol 1: 205–220. - PMC - PubMed

Publication types

Substances