Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Jul;38(Web Server issue):W7-13.
doi: 10.1093/nar/gkq291. Epub 2010 Apr 30.

TranslatorX: multiple alignment of nucleotide sequences guided by amino acid translations

Affiliations

TranslatorX: multiple alignment of nucleotide sequences guided by amino acid translations

Federico Abascal et al. Nucleic Acids Res. 2010 Jul.

Abstract

We present TranslatorX, a web server designed to align protein-coding nucleotide sequences based on their corresponding amino acid translations. Many comparisons between biological sequences (nucleic acids and proteins) involve the construction of multiple alignments. Alignments represent a statement regarding the homology between individual nucleotides or amino acids within homologous genes. As protein-coding DNA sequences evolve as triplets of nucleotides (codons) and it is known that sequence similarity degrades more rapidly at the DNA than at the amino acid level, alignments are generally more accurate when based on amino acids than on their corresponding nucleotides. TranslatorX novelties include: (i) use of all documented genetic codes and the possibility of assigning different genetic codes for each sequence; (ii) a battery of different multiple alignment programs; (iii) translation of ambiguous codons when possible; (iv) an innovative criterion to clean nucleotide alignments with GBlocks based on protein information; and (v) a rich output, including Jalview-powered graphical visualization of the alignments, codon-based alignments coloured according to the corresponding amino acids, measures of compositional bias and first, second and third codon position specific alignments. The TranslatorX server is freely available at http://translatorx.co.uk.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Example illustrating the different performance of the direct and back-translated nucleotide alignments (multiple alignments were built with Muscle with default parameters).
Figure 2.
Figure 2.
Screen capture of a fragment of the results of TranslatorX. The nucleotide back-translated alignment and the corresponding amino acid alignment are shown with Jalview.
Figure 3.
Figure 3.
Comparison of the phylogenetic trees inferred from the sub-alignments that comprise positions whose alignment differed between the back-translated and direct Mafft (A, B) and Muscle (C, D) alignments.

References

    1. Talavera G, Castresana J. Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. Syst. Biol. 2007;56:564–577. - PubMed
    1. Yang Z. Maximum-likelihood estimation of phylogeny from DNA sequences when substitution rates differ over sites. Mol. Biol. Evol. 1993;10:1396–1401. - PubMed
    1. Bininda-Emonds OR. transAlign: using amino acids to facilitate the multiple alignment of protein-coding DNA sequences. BMC Bioinformatics. 2005;6:156. - PMC - PubMed
    1. Schuerer K, Letondal C. 2003. [(26 April, 2010, date last accessed)]. PROTAL2DNA: align DNA sequences given the corresponding protein alignment. http://www.pasteur.fr/recherche/unites/sis/formation/bioperl/lecture_cod....
    1. Rice P, Longden I, Bleasby A. EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet. 2000;16:276–277. - PubMed