Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2003 Jul 1;31(13):3537-9.
doi: 10.1093/nar/gkg609.

RevTrans: Multiple alignment of coding DNA from aligned amino acid sequences

Affiliations

RevTrans: Multiple alignment of coding DNA from aligned amino acid sequences

Rasmus Wernersson et al. Nucleic Acids Res. .

Abstract

The simple fact that proteins are built from 20 amino acids while DNA only contains four different bases, means that the 'signal-to-noise ratio' in protein sequence alignments is much better than in alignments of DNA. Besides this information-theoretical advantage, protein alignments also benefit from the information that is implicit in empirical substitution matrices such as BLOSUM-62. Taken together with the generally higher rate of synonymous mutations over non-synonymous ones, this means that the phylogenetic signal disappears much more rapidly from DNA sequences than from the encoded proteins. It is therefore preferable to align coding DNA at the amino acid level and it is for this purpose we have constructed the program RevTrans. RevTrans constructs a multiple DNA alignment by: (i) translating the DNA; (ii) aligning the resulting peptide sequences; and (iii) building a multiple DNA alignment by 'reverse translation' of the aligned protein sequences. In the resulting DNA alignment, gaps occur in groups of three corresponding to entire codons, and analogous codon positions are therefore always lined up. These features are useful when constructing multiple DNA alignments for phylogenetic analysis. RevTrans also accepts user-provided protein alignments for greater control of the alignment process. The RevTrans web server is freely available at http://www.cbs.dtu.dk/services/RevTrans/.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Multiple alignment of coding DNA. (A) How alignment at the DNA level may lead to incorrectly aligned codon–codon boundaries. (B) How alignment of coding DNA at the amino acid level yields an alignment where analogous codon positions are properly lined up. The encoded amino acids are indicated at the bottom of (B).

References

    1. States D.J., Gish,W. and Altschul,S.F. (1991) Improved sensitivity of nucleic acid database searches using application-specific scoring matrices. Methods, 3, 66–70.
    1. Yang Z. and Nielsen,R.J. (1998) Synonymous and nonsynonymous rate variation in nuclear genes of mammals. Mol. Evol., 46, 409–418. - PubMed
    1. Henikoff S. and Henikoff,J.G. (1992) Amino acid substitution matrices from protein blocks. Proc. Natl Acad. Sci. USA, 89, 10915–10919. - PMC - PubMed
    1. Hein J. (1994) An algorithm combining DNA and protein alignment. J. Theor. Biol., 167, 169–174. - PubMed
    1. Hein J. and Støvlbæk,J. (1996) Combined DNA and protein alignment. Methods Enzymol., 266, 402–418. - PubMed

Publication types