Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Jul;38(Web Server issue):W23-8.
doi: 10.1093/nar/gkq443. Epub 2010 May 23.

GUIDANCE: a web server for assessing alignment confidence scores

Affiliations

GUIDANCE: a web server for assessing alignment confidence scores

Osnat Penn et al. Nucleic Acids Res. 2010 Jul.

Abstract

Evaluating the accuracy of multiple sequence alignment (MSA) is critical for virtually every comparative sequence analysis that uses an MSA as input. Here we present the GUIDANCE web-server, a user-friendly, open access tool for the identification of unreliable alignment regions. The web-server accepts as input a set of unaligned sequences. The server aligns the sequences and provides a simple graphic visualization of the confidence score of each column, residue and sequence of an alignment, using a color-coding scheme. The method is generic and the user is allowed to choose the alignment algorithm (ClustalW, MAFFT and PRANK are supported) as well as any type of molecular sequences (nucleotide, protein or codon sequences). The server implements two different algorithms for evaluating confidence scores: (i) the heads-or-tails (HoT) method, which measures alignment uncertainty due to co-optimal solutions; (ii) the GUIDANCE method, which measures the robustness of the alignment to guide-tree uncertainty. The server projects the confidence scores onto the MSA and points to columns and sequences that are unreliably aligned. These can be automatically removed in preparation for downstream analyses. GUIDANCE is freely available for use at http://guidance.tau.ac.il.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
An example of the GUIDANCE output. (a) Residue confidence scores are projected onto the MAFFT alignment of Vpu protein sequences from human and simian immunodeficiency viruses (HIV and SIV). Confidently aligned residues are colored in shades of magenta and pink, while uncertain residues are colored in shades of blue. Column scores are plotted below the alignment. (b) Dramatically improved alignment confidence after filtering low-scoring sequences and re-running GUIDANCE. Note the color-coding next to the sequence names before and after re-alignment.
Figure 2.
Figure 2.
Average run-time performance as a function of the number of sequences. Simulated protein sequences roughly 300 amino acids long were aligned using MAFFT and analyzed by GUIDANCE (blue diamonds) or HoT (red squares). In addition, running time for GUIDANCE on PRANK alignments is plotted with green triangles. Each data point represents ten replicates.

Similar articles

Cited by

References

    1. Penn O., Privman E., Landan G., Graur D., Pupko T. An alignment confidence score capturing robustness to guide-tree uncertainty. Mol. Biol. Evol. 2010 doi:10.1093/molbev/msq066. - PMC - PubMed
    1. Landan G., Graur D. Local reliability measures from sets of co-optimal multiple sequence alignments. Pacific Symp. Biocomput. 2008;13:15–24. - PubMed
    1. Landan G., Graur D. Heads or tails: a simple reliability check for multiple sequence alignments. Mol. Biol. Evol. 2007;24:1380–1383. - PubMed
    1. Poirot O., O’Toole E., Notredame C. Tcoffee@igs: a web server for computing, evaluating and combining multiple sequence alignments. Nucleic Acids Res. 2003;31:3503–3506. - PMC - PubMed
    1. Loytynoja A., Milinkovitch M.C. SOAP, cleaning multiple alignments from unstable blocks. Bioinformatics. 2001;17:573–574. - PubMed

Publication types

MeSH terms

Substances