Evaluation of PSI-BLAST alignment accuracy in comparison to structural alignments
- PMID: 11152139
- PMCID: PMC2144484
- DOI: 10.1110/ps.9.11.2278
Evaluation of PSI-BLAST alignment accuracy in comparison to structural alignments
Abstract
The PSI-BLAST algorithm has been acknowledged as one of the most powerful tools for detecting remote evolutionary relationships by sequence considerations only. This has been demonstrated by its ability to recognize remote structural homologues and by the greatest coverage it enables in annotation of a complete genome. Although recognizing the correct fold of a sequence is of major importance, the accuracy of the alignment is crucial for the success of modeling one sequence by the structure of its remote homologue. Here we assess the accuracy of PSI-BLAST alignments on a stringent database of 123 structurally similar, sequence-dissimilar pairs of proteins, by comparing them to the alignments defined on a structural basis. Each protein sequence is compared to a nonredundant database of the protein sequences by PSI-BLAST. Whenever a pair member detects its pair-mate, the positions that are aligned both in the sequential and structural alignments are determined, and the alignment sensitivity is expressed as the percentage of these positions out of the structural alignment. Fifty-two sequences detected their pair-mates (for 16 pairs the success was bi-directional when either pair member was used as a query). The average percentage of correctly aligned residues per structural alignment was 43.5+/-2.2%. Other properties of the alignments were also examined, such as the sensitivity vs. specificity and the change in these parameters over consecutive iterations. Notably, there is an improvement in alignment sensitivity over consecutive iterations, reaching an average of 50.9+/-2.5% within the five iterations tested in the current study.
Similar articles
-
Exploring the extremes of sequence/structure space with ensemble fold recognition in the program Phyre.Proteins. 2008 Feb 15;70(3):611-25. doi: 10.1002/prot.21688. Proteins. 2008. PMID: 17876813
-
A comparison of scoring functions for protein sequence profile alignment.Bioinformatics. 2004 May 22;20(8):1301-8. doi: 10.1093/bioinformatics/bth090. Epub 2004 Feb 12. Bioinformatics. 2004. PMID: 14962936
-
Benchmarking PSI-BLAST in genome annotation.J Mol Biol. 1999 Nov 12;293(5):1257-71. doi: 10.1006/jmbi.1999.3233. J Mol Biol. 1999. PMID: 10547299
-
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.Nucleic Acids Res. 1997 Sep 1;25(17):3389-402. doi: 10.1093/nar/25.17.3389. Nucleic Acids Res. 1997. PMID: 9254694 Free PMC article. Review.
-
Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements.Nucleic Acids Res. 2001 Jul 15;29(14):2994-3005. doi: 10.1093/nar/29.14.2994. Nucleic Acids Res. 2001. PMID: 11452024 Free PMC article. Review.
Cited by
-
Prediction and visualization data for the interpretation of sarcomeric and non-sarcomeric DNA variants found in patients with hypertrophic cardiomyopathy.Data Brief. 2016 Mar 10;7:607-13. doi: 10.1016/j.dib.2016.03.004. eCollection 2016 Jun. Data Brief. 2016. PMID: 27054166 Free PMC article.
-
Three globin lineages belonging to two structural classes in genomes from the three kingdoms of life.Proc Natl Acad Sci U S A. 2005 Aug 9;102(32):11385-9. doi: 10.1073/pnas.0502103102. Epub 2005 Aug 1. Proc Natl Acad Sci U S A. 2005. PMID: 16061809 Free PMC article.
-
An efficient algorithm for protein structure comparison using elastic shape analysis.Algorithms Mol Biol. 2016 Sep 29;11:27. doi: 10.1186/s13015-016-0089-1. eCollection 2016. Algorithms Mol Biol. 2016. PMID: 27708689 Free PMC article.
-
AlignHUSH: alignment of HMMs using structure and hydrophobicity information.BMC Bioinformatics. 2011 Jul 5;12:275. doi: 10.1186/1471-2105-12-275. BMC Bioinformatics. 2011. PMID: 21729312 Free PMC article.
-
A directed approach to improving the solubility of Moloney murine leukemia virus reverse transcriptase.Protein Sci. 2001 Oct;10(10):1936-41. doi: 10.1110/ps.16301. Protein Sci. 2001. PMID: 11567084 Free PMC article.
References
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Research Materials