CAALIGN: a program for pairwise and multiple protein-structure alignment
- PMID: 17372357
- DOI: 10.1107/S0907444907000844
CAALIGN: a program for pairwise and multiple protein-structure alignment
Abstract
Coordinate superposition of proteins provides a structural basis to protein similarity and therefore complements the technique of sequence alignment. Methods that carry out structure alignment are faced with the problem of the large number of trials necessary to determine the optimal alignment solution. This article presents a method of carrying out rapid (subsecond) protein-structure alignment between pairs of proteins based on a maximal C(alpha)-atom superposition. The algorithm can return alignments of 12 or more residues in length as multiple non-overlapping solutions of alignment between a pair of proteins which are independent of the fold connectivity and secondary-structure content. The algorithm is equally effective for all protein fold types and can align proteins containing no secondary-structure elements such as is the case when searching for common turn structures in proteins. It has high sensitivity and returns the set of true positive results before any false positives as judged by SCOP classification. It can find alignments between topologically different folds and returns information about sequence alignment based on structure alignment. Additionally, this algorithm has been extended to carry out multiple structure alignment to determine common structures within groups of proteins, including the nondegenerate set of proteins in the PDB. The algorithm has been implemented within the program CAALIGN and this article presents results from pairwise structure alignment, multiple structure alignment and the generation of common structure fragments found within the PDB using multiple structure alignment.
Similar articles
-
MUSTANG: a multiple structural alignment algorithm.Proteins. 2006 Aug 15;64(3):559-74. doi: 10.1002/prot.20921. Proteins. 2006. PMID: 16736488
-
NdPASA: a novel pairwise protein sequence alignment algorithm that incorporates neighbor-dependent amino acid propensities.Proteins. 2005 Feb 15;58(3):628-37. doi: 10.1002/prot.20359. Proteins. 2005. PMID: 15616964
-
Multiple protein sequence alignment from tertiary structure comparison: assignment of global and residue confidence levels.Proteins. 1992 Oct;14(2):309-23. doi: 10.1002/prot.340140216. Proteins. 1992. PMID: 1409577
-
The WWWH of remote homolog detection: the state of the art.Brief Bioinform. 2007 Mar;8(2):78-87. doi: 10.1093/bib/bbl032. Epub 2006 Sep 26. Brief Bioinform. 2007. PMID: 17003074 Review.
-
Performance assessment of protein multiple sequence alignment algorithms based on permutation similarity measurement.Biochem Biophys Res Commun. 2010 Sep 3;399(4):470-4. doi: 10.1016/j.bbrc.2010.07.103. Epub 2010 Aug 3. Biochem Biophys Res Commun. 2010. PMID: 20678477 Review.
Cited by
-
The SALAMI protein structure search server.Nucleic Acids Res. 2009 Jul;37(Web Server issue):W480-4. doi: 10.1093/nar/gkp431. Epub 2009 May 22. Nucleic Acids Res. 2009. PMID: 19465380 Free PMC article.
-
Fr-TM-align: a new protein structural alignment method based on fragment alignments and the TM-score.BMC Bioinformatics. 2008 Dec 12;9:531. doi: 10.1186/1471-2105-9-531. BMC Bioinformatics. 2008. PMID: 19077267 Free PMC article.
-
Protein sequence and structure alignments within one framework.Algorithms Mol Biol. 2008 Apr 1;3:4. doi: 10.1186/1748-7188-3-4. Algorithms Mol Biol. 2008. PMID: 18380904 Free PMC article.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources