Do aligned sequences share the same fold?
- PMID: 9367768
- DOI: 10.1006/jmbi.1997.1287
Do aligned sequences share the same fold?
Abstract
Sequence comparison remains a powerful tool to assess the structural relatedness of two proteins. To develop a sensitive sequence-based procedure for fold recognition, we performed an exhaustive global alignment (with zero end gap penalties) between sequences of protein domains with known three-dimensional folds. The subset of 1.3 million alignments between sequences of structurally unrelated domains was used to derive a set of analytical functions that represent the probability of structural significance for any sequence alignment at a given sequence identity, sequence similarity and alignment score. Analysis of overlap between structurally significant and insignificant alignments shows that sequence identity and sequence similarity measures are poor indicators of structural relatedness in the "twilight zone", while the alignment score allows much better discrimination between alignments of structurally related and unrelated sequences for a wide variety of alignment settings. A fold recognition benchmark was used to compare eight different substitution matrices with eight sets of gap penalties. The best performing matrices were Gonnet and Blosum50 with normalized gap penalties of 2.4/0.15 and 2.0/0.15, respectively, while the positive matrices were the worst performers. The derived functions and parameters can be used for fold recognition via a multilink chain of probability weighted pairwise sequence alignments.
Similar articles
-
An assessment of amino acid exchange matrices in aligning protein sequences: the twilight zone revisited.J Mol Biol. 1995 Jun 16;249(4):816-31. doi: 10.1006/jmbi.1995.0340. J Mol Biol. 1995. PMID: 7602593
-
Significant improvement in accuracy of multiple protein sequence alignments by iterative refinement as assessed by reference to structural alignments.J Mol Biol. 1996 Dec 13;264(4):823-38. doi: 10.1006/jmbi.1996.0679. J Mol Biol. 1996. PMID: 8980688
-
Recognition of analogous and homologous protein folds: analysis of sequence and structure conservation.J Mol Biol. 1997 Jun 13;269(3):423-39. doi: 10.1006/jmbi.1997.1019. J Mol Biol. 1997. PMID: 9199410
-
Sensitive methods for determining the relatedness of proteins with limited sequence homology.Curr Opin Biotechnol. 1994 Aug;5(4):361-71. doi: 10.1016/0958-1669(94)90044-2. Curr Opin Biotechnol. 1994. PMID: 7765168 Review.
-
Assigning amino acid sequences to 3-dimensional protein folds.FASEB J. 1996 Jan;10(1):126-36. doi: 10.1096/fasebj.10.1.8566533. FASEB J. 1996. PMID: 8566533 Review.
Cited by
-
Biophysical Compatibility of a Heterotrimeric Tyrosinase-TYRP1-TYRP2 Metalloenzyme Complex.Front Pharmacol. 2021 Apr 28;12:602206. doi: 10.3389/fphar.2021.602206. eCollection 2021. Front Pharmacol. 2021. PMID: 33995009 Free PMC article.
-
Predicting protein interface residues using easily accessible on-line resources.Brief Bioinform. 2015 Nov;16(6):1025-34. doi: 10.1093/bib/bbv009. Epub 2015 Mar 21. Brief Bioinform. 2015. PMID: 25797794 Free PMC article. Review.
-
All are not equal: a benchmark of different homology modeling programs.Protein Sci. 2005 May;14(5):1315-27. doi: 10.1110/ps.041253405. Protein Sci. 2005. PMID: 15840834 Free PMC article.
-
Orientation-dependent backbone-only residue pair scoring functions for fixed backbone protein design.BMC Bioinformatics. 2010 Apr 16;11:192. doi: 10.1186/1471-2105-11-192. BMC Bioinformatics. 2010. PMID: 20398384 Free PMC article.
-
Protein secondary structure appears to be robust under in silico evolution while protein disorder appears not to be.Bioinformatics. 2010 Mar 1;26(5):625-31. doi: 10.1093/bioinformatics/btq012. Epub 2010 Jan 16. Bioinformatics. 2010. PMID: 20081223 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Miscellaneous