Comparison of sequence and structure alignments for protein domains
- PMID: 12112669
- DOI: 10.1002/prot.10163
Comparison of sequence and structure alignments for protein domains
Abstract
Profile search methods based on protein domain alignments have proven to be useful tools in comparative sequence analysis. Domain alignments used by currently available search methods have been computed by sequence comparison. With the growth of the protein structure database, however, alignments of many domain pairs have also been computed by structure comparison. Here, we examine the extent to which information from these two sources agrees. We measure agreement with respect to identification of homologous regions in each protein, that is, with respect to the location of domain boundaries. We also measure agreement with respect to identification of homologous residue sites by comparing alignments and assessing the accuracy of the molecular models they predict. We find that domain alignments in publicly available collections based on sequence and structure comparison are largely consistent. However, the homologous regions identified by sequence comparison are often shorter than those identified by 3D structure comparison. In addition, when overall sequence similarity is low alignments from sequence comparison produce less accurate molecular models, suggesting that they less accurately identify homologous sites. These observations suggest that structure comparison results might be used to improve the overall accuracy of domain alignment collections and the performance of profile search methods based on them.
Copyright 2002 Wiley-Liss, Inc.
Similar articles
-
A Shannon entropy-based filter detects high- quality profile-profile alignments in searches for remote homologues.Proteins. 2004 Feb 1;54(2):351-60. doi: 10.1002/prot.10564. Proteins. 2004. PMID: 14696197
-
Within the twilight zone: a sensitive profile-profile comparison tool based on information theory.J Mol Biol. 2002 Feb 1;315(5):1257-75. doi: 10.1006/jmbi.2001.5293. J Mol Biol. 2002. PMID: 11827492
-
Consistency matrices: quantified structure alignments for sets of related proteins.Proteins. 2003 Apr 1;51(1):1-9. doi: 10.1002/prot.10293. Proteins. 2003. PMID: 12596259
-
Methods for sequence-structure alignment.Methods Mol Biol. 2012;857:55-82. doi: 10.1007/978-1-61779-588-6_3. Methods Mol Biol. 2012. PMID: 22323217 Review.
-
Selection of soluble protein expression constructs: the experimental determination of protein domain boundaries.Biochem Soc Trans. 2010 Aug;38(4):908-13. doi: 10.1042/BST0380908. Biochem Soc Trans. 2010. PMID: 20658975 Review.
Cited by
-
Molecular modeling of the membrane targeting of phospholipase C pleckstrin homology domains.Protein Sci. 2003 Sep;12(9):1934-53. doi: 10.1110/ps.0358803. Protein Sci. 2003. PMID: 12930993 Free PMC article.
-
Salivary gland transcripts of the kissing bug, Panstrongylus chinai, a vector of Chagas disease.Acta Trop. 2017 Oct;174:122-129. doi: 10.1016/j.actatropica.2017.06.022. Epub 2017 Jul 6. Acta Trop. 2017. PMID: 28690145 Free PMC article.
-
TSOL18/HP6-Tsol, an immunogenic Taenia solium oncospheral adhesion protein and potential protective antigen.Parasitol Res. 2008 Apr;102(5):921-6. doi: 10.1007/s00436-007-0853-8. Epub 2008 Jan 24. Parasitol Res. 2008. PMID: 18214543
-
A repertoire of the dominant transcripts from the salivary glands of the blood-sucking bug, Triatoma dimidiata, a vector of Chagas disease.Infect Genet Evol. 2010 Mar;10(2):184-91. doi: 10.1016/j.meegid.2009.10.012. Epub 2009 Nov 10. Infect Genet Evol. 2010. PMID: 19900580 Free PMC article.
-
The effect of disease associated point mutations on 5β-reductase (AKR1D1) enzyme function.Chem Biol Interact. 2011 May 30;191(1-3):250-4. doi: 10.1016/j.cbi.2010.12.020. Epub 2010 Dec 24. Chem Biol Interact. 2011. PMID: 21185810 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources