Automated structure-based prediction of functional sites in proteins: applications to assessing the validity of inheriting protein function from homology in genome annotation and to protein docking
- PMID: 11478868
- DOI: 10.1006/jmbi.2001.4870
Automated structure-based prediction of functional sites in proteins: applications to assessing the validity of inheriting protein function from homology in genome annotation and to protein docking
Abstract
A major problem in genome annotation is whether it is valid to transfer the function from a characterised protein to a homologue of unknown activity. Here, we show that one can employ a strategy that uses a structure-based prediction of protein functional sites to assess the reliability of functional inheritance. We have automated and benchmarked a method based on the evolutionary trace approach. Using a multiple sequence alignment, we identified invariant polar residues, which were then mapped onto the protein structure. Spatial clusters of these invariant residues formed the predicted functional site. For 68 of 86 proteins examined, the method yielded information about the observed functional site. This algorithm for functional site prediction was then used to assess the validity of transferring the function between homologues. This procedure was tested on 18 pairs of homologous proteins with unrelated function and 70 pairs of proteins with related function, and was shown to be 94 % accurate. This automated method could be linked to schemes for genome annotation. Finally, we examined the use of functional site prediction in protein-protein and protein-DNA docking. The use of predicted functional sites was shown to filter putative docked complexes with a discrimination similar to that obtained by manually including biological information about active sites or DNA-binding residues.
Copyright 2001 Academic Press.
Similar articles
-
Functional analysis of the Escherichia coli genome using the sequence-to-structure-to-function paradigm: identification of proteins exhibiting the glutaredoxin/thioredoxin disulfide oxidoreductase activity.J Mol Biol. 1998 Oct 2;282(4):703-11. doi: 10.1006/jmbi.1998.2061. J Mol Biol. 1998. PMID: 9743619
-
Predicting functional sites with an automated algorithm suitable for heterogeneous datasets.BMC Bioinformatics. 2005 May 13;6:116. doi: 10.1186/1471-2105-6-116. BMC Bioinformatics. 2005. PMID: 15890082 Free PMC article.
-
[Computational method for prediction of protein functional sites using specificity determinants].Mol Biol (Mosk). 2007 Jan-Feb;41(1):151-62. Mol Biol (Mosk). 2007. PMID: 17380902 Russian.
-
Automated protein function prediction--the genomic challenge.Brief Bioinform. 2006 Sep;7(3):225-42. doi: 10.1093/bib/bbl004. Epub 2006 May 23. Brief Bioinform. 2006. PMID: 16772267 Review.
-
Protein-protein docking dealing with the unknown.J Comput Chem. 2010 Jan 30;31(2):317-42. doi: 10.1002/jcc.21276. J Comput Chem. 2010. PMID: 19462412 Review.
Cited by
-
Structure-based function inference using protein family-specific fingerprints.Protein Sci. 2006 Jun;15(6):1537-43. doi: 10.1110/ps.062189906. Protein Sci. 2006. PMID: 16731985 Free PMC article.
-
Prediction of functional sites by analysis of sequence and structure conservation.Protein Sci. 2004 Apr;13(4):884-92. doi: 10.1110/ps.03465504. Epub 2004 Mar 9. Protein Sci. 2004. PMID: 15010543 Free PMC article.
-
How accurate and statistically robust are catalytic site predictions based on closeness centrality?BMC Bioinformatics. 2007 May 11;8:153. doi: 10.1186/1471-2105-8-153. BMC Bioinformatics. 2007. PMID: 17498304 Free PMC article.
-
Accuracy of structure-derived properties in simple comparative models of protein structures.Nucleic Acids Res. 2005 Jan 12;33(1):244-59. doi: 10.1093/nar/gki162. Print 2005. Nucleic Acids Res. 2005. PMID: 15647507 Free PMC article.
-
Identification of family-specific residue packing motifs and their use for structure-based protein function prediction: II. Case studies and applications.J Comput Aided Mol Des. 2009 Nov;23(11):785-97. doi: 10.1007/s10822-009-9277-0. Epub 2009 Jun 23. J Comput Aided Mol Des. 2009. PMID: 19548090
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources