Modeling of globular proteins. A distance-based data search procedure for the construction of insertion/deletion regions and Pro----non-Pro mutations
- PMID: 2266566
- DOI: 10.1016/S0022-2836(99)80016-3
Modeling of globular proteins. A distance-based data search procedure for the construction of insertion/deletion regions and Pro----non-Pro mutations
Abstract
A distance-based database search scheme is proposed for modeling Pro----in non-Pro and insertion/deletion regions of homologous globular proteins up to six residues in length. In the first step, geometric descriptors, the number of residues involved and target distances corresponding to the separation of C alpha atom positions adjacent to the "missing" segment, are chosen. In the second step, a database of high-resolution X-ray structures is scanned for segments with similar descriptors and selected segments are binned according to conformational type. In the third and fourth steps, the selected conformations are docked into the protein, and geometric and energetic criteria are used to determine their viability as segment models. The fifth step consists of an interaction scheme in which the geometric descriptors are redefined. This compensates for the use of a limited database and/or for the use of a poor original protein model adjacent to the missing segment. The procedure has been tested on Pro----non-Pro mutations in the homologous proteins penicillopepsin and endothiapepsin, and on the insertion/deletion regions of the homologs penicillopepsin and endothiapepsin, trypsin and gamma-chymotrypsin and hen and human lysozyme. The test cases represent a wide variety of secondary structural elements (helix, sheet, turn and coil) and insertion/deletion lengths (0 to 4 residues). It is shown that 79% of the test cases are accurately modeled (within 0.54 A root-mean-square (r.m.s.) deviation for main-chain atoms) using the proposed scheme. Failure of the scheme (main-chain atom r.m.s. deviations greater than 1.29 A) in 21% of the cases appears to be related to the presence of infrequently observed conformations or locally unique folds of the target proteins with respect to the database (18% of the test cases); the remaining 3% are unexplained. Geometric and energetic criteria are able to discriminate between trial conformations that correspond to the X-ray structures and those that are different in 97% of the conformations generated by the distance-weighted database search scheme. The scheme is shown to be relatively insensitive to uncertainty in the template co-ordinates, since the geometric descriptors were taken from the homologous protein (r.m.s. deviations in the position of descriptors range from 0.18 to 1.35 A for the accurately modeled test cases). It is demonstrated that the scheme can be used to correct local sequence misalignments.
Similar articles
-
X-ray analyses of aspartic proteinases. V. Structure and refinement at 2.0 A resolution of the aspartic proteinase from Mucor pusillus.J Mol Biol. 1993 Mar 5;230(1):260-83. J Mol Biol. 1993. PMID: 8450540
-
X-ray analyses of aspartic proteinases. The three-dimensional structure at 2.1 A resolution of endothiapepsin.J Mol Biol. 1990 Feb 20;211(4):919-41. doi: 10.1016/0022-2836(90)90084-Y. J Mol Biol. 1990. PMID: 2179568
-
An algorithm for determining the conformation of polypeptide segments in proteins by systematic search.Proteins. 1986 Oct;1(2):146-63. doi: 10.1002/prot.340010207. Proteins. 1986. PMID: 3130622
-
Comparison of the three-dimensional structures of a humanized and a chimeric Fab of an anti-gamma-interferon antibody.J Mol Recognit. 1999 Jan-Feb;12(1):19-32. doi: 10.1002/(SICI)1099-1352(199901/02)12:1<19::AID-JMR445>3.0.CO;2-Y. J Mol Recognit. 1999. PMID: 10398393 Review.
-
[A turning point in the knowledge of the structure-function-activity relations of elastin].J Soc Biol. 2001;195(2):181-93. J Soc Biol. 2001. PMID: 11727705 Review. French.
Cited by
-
Homology modeling of cephalopod lens S-crystallin: a natural mutant of sigma-class glutathione transferase with diminished endogenous activity.Biophys J. 1999 Feb;76(2):679-90. doi: 10.1016/S0006-3495(99)77235-8. Biophys J. 1999. PMID: 9929473 Free PMC article.
-
Multiple copy sampling in protein loop modeling: computational efficiency and sensitivity to dihedral angle perturbations.Protein Sci. 1994 Mar;3(3):493-506. doi: 10.1002/pro.5560030315. Protein Sci. 1994. PMID: 8019420 Free PMC article.
-
The Ramachandran plots of glycine and pre-proline.BMC Struct Biol. 2005 Aug 16;5:14. doi: 10.1186/1472-6807-5-14. BMC Struct Biol. 2005. PMID: 16105172 Free PMC article.
-
Chemical shift prediction for protein structure calculation and quality assessment using an optimally parameterized force field.Prog Nucl Magn Reson Spectrosc. 2012 Jan;60:1-28. doi: 10.1016/j.pnmrs.2011.05.002. Epub 2011 May 23. Prog Nucl Magn Reson Spectrosc. 2012. PMID: 22293396 Free PMC article. Review.
-
SMS 2.0: an updated database to study the structural plasticity of short peptide fragments in non-redundant proteins.Genomics Proteomics Bioinformatics. 2012 Feb;10(1):44-50. doi: 10.1016/S1672-0229(11)60032-6. Genomics Proteomics Bioinformatics. 2012. PMID: 22449400 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources