PROGEN: an automated modelling algorithm for the generation of complete protein structures from the alpha-carbon atomic coordinates
- PMID: 8320557
- DOI: 10.1007/BF00126445
PROGEN: an automated modelling algorithm for the generation of complete protein structures from the alpha-carbon atomic coordinates
Abstract
A modelling algorithm (PROGEN) for the generation of complete protein atomic coordinates from only the alpha-carbon coordinates is described. PROGEN utilizes an optimal geometry parameter (OGP) database for the positioning of atoms for each amino acid of the polypeptide model. The OGP database was established by examining the statistical correlations between 23 different intra-peptide and inter-peptide geometric parameters relative to the alpha-carbon distances for each amino acid in a library of 19 known proteins from the Brookhaven Protein Database (BPDB). The OGP files for specific amino acids and peptides were used to generate the atomic positions, with respect to alpha-carbons, for main-chain and side-chain atoms in the modelled structure. Refinement of the initial model was accomplished using energy minimization (EM) and molecular dynamics techniques. PROGEN was tested using 60 known proteins in the BPDB, representing a wide spectrum of primary and secondary structures. Comparison between PROGEN models and BPDB crystal reference structures gave r.m.s.d. values for peptide main-chain atoms between 0.29 and 0.76 A, with a grand average of 0.53 A for all 60 models. The r.m.s.d. for all non-hydrogen atoms ranged between 1.44 and 1.93 A for the 60 polypeptide models. PROGEN was also able to make the correct assignment of cis- or trans-proline configurations in the protein structures examined. PROGEN offers a fully automatic building and refinement procedure and requires no special or specific structural considerations for the protein to be modelled.
Similar articles
-
Application of a directed conformational search for generating 3-D coordinates for protein structures from alpha-carbon coordinates.Proteins. 1992 Dec;14(4):465-74. doi: 10.1002/prot.340140407. Proteins. 1992. PMID: 1438184
-
Protein NMR structure determination with automated NOE assignment using the new software CANDID and the torsion angle dynamics algorithm DYANA.J Mol Biol. 2002 May 24;319(1):209-27. doi: 10.1016/s0022-2836(02)00241-3. J Mol Biol. 2002. PMID: 12051947
-
Activated conformations of the ras-gene-encoded p21 protein. 1. An energy-refined structure for the normal p21 protein complexed with GDP.J Biomol Struct Dyn. 1992 Jun;9(6):1025-44. doi: 10.1080/07391102.1992.10507977. J Biomol Struct Dyn. 1992. PMID: 1637501
-
Reconstruction of protein conformations from estimated positions of the C alpha coordinates.Protein Sci. 1993 Mar;2(3):315-24. doi: 10.1002/pro.5560020303. Protein Sci. 1993. PMID: 8453371 Free PMC article.
-
Current progress, challenges, and future perspectives of language models for protein representation and protein design.Innovation (Camb). 2023 May 21;4(4):100446. doi: 10.1016/j.xinn.2023.100446. eCollection 2023 Jul 10. Innovation (Camb). 2023. PMID: 37485078 Free PMC article. Review.
Cited by
-
Discrete restraint-based protein modeling and the Calpha-trace problem.Protein Sci. 2003 Sep;12(9):2032-46. doi: 10.1110/ps.0386903. Protein Sci. 2003. PMID: 12931001 Free PMC article.
References
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Miscellaneous