Three-dimensional profiles for analysing protein sequence-structure relationships
- PMID: 1290936
Three-dimensional profiles for analysing protein sequence-structure relationships
Abstract
In the method of 3D (three-dimensional) profiles, each residue position in a protein is characterized by its environment and is represented by a row of 20 numbers in a table, the profile. These numbers are the statistical preferences (called 3D-1D scores) of each of the 20 amino acids for this environment. A profile is computed from the coordinates of a protein model, and it gives a score S for any amino acid sequence folded as the model. To date 3D profiles have found three applications. The first is to identify other protein sequences which are folded in the same general pattern as the structure from which the profile was prepared. These are sequences which have high scores for the profile computed from the model. The second is to assess the validity of protein models, however determined. Correct models are found to give profiles that have high scores for their own amino acid sequences, and incorrect models are found to have lower scores. The example of the X-ray structure determination of diphtheria toxin is discussed. The third application is to assess which is the stable oligomeric state of a folded protein. Several examples suggest that the highest profile score for a sequence is achieved when the protein is aggregated into its most stable oligomeric state.
Similar articles
-
Assessment of protein models with three-dimensional profiles.Nature. 1992 Mar 5;356(6364):83-5. doi: 10.1038/356083a0. Nature. 1992. PMID: 1538787
-
A 3D-1D substitution matrix for protein fold recognition that includes predicted secondary structure of the sequence.J Mol Biol. 1997 Apr 11;267(4):1026-38. doi: 10.1006/jmbi.1997.0924. J Mol Biol. 1997. PMID: 9135128
-
The 1.7 A crystal structure of BPI: a study of how two dissimilar amino acid sequences can adopt the same fold.J Mol Biol. 2000 Jun 16;299(4):1019-34. doi: 10.1006/jmbi.2000.3805. J Mol Biol. 2000. PMID: 10843855
-
Potential implications of availability of short amino acid sequences in proteins: an old and new approach to protein decoding and design.Biotechnol Annu Rev. 2008;14:109-41. doi: 10.1016/S1387-2656(08)00004-5. Biotechnol Annu Rev. 2008. PMID: 18606361 Review.
-
[Structured proteins and proteins with internal disorder].Mol Biol (Mosk). 2007 Mar-Apr;41(2):297-313. Mol Biol (Mosk). 2007. PMID: 17514898 Review. Russian.
Cited by
-
Inhibition of IRF5 cellular activity with cell-penetrating peptides that target homodimerization.Sci Adv. 2020 May 15;6(20):eaay1057. doi: 10.1126/sciadv.aay1057. eCollection 2020 May. Sci Adv. 2020. PMID: 32440537 Free PMC article.
-
Genome bioinformatic analysis of nonsynonymous SNPs.BMC Bioinformatics. 2007 Aug 20;8:301. doi: 10.1186/1471-2105-8-301. BMC Bioinformatics. 2007. PMID: 17708757 Free PMC article.
-
EvDTree: structure-dependent substitution profiles based on decision tree classification of 3D environments.BMC Bioinformatics. 2005 Jan 10;6:4. doi: 10.1186/1471-2105-6-4. BMC Bioinformatics. 2005. PMID: 15638949 Free PMC article.
-
Accurate prediction of peptide binding sites on protein surfaces.PLoS Comput Biol. 2009 Mar;5(3):e1000335. doi: 10.1371/journal.pcbi.1000335. Epub 2009 Mar 27. PLoS Comput Biol. 2009. PMID: 19325869 Free PMC article.
-
The blind watchmaker and rational protein engineering.J Biotechnol. 1994 Aug 31;36(3):185-220. doi: 10.1016/0168-1656(94)90152-x. J Biotechnol. 1994. PMID: 7765263 Free PMC article. Review.