Protein structure prediction begins well but ends badly
- PMID: 20014025
- DOI: 10.1002/prot.22646
Protein structure prediction begins well but ends badly
Abstract
The accurate prediction of protein structure, both secondary and tertiary, is an ongoing problem. Over the years, many approaches have been implemented and assessed. Most prediction algorithms start with the entire amino acid sequence and treat all residues in an identical fashion independent of sequence position. Here, we analyze blind prediction data to investigate whether predictive capability varies along the chain. Free modeling results from recent critical assessment of techniques for protein structure prediction (CASP) experiments are evaluated; as is the most up-to-date data from EVA, a fully automated blind test of secondary structure prediction servers. The results demonstrate that structure prediction accuracy is dependent on sequence position. Both secondary structure and tertiary structure predictions are more accurate in regions near the amino(N)-terminus when compared with analogous regions near the carboxy(C)-terminus. Eight of 10 secondary structure prediction algorithms assessed by EVA perform significantly better in regions at the N-terminus. CASP data shows a similar bias, with N-terminal fragments being predicted more accurately than fragments from the C-terminus. Two analogous fragments are taken from each model, the N-terminal fragment begins at the start of the most N-terminal secondary structure element (SSE), whereas the C-terminal fragment finishes at the end of the most C-terminal SSE. Each fragment is locally superimposed onto its respective native fragment. The relative terminal prediction accuracy (RMSD) is calculated on an intramodel basis. At a fragment length of 20 residues, the N-terminal fragment is predicted with greater accuracy in 79% of cases.
Similar articles
-
Improved prediction for N-termini of alpha-helices using empirical information.Proteins. 2004 Nov 1;57(2):322-30. doi: 10.1002/prot.20218. Proteins. 2004. PMID: 15340919
-
SSEP-Domain: protein domain prediction by alignment of secondary structure elements and profiles.Bioinformatics. 2006 Jan 15;22(2):181-7. doi: 10.1093/bioinformatics/bti751. Epub 2005 Nov 2. Bioinformatics. 2006. PMID: 16267083
-
HYPROSP II--a knowledge-based hybrid method for protein secondary structure prediction based on local prediction confidence.Bioinformatics. 2005 Aug 1;21(15):3227-33. doi: 10.1093/bioinformatics/bti524. Epub 2005 Jun 2. Bioinformatics. 2005. PMID: 15932901
-
A decade of CASP: progress, bottlenecks and prognosis in protein structure prediction.Curr Opin Struct Biol. 2005 Jun;15(3):285-9. doi: 10.1016/j.sbi.2005.05.011. Curr Opin Struct Biol. 2005. PMID: 15939584 Review.
-
Protein structure prediction: recognition of primary, secondary, and tertiary structural features from amino acid sequence.Crit Rev Biochem Mol Biol. 1995;30(1):1-94. doi: 10.3109/10409239509085139. Crit Rev Biochem Mol Biol. 1995. PMID: 7587278 Review.
Cited by
-
Identification of distant Agouti-like sequences and re-evaluation of the evolutionary history of the Agouti-related peptide (AgRP).PLoS One. 2012;7(7):e40982. doi: 10.1371/journal.pone.0040982. Epub 2012 Jul 16. PLoS One. 2012. PMID: 22815886 Free PMC article.
-
Decoding mechanisms by which silent codon changes influence protein biogenesis and function.Int J Biochem Cell Biol. 2015 Jul;64:58-74. doi: 10.1016/j.biocel.2015.03.011. Epub 2015 Mar 26. Int J Biochem Cell Biol. 2015. PMID: 25817479 Free PMC article. Review.
-
Robustness by intrinsically disordered C-termini and translational readthrough.Nucleic Acids Res. 2018 Nov 2;46(19):10184-10194. doi: 10.1093/nar/gky778. Nucleic Acids Res. 2018. PMID: 30247639 Free PMC article.
-
From protein engineering to immobilization: promising strategies for the upgrade of industrial enzymes.Int J Mol Sci. 2013 Jan 10;14(1):1232-77. doi: 10.3390/ijms14011232. Int J Mol Sci. 2013. PMID: 23306150 Free PMC article. Review.
-
Conjugation with the Carrier Helped to Reveal acidification-Induced Structural Shift in the Peptide from Phospholipase Domain of Parvovirus B19.Protein J. 2024 Aug;43(4):805-818. doi: 10.1007/s10930-024-10209-w. Epub 2024 Jul 9. Protein J. 2024. PMID: 38980534
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources