Improved prediction for N-termini of alpha-helices using empirical information

Claire L Wilson¹, Paul E Boardman, Andrew J Doig, Simon J Hubbard

Affiliations

PMID: 15340919
DOI: 10.1002/prot.20218

Improved prediction for N-termini of alpha-helices using empirical information

Claire L Wilson et al. Proteins. 2004.

. 2004 Nov 1;57(2):322-30.

doi: 10.1002/prot.20218.

Authors

Claire L Wilson¹, Paul E Boardman, Andrew J Doig, Simon J Hubbard

Affiliation

¹ Department of Biomolecular Sciences, University of Manchester Institute of Science and Technology, Manchester, United Kingdom.

PMID: 15340919
DOI: 10.1002/prot.20218

Abstract

The prediction of the secondary structure of proteins from their amino acid sequences remains a key component of many approaches to the protein folding problem. The most abundant form of regular secondary structure in proteins is the alpha-helix, in which specific residue preferences exist at the N-terminal locations. Propensities derived from these observed amino acid frequencies in the Protein Data Bank (PDB) database correlate well with experimental free energies measured for residues at different N-terminal positions in alanine-based peptides. We report a novel method to exploit this data to improve protein secondary structure prediction through identification of the correct N-terminal sequences in alpha-helices, based on existing popular methods for secondary structure prediction. With this algorithm, the number of correctly predicted alpha-helix start positions was improved from 30% to 38%, while the overall prediction accuracy (Q3) remained the same, using cross-validated testing. Although the algorithm was developed and tested on multiple sequence alignment-based secondary structure predictions, it was also able to improve the predictions of start locations by methods that use single sequences to make their predictions. Furthermore, the residue frequencies at N-terminal positions of the improved predictions better reflect those seen at the N-terminal positions of alpha-helices in proteins. This has implications for areas such as comparative modeling, where a more accurate prediction of the N-terminal regions of alpha-helices should benefit attempts to model adjacent loop regions. The algorithm is available as a Web tool, located at http://rocky.bms.umist.ac.uk/elephant.

PubMed Disclaimer

Cited by

Position-specific propensities of amino acids in the β-strand.
Bhattacharjee N, Biswas P. Bhattacharjee N, et al. BMC Struct Biol. 2010 Sep 28;10:29. doi: 10.1186/1472-6807-10-29. BMC Struct Biol. 2010. PMID: 20920153 Free PMC article.
Sixty-five years of the long march in protein secondary structure prediction: the final stretch?
Yang Y, Gao J, Wang J, Heffernan R, Hanson J, Paliwal K, Zhou Y. Yang Y, et al. Brief Bioinform. 2018 May 1;19(3):482-494. doi: 10.1093/bib/bbw129. Brief Bioinform. 2018. PMID: 28040746 Free PMC article.
Folding by numbers: primary sequence statistics and their use in studying protein folding.
Wathen B, Jia Z. Wathen B, et al. Int J Mol Sci. 2009 Apr 8;10(4):1567-1589. doi: 10.3390/ijms10041567. Int J Mol Sci. 2009. PMID: 19468326 Free PMC article. Review.
Synonymous codon usage influences the local protein structure observed.
Saunders R, Deane CM. Saunders R, et al. Nucleic Acids Res. 2010 Oct;38(19):6719-28. doi: 10.1093/nar/gkq495. Epub 2010 Jun 8. Nucleic Acids Res. 2010. PMID: 20530529 Free PMC article.

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions

LinkOut - more resources

Full Text Sources
- Wiley

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Improved prediction for N-termini of alpha-helices using empirical information

Affiliation

Improved prediction for N-termini of alpha-helices using empirical information

Authors

Affiliation

Abstract

Similar articles

Cited by

MeSH terms

Substances

LinkOut - more resources

Full Text Sources