Deciphering the preference and predicting the viability of circular permutations in proteins
- PMID: 22359629
- PMCID: PMC3281007
- DOI: 10.1371/journal.pone.0031791
Deciphering the preference and predicting the viability of circular permutations in proteins
Abstract
Circular permutation (CP) refers to situations in which the termini of a protein are relocated to other positions in the structure. CP occurs naturally and has been artificially created to study protein function, stability and folding. Recently CP is increasingly applied to engineer enzyme structure and function, and to create bifunctional fusion proteins unachievable by tandem fusion. CP is a complicated and expensive technique. An intrinsic difficulty in its application lies in the fact that not every position in a protein is amenable for creating a viable permutant. To examine the preferences of CP and develop CP viability prediction methods, we carried out comprehensive analyses of the sequence, structural, and dynamical properties of known CP sites using a variety of statistics and simulation methods, such as the bootstrap aggregating, permutation test and molecular dynamics simulations. CP particularly favors Gly, Pro, Asp and Asn. Positions preferred by CP lie within coils, loops, turns, and at residues that are exposed to solvent, weakly hydrogen-bonded, environmentally unpacked, or flexible. Disfavored positions include Cys, bulky hydrophobic residues, and residues located within helices or near the protein's core. These results fostered the development of an effective viable CP site prediction system, which combined four machine learning methods, e.g., artificial neural networks, the support vector machine, a random forest, and a hierarchical feature integration procedure developed in this work. As assessed by using the hydrofolate reductase dataset as the independent evaluation dataset, this prediction system achieved an AUC of 0.9. Large-scale predictions have been performed for nine thousand representative protein structures; several new potential applications of CP were thus identified. Many unreported preferences of CP are revealed in this study. The developed system is the best CP viability prediction method currently available. This work will facilitate the application of CP in research and biotechnology.
Conflict of interest statement
Figures





Similar articles
-
CPred: a web server for predicting viable circular permutations in proteins.Nucleic Acids Res. 2012 Jul;40(Web Server issue):W232-7. doi: 10.1093/nar/gks529. Epub 2012 Jun 11. Nucleic Acids Res. 2012. PMID: 22693212 Free PMC article.
-
Protein engineering using circular permutation - structure, function, stability, and applications.FEBS J. 2024 Aug;291(16):3581-3596. doi: 10.1111/febs.17146. Epub 2024 Apr 27. FEBS J. 2024. PMID: 38676939 Review.
-
[A turning point in the knowledge of the structure-function-activity relations of elastin].J Soc Biol. 2001;195(2):181-93. J Soc Biol. 2001. PMID: 11727705 Review. French.
-
High-resolution structure prediction of a circular permutation loop.Protein Sci. 2011 Nov;20(11):1929-34. doi: 10.1002/pro.725. Epub 2011 Sep 30. Protein Sci. 2011. PMID: 21898647 Free PMC article.
-
CPDB: a database of circular permutation in proteins.Nucleic Acids Res. 2009 Jan;37(Database issue):D328-32. doi: 10.1093/nar/gkn679. Epub 2008 Oct 8. Nucleic Acids Res. 2009. PMID: 18842637 Free PMC article.
Cited by
-
Protein switch engineering by domain insertion.Methods Enzymol. 2013;523:369-88. doi: 10.1016/B978-0-12-394292-0.00017-5. Methods Enzymol. 2013. PMID: 23422439 Free PMC article.
-
CirPred, the first structure modeling and linker design system for circularly permuted proteins.BMC Bioinformatics. 2021 Oct 12;22(Suppl 10):494. doi: 10.1186/s12859-021-04403-1. BMC Bioinformatics. 2021. PMID: 34641789 Free PMC article.
-
A secondary structure-based position-specific scoring matrix applied to the improvement in protein secondary structure prediction.PLoS One. 2021 Jul 28;16(7):e0255076. doi: 10.1371/journal.pone.0255076. eCollection 2021. PLoS One. 2021. PMID: 34320027 Free PMC article.
-
Tandem domain swapping: determinants of multidomain protein misfolding.Curr Opin Struct Biol. 2019 Oct;58:97-104. doi: 10.1016/j.sbi.2019.05.012. Epub 2019 Jun 28. Curr Opin Struct Biol. 2019. PMID: 31260947 Free PMC article. Review.
-
A simple strategy to enhance the speed of protein secondary structure prediction without sacrificing accuracy.PLoS One. 2020 Jun 30;15(6):e0235153. doi: 10.1371/journal.pone.0235153. eCollection 2020. PLoS One. 2020. PMID: 32603341 Free PMC article.
References
-
- Carrington DM, Auffret A, Hanke DE. Polypeptide ligation occurs during post-translational modification of concanavalin A. Nature. 1985;313:64–67. - PubMed
-
- Ponting CP, Russell RB. Swaposins: circular permutations within genes encoding saposin homologues. Trends Biochem Sci. 1995;20:179–180. - PubMed
-
- Lindqvist Y, Schneider G. Circular permutations of natural protein sequences: structural evidence. Curr Opin Struct Biol. 1997;7:422–427. - PubMed
-
- Uliel S, Fliess A, Unger R. Naturally occurring circular permutations in proteins. Protein Eng. 2001;14:533–542. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Research Materials
Miscellaneous