Understanding the relationship between the primary structure of proteins and its propensity to be soluble on overexpression in Escherichia coli
- PMID: 15689506
- PMCID: PMC2279285
- DOI: 10.1110/ps.041009005
Understanding the relationship between the primary structure of proteins and its propensity to be soluble on overexpression in Escherichia coli
Abstract
Solubility of proteins on overexpression in Escherichia coli is a manifestation of the net effect of several sequence-dependent and sequence-independent factors. This study aims to delineate the relationship between the primary structure and solubility on overexpression. The amino acid sequences of proteins reported to be soluble or to form inclusion bodies on overexpression in E. coli under normal growth conditions were analyzed. The results show a positive correlation between thermostability and solubility of proteins, and an inverse correlation between the in vivo half-life of proteins and solubility. The amino acid (Asn, Thr, Tyr) composition and the tripeptide frequency of the protein were also found to influence its solubility on overexpression. The amino acids that were seen to be present in a comparatively higher frequency in inclusion body-forming proteins have a higher sheet propensity, whereas those that are seen more in soluble proteins have a higher helix propensity; this is indicative of a possible correlation between sheet propensity and inclusion body formation. Thus, the present analysis shows that thermostability, in vivo half-life, Asn, Thr, and Tyr content, and tripeptide composition of a protein are correlated to the propensity of a protein to be soluble on overexpression in E. coli. The precise mechanism by which these properties affect the solubility status of the overexpressed protein remains to be understood.
Similar articles
-
Correlation between the structural stability and aggregation propensity of proteins.In Silico Biol. 2007;7(2):225-37. In Silico Biol. 2007. PMID: 17688448
-
A support vector machine-based method for predicting the propensity of a protein to be soluble or to form inclusion body on overexpression in Escherichia coli.Bioinformatics. 2006 Feb 1;22(3):278-84. doi: 10.1093/bioinformatics/bti810. Epub 2005 Dec 6. Bioinformatics. 2006. PMID: 16332713
-
Inclusion body anatomy and functioning of chaperone-mediated in vivo inclusion body disassembly during high-level recombinant protein production in Escherichia coli.J Biotechnol. 2007 Jan 1;127(2):244-57. doi: 10.1016/j.jbiotec.2006.07.004. Epub 2006 Jul 16. J Biotechnol. 2007. PMID: 16945443
-
Predicting the solubility of recombinant proteins in Escherichia coli.Methods Mol Biol. 2015;1258:403-8. doi: 10.1007/978-1-4939-2205-5_23. Methods Mol Biol. 2015. PMID: 25447878 Review.
-
Optimization of culture parameters and novel strategies to improve protein solubility.Methods Mol Biol. 2015;1258:45-63. doi: 10.1007/978-1-4939-2205-5_3. Methods Mol Biol. 2015. PMID: 25447858 Review.
Cited by
-
Discrimination of soluble and aggregation-prone proteins based on sequence information.Mol Biosyst. 2013 Apr 5;9(4):806-11. doi: 10.1039/c3mb70033j. Epub 2013 Feb 25. Mol Biosyst. 2013. PMID: 23440081 Free PMC article.
-
Scoring function to predict solubility mutagenesis.Algorithms Mol Biol. 2010 Oct 7;5:33. doi: 10.1186/1748-7188-5-33. Algorithms Mol Biol. 2010. PMID: 20929563 Free PMC article.
-
A consensus-guided approach yields a heat-stable alkane-producing enzyme and identifies residues promoting thermostability.J Biol Chem. 2018 Jun 15;293(24):9148-9161. doi: 10.1074/jbc.RA117.000639. Epub 2018 Apr 9. J Biol Chem. 2018. PMID: 29632075 Free PMC article.
-
Questing functions and structures of hypothetical proteins from Campylobacter jejuni: a computer-aided approach.Biosci Rep. 2020 Jun 26;40(6):BSR20193939. doi: 10.1042/BSR20193939. Biosci Rep. 2020. PMID: 32458979 Free PMC article.
-
Solubility-Weighted Index: fast and accurate prediction of protein solubility.Bioinformatics. 2020 Sep 15;36(18):4691-4698. doi: 10.1093/bioinformatics/btaa578. Bioinformatics. 2020. PMID: 32559287 Free PMC article.
References
-
- Bertone, P., Kluger, Y., Lan, N., Zheng, D., Christendat, D., Yee, A., Edwards, A.M., Arrowsmith, C.H., Montelione, G.T., and Gerstein, M. 2001. SPINE: An integrated tracking database and data mining approach for identifying feasible targets in high-throughput structural proteomics. Nucleic Acids Res. 29 2884–2898. - PMC - PubMed
-
- Boeggeman, E.E., Ramakrishnan, B., and Qasba, P.K. 2003. The N-terminal stem region of bovine and human β1,4-galactosyltransferase I increases the in vitro folding efficiency of their catalytic domain from inclusion bodies. Protein Expr. Purif. 30 219–229. - PubMed
-
- Carrio, M.M. and Villaverde, A. 2001. Protein aggregation as bacterial inclusion bodies is reversible. FEBS Lett. 489 29–33. - PubMed
-
- Cha, H.J., Wu, C.F., Valdes, J.J., Rao, G., and Bentley, W.E. 2000. Observations of green fluorescent protein as a fusion partner in genetically engineered Escherichia coli: Monitoring protein expression and solubility. Biotechnol. Bioeng. 67 565–574. - PubMed
-
- Chan, H.S. and Dill, K.A. 1994. Transition states and folding dynamics of proteins and heteropolymers. J. Chem. Phys. 100 9238–9257.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources