Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Mar 20;5(12):6487-6493.
doi: 10.1021/acsomega.9b04105. eCollection 2020 Mar 31.

Evaluating Protein Engineering Thermostability Prediction Tools Using an Independently Generated Dataset

Affiliations

Evaluating Protein Engineering Thermostability Prediction Tools Using an Independently Generated Dataset

Peishan Huang et al. ACS Omega. .

Abstract

Engineering proteins to enhance thermal stability is a widely utilized approach for creating industrially relevant biocatalysts. The development of new experimental datasets and computational tools to guide these engineering efforts remains an active area of research. Thus, to complement the previously reported measures of T 50 and kinetic constants, we are reporting an expansion of our previously published dataset of mutants for β-glucosidase to include both measures of T M and ΔΔG. For a set of 51 mutants, we found that T 50 and T M are moderately correlated, with a Pearson correlation coefficient and Spearman's rank coefficient of 0.58 and 0.47, respectively, indicating that the two methods capture different physical features. The performance of predicted stability using nine computational tools was also evaluated on the dataset of 51 mutants, none of which are found to be strong predictors of the observed changes in T 50, T M, or ΔΔG. Furthermore, the ability of the nine algorithms to predict the production of isolatable soluble protein was examined, which revealed that Rosetta ΔΔG, FoldX, DeepDDG, PoPMuSiC, and SDM were capable of predicting if a mutant could be produced and isolated as a soluble protein. These results further highlight the need for new algorithms for predicting modest, yet important, changes in thermal stability as well as a new utility for current algorithms for prescreening designs for the production of mutants that maintain fold and soluble production properties.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interest.

Figures

Figure 1
Figure 1
Structure of BglB (PDB ID: 2JIE) from the bacterium Paenibacillus polymyxa. PyMOL rendering of BglB showing the 28 sequence-positions (teal spheres) of the 51 mutants chosen out of the original 92 previously expressed proteins for the TM analysis. The reaction scheme of the hydrolysis of 4-nitrophenyl β-d-glucopyranoside by BglB used in the T50 study.
Figure 2
Figure 2
Comparison of two different experimental thermal stability datasets and experimentally derived ΔΔG. (A) Relationship for each mutant between T50 and TM. The PCC of 0.58 illustrates that the two methods are modestly positively correlated with mutations that are in the extreme ends of the temperature range (±5 °C). (B) Evaluation of ΔTM with experimentally derived ΔΔG shows the two qualities are highly correlated (PCC = −0.76), unlike (C) where the relationship between ΔT50 and experimentally derived ΔΔG has a PCC of −0.35.
Figure 3
Figure 3
Evaluation of the algorithms ΔTSE versus the experimentally derived ΔΔG and the TM and T50 datasets. The Pearson correlation coefficient and Spearman’s rank correlation for each performance against three types of experimental data were determined. Five representative comparisons are illustrated above, with four additional algorithms, SDM, AUTO-MUTE (DDG), AUTO-MUTE (ΔTM), and HoTMuSiC provided in Figure SI 1-3. No algorithm resulted in a significant correlation between the calculated energies and the observed TM, T50, or ΔΔG for this dataset.
Figure 4
Figure 4
Computational prediction for the effect on mutant soluble protein production using nine different algorithms. From left to right: Rosetta ΔΔG, FoldX PSSM, ELASPIC, DeepDDG, PoPMuSiC, SDM, AUTO-MUTE (DDG), AUTO-MUTE (ΔTM), and HoTMuSiC of soluble (green) and nonisolated protein (blue). In this case, mutants that resulted in a significant (>10-fold) decrease in yield of purified soluble protein are considered nonisolatable. Significance in population differences was determined using a Student’s t-test. The units (ΔTSE and ΔTM) of all algorithms are individually normalized between 1 to −1. For visualization purposes, outliers were omitted after normalization. Each graph without normalization and with outliers can be found in Figure SI 1-4 and all raw values in Figure SI 4.

References

    1. Iyer P. V.; Ananthanarayan L. Enzyme Stability and Stabilization-Aqueous and Non-Aqueous Environment. Process Biochem. 2008, 43, 1019–1032. 10.1016/j.procbio.2008.06.004. - DOI
    1. Turner P.; Mamo G.; Karlsson E. N. Potential and Utilization of Thermophiles and Thermostable Enzymes in Biorefining. Microb. Cell Fact. 2007, 6, 9.10.1186/1475-2859-6-9. - DOI - PMC - PubMed
    1. Ferdjani S.; Ionita M.; Roy B.; Dion M.; Djeghaba Z.; Rabiller C.; Tellier C. Correlation between Thermostability and Stability of Glycosidases in Ionic Liquid. Biotechnol. Lett. 2011, 33, 1215–1219. 10.1007/s10529-011-0560-5. - DOI - PubMed
    1. Xie Y.; An J.; Yang G.; Wu G.; Zhang Y.; Cui L.; Feng Y. Enhanced Enzyme Kinetic Stability by Increasing Rigidity within the Active Site. J. Biol. Chem. 2014, 289, 7994–8006. 10.1074/jbc.M113.536045. - DOI - PMC - PubMed
    1. Wu I.; Arnold F. H. Engineered Thermostable Fungal Cel6A and Cel7A Cellobiohydrolases Hydrolyze Cellulose Efficiently at Elevated Temperatures. Biotechnol. Bioeng. 2013, 110, 1874–1883. 10.1002/bit.24864. - DOI - PubMed