Opportunities and Challenges for Machine Learning-Assisted Enzyme Engineering
- PMID: 38435522
- PMCID: PMC10906252
- DOI: 10.1021/acscentsci.3c01275
Opportunities and Challenges for Machine Learning-Assisted Enzyme Engineering
Abstract
Enzymes can be engineered at the level of their amino acid sequences to optimize key properties such as expression, stability, substrate range, and catalytic efficiency-or even to unlock new catalytic activities not found in nature. Because the search space of possible proteins is vast, enzyme engineering usually involves discovering an enzyme starting point that has some level of the desired activity followed by directed evolution to improve its "fitness" for a desired application. Recently, machine learning (ML) has emerged as a powerful tool to complement this empirical process. ML models can contribute to (1) starting point discovery by functional annotation of known protein sequences or generating novel protein sequences with desired functions and (2) navigating protein fitness landscapes for fitness optimization by learning mappings between protein sequences and their associated fitness values. In this Outlook, we explain how ML complements enzyme engineering and discuss its future potential to unlock improved engineering outcomes.
© 2024 The Authors. Published by American Chemical Society.
Conflict of interest statement
The authors declare no competing financial interest.
Figures




Similar articles
-
Active learning-assisted directed evolution.Nat Commun. 2025 Jan 16;16(1):714. doi: 10.1038/s41467-025-55987-8. Nat Commun. 2025. PMID: 39821082 Free PMC article.
-
Data-Driven Protein Engineering for Improving Catalytic Activity and Selectivity.Chembiochem. 2024 Feb 1;25(3):e202300754. doi: 10.1002/cbic.202300754. Epub 2023 Dec 11. Chembiochem. 2024. PMID: 38029350
-
High-throughput screening, next generation sequencing and machine learning: advanced methods in enzyme engineering.Chem Commun (Camb). 2022 Feb 17;58(15):2455-2467. doi: 10.1039/d1cc04635g. Chem Commun (Camb). 2022. PMID: 35107442 Free PMC article. Review.
-
Machine learning-assisted enzyme engineering.Methods Enzymol. 2020;643:281-315. doi: 10.1016/bs.mie.2020.05.005. Epub 2020 Jun 12. Methods Enzymol. 2020. PMID: 32896285 Review.
-
Current status and emerging frontiers in enzyme engineering: An industrial perspective.Heliyon. 2024 Jun 7;10(11):e32673. doi: 10.1016/j.heliyon.2024.e32673. eCollection 2024 Jun 15. Heliyon. 2024. PMID: 38912509 Free PMC article. Review.
Cited by
-
Substrate Activation Efficiency in Active Sites of Hydrolases Determined by QM/MM Molecular Dynamics and Neural Networks.Int J Mol Sci. 2025 May 26;26(11):5097. doi: 10.3390/ijms26115097. Int J Mol Sci. 2025. PMID: 40507908 Free PMC article.
-
The Nobel Prize in Chemistry: past, present, and future of AI in biology.Commun Biol. 2024 Oct 29;7(1):1409. doi: 10.1038/s42003-024-07113-5. Commun Biol. 2024. PMID: 39472680 Free PMC article.
-
Computational stabilization of a non-heme iron enzyme enables efficient evolution of new function.bioRxiv [Preprint]. 2024 Jul 25:2024.04.18.590141. doi: 10.1101/2024.04.18.590141. bioRxiv. 2024. Update in: Angew Chem Int Ed Engl. 2025 Jan 10;64(2):e202414705. doi: 10.1002/anie.202414705. PMID: 39091854 Free PMC article. Updated. Preprint.
-
Ancestral Sequence Reconstruction for Designing Biocatalysts and Investigating their Functional Mechanisms.JACS Au. 2024 Oct 25;4(12):4571-4591. doi: 10.1021/jacsau.4c00653. eCollection 2024 Dec 23. JACS Au. 2024. PMID: 39735918 Free PMC article. Review.
-
DEKP: a deep learning model for enzyme kinetic parameter prediction based on pretrained models and graph neural networks.Brief Bioinform. 2025 Mar 4;26(2):bbaf187. doi: 10.1093/bib/bbaf187. Brief Bioinform. 2025. PMID: 40273427 Free PMC article.
References
-
- Bell E. L.; Finnigan W.; France S. P.; Green A. P.; Hayes M. A.; Hepworth L. J.; Lovelock S. L.; Niikura H.; Osuna S.; Romero E.; Ryan K. S.; Turner N. J.; Flitsch S. L. Biocatalysis. Nat. Rev. Methods Primer 2021, 1 (1), 46.10.1038/s43586-021-00044-z. - DOI
Publication types
LinkOut - more resources
Full Text Sources