Bridging Chemical Knowledge and Machine Learning for Performance Prediction of Organic Synthesis
- PMID: 36206170
- PMCID: PMC10099903
- DOI: 10.1002/chem.202202834
Bridging Chemical Knowledge and Machine Learning for Performance Prediction of Organic Synthesis
Abstract
Recent years have witnessed a boom of machine learning (ML) applications in chemistry, which reveals the potential of data-driven prediction of synthesis performance. Digitalization and ML modelling are the key strategies to fully exploit the unique potential within the synergistic interplay between experimental data and the robust prediction of performance and selectivity. A series of exciting studies have demonstrated the importance of chemical knowledge implementation in ML, which improves the model's capability for making predictions that are challenging and often go beyond the abilities of human beings. This Minireview summarizes the cutting-edge embedding techniques and model designs in synthetic performance prediction, elaborating how chemical knowledge can be incorporated into machine learning until June 2022. By merging organic synthesis tactics and chemical informatics, we hope this Review can provide a guide map and intrigue chemists to revisit the digitalization and computerization of organic chemistry principles.
Keywords: machine learning; molecular embedding; organic synthesis; performance prediction; synthetic dataset.
© 2022 The Authors. Chemistry - A European Journal published by Wiley-VCH GmbH.
Conflict of interest statement
The authors declare no conflict of interest.
Figures















Similar articles
-
Reaction performance prediction with an extrapolative and interpretable graph model based on chemical knowledge.Nat Commun. 2023 Jun 15;14(1):3569. doi: 10.1038/s41467-023-39283-x. Nat Commun. 2023. PMID: 37322041 Free PMC article.
-
Transfer Learning: Making Retrosynthetic Predictions Based on a Small Chemical Reaction Dataset Scale to a New Level.Molecules. 2020 May 19;25(10):2357. doi: 10.3390/molecules25102357. Molecules. 2020. PMID: 32438572 Free PMC article.
-
AutoTemplate: enhancing chemical reaction datasets for machine learning applications in organic chemistry.J Cheminform. 2024 Jun 27;16(1):74. doi: 10.1186/s13321-024-00869-2. J Cheminform. 2024. PMID: 38937840 Free PMC article.
-
Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes.Artif Intell Med. 2019 Jul;98:109-134. doi: 10.1016/j.artmed.2019.07.007. Epub 2019 Jul 26. Artif Intell Med. 2019. PMID: 31383477 Review.
-
Deep Learning for Deep Chemistry: Optimizing the Prediction of Chemical Patterns.Front Chem. 2019 Nov 26;7:809. doi: 10.3389/fchem.2019.00809. eCollection 2019. Front Chem. 2019. PMID: 32039134 Free PMC article. Review.
Cited by
-
Connecting the complexity of stereoselective synthesis to the evolution of predictive tools.Chem Sci. 2025 Jan 23;16(9):3832-3851. doi: 10.1039/d4sc07461k. eCollection 2025 Feb 26. Chem Sci. 2025. PMID: 39911341 Free PMC article. Review.
-
Evaluation of functional group compatibility and development of reaction-accelerating additives in ammonium salt-accelerated hydrazinolysis of amides.Front Chem. 2024 May 22;12:1378746. doi: 10.3389/fchem.2024.1378746. eCollection 2024. Front Chem. 2024. PMID: 38841334 Free PMC article.
-
Reaction performance prediction with an extrapolative and interpretable graph model based on chemical knowledge.Nat Commun. 2023 Jun 15;14(1):3569. doi: 10.1038/s41467-023-39283-x. Nat Commun. 2023. PMID: 37322041 Free PMC article.
-
A dual-targeted drug inhibits cardiac ryanodine receptor Ca2+ leak but activates SERCA2a Ca2+ uptake.Life Sci Alliance. 2023 Nov 27;7(2):e202302278. doi: 10.26508/lsa.202302278. Print 2024 Feb. Life Sci Alliance. 2023. PMID: 38012000 Free PMC article.
References
Publication types
Grants and funding
- 21873081, 22122109 and 22103070/National Natural Science Foundation of China
- SN-ZJU-SIAS-006/Zhejiang University Shanghai Institute for Advanced Study
- BNLMS202102/Beijing National Laboratory for Molecular Sciences
- PSFM 2021-01/Center of Chemistry for Frontier Technologies and Key Laboratory of Precise Synthesis of Functional Molecules of Zhejiang Province
- ZJUCEU2020007/State Key Laboratory of Clean Energy Utilization
LinkOut - more resources
Full Text Sources