Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Aug 2;4(1):112.
doi: 10.1038/s42004-021-00550-x.

Data-science driven autonomous process optimization

Affiliations

Data-science driven autonomous process optimization

Melodie Christensen et al. Commun Chem. .

Abstract

Autonomous process optimization involves the human intervention-free exploration of a range process parameters to improve responses such as product yield and selectivity. Utilizing off-the-shelf components, we develop a closed-loop system for carrying out parallel autonomous process optimization experiments in batch. Upon implementation of our system in the optimization of a stereoselective Suzuki-Miyaura coupling, we find that the definition of a set of meaningful, broad, and unbiased process parameters is the most critical aspect of successful optimization. Importantly, we discern that phosphine ligand, a categorical parameter, is vital to determination of the reaction outcome. To date, categorical parameter selection has relied on chemical intuition, potentially introducing bias into the experimental design. In seeking a systematic method for selecting a diverse set of phosphine ligands, we develop a strategy that leverages computed molecular feature clustering. The resulting optimization uncovers conditions to selectively access the desired product isomer in high yield.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1. Phosphine ligand influence on a palladium-catalyzed stereoselective Suzuki–Miyaura coupling to generate the stereoinversion product (2-Z) or stereoretention product (2-E).
aConditions: 10 µmol 1-E, 1 µmol 1,3,5-trimethoxybenzene, 20 µmol (3-(benzyloxy)phenyl)boronic acid 3, 0.4 µmol Pd(ACN)2Cl2, 30 µmol K3PO4 (0.5 M aq) in ACN (0.05 M), 2 h at 25 °C. b,cConditions: 10 µmol 1-E, 1 µmol 1,3,5-trimethoxybenzene, 11 µmol (3-(benzyloxy)phenyl)boronic acid 3, 0.2 µmol Pd(ACN)2Cl2, 0.4 µmol L, 30 µmol K3PO4 (0.5 M aq) in ACN (0.05 M), 2 h at 25 °C. Ligand structures are provided in Fig. 4. Tabulated results are provided in Supplementary Information Table SI-9.
Fig. 2
Fig. 2. Closed-loop system for autonomous optimization in batch.
The three main components to enable this closed loop include (1) ChemOS to coordinate experiments and data-driven approaches, (2) Chemspeed SWING for automated experimental setup, and (3) Agilent 1100 to characterize the experimental outcomes.
Fig. 3
Fig. 3. Standard experimental conditions for reproducibility testing.
Conditions: 10 µmol 1-E, 1 µmol 1,3,5-trimethoxybenzene, 15 µmol (3-(benzyloxy)phenyl)boronic acid 3, 0.3 µmol Pd(ACN)2Cl2, 0.4 µmol L2, 30 µmol K3PO4 (0.5 M aq) in ACN (0.05 M), 2 h at 20 °C. Average yields for first set of replicates: 2-E: 30(±2)%; 2-Z: 19(±1)% and second set of replicates: 2-E: 28(±2)%; 2-Z: 17(±1)%. Tabulated results are provided in Supplementary Information Table SI-13.
Fig. 4
Fig. 4. Parameters and results of optimization with ligands selected through chemical intuition in campaign 1.
Conditions: 10 µmol 1-E, 1 µmol 1,3,5-trimethoxybenzene, 10–20 µmol (3-(benzyloxy)phenyl)boronic acid 3, 0.1–0.5 µmol Pd(ACN)2Cl2, 0.05–2 µmol L, 30 µmol K3PO4 (0.5 M aq) in ACN (0.05 M), 2 h at 10–40 °C. Tabulated results are provided in Supplementary Information Table SI-10.
Fig. 5
Fig. 5. K-means clustering on the first four principal components of the molecular descriptor set for 365 commercial monodentate phosphines.
The chemical space is represented by a two-dimensional plot of the first two principal components. Each cluster is represented by color and highlighted boxes indicate selected ligands. Selected ligand structures are provided in Fig. 6.
Fig. 6
Fig. 6. Parameters and results of optimization with ligands selected through descriptor clustering in campaign 2.
Conditions: 10 µmol 1-E, 1 µmol 1,3,5-trimethoxybenzene, 15 µmol (3-(benzyloxy)phenyl)boronic acid 3, 0.1–0.5 µmol Pd(ACN)2Cl2, 0.05–2 µmol L, 30 µmol K3PO4 (0.5 M aq) in ACN (0.05 M), 2 h at 10–40 °C. Tabulated results are provided in Supplementary Information Table SI-11. An animated chart of E-product yield over time is provided as Supplementary Movie 2.
Fig. 7
Fig. 7. Three-dimensional plot of three continuous parameter selections color coded for 2-E yield in campaign 2.
PhSPhos results are outlined; all other ligand results are not outlined.
Fig. 8
Fig. 8. Two-dimensional plots of 2-E yield for each experiment ID and average 2-E yield for each sample bias value in campaign 2.
a Green indicates positive, exploitative sample bias values, while purple indicates negative, explorative sample bias values. PhSPhos results are outlined; all other ligand results are not outlined. b Green indicates positive, exploitative sample bias values, while purple indicates negative, explorative sample bias values.
Fig. 9
Fig. 9. The maximum yield of 2-E obtained for each monodentate ligand explored in campaigns 1 and 2 mapped onto the chemical space of 365 commercial monodentate phosphines.
The chemical space is represented by a two-dimensional plot of the first two principal components. Color indicates the maximum yield of 2-E obtained for each evaluated monodentate phosphine ligand.
Fig. 10
Fig. 10. Manual experiments under optimized conditions with predicted E-selective ligands.
Conditions: 10 µmol 1-E, 1 µmol 1,3,5-trimethoxybenzene, 15 µmol (3-(benzyloxy)phenyl)boronic acid 3, 0.4 µmol Pd(ACN)2Cl2, 0.9 µmol L, 30 µmol K3PO4 (0.5 M aq) in ACN (0.05 M), 2 h at 35 °C. Comparisons of the predicted and experimental yields are provided in Supplementary Information Table SI-16.

References

    1. Jensen, K. F., Coley, C. W. & Eyke, N. S. Autonomous discovery in the chemical sciences part I: progress. Angew. Chem. Int. Ed. 59, 22858–22893 (2019). - PubMed
    1. Coley, C. W., Eyke, N. S. & Jensen, K. F. Autonomous discovery in the chemical sciences part II: outlook. Angew. Chem. Int. Ed. 59, 23414–23436 (2019). - PubMed
    1. Häse F, Roch LM, Aspuru-Guzik A. Next-generation experimentation with self-driving laboratories. Trends Chem. 2019;1:282–291. doi: 10.1016/j.trechm.2019.02.007. - DOI
    1. Stein HS, Gregoire JM. Progress and prospects for accelerating materials science with automated and autonomous workflows. Chem. Sci. 2019;10:9640–9649. doi: 10.1039/C9SC03766G. - DOI - PMC - PubMed
    1. Desai B, et al. Rapid discovery of a novel series of Abl kinase inhibitors by application of an integrated microfluidic synthesis and screening platform. J. Med. Chem. 2013;56:3033–3047. doi: 10.1021/jm400099d. - DOI - PubMed