Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Feb 9;9(2):307-317.
doi: 10.1021/acscentsci.2c01042. eCollection 2023 Feb 22.

Open-Source Chromatographic Data Analysis for Reaction Optimization and Screening

Affiliations

Open-Source Chromatographic Data Analysis for Reaction Optimization and Screening

Christian P Haas et al. ACS Cent Sci. .

Abstract

Automation and digitalization solutions in the field of small molecule synthesis face new challenges for chemical reaction analysis, especially in the field of high-performance liquid chromatography (HPLC). Chromatographic data remains locked in vendors' hardware and software components, limiting their potential in automated workflows and data science applications. In this work, we present an open-source Python project called MOCCA for the analysis of HPLC-DAD (photodiode array detector) raw data. MOCCA provides a comprehensive set of data analysis features, including an automated peak deconvolution routine of known signals, even if overlapped with signals of unexpected impurities or side products. We highlight the broad applicability of MOCCA in four studies: (i) a simulation study to validate MOCCA's data analysis features; (ii) a reaction kinetics study on a Knoevenagel condensation reaction demonstrating MOCCA's peak deconvolution feature; (iii) a closed-loop optimization study for the alkylation of 2-pyridone without human control during data analysis; (iv) a well plate screening of categorical reaction parameters for a novel palladium-catalyzed cyanation of aryl halides employing O-protected cyanohydrins. By publishing MOCCA as a Python package with this work, we envision an open-source community project for chromatographic data analysis with the potential of further advancing its scope and capabilities.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interest.

Figures

Figure 1
Figure 1
Proposed analytical workflow starting with HPLC systems controlled by vendor-specific software. HPLC–DAD raw data are exported in nonproprietary and open data formats, preferably, a metadata-enriched standardized format (Allotrope) implementing FAIR data principles. After parsing in Python, HPLC–DAD data sets are analyzed in context to each other by MOCCA. From the analysis results, structured data sets are generated for data-based decision making.
Figure 2
Figure 2
Summary of the data analysis features implemented in MOCCA.
Scheme 1
Scheme 1. Knoevenagel Condensation Reactions of Benzaldehyde (1a), 4-Methoxybenzaldehyde (1b) and 4-(Dimethylamino)benzaldehyde (1c) with Malononitrile (2) in Methanol (MeOH) to Yield Benzylidenemalononitriles 3ac
Figure 3
Figure 3
(a) Results of the competition experiment with two benzaldehydes (1a and 1b). (b) Results of the competition experiment with three benzaldehydes (1ac). Top: Chromatographic signals of the benzaldehydes using different gradient lengths. MOCCA indicates results of purity checks (green passed, red failed) and centers of retention profiles modeled by the deconvolution algorithm (vertical black dashed lines). Bottom: Deconvolution results of the overlapping signal recorded with a gradient length of 0.5 min. The modeled retention profiles (left, colored lines) described the retention profile of the impure peak (black dashed line). The modeled UV–Vis traces (right, colored lines) correspond to the UV–Vis spectra of the benzaldehydes as exemplified for 1a (black dashed line).
Figure 4
Figure 4
Second-order reaction kinetics plots of benzaldehyde (1a) in the Knoevenagel condensation recorded with five different HPLC methods employing varying gradient lengths. (a) Competition experiment with two benzaldehydes (1a and 1b). (b) Competition experiment with with three benzaldehydes (1ac).
Figure 5
Figure 5
Closed-loop optimization cycle employed in this work. Blue: Experimental Design via Bayesian Optimization (EDBO) Python package from the Doyle group and translation of the suggested parameters to a LabVIEW experimental protocol. Yellow: Experimental execution by a microfluidic reactor platform employing an oscillatory droplet reactor design. 0.02 μL HPLC samples are taken out of the droplet after diltution with acetonitrile. Green: HPLC system with a photodiode array detector (DAD) and an automated HPLC–DAD raw data export routine. Red: Data analysis by the MOCCA tool and a project-specific script for the extraction of objective values and process control values.
Scheme 2
Scheme 2. Optimization Campaign on the Alkylation of 2-Pyridone (4) with 1-Iodobutane (5) Yielding 1-Butylpyridone (6)
The domain space of the optimization campaign spans over two continuous variables, reaction time (low boundary: 10 min, high boundary: 60 min), and temperature (low boundary: 35 °C, high boundary: 100 °C), as well as two categorical variables, identity of base (DBU, TMG, DIPEA) and solvent (DMF, toluene, n-butanol). The objective value of the optimization is the yield of 6. Two side products were identified with 2-butoxypyridine (7) and butylated DBU (8).
Figure 6
Figure 6
Results of the closed-loop optimization on the alkylation of 2-pyridone (4). (a) Objective values as a function of optimizer choices in each round. Top: Objective value (yield of 6) with marker shape indicating the chosen solvent and marker color indicating the chosen base; middle: Chosen reaction temperature; bottom: Chosen reaction time. (b) Chromatogram of the reaction under optimal conditions with an impure product peak (∼1.7 min). (c) Modeled retention profiles (dashed line: impure peak) and UV–Vis spectra (dashed line: reference UV–Vis spectrum of (6) of the product 6 (yellow) and the unexpected impurity 8 (blue).
Scheme 3
Scheme 3. (a) Palladium-Catalyzed Cyanation of Aryl Chlorides Developed by Guimond et al. Based on the Slow Addition of Acetone Cyanohydrin via Syringe Pump. (b) Newly Developed Cyanation Method Using Protected Cyanohydrins (PG: protecting group) for in Situ Release of Cyanide
Figure 7
Figure 7
(a) Reaction conditions for the well plate screening of the cyanation of 2-chlorotoluene (9) yielding o-tolunitrile (11) using palladium(π-cinnamyl) chloride dimer as the catalyst precursor. (b) Screened O-protected cyanohydrins. (c) Screened bases. (d) Screened ligands. (e) Yield of o-tolunitrile (11) in dependency on the employed protected cyanide-releasing agent 10 as well as the chosen ligand and base. CPME: cyclopentyl methyl ether; Bz: benzoyl group; Ac: acetyl group; TMS: trimethylsilyl group.

Similar articles

Cited by

References

    1. Andersson S.; Armstrong A.; Björe A.; Bowker S.; Chapman S.; Davies R.; Donald C.; Egner B.; Elebring T.; Holmqvist S.; Inghardt T.; Johannesson P.; Johansson M.; Johnstone C.; Kemmitt P.; Kihlberg J.; Korsgren P.; Lemurell M.; Moore J.; Pettersson J. A.; Pointon H.; Pontén F.; Schofield P.; Selmi N.; Whittamore P. Making Medicinal Chemistry More Effective-Application of Lean Sigma to Improve Processes, Speed and Quality. Drug Discovery Today 2009, 14 (11–12), 598–604. 10.1016/j.drudis.2009.03.005. - DOI - PubMed
    1. Whitesides G. M. Reinventing Chemistry. Angew. Chemie - Int. Ed. 2015, 54 (11), 3196–3209. 10.1002/anie.201410884. - DOI - PubMed
    1. Matlin S. A.; Mehta G.; Hopf H.; Krief A. One-World Chemistry and Systems Thinking. Nat. Chem. 2016, 8 (5), 393–398. 10.1038/nchem.2498. - DOI - PubMed
    1. Marion P.; Bernela B.; Piccirilli A.; Estrine B.; Patouillard N.; Guilbot J.; Jérôme F. Sustainable Chemistry: How to Produce Better and More from Less?. Green Chem. 2017, 19 (21), 4973–4989. 10.1039/C7GC02006F. - DOI
    1. Keijer T.; Bakker V.; Slootweg J. C. Circular Chemistry to Enable a Circular Economy. Nat. Chem. 2019, 11 (3), 190–195. 10.1038/s41557-019-0226-9. - DOI - PubMed