Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Sep;119(9):2447-2458.
doi: 10.1002/bit.28132. Epub 2022 May 27.

Multi-information source Bayesian optimization of culture media for cellular agriculture

Affiliations

Multi-information source Bayesian optimization of culture media for cellular agriculture

Zachary Cosenza et al. Biotechnol Bioeng. 2022 Sep.

Abstract

Culture media used in industrial bioprocessing and the emerging field of cellular agriculture is difficult to optimize due to the lack of rigorous mathematical models of cell growth and culture conditions, as well as the complexity of the design space. Rapid growth assays are inaccurate yet convenient, while robust measures of cell number can be time-consuming to the point of limiting experimentation. In this study, we optimized a cell culture media with 14 components using a multi-information source Bayesian optimization algorithm that locates optimal media conditions based on an iterative refinement of an uncertainty-weighted desirability function. As a model system, we utilized murine C2C12 cells, using AlamarBlue, LIVE stain, and trypan blue exclusion cell counting assays to determine cell number. Using this experimental optimization algorithm, we were able to design media with 181% more cells than a common commercial variant with a similar economic cost, while doing so in 38% fewer experiments than an efficient design-of-experiments method. The optimal medium generalized well to long-term growth up to four passages of C2C12 cells, indicating the multi-information source assay improved measurement robustness relative to rapid growth assays alone.

Keywords: Bayesian optimization; Gaussian process; cellular agriculture; expected improvement; media optimization; multi-information source.

PubMed Disclaimer

Figures

Figure 1
Figure 1
BO algorithm. This loop describes the Bayesian Optimization algorithm to maximize some acquisition function α(x) for a process Y=f(X) given q0 high‐fidelity IS and qq0 low‐fidelity IS samples per batch of experiments. After each batch, the process is repeated until process is optimized or resources are exhausted. BO, Bayesian optimization
Figure 2
Figure 2
Simulation results. (a) Number of cumulative high‐fidelity simulations used plotted against average (with standard deviation for five runs of the entire optimization loop) optimal output from f across five sequential iterations of the optimization framework. The multi‐IS GP (solid) had access to q=15 total simulations with q0=2 high‐fidelity and qq0=13 low fidelity simulations per iteration (multi‐IS GP has stopped one iteration early to reduce computational burden). The regular GP (dotted) only had access to the q0=2 high‐fidelity simulations per iteration. Each test function {f1,f2,f3,f4} had two biased low‐fidelity versions whose correlations are described by plots (b). Squares and triangles represent a given fbias1 and fbias2 respectively. The solid line represents the underlying high fidelity IS f. Hyperparameter and acquisition function optimization was done using multistart L‐BFGS‐B implemented in botorch/scipy.
Figure 3
Figure 3
Learning curve and trade‐off curve of BO method. (a) Learning curve of D(x) shows BO and DOE method designing experiments over the course of the optimization study. The line and dots represent the high‐fidelity IS optima and designs, the dashed and dotted lines represent the DMEM control and DOE optima values for D(x) respectively. Each IS experiment is shown in (b) the trade‐off curve indicating a clear tradeoff between cost c(x) and cell number y(x), where the dots, triangles, squares, and “x's” represent Passage 2, Passage 1, AlamarBlue, and Live Stain respectively. (c) Simulated trade‐off curve also shown for high‐fidelity IS also showing a predicted parabolic relationship between competing objectives y(x) and c(x). BO, Bayesian optimization; DMEM, Dulbecco's modified Eagle's medium; DOE, design‐of‐experiments
Figure 4
Figure 4
Learned optimal concentration. The conditions of each experiment (concentration ranges in Table 1) are shown plotted as a function of the cumulative number of experiments in the BO (circle) and DOE (box) study. The moving average (solid and dashed line for BO and DOE respectively) shows how each method searches for optimal concentrations. The horizontal line represents the final BO optimal concentration. BO, Bayesian optimization; DOE, design‐of‐experiments
Figure 5
Figure 5
Long‐term validation of optima media. The optimal BO‐designed (dots), DOE (triangles), and DMEM control (squares) media performance up to Passage 4 Each passage was 72 h of growth at 37°C and 5% CO2. Trypsinization took place after each 72 h period to count cells and replate them to allow for further growth (standard deviations indicated). The BO method designed an optimal media with substantially improved long‐term growth capacity than the DOE or DMEM control. BO, Bayesian optimization; DMEM, Dulbecco's modified Eagle's medium; DOE, design‐of‐experiments
Figure 6
Figure 6
Predicted first and second‐order effects. First‐order predicted effects of each component of the high‐fidelity IS are shown on diagonal plots (y‐axis is not to scale) with solid and dashed lines representing predicted cell number y(x) and desirability D(x) respectively. The “above” diagonal plots are second‐order plots for cell number y(x) and “below” are those for desirability D(x). The range of all components as described in Table 1. Labels are left off for clarity; to find the axis labels read the x‐axis labels horizontal from the diagonal label and read the y‐axis labels vertical from the label.
Figure 7
Figure 7
Variogram sensitivity analysis. The local (horizontal hatching), global (diagonal hatching), and mid‐range sensitivity of each component on D(x) is indicated by the height of the bars. Albumin, FBS, dexamethasone, and glutamine have the largest effect on D(x), with FBS being by far the most critical component with respect to global sensitivity. Predicted variogram γi for each component was formed from R=300 random samples from domain [0,1]. FBS, fetal bovine serum.
Figure 8
Figure 8
Kernel plots and IS distributions. (a) and (b) Show the output of the kernel Σ(xm,xm) for all data collected {XN,YN} and a simulated data set where only xFBS is varied from [0,1], respectively. Darker regions indicate large values of Σ, and thus a correlation. Also (c) the various IS cell number/correlate distributions (diagonal histograms) are shown. Above the diagonal (squares) are the actual inter‐IS correlations for each IS with their respective R 2 values, and below the diagonal (circles) are the predicted inter‐IS correlations for a random data set.

References

    1. Akteke‐Ozturk, B. , Koksal, G. , & Weber, G. W. (2018). Nonconvex optimization of desirability functions. Quality Engineering, 30(2), 293–310. 10.1080/08982112.2017.1315136 - DOI
    1. Ankenman, B. , Nelson, B. L. , & Staum, J. (2010). Stochastic kriging for simulation metamodeling. Operations Research, 58(2), 371–382. 10.1287/opre.1090.0754 - DOI
    1. Belakaria, S. , Deshwal, A. , & Doppa, J. R. (2019). Max‐value entropy search for multi‐objective Bayesian optimization. Advances in Neural Information Processing Systems, 32(NeurIPS)​.
    1. Brunner, D. , Appl, H. , Pfaller, W. , & Gstraunthaler, G. (2010). Serum‐free cell culture: the serum‐free media interactive online. Database, 27(December 2009), 53–62. - PubMed
    1. Coleman, M. C. , & Block, D. E. (2006). Bayesian parameter estimation with informative priors for nonlinear systems, AIChE Journal, 52(2), 651–667. 10.1002/aic.10667 - DOI

Publication types

LinkOut - more resources