Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Nov 13;10(1):5150.
doi: 10.1038/s41467-019-13189-z.

Towards a fully automated algorithm driven platform for biosystems design

Affiliations

Towards a fully automated algorithm driven platform for biosystems design

Mohammad HamediRad et al. Nat Commun. .

Abstract

Large-scale data acquisition and analysis are often required in the successful implementation of the design, build, test, and learn (DBTL) cycle in biosystems design. However, it has long been hindered by experimental cost, variability, biases, and missed insights from traditional analysis methods. Here, we report the application of an integrated robotic system coupled with machine learning algorithms to fully automate the DBTL process for biosystems design. As proof of concept, we have demonstrated its capacity by optimizing the lycopene biosynthetic pathway. This fully-automated robotic platform, BioAutomata, evaluates less than 1% of possible variants while outperforming random screening by 77%. A paired predictive model and Bayesian algorithm select experiments which are performed by Illinois Biological Foundry for Advanced Biomanufacturing (iBioFAB). BioAutomata excels with black-box optimization problems, where experiments are expensive and noisy and the success of the experiment is not dependent on extensive prior knowledge of biological mechanisms.

PubMed Disclaimer

Conflict of interest statement

HamediRad and Chao are co-founders of LifeFoundry, which aims to develop automated workflows for pathway engineering and metabolic engineering. All the other authors declare no competing interests.

Figures

Fig. 1
Fig. 1
The overall workflow of BioAutomata. After setting the initial parameters, designing the sequence space of variable regions (such as promoter variants in a combinatorial pathway assembly), and defining the objective function, BioAutomata selects which experiments are expected to result in the highest improvement of yield, performs those experiments, generates data and learns from it, updating its predictive model given the newly presented evidence. It will then decide on the next experiments to perform to reach the goal set by the user while trying to minimize the number of experiments and the cost of the project
Fig. 2
Fig. 2
Testing Bayesian optimization by finding the maximum of a two-dimensional function. a The acquisition function decides the next input to test and the output is used to refine the predictive model. Iterations 5, 10, 15 and 20 of this process are shown. b With increasing rounds of iteration, the predictive model grows more confident of the location of the global maximum and the distance between tested inputs decreases with each iteration. c The algorithm evaluated 9 points before finding the location of the maximum. Subsequent iterations tuned this approximation toward the true optimum. The algorithm evaluated 12 points before finding the maximum. The order in which each point is evaluated is shown on the graph
Fig. 3
Fig. 3
Change in sampling behavior of Baysesian optimization of the lycopene production pathway. In the first round (a), all points were chosen to uniformally explore the landscape since it is completely unexplored and unknown (n = 46). In the second round (b), some information is acquired and the points picked by the algorithm are clearly skewed from the unifrom distribution (n = 45). However, since there is some uncertainty, it is still exploring the landscape. Finally, in the third round (c), a clear pattern is observed where the algorithm has determined the points in a particular area are more likely to be closer to the global optima and is actively exploring that area but still doing some minimal exploration (n = 45). Source data are provided as a Source Data file
Fig. 4
Fig. 4
Lycopene production in different rounds of pathway optimization and random screening. The average and maximum points have increased after each round of pathway optimization. Moreover, although the average and maximum of evaluating 46 and 136 random points are a little more than the uniform distribution in round 1, they are significantly lower than the points picked by the algorithm in the subsequent rounds. The boxes of the plots contain data within the interquartile range (IQR), while whiskers spread from the boxes to 1.5 times IQR. The center line in the boxes is the median of the data and points above the whiskers are values which are higher than 1.5 times IQR above the third quartile. n = 46, 45, 45, 46 and 136 for each plot, respectively. Source data are provided as a Source Data file
Fig. 5
Fig. 5
The overall fully automated pathway optimization workflow. The machine learning algorithm picks the plasmids to be assembled and returns the list to iBioFAB to perform the assembly. The assembled products are then transformed, and four single colonies are isolated for lycopene quantification and OD measurement. The resulting data are then given to the machine learning algorithm to pick the next set of points to be evaluated

References

    1. Bornscheuer UT, et al. Engineering the third wave of biocatalysis. Nature. 2012;485:185–194. doi: 10.1038/nature11117. - DOI - PubMed
    1. Nielsen J, Keasling JD. Engineering cellular metabolism. Cell. 2016;164:1185–1197. doi: 10.1016/j.cell.2016.02.004. - DOI - PubMed
    1. Chao R, Mishra S, Si T, Zhao H. Engineering biological systems using automated biofoundries. Metab. Eng. 2017;42:98–108. doi: 10.1016/j.ymben.2017.06.003. - DOI - PMC - PubMed
    1. Du J, Shao Z, Zhao H. Engineering microbial factories for synthesis of value-added products. J. Ind. Microbiol. Biotechnol. 2011;38:873–890. doi: 10.1007/s10295-011-0970-3. - DOI - PMC - PubMed
    1. Liu Y, Shin H, Li J, Liu L. Toward metabolic engineering in the context of system biology and synthetic biology: advances and prospects. Appl. Microbiol. Biotechnol. 2015;99:1109–1118. doi: 10.1007/s00253-014-6298-y. - DOI - PubMed

Publication types