Adaptive Machine Learning Framework for Optimizing the Affinity Purification of Adeno-Associated Viral Vectors
- PMID: 41553185
- DOI: 10.1002/bit.70159
Adaptive Machine Learning Framework for Optimizing the Affinity Purification of Adeno-Associated Viral Vectors
Abstract
Adeno-associated viral (AAV) vectors for gene therapy are becoming integral to modern medicine, providing therapeutic options for diseases once deemed incurable. Currently, viral vector purification is a critical bottleneck in the gene therapy industry, impacting product efficacy and safety as well as accessibility and cost to patients. Traditional methods for improving viral vector purity are resource-intensive and often fail to adjust the purification process parameters to maximize the resulting product yield and quality. To address this challenge, we developed a machine learning framework that leverages Bayesian optimization to systematically refine affinity chromatography parameters (sample load, flow rate, and the formulation of chromatographic media) to improve AAV purification. The efficiency of this closed-loop workflow in iteratively optimizing the vector's yield, purity, and transduction efficiency was demonstrated by purifying clinically relevant serotypes AAV2, AAV5, AAV6, and AAV9 from HEK293 cell lysates using the affinity adsorbent AvXcel. We show that in three (or fewer) cycles of Bayesian optimization, we elevated yields from a baseline of 70% to a remarkable 97%-99%, while reducing host cell impurities by 230- to 400-fold across all serotypes. Performing the purification process with optimized parameters consistently produced vectors with high purity and preserved high transduction activity, essential for therapeutic efficacy and safety, demonstrating the applicability of the framework across multiple serotypes-a key challenge in AAV manufacturing. This study represents the first reported application of closed-loop, data-driven Bayesian optimization for enhancing AAV productivity and quality at the affinity capture step, with demonstrated transferability of historical purification data and process knowledge. The proposed adaptive machine learning framework is efficient and applicable across serotypes, enabling rapid process development, reduced costs, and advancing the accessibility and clinical translation of AAV-based gene therapies.
Keywords: Bayesian optimization; Gaussian process; bioseparations; closed‐loop process development; gene therapy.
© 2026 The Author(s). Biotechnology and Bioengineering published by Wiley Periodicals LLC.
Update of
-
Adaptive Machine Learning Framework enables Unprecedented Yield and Purity of Adeno-Associated Viral Vectors for Gene Therapy.bioRxiv [Preprint]. 2025 May 24:2025.05.23.655859. doi: 10.1101/2025.05.23.655859. bioRxiv. 2025. Update in: Biotechnol Bioeng. 2026 Jan 19. doi: 10.1002/bit.70159. PMID: 40475650 Free PMC article. Updated. Preprint.
References
-
- Au, H. K. E., M. Isalan, and M. Mielcarek. 2022. “Gene Therapy Advances: A Meta‐Analysis of AAV Usage in Clinical Settings.” Frontiers in Medicine 8: 809118.
-
- BioSpace. 2019. “AveXis Data Reinforce Effectiveness of Zolgensma® in Treating Spinal Muscular Atrophy (SMA) Type 1.” Published May 7. https://www.biospace.com/avexis-data-reinforce-effectiveness-of-zolgensm....
-
- Behere, K., and S. Yoon. 2020. “Chromatography Bioseparation Technologies and In‐Silico Modelings for Continuous Production of Biotherapeutics.” Journal of Chromatography A 1627: 461376.
-
- Bernau, C. R., M. Knödler, J. Emonts, R. C. Jäpel, and J. F. Buyel. 2022. “The Use of Predictive Models to Develop Chromatography‐Based Purification Processes.” Frontiers in Bioengineering and Biotechnology 10: 1009102.
-
- Boelrijk, J., B. Ensing, P. Forré, and B. W. J. Pirok. 2023. “Closed‐Loop Automatic Gradient Design for Liquid Chromatography Using Bayesian Optimization.” Analytica Chimica Acta 1242: 340789.
Grants and funding
LinkOut - more resources
Full Text Sources
