. 2022 Nov 3:13:1029167.

doi: 10.3389/fimmu.2022.1029167. eCollection 2022.

Moving the needle: Employing deep reinforcement learning to push the boundaries of coarse-grained vaccine models

Jonathan G Faris¹, Daniel Orbidan¹, Charles Wells², Brenden K Petersen³, Kayla G Sprenger¹

Affiliations

¹ Department of Chemical and Biological Engineering, University of Colorado Boulder, Boulder, CO, United States.
² Department of Computer Science, Rice University, TX, Houston, United States.
³ Computational Engineering Division, Lawrence Livermore National Laboratory, Livermore, CA, United States.

PMID: 36405722
PMCID: PMC9670804
DOI: 10.3389/fimmu.2022.1029167

Moving the needle: Employing deep reinforcement learning to push the boundaries of coarse-grained vaccine models

Jonathan G Faris et al. Front Immunol. 2022.

. 2022 Nov 3:13:1029167.

doi: 10.3389/fimmu.2022.1029167. eCollection 2022.

Authors

Jonathan G Faris¹, Daniel Orbidan¹, Charles Wells², Brenden K Petersen³, Kayla G Sprenger¹

Affiliations

¹ Department of Chemical and Biological Engineering, University of Colorado Boulder, Boulder, CO, United States.
² Department of Computer Science, Rice University, TX, Houston, United States.
³ Computational Engineering Division, Lawrence Livermore National Laboratory, Livermore, CA, United States.

PMID: 36405722
PMCID: PMC9670804
DOI: 10.3389/fimmu.2022.1029167

Abstract

Highly mutable infectious disease pathogens (hm-IDPs) such as HIV and influenza evolve faster than the human immune system can contain them, allowing them to circumvent traditional vaccination approaches and causing over one million deaths annually. Agent-based models can be used to simulate the complex interactions that occur between immune cells and hm-IDP-like proteins (antigens) during affinity maturation-the process by which antibodies evolve. Compared to existing experimental approaches, agent-based models offer a safe, low-cost, and rapid route to study the immune response to vaccines spanning a wide range of design variables. However, the highly stochastic nature of affinity maturation and vast sequence space of hm-IDPs render brute force searches intractable for exploring all pertinent vaccine design variables and the subset of immunization protocols encompassed therein. To address this challenge, we employed deep reinforcement learning to drive a recently developed agent-based model of affinity maturation to focus sampling on immunization protocols with greater potential to improve the chosen metrics of protection, namely the broadly neutralizing antibody (bnAb) titers or fraction of bnAbs produced. Using this approach, we were able to coarse-grain a wide range of vaccine design variables and explore the relevant design space. Our work offers new testable insights into how vaccines should be formulated to maximize protective immune responses to hm-IDPs and how they can be minimally tailored to account for major sources of heterogeneity in human immune responses and various socioeconomic factors. Our results indicate that the first 3 to 5 immunizations, depending on the metric of protection, should be specially tailored to achieve a robust protective immune response, but that beyond this point further immunizations require only subtle changes in formulation to sustain a durable bnAb response.

Keywords: HIV - human immunodeficiency virus; affinity maturation; agent-based modelling; deep reinforcement learning (Deep RL); immunovirology; multiscale (MS) modelling; vaccine design protocol.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

**Figure 1**
Overview of how deep reinforcement learning (DRL) is coupled to the agent-based model of affinity maturation (AM) in the current work. The *DRL agent* chooses an *action* (here, the level of vaccine-imposed frustration, modulated by changing antigen sequence and/or concentration) that is fed into the *environment* (here, the AM process), after which the agent observes the *state* of the system (here, properties of the vaccine-induced memory B cell receptor/antibody population). The agent then receives a *reward* based on how well the observed state meets a user-defined reward metric (here, the quality and quantity of the resulting plasma B cell receptor/antibody population). Over time, the agent learns a robust mapping between the states and actions, leading to an optimal *policy* (here, a vaccination/temporal frustration protocol) for maximizing the chosen reward.

**Figure 2**
Broad overview of the affinity maturation (AM) process by which antibodies (Abs) evolve against vaccine-candidate antigens (Ags) in a germinal center (GC) reaction (see text for details). Here, we administered between one and ten total immunizations, N _T , of a single Ag, varying only the Ag concentration, c _Ag , in each immunization (indicated by the different shades of the Ags).

**Figure 3**
Convergence profiles of **(A)** the level of frustration imposed on GC reactions in the first vaccine immunization, as chosen by the DRL agent, and **(B, C)** the corresponding scaled reward values obtained by the DRL agent. Results are shown in blue for the bnAb titers reward function, and in pink for the bnAb fraction reward function. The data shown represents a rolling average with 500 DRL steps for both the mean and standard deviation (shown in lighter colors).

**Figure 4**
Convergence of frustration values F ₁ (green), F ₂ (orange), F ₃ (purple), and F ₄ (pink) for **(A)** the bnAb titers reward function (RF) and **(B)** the bnAb fraction RF, across four total immunizations; **(C)** bnAb titers produced per successful GC (out of n=100 GCs) for both RFs after each of the four immunizations; **(D)** number of successfully terminating GCs after each immunization for both RFs; **(E)** distribution of bnAb titer response for a given F _i after the four immunizations; and **(F)** fractional bnAb response for a given F _i after the four immunizations. In **(A, B)**, the rolling average +/- the rolling standard deviation is plotted using 500 DRL steps. In **(C, D)**, error bars are +/- the standard deviation of the respective metric (bnAb titers/successful GC and total successful GCs, respectively). In **(E, F)**, the y-values shown are the responses after administering all four immunizations; that is, the y-values are plotted as (F _i∈4, ℛ ₄) for ℛ ₄≡ total bnAb titers and bnAb fraction, respectively.

**Figure 5**
Frustration values of a given immunization, i , out of a total number of immunizations, N _T , where the column corresponds to the i ^th immunization, and the row corresponds to the total immunizations for both the **(A)** bnAb titers reward function and **(B)** bnAb fraction reward function; the average frustration for a given immunization, F _i , averaged across all N _T for **(C)** the bnAb titers reward function (purple) and **(D)** the bnAb fraction reward function (pink), with their respective Michaelis-Menten saturation fits (red) +/- the standard deviation (gray); the average reward for a given immunization, ℛ _i , for the bnAb titers reward function (E, blue) and bnAb fraction reward function (F, green), with their respective fits (red) and standard deviation of the fit (gray). In **(C–F)**, the error bars represent the standard deviation. Fitted parameters for the Michaelis-Menten model are shown in **(C–F)**.

See this image and copyright information in PMC

Cited by

Broadly neutralizing antibodies for HIV prevention: a comprehensive review and future perspectives.
Mahomed S. Mahomed S. Clin Microbiol Rev. 2024 Jun 13;37(2):e0015222. doi: 10.1128/cmr.00152-22. Epub 2024 Apr 30. Clin Microbiol Rev. 2024. PMID: 38687039 Free PMC article. Review.
Retrospective SARS-CoV-2 human antibody development trajectories are largely sparse and permissive.
Kirby MB, Petersen BM, Faris JG, Kells SP, Sprenger KG, Whitehead TA. Kirby MB, et al. Proc Natl Acad Sci U S A. 2025 Jan 28;122(4):e2412787122. doi: 10.1073/pnas.2412787122. Epub 2025 Jan 22. Proc Natl Acad Sci U S A. 2025. PMID: 39841142 Free PMC article.
The application of machine learning in clinical microbiology and infectious diseases.
Xu C, Zhao LY, Ye CS, Xu KC, Xu KY. Xu C, et al. Front Cell Infect Microbiol. 2025 May 1;15:1545646. doi: 10.3389/fcimb.2025.1545646. eCollection 2025. Front Cell Infect Microbiol. 2025. PMID: 40375898 Free PMC article. Review.
Deep learning in next-generation vaccine development for infectious diseases.
Bhattacharya M, Lo YH, Chatterjee S, Das A, Wen ZH, Chakraborty C. Bhattacharya M, et al. Mol Ther Nucleic Acids. 2025 Jun 4;36(3):102586. doi: 10.1016/j.omtn.2025.102586. eCollection 2025 Sep 9. Mol Ther Nucleic Acids. 2025. PMID: 40641804 Free PMC article. Review.

References

1. Vaccine-preventable diseases. Health.mil; (2020). Available at: https://www.health.mil/Military-Health-Topics/Health-Readiness/Immunizat....
1. HIV/AIDS . Available at: https://www.who.int/data/gho/data/themes/hiv-aids.
1. WHO . WHO coronavirus (COVID-19) dashboard with vaccination data. Available at: https://covid19.who.int/.
1. History of 1918 flu pandemic. In: Pandemic influenza (Flu) (2018). CDC. Available at: https://www.cdc.gov/flu/pandemic-resources/1918-commemoration/1918-pande....
1. Janeway CJ, Travers P, Walport M. Pathogens have evolved various means of evading or subverting normal host defenses. Immunobiology: Immune System Health Dis 5th edition (2001). Available at: https://www.ncbi.nlm.nih.gov/books/NBK27176/

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Moving the needle: Employing deep reinforcement learning to push the boundaries of coarse-grained vaccine models

Affiliations

Moving the needle: Employing deep reinforcement learning to push the boundaries of coarse-grained vaccine models

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Medical