Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Dec;18(12):e70428.
doi: 10.1111/cts.70428.

AAV Gene Therapy Drug Development and Translation of Engineered Ocular and Neurotropic Capsids: A Systematic Review Using Natural Language Processing

Affiliations

AAV Gene Therapy Drug Development and Translation of Engineered Ocular and Neurotropic Capsids: A Systematic Review Using Natural Language Processing

Chinaza Agbim et al. Clin Transl Sci. 2025 Dec.

Abstract

Natural AAV serotypes often lack specificity and efficiency, leading to off-target effects and a low therapeutic index. To overcome these limitations of naturally occurring serotypes, there has been a keen interest in the field to engineer novel capsids to enhance tissue and cell-specific targeting, resulting in a high number of published literature reports over the past few years. To ensure a systematic review and illustrate advances in engineered capsids that enhance specificity and efficiency, we used Natural Language Processing with Linguamatics i2E to identify neurotropic and ocular AAV capsids tested in non-human primates. By querying PubMed abstracts for specific mentions of AAVs, administration routes, and organ/tissue/species, we obtained 5907 hits, refined through an optimized process to 36 relevant and unique abstracts. Notable findings include numerous novel capsids summarized by route of administration: (1) systemic administration, targeting the central nervous system (e.g., AAV-PHP.eB, AAV X1.1, and AAV.PAL2), (2) direct central nervous system injection (e.g., AAV2.Retro, Olig001, and AAV2.1A), and (3) ocular administration (e.g., AAV.44.9 (E531D), rAAV2tYF, and Anc80L65). Such engineered capsids exhibit enhanced tissue specificity, improved pharmacokinetics and pharmacodynamics, or reduced off-target effects compared to the parent serotypes. Our study provides insight into state-of-the-art translational and drug-development considerations for engineered neurotropic and ocular capsids. We also highlight the effectiveness of Natural Language Processing and Large Language Models as tools in identifying and characterizing engineered neurotropic and ocular AAV capsids to summarize this rapidly growing class of drugs and area of therapeutics.

Keywords: CNS; adeno‐associated virus; drug development; engineered capsids; gene therapy; natural language processing; ocular.

PubMed Disclaimer

Conflict of interest statement

At the time of this research, M.W., C.K., E.M.‐R., S.S., and M.M. were salaried employees of Genentech Inc. (a member of the Roche Group) and were stockholders in Roche. C.A. declares no conflicts of interest.

Figures

FIGURE 1
FIGURE 1
Summarized NLP workflow. NLP, natural language processing, LLM, large language model (1) NLP Criteria—The NLP software searches PubMed abstracts with the following criteria: [AAV] AND [Route of Administration] AND [Organ OR Tissue OR Species]. (2) This workflow produced 5907 hits, where each hit is an instance of the previous criteria within 150 words and/or 5 sentences. AAVs mentioned in the same abstract were typically grouped. (3) The data was further filtered into unique abstracts by removing duplicates, and then filtered by any mentions of an NHP species in its respective column, yielding 458 unique abstracts. (4a) The NLP‐generated data were filtered by researcher‐identified engineered AAV capsids. (5a) Remaining abstracts were finally filtered by NLP‐generated data in the tissue column, producing a total of 33 abstracts fitting the filtration criteria. (4b) The NHP abstracts were revisited, and the exact engineered AAV capsids mentioned in the abstracts were manually identified, producing 28 abstracts in addition to those from 4A. (5b) Manually identified abstracts and NLP‐generated data abstracts were combined into 65 total engineered capsid abstracts relating to NHPs. (6) The full‐text papers from all 65 engineered capsid abstracts were manually screened to identify papers that targeted the CNS and the eye. (7) These papers were finally filtered to ensure that they injected engineered capsids into NHPs and to remove cases where NHPs were simply mentioned. Genentech's internal LLM software aided in producing the data in steps 4B and 7 through the identification of engineered capsids and the validation of papers fitting all the criteria.
FIGURE 2
FIGURE 2
NLP generated results.

References

    1. Wang J.‐H., Gessler D. J., Zhan W., Gallagher T. L., and Gao G., “Adeno‐Associated Virus as a Delivery Vector for Gene Therapy of Human Diseases,” Signal Transduction and Targeted Therapy 9 (2024): 1–33. - PMC - PubMed
    1. Sun K. and Liao M. Z., “Clinical Pharmacology Considerations on Recombinant Adeno‐Associated Virus–Based Gene Therapy,” Journal of Clinical Pharmacology 62 (2022): S79–S94. - PubMed
    1. Kavita U., Sun K., Braun M., et al., “PK/PD and Bioanalytical Considerations of AAV‐Based Gene Therapies: An IQ Consortium Industry Position Paper,” AAPS Journal 25 (2023): 78. - PubMed
    1. Chowdhury E. A., Meno‐Tetang G., Chang H. Y., et al., “Current Progress and Limitations of AAV Mediated Delivery of Protein Therapeutic Genes and the Importance of Developing Quantitative Pharmacokinetic/Pharmacodynamic (PK/PD) Models,” Advanced Drug Delivery Reviews 170 (2021): 214–237. - PubMed
    1. Banou L., Sarrafpour S., Teng C. C., and Liu J., “Ocular Gene Therapy: An Overview of Viral Vectors, Immune Responses, and Future Directions,” Yale Journal of Biology and Medicine 97 (2024): 491–503. - PMC - PubMed

Publication types

LinkOut - more resources