Building a neural network model to define DNA sequence specificity in V(D)J recombination
- PMID: 40548941
- PMCID: PMC12205992
- DOI: 10.1093/nar/gkaf551
Building a neural network model to define DNA sequence specificity in V(D)J recombination
Abstract
In developing lymphocytes, V(D)J recombination assembles functional antigen receptor (AgR) genes through rearrangement of the AgR loci to adjoin component gene segments. Each candidate gene segment for recombination is flanked by a recombination signal sequence (RSS), composed of heptamer and nonamer motifs separated by 12 or 23 base pairs. To initiate V(D)J recombination, the recombination activating proteins RAG1 and RAG2 create DNA double-stranded breaks between a 12/23-RSS pair and their adjoining gene segments. The basis for selection of individual RSSs during each V(D)J recombination event is not well understood due, in part, to the wide-spread distribution of the semi-conserved RSSs across the AgR loci. Using publicly-available data for V(D)J recombination efficiencies on randomized 12-RSSs, we first built a neural network model that delineates how changes in sequence at certain positions in the RSS affects recombination efficiency. Second, to interpret the model's decision-making process, we repurposed the game theoretic SHapley Additive exPlanations (SHAP) approach, with the results illustrating how nucleotides at pairwise positions in the heptamer provide synergistic contributions to recombination efficiency. Third, we trained a nonamer-informed neural network model with varied nonamer RSS substrates, and subsequently identified interdependent effects between the heptamer and nonamer regions on recombination efficiency.
© The Author(s) 2025. Published by Oxford University Press on behalf of Nucleic Acids Research.
Conflict of interest statement
None declared.
Figures










Similar articles
-
How RAG1/2 evolved from ancestral transposases to initiate V(D)J recombination without transposition.Proc Natl Acad Sci U S A. 2025 Aug 5;122(31):e2512362122. doi: 10.1073/pnas.2512362122. Epub 2025 Jul 29. Proc Natl Acad Sci U S A. 2025. PMID: 40729386
-
An updated definition of V(D)J recombination signal sequences revealed by high-throughput recombination assays.Nucleic Acids Res. 2022 Nov 11;50(20):11696-11711. doi: 10.1093/nar/gkac1038. Nucleic Acids Res. 2022. PMID: 36370096 Free PMC article.
-
Generating combinatorial diversity via engineered V(D)J-like recombination in Saccharomyces cerevisiae.Nat Commun. 2025 Jul 1;16(1):5688. doi: 10.1038/s41467-025-61206-1. Nat Commun. 2025. PMID: 40592886 Free PMC article.
-
Structural gymnastics of RAG-mediated DNA cleavage in V(D)J recombination.Curr Opin Struct Biol. 2018 Dec;53:178-186. doi: 10.1016/j.sbi.2018.11.001. Epub 2018 Nov 23. Curr Opin Struct Biol. 2018. PMID: 30476719 Free PMC article. Review.
-
Molecular Mechanisms of DNA Sequence Selectivity in V(D)J Recombination.ACS Omega. 2023 Sep 15;8(38):34206-34214. doi: 10.1021/acsomega.3c05601. eCollection 2023 Sep 26. ACS Omega. 2023. PMID: 37779976 Free PMC article. Review.
References
-
- Schatz DG, Zhang Y, Xiao J et al.. Honjo T, Reth M, Radbruch A, Alt F, Martin A Molecular Biology of B Cells. 2024; 3rd editionCambridge, Massachusetts, United States: Academic Press; 13–57.
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources