Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Apr 4;123(13):3030-3037.
doi: 10.1021/acs.jpca.9b00910. Epub 2019 Mar 21.

Random Walk Enzymes: Information Theory, Quantum Isomorphism, and Entropy Dispersion

Random Walk Enzymes: Information Theory, Quantum Isomorphism, and Entropy Dispersion

Chi H Mak et al. J Phys Chem A. .

Abstract

Activation-induced deoxycytidine deaminase (AID) is a key enzyme in the human immune system. AID binds to and catalyzes random point mutations on the immunoglobulin (Ig) gene, leading to diversification of the Ig gene sequence by random walk motions, scanning for cytidines and turning them to uracils. The mutation patterns deposited by AID on its substrate DNA sequences can be interpreted as random binary words, and the information content of this stochastically generated library of mutated DNA sequences can be measured by its entropy. In this paper, we derive an analytical formula for this entropy and show that the stochastic scanning + catalytic dynamics of AID is controlled by a characteristic length that depends on the diffusion coefficient of AID and the catalytic rate. Experiments showed that the deamination rates have a sequence context dependence, where mutations are generated at higher intensities on DNA sequences with higher densities of mutable sites. We derive an isomorphism between this classical system and a quantum mechanical model and use this isomorphism to explain why AID appears to focus its scanning on regions with higher concentrations of deaminable sites. Using path integral Monte Carlo simulations of the quantum isomorphic system, we demonstrate how AID's scanning indeed depends on the context of the DNA sequence and how this affects the entropy of the library of generated mutant clones. Examining detailed features in the entropy of the experimentally generated clone library, we provide clear evidence that the random walk of AID on its substrate DNA is focused near hot spots. The model calculations applied to the experimental data show that the observed per-site mutation frequencies display similar contextual dependences as observed in the experiments, in which hot motifs are located adjacent to several different types of hot and cold motifs.

PubMed Disclaimer

Figures

FIGURE 1.
FIGURE 1.
Experimental setup and results. (A) Deamination assay reports AID-catalyzed deaminations on target cassette with multiple trinucleotide motifs NNC embedded in lacZα reporter gene, and examples of mutation patterns where each “T” indicates a C→U mutation. (B) Intrinsic deamination rates for each trinucleotide motif in the (hot hot’)-(hot cold) cassette consisting of a (AAC AGC)15-sss-(AAC GTC)15 sequence. (C) Per-site mutation frequencies observed in experiments. The hot motifs (red) are hotter when amongst hot’ motifs (orange) than amongst cold motifs (blue). (D) A hypothetical path and a set of deamination events along this path, illustrating how the convolution between the scanning dynamics and catalysis of AID generates mutant DNA.
FIGURE 2.
FIGURE 2.
Results from Monte Carlo path integral simulations on a (hot hot’)-(hot-cold) mixed cassette. (A) Footprints of AID on the sequence. (B) Observed per-site mutation frequencies, showing similar contextual dependence observed in the experiments shown in Fig. 1.
FIGURE 3.
FIGURE 3.
Results from Monte Carlo path integral simulations on a (hot hot’)-(hot frigid)-(hot frigid) cassette. (A) Footprints of AID on the sequence. (B) Observed per-site mutation frequencies. (C) Far left: Entropy densities of a word starting at the center of the (hot hot’) region going to the right (5’ to 3’) or the left (3’ to 5’) as a function of word length. Blue solids lines are results for a 60 s incubation time. Dashed and dotted dashed lines are for 30 s and 15 s, respectively. Middle: Orange line shows entropy densities of words staring at the center of the (hot frigid) region in the middle of the cassette for incubation times of 60 s (solid orange), 30 s (dashed) and 15 s (dotted dashed). Far right: Green line, dashed and dotted dashed line show entropy densities of words staring at the center of the (hot frigid) region on the far right of the cassette after 60 s, 30 s and 15 s, similar to the other two panels. The length of the word is shortest on the far right, indicating that AID’s diffusion is slow there. In the middle of the cassette, correlations among mutations are weak generating long words, suggesting that AID’s diffusion is fast here.
FIGURE 4.
FIGURE 4.
Entropy density obtained from the simulation results for the (hot hot’)-(hot frigid)-(hot frigid) cassette, in units of bits of information divided by the length of the word read in the 5’ to 3’ direction, for words of different lengths and at different starting positions along the sequence. In the middle of the cassette where the (hot frigid) cluster is, the entropy shows almost no dependence on word length, indicating that long words are preferentially generated here due to the fast diffusion of AID.
FIGURE 5.
FIGURE 5.
Experimental results corresponding to the cassette studied by the simulations shown in Fig. 3. (A) Per-site mutation frequencies. (B) Decay of entropy density as a function of word length, for words centered at the center of the (hot hot’) cluster on the far left (blue), the middle (orange) and the far right (green). Consistent with the simulations, the words are shortest on the far left, suggesting that AID diffusion is slowest here, while the words are longest in the middle of the cassette, indicating that AID diffusion is fastest there.

Similar articles

Cited by

References

    1. Muramatsu M; Kinoshita K; Fagarasan S; Yamada S; Shinkai Y; Honjo T, Class switch recombination and hypermutation require activation-induced cytidine deaminase (AID), a potential RNA editing enzyme. Cell 2000, 102, 553–563. - PubMed
    1. Bransteitter R; Pham P; Scharff MD; Goodman MF, Activation-induced cytidine deaminase deaminates deoxycytidine on single-stranded DNA but requires the action of RNase. Proc. Natl. Acad. Sci. USA 2003, 100, 4102–4107. - PMC - PubMed
    1. Pham P; Bransteitter R; Petruska J; Goodman MF, Processive AID-catalysed cytosine deamination on single-stranded DNA simulates somatic hypermutation. Nature 2003, 424 (6944), 103–7. - PubMed
    1. Sohail A; Klapacz J; Samaranayake M; Ullah A; Bhagwat AS, Human activation-induced cytidine deaminase causes transcription-dependent, strand-biased C to U deaminations. Nucleic Acids Res 2003, 31 (12), 2990–4. - PMC - PubMed
    1. Conticello SG, The AID/APOBEC family of nucleic acid mutators. Genome Biol 2008, 9 (6), 229. - PMC - PubMed