Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 May 2;19(1):163.
doi: 10.1186/s12859-018-2161-y.

The cytidine deaminase under-representation reporter (CDUR) as a tool to study evolution of sequences under deaminase mutational pressure

Affiliations

The cytidine deaminase under-representation reporter (CDUR) as a tool to study evolution of sequences under deaminase mutational pressure

Maxwell Shapiro et al. BMC Bioinformatics. .

Erratum in

Abstract

Background: Activation induced deaminase (AID) and apolipoprotein B mRNA editing enzyme, catalytic polypeptide-like 3 (APOBEC3) are deaminases that mutate C to U on single-stranded DNA (ssDNA). AID is expressed primarily in germinal center B-cells, where it facilitates affinity maturation and class-switch recombination. APOBEC3 are a family of anti-viral proteins that act as part of the intrinsic immune response. In both cases, there are particular sequence motifs, also known as "mutation motifs", to which these deaminases prefer to bind and mutate.

Results: We present a program, the cytidine deaminase under-representation reporter (CDUR) designed to statistically determine whether a given sequence has an under/over-representation of these mutation motifs. CDUR shows consitency with other studies of mutation motifs, as we show by analyzing sequences from the adeno-associated virus 2 (AAV2) and human papillomavirus (HPV).

Conclusion: Using various shuffling mechanisms to generate different null model distributions, we can tailor CDUR to correct for metrics such as GC-content, dinucleotide frequency, and codon bias.

Keywords: AID; APOBEC; Deaminase; Mutation motifs; Virus.

PubMed Disclaimer

Conflict of interest statement

Ethics approval and consent to participate

Not applicable

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Figures

Fig. 1
Fig. 1
gc3 shuffle method. The choice of codons in the 4th nucleotide in the sequence (Ile) was determined by the probabilities as follows: since there is an overall GC content of 60% at the 3rd position of the codons in the subject sequence, the ATC codon will be chosen with 0.6 probability. Since the AT content is then 0.4, the other two codons ATT and ATA are chosen randomly with equal probability, conditional on the 40% AT content. Note that the shuffling occurs iteratively throughout sequence, not just one codon at a time
Fig. 2
Fig. 2
n3 shuffle method. Third codon positions are first categorized into whether they belong to Y, R, H, or N nucleotides, then assigned to the corresponding set (Type vs Set table). Then nucleotides within each set are shuffled (Type vs Shuffle table) to produce the shuffled sequence. Certain nucleotides can belong to two sets (here two nucleotides belong to both H and N) and are shuffled twice
Fig. 3
Fig. 3
dn23 shuffle method. First the dinucleotide frequency is calculated for the 2nd and 3rd codon positions of the original sequence. Then for each amino acid, codons are chosen based on the appropriately normalized probabilities for the dinucleotides available for that amino acid
Fig. 4
Fig. 4
Rep-68 Under-representation. Histograms showing the results of CDUR applied to the AAV2 Rep-68 gene using the gc3 shuffle method. In each graph, the filled black bars represent the “below” portion, that is, the number of shuffled sequences with that given metric below that of the one computed for the input. The red line shows the Normal distribution approximation given the mean and standard deviation (SD) from that calculation. a The mutation motif counting graph (belowTAC). The observed number of TAC mutation motifs is 20 (which includes the GTA mutation motifs on the bottom strand). The black bars are all shuffled sequences with fewer observed mutation motifs than our input. b The replacement transition graph (RepTr_belowTAC), also shown with the Normal approximation. c The replacement transition fraction graph (RepTrFrac_belowTAC). Our sequence had an observed RepTrFrac of 0.25, so the sequences with a lower RepTrFrac are filled in with black (see Table 2 for complete results)
Fig. 5
Fig. 5
E6 Under-representation. Histograms showing the results of CDUR applied to the HPV E6 gene using the gc3 shuffle method. All subplots (a-c) and annotations are equivalent to those of Fig. 4, but for this gene. In contrast to Fig. 4 however, rather than under-representation, we see a neutral representation for TAC mutation motifs. (See Table 3 for complete results)

References

    1. Petersen-Mahrt SK, Harris RS, Neuberger MS. AID mutates E. coli suggesting a DNA deamination mechanism for antibody diversification. Nature. 2002;418(6893):99–104. doi: 10.1038/nature00862. - DOI - PubMed
    1. Muramatsu M, Kinoshita K, Fagarasan S, Yamada S, Shinkai Y, Honjo T. Class Switch Recombination and Hypermutation Require Activation-Induced Cytidine Deaminase (AID), a Potential RNA Editing Enzyme. Cell. 2000;102(5):553–63. doi: 10.1016/S0092-8674(00)00078-7. - DOI - PubMed
    1. Conticello SG, Thomas CJF, Petersen-Mahrt SK, Neuberger MS. Evolution of the AID/APOBEC family of polynucleotide (deoxy)cytidine deaminases. Mol Biol Evol. 2005;22(2):367–77. doi: 10.1093/molbev/msi026. - DOI - PubMed
    1. King JJ, Larijani M. A novel regulator of activation-induced cytidine deaminase/APOBECs in immunity and cancer: Schrödinger’s catalytic pocket. Front Immunol. 2017;8:351. doi: 10.3389/fimmu.2017.00351. - DOI - PMC - PubMed
    1. Conticello SG. The AID/APOBEC family of nucleic acid mutators. Genome Biol. 2008;9(6):229. doi: 10.1186/gb-2008-9-6-229. - DOI - PMC - PubMed

Publication types

Substances

LinkOut - more resources