Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2023 Aug 7:arXiv:2308.03714v1.

The role of APOBEC3-induced mutations in the differential evolution of monkeypox virus

Affiliations

The role of APOBEC3-induced mutations in the differential evolution of monkeypox virus

Xiangting Li et al. ArXiv. .

Update in

Abstract

Recent studies show that newly sampled monkeypox virus (MPXV) genomes exhibit mutations consistent with Apolipoprotein B mRNA Editing Catalytic Polypeptide-like3 (APOBEC3)-mediated editing, compared to MPXV genomes collected earlier. It is unclear whether these single nucleotide polymorphisms (SNPs) result from APOBEC3-induced editing or are a consequence of genetic drift within one or more MPXV animal reservoirs. We develop a simple method based on a generalization of the General-Time-Reversible (GTR) model to show that the observed SNPs are likely the result of APOBEC3-induced editing. The statistical features allow us to extract lineage information and estimate evolutionary events.

PubMed Disclaimer

Conflict of interest statement

Disclosure and competing interests The authors have no competing interests.

Figures

Figure 1:
Figure 1:
Schematic of the stochastic evolution model. (a) There are different potential synonymous mutation sites along the genome. Some potential mutations are APOBEC3-induced (red), some are reverse-APOBEC3-induced (blue), and some are neither (green). The transitions (from open circles to hash-filled circles) are indicated by arrows. Transitions of each type occur independently at their type-specific transition rate. (b) An example of two lineages (i) and (j) arising from a common ancestor that shared a common evolutionary path until time t(ij). Each black dot represents a mutation. An open circle represents the time a lineage is sampled. The waiting time between two successive mutations are independent and randomly distributed with the same total rate Nv. The specific type of mutation is also randomly chosen according to the relative rates of the different mutation types. (c) Counts of mutations using the common ancestor as a reference. (d) Counting the numbers of mutations of lineage (i) at time t(i) using the lineage (j) at time t(j) as a reference. Since sequences share part of their evolutionary paths starting with the common ancestor, mutations α1 and α2 are not identified by using lineage (j) as a reference. The subsequent mutations of lineage (j) count towards n(ij). For completeness, we included all mutations instead of only synonymous ones in the schematic sample sequences in (c) and (d).
Figure 2:
Figure 2:
Simulations of different scenarios. (a-b) Four representative trajectories of an iid substitution process of the JC69 limit. A total of 106 nucleotides are used for each simulation and both TC → TT and A → C mutations are linearly correlated with the total number of point mutations.
Figure 3:
Figure 3:
Simulations of scenarios that include mutation and selection following Eqs. 11.13. We used β0=1, μ0=0.5 and assumed that with each birth, one daughter has equal probability 0.1 of acquiring each type of mutation. The total initial numbers of possible mutations Ni for synonymous non-APOBEC3 mutations, synonymous APOBEC3 mutations, and hidden nonsynonymous mutations is set to 105,103, and 103, respectively. (a) Numbers of synonymous APOBEC3-relevant mutations na associated with the numbers of synonymous non-APOBEC3-relevant mutations n-na from samples of simulation trajectories. Here, no selection for hidden mutations σh=0 is present. (b) Each diamond symbol represents the slope of an independently simulated selection-mutation process, as shown on (a). Slopes for different σa and 25 uniformly sampled values of logK are shown. (a) and (b) show that large population size K and strong selection coefficient σa can lead to a larger effective mutation rate of APOBEC3-driven mutations. By design, under weak selection strength σa, the effective mutation rate of APOBEC3-driven mutations is identical to the non-APOBEC3-driven mutations. (c) Values of na associated with n-na under different strengths σh of selection of hidden beneficial mutations. Strong selection of hidden beneficial mutations can restore the increased effective mutation rate of APOBEC3-driven mutations to the same level as the pure mutation processes. (d) For large σh>σa, the expansion of na is limited (red diamonds).
Figure 4:
Figure 4:
Relative molecular clocks for different types of mutations with respect to an arbitrarily chosen reference genome, instead of time, exhibit linearity. (a) The number of synonymous APOBEC3-relevant mutations (TC → TT) is plotted against the number of synonymous mutations for the genomes collected before 2016. (b) The number of synonymous APOBEC3-relevant mutations (TC → TT) is plotted against the number of synonymous mutations for the genomes collected after 2016.
Figure 5:
Figure 5:
(a) The number of observed APOBEC3-relevant mutations na of all genomes plotted against the number of observed synonymous mutations n, using the genome KP849470 as a reference. The two genomes KJ642617 and MK783029 are highlighted since they are the closest genomes to the intersection of the two lines corresponding to the pre-2016 and post-2016 genomes. (b) The number of observed APOBEC3-relevant mutations na, using genome KJ642617 as the reference, plotted against the number of observed synonymous mutations n. The open circle represents the location of the hypothesized most recent common ancestor (MRCA) of the post-2016 genomes in the animal reservoir. The inset shows that the line of the post-2016 group does not intersect the origin, suggesting that KJ642617 is not a common ancestor of the post-2016 group. (c) The number of observed APOBEC3-relevant mutations na plotted against the number of observed synonymous mutations n, using MK783029 as reference. The inset shows that the line of the pre-2016 group does not pass through the origin, suggesting that MK783029 has already undergone some APOBEC3-editing after its ancestor’s transmission to human.
Figure 6:
Figure 6:
The number of synonymous APOBEC3-induced mutations plotted against the collection date of the post-2016 genomes, using KJ642617 as the reference. The red diamond represents the genomes within the lineage with the longest branch length, starting with MK783028 and ending with ON694329.
Figure 7:
Figure 7:
Different evolutionary environments shape the shared features of pre- and post-2016 groups of MPXV genomes. Pre-2016 cross-species transmission did not persist in human hosts. The blue dots represent the shared features due to evolution in the animal reservoir. The red dots represent the shared features due to evolution in the human hosts. We hypothesize an unobserved stage of continuous and persistent evolution in the human hosts between the 2017 cross-species transmission and the 2022 global outbreak.

Similar articles

References

    1. Áine O’Toole and Andrew Rambaut. An APOBEC3 molecular clock to estimate the date of emergence of hMPXV. https://virological.org/t/an-apobec3-molecular-clock-to-estimate-the-dat..., 2022. (Accessed on 10/16/2022).
    1. Áine O’Toole and Andrew Rambaut. Initial observations about putative APOBEC3 deaminase editing driving short-term evolution of MPXV since 2017. https://virological.org/t/initial-observations-about-putative-apobec3-de..., 2022. (Accessed on 10/16/2022).
    1. Áine O’Toole and Andrew Rambaut. Update to observations about putative APOBEC3 deaminase editing in the light of new genomes from USA. https://virological.org/t/update-to-observations-about-putative-apobec3-..., 2022. (Accessed on 10/16/2022).
    1. Pablo Aja-Macaya Soraya Rumbo-Feal, Poza Margarita, Angelina Cañizares Juan A. Vallejo, and Bou Germán. A new and efficient enrichment method for metagenomic sequencing of monkeypox virus. medRxiv, 2022. - PMC - PubMed
    1. Arita Isao and Henderson Donald A.. Monkeypox and whitepox viruses in west and central africa. Bulletin of the World Health Organization, 53(4):347, 1976. - PMC - PubMed

Publication types