Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Apr;91(2):214-224.
doi: 10.1007/s00239-023-10097-1. Epub 2023 Feb 17.

Evidence Supporting That C-to-U RNA Editing Is the Major Force That Drives SARS-CoV-2 Evolution

Affiliations

Evidence Supporting That C-to-U RNA Editing Is the Major Force That Drives SARS-CoV-2 Evolution

Jinxiang Wang et al. J Mol Evol. 2023 Apr.

Abstract

Mutations of DNA organisms are introduced by replication errors. However, SARS-CoV-2, as an RNA virus, is additionally subjected to rampant RNA editing by hosts. Both resources contributed to SARS-CoV-2 mutation and evolution, but the relative prevalence of the two origins is unknown. We performed comparative genomic analyses at intra-species (world-wide SARS-CoV-2 strains) and inter-species (SARS-CoV-2 and RaTG13 divergence) levels. We made prior predictions of the proportion of each mutation type (nucleotide substitution) under different scenarios and compared the observed versus the expected. C-to-T alteration, representing C-to-U editing, is far more abundant that all other mutation types. Derived allele frequency (DAF) as well as novel mutation rate of C-to-T are the highest in SARS-CoV-2 population, and C-T substitution dominates the divergence sites between SARS-CoV-2 and RaTG13. This is compelling evidence suggesting that C-to-U RNA editing is the major source of SARS-CoV-2 mutation. While replication errors serve as a baseline of novel mutation rate, the C-to-U editing has elevated the mutation rate for orders of magnitudes and accelerates the evolution of the virus.

Keywords: Allele frequency; C-to-U RNA editing; Evolution; Mutation; SARS-CoV-2.

PubMed Disclaimer

Conflict of interest statement

The authors declare they have no conflict of interest.

Figures

Fig. 1
Fig. 1
Introduction of basic evolutionary concepts. A Sequence of outgroup species is used for inference of ancestral state of the target species. Then the direction and derived allele frequency (DAF) of polymorphic sites could be determined. B Site1 has higher novel mutation rate than site2, then site1 eventually has higher DAF. C Site1 has similar novel mutation rate with site2, but site1 is positively selected and finally has higher DAF than site2. D Synonymous mutations are (believed to be) free from natural selection. The final DAF of synonymous sites might mirror the initial mutation rate
Fig. 2
Fig. 2
Prior prediction of mutation profile based on different mutation sources. A The three possible sources of mutations in SARS-CoV-2. RDRP-mediated replication errors, ADAR-mediated A-to-I editing, and APOBEC-mediated C-to-U editing. B If replication error is the major mutation source, then the mutation profile should be symmetric. SNP denotes “single nucleotide polymorphism”. C If A-to-I editing is the major mutation source, then the A-to-G mutations should be dominant. D If C-to-U editing is the major mutation source, then the C-to-T mutations should be dominant
Fig. 3
Fig. 3
The numbers of SNV (single nucleotide variation) in the SARS-CoV-2 genome under different thresholds of DAF (derived allele frequency). A When DAF is low, the mutation profile resembles the SNP profile. B The fraction of C-to-T substitution increases with DAF. (C-D) When DAF is high, C-to-U editing sites become dominant. E DAF spectrum of C-to-T substitutions. F DAF spectrum of non-C-to-T mutations. G Comparison of C-to-U and non-C-to-U mutations. The difference of DAF was determined by KS test
Fig. 4
Fig. 4
The DAF of each mutation type. A The median and mean DAF of each mutation type. B Synonymous and missense sites are shown separately. Synonymous sites have obviously higher DAF than missense sites
Fig. 5
Fig. 5
The definition of inter-species divergence. A How mutations within a population lead to speciation and the sequence divergence. B Calculating the numbers of each substitution type when outgroup (MERS-CoV) is absent or present
Fig. 6
Fig. 6
The divergence sites between SARS-CoV-2 and RaTG13. Synonymous and missense sites were displayed separately. Non-directional mutations were counted without outgroup information. Directional mutations were inferred based on other coronaviruses as outgroup
Fig. 7
Fig. 7
The divergence between five entomopoxviruses (EPV). A Phylogeny of the five viral sequences. B, C, D The directional inter-species divergence between two viral sequences

References

    1. Alonso-Blanco C, Andrade J, Becker C, Bemm F, Bergelson J, Borgwardt KM, Cao J, Chae E, Dezwaan TM, Ding W, et al. 1,135 genomes reveal the global pattern of polymorphism in arabidopsis thaliana. Cell. 2016;166:481–491. doi: 10.1016/j.cell.2016.05.063. - DOI - PMC - PubMed
    1. Bass BL, Weintraub H. A developmentally regulated activity that unwinds RNA duplexes. Cell. 1987;48:607–613. doi: 10.1016/0092-8674(87)90239-X. - DOI - PubMed
    1. Bass BL, Weintraub H. An unwinding activity that covalently modifies its double-stranded RNA substrate. Cell. 1988;55:1089–1098. doi: 10.1016/0092-8674(88)90253-X. - DOI - PubMed
    1. Byng JW, Chase MW, Christenhusz MJM, Fay MF, Judd WS, Mabberley DJ, Sennikov AN, Soltis DE, Soltis PS, Stevens PF, et al. An update of the angiosperm phylogeny group classification for the orders and families of flowering plants: APG IV. Bot J Linn Soc. 2016;181:1–20. doi: 10.1111/boj.12385. - DOI
    1. Cai H, Liu X, Zheng X. RNA editing detection in SARS-CoV-2 transcriptome should be different from traditional SNV identification. J Appl Genet. 2022;63:587–594. doi: 10.1007/s13353-022-00706-y. - DOI - PMC - PubMed