Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Mar 2;13(3):394.
doi: 10.3390/v13030394.

The Mutation Profile of SARS-CoV-2 Is Primarily Shaped by the Host Antiviral Defense

Affiliations

The Mutation Profile of SARS-CoV-2 Is Primarily Shaped by the Host Antiviral Defense

Cem Azgari et al. Viruses. .

Abstract

Understanding SARS-CoV-2 evolution is a fundamental effort in coping with the COVID-19 pandemic. The virus genomes have been broadly evolving due to the high number of infected hosts world-wide. Mutagenesis and selection are two inter-dependent mechanisms of virus diversification. However, which mechanisms contribute to the mutation profiles of SARS-CoV-2 remain under-explored. Here, we delineate the contribution of mutagenesis and selection to the genome diversity of SARS-CoV-2 isolates. We generated a comprehensive phylogenetic tree with representative genomes. Instead of counting mutations relative to the reference genome, we identified each mutation event at the nodes of the phylogenetic tree. With this approach, we obtained the mutation events that are independent of each other and generated the mutation profile of SARS-CoV-2 genomes. The results suggest that the heterogeneous mutation patterns are mainly reflections of host (i) antiviral mechanisms that are achieved through APOBEC, ADAR, and ZAP proteins, and (ii) probable adaptation against reactive oxygen species.

Keywords: ADAR; APOBEC; COVID-19; ROS; SARS-CoV-2; ZAP; evolution; mutation; phylogenetics.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Figures

Figure 1
Figure 1
Schematic representation of the methodology.
Figure 2
Figure 2
Mutation profile of SARS-CoV-2 genomes. Mutation counts are normalized by the trinucleotide content for each trinucleotide generated from 33,540 (representative sequences and nodes) sequences (A). From unstable positions where more than 8 mutations of the same type at the same position, highly occurred mutations are retrieved. The occurrences of these mutations are divided by the total number of the mutations of the same type and observed in the same trinucleotide. The calculated ratio is used to visualize the impact of highly occurring mutations on signature profile, as percentages (B). The bars are labelled with their positions in the genome. Number of mutations observed in the phylogenetic tree, per position (C). Mutations which have a significant contribution to their signature visualized in part B are marked according to the mutation count they represent at the position. Marked mutations are colored and reshaped with respect to their mutation type and impact range on their signatures.
Figure 3
Figure 3
Comparison of codon variations in SARS-CoV-2 phylogenetic tree, human genome, and SARS-CoV-2reference genome. (A) Codon variations from mutations in the phylogenetic tree, represented by the ratio of formations over deformations per codon in log2 formation. (B) Relative codon usage percentage between Human and SARS-CoV-2 reference genome.
Figure 4
Figure 4
Comparison of dinucleotide formations and deformations retrieved from phylogenetic trees. Deformation ratio of dinucleotides is represented as the ratio of deformation count in the tree over dinucleotide’s abundance in the reference genome (A). As a result of the mutations, the relative dinucleotides are formed (B).
Figure 5
Figure 5
Mechanisms that can alter the sequence context of SARS-CoV-2. (i) APOBEC-caused mutations correlated with the enzyme signature dominantly on the plus RNA strand; (ii) ADAR-caused mutations equivalently affecting both RNA strands due to its mechanism of action; (iii) drop of CG dinucleotide targeted by ZAP through selection; (iv) ROS effect shown on the plus RNA strand.

References

    1. Hu B., Guo H., Zhou P., Shi Z.L. Characteristics of SARS-CoV-2 and COVID-19. Nat. Rev. Microbiol. 2020:1–14. - PMC - PubMed
    1. Shah B., Modi P., Sagar S.R. In silico studies on therapeutic agents for COVID-19: Drug repurposing approach. Life Sci. 2020;252:117652. doi: 10.1016/j.lfs.2020.117652. - DOI - PMC - PubMed
    1. Li C., Yang Y., Ren L. Genetic evolution analysis of 2019 novel coronavirus and coronavirus from other species. Infect. Genet. Evol. 2020;82:104285. doi: 10.1016/j.meegid.2020.104285. - DOI - PMC - PubMed
    1. Wu F., Zhao S., Yu B., Chen Y.-M., Wang W., Song Z.-G., Hu Y., Tao Z.-W., Tian J.-H., Pei Y.-Y., et al. Author Correction: A new coronavirus associated with human respiratory disease in China. Nature. 2020;580:E7. doi: 10.1038/s41586-020-2202-3. - DOI - PMC - PubMed
    1. Zhou P., Yang X.-L., Wang X.-G., Hu B., Zhang L., Zhang W., Si H.-R., Zhu Y., Li B., Huang C.-L., et al. A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature. 2020;579:270–273. doi: 10.1038/s41586-020-2012-7. - DOI - PMC - PubMed

Publication types

LinkOut - more resources