Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Dec;4(2):114-120.
doi: 10.1016/j.jobb.2022.06.006. Epub 2022 Jul 12.

"Mutation blacklist" and "mutation whitelist" of SARS-CoV-2

Affiliations

"Mutation blacklist" and "mutation whitelist" of SARS-CoV-2

Yamin Sun et al. J Biosaf Biosecur. 2022 Dec.

Abstract

Over the past two years, scientists throughout the world have completed more than 6 million SARS-CoV-2 genome sequences. Today, the number of SARS-CoV-2 genomes exceeds the total number of all other viral genomes. These genomes are a record of the evolution of SARS-CoV-2 in the human host, and provide information on the emergence of mutations. In this study, analysis of these sequenced genomes identified 296,728 de novo mutations (DNMs), and found that six types of base substitutions reached saturation in the sequenced genome population. Based on this analysis, a "mutation blacklist" of SARS-CoV-2 was compiled. The loci on the "mutation blacklist" are highly conserved, and these mutations likely have detrimental effects on virus survival, replication, and transmission. This information is valuable for SARS-CoV-2 research on gene function, vaccine design, and drug development. Through association analysis of DNMs and viral transmission rates, we identified 185 DNMs that positively correlated with the SARS-CoV-2 transmission rate, and these DNMs where classified as the "mutation whitelist" of SARS-CoV-2. The mutations on the "mutation whitelist" are beneficial for SARS-CoV-2 transmission and could therefore be used to evaluate the transmissibility of new variants. The occurrence of mutations and the evolution of viruses are dynamic processes. To more effectively monitor the mutations and variants of SARS-CoV-2, we built a SARS-CoV-2 mutation and variant monitoring and pre-warning system (MVMPS), which can monitor the occurrence and development of mutations and variants of SARS-CoV-2, as well as provide pre-warning for the prevention and control of SARS-CoV-2 (https://www.omicx.cn/). Additionally, this system could be used in real-time to update the "mutation whitelist" and "mutation blacklist" of SARS-CoV-2.

Keywords: De novo mutations; Mutation saturation; SARS-CoV-2; Transmission.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Fig. 1
Fig. 1
Mutation saturation curves of 12 types of base substitution. Identified DNMs in the SARS-CoV-2 sequenced genomes were grouped by base type and the cumulative saturation percentage of C-> T and G-A (A), C->A and G->T (B), T->C and A->G (C), A->T and T->A (D), C->G and G->C (E), T->G and A->C (F), and were plotted over time. The curve plateau reflects 100% saturation, which is marked in the figure by a vertical line.
Fig. 2
Fig. 2
A Sequence conservation analysis in the 5ʹ-UTR. The x-axis is the position on the genome, and the y-axis is the number of mutations that have never been detected. B is the conservative analysis of the sequence in the intergenic region of the orf1ab and spike genes. The x-axis is the position on the genome, and the y-axis is the number of mutations that have never been detected.
Fig. 3
Fig. 3
Association between DNMs and transmission rate. The x-axis represents the SARS-CoV-2 genome position, and the y-axis represents the weekly growth slope of each DNM in the SARS-CoV-2 population. Mutations with a growth slope greater than 0.001 are marked in red and the corresponding amino acid mutations are noted.
Fig. 4
Fig. 4
Features of the mutation and variant monitoring and pre-warning system (MVMPS). A is the website portal. B is the variant monitoring module, which shows the global distribution of each variant by time frame and geography. C is the mutation monitoring module, which shows the global distribution of each mutation by time frame and geography. D is the mutation blacklist and whitelist module.

References

    1. Pal M., Berhanu G., Desalegn C., Kandi V. Severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2): an update. Cureus. 2020;12(3) - PMC - PubMed
    1. Nakagawa S., Miyazawa T. Genome evolution of SARS-CoV-2 and its virological characteristics. Inflamm Regener. 2020;40(1):1–7. - PMC - PubMed
    1. Zhu N, Zhang D, Wang W, et al. A novel coronavirus from patients with pneumonia in China, 2019. N Engl J Med. 2020.
    1. Lu R., Zhao X., Li J., et al. Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding. The Lancet. 2020;395(10224):565–574. - PMC - PubMed
    1. Wu F., Zhao S., Yu B., et al. A new coronavirus associated with human respiratory disease in China. Nature. 2020;579(7798):265–269. - PMC - PubMed

LinkOut - more resources