Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Sep;6(9):e05001.
doi: 10.1016/j.heliyon.2020.e05001. Epub 2020 Sep 21.

Selective pressure on SARS-CoV-2 protein coding genes and glycosylation site prediction

Affiliations

Selective pressure on SARS-CoV-2 protein coding genes and glycosylation site prediction

Alessandra Lo Presti et al. Heliyon. 2020 Sep.

Abstract

Background: An outbreak of a febrile respiratory illness due to the newly discovered Coronavirus, SARS-CoV-2, was initially detected in mid-December 2019 in the city of Wuhan, Hubei province (China). The virus then spread to most countries in the world. As an RNA virus, SARS-CoV-2 may acquire mutations that may be fixed. The aim of this study was to evaluate the selective pressure acting on SARS-CoV-2 protein coding genes.

Methods: Mutations and glycosylation site prediction were analyzed in SARS-CoV-2 genomes (from 464 to 477 sequences).

Results: Selective pressure on surface glycoprotein (S) revealed one positively selected site (AA 943), located outside the receptor binding domain (RBD). Mutation analysis identified five residues on the surface glycoprotein, with variations (AA positions 367, 458, 477, 483, 491) located inside the RDB. Positive selective pressure was identified in nsp2, nsp3, nsp4, nsp6, nsp12, helicase, ORF3a, ORF8, and N sub-sets. A total of 22 predicted N-glycosylation positions were found in the SARS-CoV-2 surface glycoprotein; one of them, 343N, was located within the RBD. One predicted N-glycosylation site was found in the M protein and 4 potential O-glycosylation sites in specific protein 3a sequences.

Conclusion: Overall, the data showed positive pressure and mutations acting on specific protein coding genes. These findings may provide useful information on: i) markers for vaccine design, ii) new therapeutic approach, iii) information to implement mutagenesis experiments to inhibit SARS-CoV-2 cell entry. The negative selection identified in SARS-CoV-2 protein coding genes may help the identification of highly conserved regions useful to implement new future diagnostic protocols.

Keywords: Bioinformatics; Evolutionary biology; Glycosylation; Infectious disease; Microbiology; Mutation; Public health; Receptor binding domain; SARS-CoV-2; Selective pressure; Virology.

PubMed Disclaimer

Figures

Figure 1
Figure 1
The representative alignment for the comparison of the surface glycoprotein between SARS-CoV-2, SARS-CoV and Bat SARS - like virus (including the first 20 SARS-CoV-2 sequences, in addition to SARS-CoV-2 references), focusing the attention on the relevant positions 472 (amino acid L or P in SARS COV), 479 (amino acid N in SARS CoV) and 487 (amino acid T or S) of SARS CoV.
Figure 2
Figure 2
A. The predicted N-glycosylation sites in SARS-CoV-2 surface glycoprotein sub-set, obtained by using N-GlycoSite tool. The positions, number and fraction of the predicted N-glycosylation sites were reported. B. The predicted N-glycosylation sites in SARS-CoV-2 M protein sub-set, obtained by using N-GlycoSite tool. The position, number and fraction of the predicted N-glycosylation sites were reported.
Figure 3
Figure 3
The predicted N-glycosylation sites in SARS-CoV-2 E protein sub-set obtained by using N-GlycoSite tool. The positions, number and fraction of the predicted N-glycosylation sites were reported.

References

    1. Afgan E., Baker D., Batut B., van den Beek M., Bouvier D., Čech M. The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update. Nucleic Acids Res. 2018;46(W1):W537–W544. - PMC - PubMed
    1. Andersen K.G., Rambaut A., Lipkin W.I., Holmes E.C., Garry R.F. The proximal origin of SARS-CoV-2. Nat. Med. 2020 - PMC - PubMed
    1. Bagdonaite I., Wandall H.H. Global aspects of viral glycosylation. Glycobiology. 2018;28:443–467. - PMC - PubMed
    1. Benvenuto D., Angeletti S., Giovanetti M., Bianchi M., Pascarella S., Cauda R., Ciccozzi M., Cassone A. Evolutionary analysis of SARS-CoV-2: how mutation of Non-Structural Protein 6 (NSP6) could affect viral autophagy. J. Infect. 2020;10 pii: S0163-4453(20)30186-9. - PMC - PubMed
    1. Chakraborti S., Prabakaran P., Xiao X., Dimitrov D.S. The SARS coronavirus S glycoprotein receptor binding domain: fine mapping and functional characterization. Virol. J. 2005;2:73. - PMC - PubMed