Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jan 4:9:e10666.
doi: 10.7717/peerj.10666. eCollection 2021.

Identification and molecular characterization of mutations in nucleocapsid phosphoprotein of SARS-CoV-2

Affiliations

Identification and molecular characterization of mutations in nucleocapsid phosphoprotein of SARS-CoV-2

Gajendra Kumar Azad. PeerJ. .

Abstract

SARS-CoV-2 genome encodes four structural proteins that include the spike glycoprotein, membrane protein, envelope protein and nucleocapsid phosphoprotein (N-protein). The N-protein interacts with viral genomic RNA and helps in packaging. As SARS-CoV-2 spread to almost all countries worldwide within 2-3 months, it also acquired mutations in its RNA genome. Therefore, this study was conducted with an aim to identify the variations present in N-protein of SARS-CoV-2. Here, we analysed 4,163 reported sequence of N-protein from United States of America (USA) and compared them with the first reported sequence from Wuhan, China. Our study identified 107 mutations that reside all over the N-protein. Further, we show the high rate of mutations in intrinsically disordered regions (IDRs) of N-protein. Our study show 45% residues of IDR2 harbour mutations. The RNA-binding domain (RBD) and dimerization domain of N-protein also have mutations at key residues. We further measured the effect of these mutations on N-protein stability and dynamicity and our data reveals that multiple mutations can cause considerable alterations. Altogether, our data strongly suggests that N-protein is one of the mutational hotspot proteins of SARS-CoV-2 that is changing rapidly and these mutations can potentially interferes with various aspects of N-protein functions including its interaction with RNA, oligomerization and signalling events.

Keywords: COVID-19; Infectious disease; Mutations; Nucleocapsid Phosphoprotein (N protein); SARS-CoV-2; USA.

PubMed Disclaimer

Conflict of interest statement

The author declares that they have no competing interests.

Figures

Figure 1
Figure 1. The schematic structure of Nucleocapsid Phosphoprotein (N protein) of SARS-CoV-2.
(A) The N-protein is comprised of 419 residues. The RNA-binding domain, dimerization domain, intrinsically disordered regions including IRD1, IRD2 and IRD3 are labelled. (B and C) Cartoon representation of crystal structure of the RNA-binding domain (RCSB protein ID-6VYO) and dimerization domain (RCSB protein ID-6WJI) of N-protein. (B) Demonstrates the RBD while (C) represents dimerization domain of N-protein. The identified mutated amino acids (single letter code) along with its respective position in polypeptide sequence are shown. The structural representations were made using UCSF chimera software tool.
Figure 2
Figure 2. The weblogo diagram showing the conservation status of polypeptide sequence of N-protein.
The sequence weblogo was generated by multiple sequence alignment of 4,164 N-protein sequences. The overall height of the stack indicates the sequence conservation at that position.
Figure 3
Figure 3. Visual representation of Δ vibrational entropy energy between wild-type and mutant N protein.
The amino acids residues are colored according to the vibrational entropy change as a consequence of mutation of N-protein. Blue (A–C) represents a rigidification of the structure and red (D–F) represents, a gain in flexibility. (A–C) The top three mutants that show rigidification in structure upon mutation. (D–F) The top three mutants that show gain in flexibility upon mutation. Each panel also shows the mutation and the location of the residues.
Figure 4
Figure 4. Analysis of interatomic interactions.
Visual representation of interatomic interactions contributed by T271I and I292T of N-protein. Both of these mutants showed maximum positive and negative ΔΔG among mutants present in RBD and dimerization domain of N-protein. (A & B) Threonine to isoleucine substitution at 271st position; (C & D) isoleucine to threonine substitution at 292nd position. Wild-type and mutant residues are represented in light-greencolor. The interactions made by wild type and mutant residues are highlighted in each panel. The polar interactions are depicted in red dotted line, hydrophobic interaction in green and weak hydrogen bonds in orange.

References

    1. Ahmed SF, Quadeer AA, McKay MR. Preliminary identification of potential vaccine targets for the COVID-19 coronavirus (SARS-CoV-2) based on SARS-CoV immunological studies. Viruses. 2020;12(3):254. doi: 10.3390/v12030254. - DOI - PMC - PubMed
    1. Astuti I, Ysrafil Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2): an overview of viral structure and host response. Diabetes and Metabolic Syndrome: Clinical Research and Reviews. 2020;14(4):407–412. doi: 10.1016/j.dsx.2020.04.020. - DOI - PMC - PubMed
    1. Azad GK. Identification of novel mutations in the methyltransferase complex (Nsp10-Nsp16) of SARS-CoV-2. Biochemistry and Biophysics Reports. 2020;24:100833. doi: 10.1016/j.bbrep.2020.100833. - DOI - PMC - PubMed
    1. Blanco-Melo D, Nilsson-Payant BE, Liu WC, Uhl S, Hoagland D, Møller R, Jordan TX, Oishi K, Panis M, Sachs D, Wang TT, Schwartz RE, Lim JK, Albrecht RA, TenOever BR. Imbalanced host response to SARS-CoV-2 drives development of COVID-19. Cell. 2020;181(5):1036–1045.e9. doi: 10.1016/j.cell.2020.04.026. - DOI - PMC - PubMed
    1. Chand GB, Banerjee A, Azad GK. Identification of twenty-five mutations in surface glycoprotein (Spike) of SARS-CoV-2 among Indian isolates and their impact on protein dynamics. Gene Reports. 2020a;21:100891. doi: 10.1016/j.genrep.2020.100891. - DOI - PMC - PubMed

LinkOut - more resources