Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2020 Sep:1:53-61.
doi: 10.1016/j.crmicr.2020.06.003. Epub 2020 Jun 29.

Minireview of progress in the structural study of SARS-CoV-2 proteins

Affiliations
Review

Minireview of progress in the structural study of SARS-CoV-2 proteins

Guoliang Zhu et al. Curr Res Microb Sci. 2020 Sep.

Abstract

A severe form of pneumonia, named coronavirus disease 2019 (COVID-19) by the World Health Organization, broke out in China and rapidly developed into a global pandemic, with millions of cases and hundreds of thousands of deaths reported globally. The novel coronavirus, which was designated as severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), was identified as the etiological agent of COVID-19. On the basis of experience accumulated following previous SARS-CoV and MERS-CoV outbreaks and research, a series of studies have been conducted rapidly, and major progress has been achieved with regard to the understanding of the phylogeny and genomic organization of SARS-CoV-2 in addition its molecular mechanisms of infection and replication. In the present review, we summarized crucial developments in the elucidation of the structure and function of key SARS-CoV-2 proteins, especially the main protease, RNA-dependent RNA polymerase, spike glycoprotein, and nucleocapsid protein. Results of studies on their associated inhibitors and drugs have also been highlighted.

Keywords: 3CLpro, 3C-like protease; 6-HB, six-helix bundle; ACE2, angiotensin-converting enzyme 2; COVID-19; COVID-19, coronavirus disease 2019; CatB/L, cysteine proteases-cathepsin B and L; Drug-screening; E protein, Envelope protein; Genome-encoded proteins; HR1, heptad repeat 1; HR2, heptad repeat 2; M protein, Membrane protein; MERS-CoV, the Middle Eastern respiratory syndrome coronavirus; Mpro, Main protease; N protein, Nucleocapsid protein; NSP, non-structural protein; ORF, Open reading frame; PD, peptidase domain; RBD, receptor-binding domain; RBM, receptor-binding motif; RMP, The remdesivir monophosphate; RdRp, RNA-dependent RNA polymerase; S protein, Spike glycoprotein; SARS-CoV, severe acute respiratory syndrome coronavirus; SARS-CoV-2; SARS-CoV-2, severe acute respiratory syndrome coronavirus 2; Structure-based screening; gRNA, genomic RNA; sgRNA, subgenomic RNA.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Image, graphical abstract
Graphical abstract
Fig. 1
Fig. 1
Schematic presentation of the SARS-CoV-2 genome organization, and the primary structures of Mpro, NSP12, S protein and N protein. The SARS-CoV-2 genome consists of ∼30 Kb RNA strand. There are 14 ORFs. The first two ORFs at 5′ untranslated regions code for polyprotein (pp1a/ab) that are segmented into 16 NSPs required for virus replication, followed by four structural proteins for spike glycoprotein(S), envelope protein(E), membrane protein(M), and nucleocapsid protein(N). At the 3′ terminus, there are nine accessory proteins (3a, 3b, 6, 7a, 7b, 8, 9b, 9c, and 10). Mpro consists of three domains, Domains I (8–101 aa), II (102–184 aa) and III (201–303 aa). NSP12 has three domains, the RdRp domain (367–920 aa), NiRAN domain (4–28 aa and 69–249 aa) and interface domain (250–365 aa). The RdRp domain consists of three subdomains: the finger subdomain (66–581 and 621–679 aa), the palm subdomain (582–620 and 680–815 aa), and the thumb subdomain (816–920 aa). S glycoprotein is divided into two subunits by protease at the S1/S2 protease cleavage site, Its S1 subunit contains NTD (14–305 aa), RBD (319 –541 aa), and RBM (437–508 aa). Its S2 subunit contains FP (788–806 aa), HR1 (912–984 aa), HR2 (1163–1213 aa), TM (1214–1237 aa) and CP (1238–1273 aa). N protein encompasses two conserved domains, namely the N1b domain (49–175aa), and N2b domain (247–365aa), and three short regions [N1a (1–49aa), N2a (174–247aa), and spacer B/N3 (365–419aa)]. The white box represents loop or non-special structural domain connecting the two domains on either side.
Fig. 2
Fig. 2
General structure of a SARS-CoV-2 virion, the pre- and post-fusion states of the SARS-CoV-2 S glycoprotein, and some effective inhibitors targeting different proteins to impair SARS-CoV-2 infection. SARS-CoV-2 enters host cells mainly through the trimeric S glycoprotein (PDB ID: 6VSB, 6VYB/6VXX) containing RBD which binds to the human receptor ACE2 (PDB ID: 6M17). A previously characterized SARS-CoV monoclonal antibody CR3022 could bind to the SARS-CoV-2 RBD (PDB ID: 6W41); the epitope can potentially confer in vivo protection. The recombinant soluble human ACE2 can bind to RBD of the S protein to block the pre-fusion stage of SARS-CoV-2 infection. The TMPRSS2 for S protein priming can be inhibited by camostat mesylate, which in turn would block SARS-CoV-2 infection. After receptor binding of the S1 subunit to support membrane fusion and viral infection, the S2 subunit is exposed and undergoes a conformational change from a pre-fusion to a post-fusion state (PDB ID: 6LXT). The inhibitor EK1C4 targets the S2 to inhibit viral infection. In cells, SARS-CoV-2 RNA translates to different proteins, such as NSP12 (PDB ID: 7BV1), N protein (PDB ID: 6M3M & 6WZO) and Mpro (PDB ID: 6Y2G). Several compounds (N3, α-ketoamide,11a and 11b) have been identified to exhibit high inhibitory activity against SARS-CoV-2 Mpro activity and viral infection.

Similar articles

Cited by

References

    1. Ahmed S.F., Quadeer A.A., McKay M.R. Preliminary Identification of Potential Vaccine Targets for the COVID-19 Coronavirus (SARS-CoV-2) Based on SARS-CoV Immunological Studies. Viruses. 2020;12 doi: 10.3390/v12030254. - DOI - PMC - PubMed
    1. Ahn D.G., Choi J.K., Taylor D.R., Oh J.W. Biochemical characterization of a recombinant SARS coronavirus nsp12 RNA-dependent RNA polymerase capable of copying viral RNA templates. Arch Virol. 2012;157:2095–2104. doi: 10.1007/s00705-012-1404-x. - DOI - PMC - PubMed
    1. Anand K., Palm G.J., Mesters J.R., Siddell S.G., Ziebuhr J., Hilgenfeld R. Structure of coronavirus main proteinase reveals combination of a chymotrypsin fold with an extra alpha-helical domain. EMBO J. 2002;21:3213–3224. doi: 10.1093/emboj/cdf327. - DOI - PMC - PubMed
    1. Anand K., Ziebuhr J., Wadhwani P., Mesters J.R., Hilgenfeld R. Coronavirus main proteinase (3CLpro) structure: basis for design of anti-SARS drugs. Science. 2003;300:1763–1767. doi: 10.1126/science.1085658. - DOI - PubMed
    1. Belouzard S., Chu V.C., Whittaker G.R. Activation of the SARS coronavirus spike protein via sequential proteolytic cleavage at two distinct sites. Proc. Natl. Acad. Sci. USA. 2009;106:5871–5876. doi: 10.1073/pnas.0809524106. - DOI - PMC - PubMed