Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2021 Jul 19;11(1):136.
doi: 10.1186/s13578-021-00643-z.

SARS-CoV-2: from its discovery to genome structure, transcription, and replication

Affiliations
Review

SARS-CoV-2: from its discovery to genome structure, transcription, and replication

Ayslan Castro Brant et al. Cell Biosci. .

Abstract

SARS-CoV-2 is an extremely contagious respiratory virus causing adult atypical pneumonia COVID-19 with severe acute respiratory syndrome (SARS). SARS-CoV-2 has a single-stranded, positive-sense RNA (+RNA) genome of ~ 29.9 kb and exhibits significant genetic shift from different isolates. After entering the susceptible cells expressing both ACE2 and TMPRSS2, the SARS-CoV-2 genome directly functions as an mRNA to translate two polyproteins from the ORF1a and ORF1b region, which are cleaved by two viral proteases into sixteen non-structural proteins (nsp1-16) to initiate viral genome replication and transcription. The SARS-CoV-2 genome also encodes four structural (S, E, M and N) and up to six accessory (3a, 6, 7a, 7b, 8, and 9b) proteins, but their translation requires newly synthesized individual subgenomic RNAs (sgRNA) in the infected cells. Synthesis of the full-length viral genomic RNA (gRNA) and sgRNAs are conducted inside double-membrane vesicles (DMVs) by the viral replication and transcription complex (RTC), which comprises nsp7, nsp8, nsp9, nsp12, nsp13 and a short RNA primer. To produce sgRNAs, RTC starts RNA synthesis from the highly structured gRNA 3' end and switches template at various transcription regulatory sequence (TRSB) sites along the gRNA body probably mediated by a long-distance RNA-RNA interaction. The TRS motif in the gRNA 5' leader (TRSL) is responsible for the RNA-RNA interaction with the TRSB upstream of each ORF and skipping of the viral genome in between them to produce individual sgRNAs. Abundance of individual sgRNAs and viral gRNA synthesized in the infected cells depend on the location and read-through efficiency of each TRSB. Although more studies are needed, the unprecedented COVID-19 pandemic has taught the world a painful lesson that is to invest and proactively prepare future emergence of other types of coronaviruses and any other possible biological horrors.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
Genome structure and coding potentials of human coronaviruses. The viral genome is a single-stranded, positive-sense RNA with a cap (grey circle) at the 5′-end and a poly-A tail (A30-60) at the 3′ end. The genome encodes 16 non-structural proteins (ORF1a → nsp1-11 and ORF1b → nsp12-16) from the left three-fourth of the genome, and 4–5 structural proteins (S, spike; E, envelope; M, membrane; N, nucleocapsid; HE, hemagglutinin esterase) and various number of accessory proteins (numbered boxes) from the right one-fourth of the genome
Fig. 2
Fig. 2
Coronavirus genome replication and transcription. Diagram showing the key steps in coronavirus entry (1), initial translation of incoming viral +gRNA to express viral non-structural proteins (nsp1-16) (2), genome replication in double-membrane vesicles (DMVs), continuous transcription of gRNA through a −gRNA-intermediate by viral replication and transcription complex (RTC) (3), generation of sgRNA by discontinuous transcription RTC (4), the expression of structural and accessory proteins from +sgRNA (S, spike; M, membrane; E, envelope; N proteins) (5), and virion assembly and release (6)
Fig. 3
Fig. 3
A proposed model of viral RNA transcription and template switch during SARS-CoV-2 infection. A Continuous 5′–3′ transcription of viral genomic +gRNA leads to synthesis of the full-length, negative-sense viral genomic RNA (−gRNA) (left). Because RTC-mediated RNA transcription starts from the highly structured viral gRNA 3′ end, this transcription often leads to discontinuous 5′–3′ transcription by proposed template switch (right). Through interactions between transcription regulatory sequences (TRS) located in the leader (TRSL) and the genome body (TRSB), the template switch results in the production of viral subgenomic RNAs (−sgRNAs). B Diagram of SARS-CoV-2 genome with predicted ORFs (colored boxes) and TRS (smaller red boxes) upstream of individual ORFs. Above are the canonical TRSL-dependent junctions detected in the individual sgRNAs from SARS-CoV-2-infected cells by RNA-seq, with the junction reads corresponding to the sgRNA encoding N protein being the most abundant. Below are the TRSB-independent interactions of TRSL (red) and non-TRS dependent (blue) junctions detected by RNA-seq with unknown function [35, 38]
Fig. 4
Fig. 4
Structures of SARS-CoV-2 replication and transcription complex (RTC). A A composite structure of RTC from three PDB coordinates, 7CXM (architecture of nsp7, nsp8 X2, nsp12 and nsp13 X2 bound to RNA template and primer), 6XEZ (the ADP·AlF3, bound in the nsp13 helicase active site), and 7CYQ (nsp9 associated with nsp12 and GDP in the active site of RNA capping). The RNA template pieces bound to nsp13 and nsp12 are not connected and would be pulled by the two enzymes in opposite directions as indicated by the yellow double arrowheads. B, C Zoom-in views of RTC bound to inhibitors, Favipiravir (PDB: 7AAP), Remedisivir (RMP) (PDB: 7B3B), and Suramin (PDB: 7D4F). RdRP (nsp12) is shown in grey in B, C, the three inhibitors are in distinct colors. With several SO4 groups mimicking the phosphate backbone of RNA, two Sumarin molecules (cyan) compete for the RNA template and primer binding. Remedisivir (blue) is already incorporated in the RNA primer strand at − 3 position. Favipiravir RTD (magenta) occupies the incoming nucleotide position, but the phosphates are in a non-productive conformation. The active site residues are shown in pink-red sticks and Mg2+ ions are shown as green spheres
Fig. 5
Fig. 5
The expression of sgRNAs during human coronavirus infection. On the left are the diagrams of SARS-CoV-2 (A), hCoV-OC43 (B) and hCoV-NL63 (C) genomes and their coding potentials. Individual sgRNAs (lines) with a 5′ leader (small red box) obtained through the template switch are illustrated below and named by their corresponding proteins encoded. The viral gRNA (vgRNA) is generated by continuous transcription of the entire viral genome. On the right are the sgRNA expression profiles in African monkey kidney Vero E6 cells infected for 24 h with SARS-CoV-2 (A), 189 h with hCoV-NL63 (C), or human colorectal adenocarcinoma HCT-8 cells infected for 48 h with hCoV-OC43 (B). The sgRNAs detected by Northern blot analysis of total RNA extracted from infected cells using the individual antisense probes specific to each viral N gene. The Northern blot gel of SARS-CoV-2 sgRNA in A was modified with permission from a reference [38]

References

    1. Huang C, Wang Y, Li X, Ren L, Zhao J, Hu Y, Zhang L, Fan G, Xu J, Gu X, et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet. 2020;395(10223):497–506. doi: 10.1016/S0140-6736(20)30183-5. - DOI - PMC - PubMed
    1. Wang C, Horby PW, Hayden FG, Gao GF. A novel coronavirus outbreak of global health concern. Lancet. 2020;395(10223):470–473. doi: 10.1016/S0140-6736(20)30185-9. - DOI - PMC - PubMed
    1. Pekar J, Worobey M, Moshiri N, Scheffler K, Wertheim JO. Timing the SARS-CoV-2 index case in Hubei province. Science. 2021;372(6540):412–417. doi: 10.1126/science.abf8003. - DOI - PMC - PubMed
    1. Zhou P, Yang XL, Wang XG, Hu B, Zhang L, Zhang W, Si HR, Zhu Y, Li B, Huang CL, et al. A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature. 2020;579(7798):270–273. doi: 10.1038/s41586-020-2012-7. - DOI - PMC - PubMed
    1. Wu F, Zhao S, Yu B, Chen YM, Wang W, Song ZG, Hu Y, Tao ZW, Tian JH, Pei YY, et al. A new coronavirus associated with human respiratory disease in China. Nature. 2020;579(7798):265–269. doi: 10.1038/s41586-020-2008-3. - DOI - PMC - PubMed

LinkOut - more resources