Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2021 Sep 24;13(10):1923.
doi: 10.3390/v13101923.

SARS-CoV-2 Subgenomic RNAs: Characterization, Utility, and Perspectives

Affiliations
Review

SARS-CoV-2 Subgenomic RNAs: Characterization, Utility, and Perspectives

Samuel Long. Viruses. .

Erratum in

Abstract

SARS-CoV-2, the etiologic agent at the root of the ongoing COVID-19 pandemic, harbors a large RNA genome from which a tiered ensemble of subgenomic RNAs (sgRNAs) is generated. Comprehensive definition and investigation of these RNA products are important for understanding SARS-CoV-2 pathogenesis. This review summarizes the recent progress on SARS-CoV-2 sgRNA identification, characterization, and application as a viral replication marker. The significance of these findings and potential future research areas of interest are discussed.

Keywords: COVID-19; SARS-CoV-2; sgRNA; subgenomic RNA.

PubMed Disclaimer

Conflict of interest statement

The author declares no known competing financial interests or personal relationships that could have appeared to influence the work reported in this manuscript.

Figures

Figure 1
Figure 1
Schematic representation of SARS-CoV-2 genome organization and the canonical subgenomic mRNAs. The genome features two large genes, ORF1a and ORF1b, which encode a total of 16 non-structural proteins (nsp1-nsp16) (primary translation); structural genes encoding structural proteins include spike (S), envelope (E), membrane (M), and nucleocapsid (N), respectively; and genes encoding several small accessory proteins (3a, 6, 7a, 7b, 8 and 10). Depicted in the lower right are 10 canonical subgenomic mRNAs. Figure is adapted from [12].
Figure 2
Figure 2
Schematic depiction of SARS-CoV-2 replication and discontinuous transcription. In addition to serving as a template for producing (−) genomic RNA (which enables genome replication), the full length (+) genomic RNA also serves as a template to produce (−) subgenomic RNAs (sgRNAs), which are subsequently used to synthesize (+) subgenomic mRNAs encoding structural and accessory proteins. (−) sgRNA synthesis involves a template switch from a body transcription regulatory sequence (TRS-B) (located upstream of most open reading frames in the 3′ one-third of the viral genome) to the leader TRS (TRS-L, located at about 70 nucleotides from the 5′ end of the genome). This discontinuous transcription process, leading to leader-body fusion, can occur at any TRS-B, and eventually results in the synthesis of a characteristic nested set of (+) subgenomic mRNAs. SARS-CoV-2 subgenomic mRNAs are structurally polycistronic, but are assumed to be functionally monocistronic [3,13], in that only the first open reading frame in each sgRNA, which is absent in the next smaller sgRNA, is translated. Depicted in the lower left and lower middle is the conserved TRS motif (ACGAAC) in the leader and body sequences. In addition, in SARS-CoV-2, extensive base pairing with 7–12 consecutive base pairs beyond the conserved motif between TRS-L and anti-TRS-B has been observed [6]. Figure is adapted from [14].
Figure 3
Figure 3
SARS-CoV-2 sgRNA recombination sites. Depicted are three types of fusion/junction sites. (Green, turquoise, or purple bracket lines represent the 5′ and 3′ locations of junctions.) (A) TRS-L- and TRS-B-dependent discontinuous transcription, which gives rise to canonical sgRNAs. Note that each canonical junction represents a group of subgenomes that have similar, yet distinct fusion junction sites upstream of a common first annotated gene downstream of the junction [6]. (B) TRS-L-dependent noncanonical fusions between TRS-L and unanticipated 3′ sites in the middle of ORFs or UTR (i.e., noncanonical 3′ sites) in the body. (C) TRS-L-independent fusion between sequences that share no similarity to the leader, resulting in long-distance fusions and smaller deletions mainly in the structural and accessory genes when the fusion occurs between proximal sites. Hundreds of noncanonical sgRNAs have been identified [5,6,12,20], and in (B,C), only several representative fusion patterns are illustrated. In addition, both in-frame and out-of-frame fusion products can be generated in (B,C), with out-of-frame noncanonical sgRNAs significantly outnumbering in-frame noncanonical sgRNAs (by ~60%) [12].

References

    1. Zhou P., Yang X.L., Wang X.G., Hu B., Zhang L., Zhang W., Si H.R., Zhu Y., Li B., Huang C.L., et al. A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature. 2020;579:270–273. doi: 10.1038/s41586-020-2012-7. - DOI - PMC - PubMed
    1. Zhu N., Zhang D., Wang W., Li X., Yang B., Song J., Zhao X., Huang B., Shi W., Lu R., et al. A Novel Coronavirus from Patients with Pneumonia in China, 2019. N. Engl. J. Med. 2020;382:727–733. doi: 10.1056/NEJMoa2001017. - DOI - PMC - PubMed
    1. Sawicki S.G., Sawicki D.L. Coronaviruses use discontinuous extension for synthesis of subgenome-length negative strands. Adv. Exp. Med. Biol. 1995;380:499–506. - PubMed
    1. Zúñiga S., Sola I., Alonso S., Enjuanes L. Sequence motifs involved in the regulation of discontinuous coronavirus subgenomic RNA synthesis. J. Virol. 2004;78:980–994. doi: 10.1128/JVI.78.2.980-994.2004. - DOI - PMC - PubMed
    1. Parker M.D., Lindsey B.B., Leary S., Gaudieri S., Chopra A., Wyles M., Angyal A., Green L.R., Parsons P., Tucker R.M., et al. Subgenomic RNA identification in SARS-CoV-2 genomic sequencing data. Genome Res. 2021;31:645–658. doi: 10.1101/gr.268110.120. - DOI - PMC - PubMed

Publication types