Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Jul;15(7):409-417.
doi: 10.2217/fvl-2020-0163.

In silico identification of conserved cis-acting RNA elements in the SARS-CoV-2 genome

Affiliations

In silico identification of conserved cis-acting RNA elements in the SARS-CoV-2 genome

Bader Y Alhatlani. Future Virol. 2020 Jul.

Abstract

Aim: The aim of this study was to computationally predict conserved RNA sequences and structures known as cis-acting RNA elements (CREs) in the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) genome.

Materials & methods: Bioinformatics tools were used to analyze and predict CREs by obtaining viral sequences from available databases.

Results: Computational analysis revealed the presence of RNA stem-loop structures within the 3' end of the ORF1ab region analogous to previously identified SARS-CoV genomic packaging signals. Alignment-based RNA secondary structure predictions of the 5' end of the SARS-CoV-2 genome also identified conserved CREs.

Conclusion: These CREs may be potential vaccine and/or antiviral therapeutic targets; however, further studies are warranted to confirm their roles in the SARS-CoV-2 life cycle.

Keywords: RNA structure; RNA virus; SARS-CoV-2; cis-acting RNA elements.

PubMed Disclaimer

Conflict of interest statement

Financial & competing interests disclosure The author has no relevant affiliations or financial involvement with any organization or entity with a financial interest in or financial conflict with the subject matter or materials discussed in the manuscript. This includes employment, consultancies, honoraria, stock ownership or options, expert testimony, grants or patents received or pending, or royalties. No writing assistance was utilized in the production of this manuscript.

Figures

Figure 1.
Figure 1.. Prediction of minimum free energy structures within the severe acute respiratory syndrome coronavirus 2 ORF1b region.
The SARS-CoV-2 genome is about 30 kb in length and organized in 13 ORFs. The viral genome is flanked by the 5′ and 3′ UTRs. The ORF1b region is highlighted in red (upper panel) and a mounting plot represents the prediction of the MFE structures spanning from nt position 15,000–21,541 of the SARS-CoV-2 genome sequences (NC_045512.2) is shown with the highest peak of MFE prediction located between 19,000 and about 20,300 nt of the SARS-CoV-2 ORF1b region (lower panel). Note that the mountain plot indicates secondary structure in a plot of height versus sequence position, where the peaks represent the probabilities of RNA base pairing. MFE: Minimum free energy; nt: Nucleotide; ORF: Open reading frame; SARS-CoV-2: Severe acute respiratory syndrome coronavirus 2; UTR: Untranslated region.
Figure 2.
Figure 2.. Prediction of ORF1b RNA secondary structures.
RNA secondary structures located at the 3′ end of the SARS-CoV-2 ORF1b region (nucleotide positions: 19,712–20,294) were predicted using the Mfold web server (A) and compared with RNA secondary structures at the same position for SARS-CoV (NC_004718.3) (B) and bat SARS-like CoV (MG772933) (C). The two identical RNA stem-loops (SL1 and SL2) are indicated in a rectangle among the three viruses. SARS-CoV-2: Severe acute respiratory syndrome coronavirus 2.
Figure 3.
Figure 3.. Comparison of the predicted RNA secondary structures among Severe Acute Respiratory Syndrome Coronavirus 2.
The viral RNA sequences in the 3′ end of the ORF1b region from 19,900 to 20,019 nucleotide for both SARS-CoV-2 and bat SARS-like CoV and in the region of 19,830–19,949 nucleotide for SARS-CoV, were folded using the Mfold web server and drawn using the VARNA web applet to compare predicted RNA stem-loops. The similarities of the RNA structures of the predicted SL1 and SL2 among the three viruses are highlighted in green and blue, respectively. In addition, the sequence variation in these two stem-loops are indicated in black. These two RNA stem-loops are part of the previously identified SARS-CoV genomic PS, with SL2 being identified as the PScore [21,22]. SARS-CoV-2: Severe acute respiratory syndrome coronavirus 2.
Figure 4.
Figure 4.. Computational prediction of RNA secondary structures model at the severe acute respiratory syndrome coronavirus 2 5′-terminal region.
Schematic representation of the RNA secondary structures at the 5′ region of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) genome (nucleotide positions: 1–474) was generated using the Mfold web server and drawn using the VARNA web applet (ΔG = -155.50 kcal/mol). The eight RNA stem-loops (SL1–SL8) are indicated and the conserved core leader transcriptional regulatory sequence (TRS) region located within the SL3 is highlighted in gray. In addition, the conserved 5′-UUUCGU-3′ motifs found within SL5A and SL5B are indicated in boxes and the AUG start codon of ORF1ab is indicated in red. The SARS-CoV-2 reference genome sequence obtained from GenBank (NC_045512.2) was used.
Figure 5.
Figure 5.. Alignment-based prediction of the RNA structures at the 5′-terminal regions of severe acute respiratory syndrome coronavirus 2 genomes.
The first 474 nucleotides of SARS-CoV-2, bat SARS-like CoV and SARS-CoV genomes were aligned using LocARNA to predict conserved RNA stem-loops at the 5′ ends. Colors indicate the conservation of base pair types in which red represents conservation of one base pair and yellow and green indicate conservation of two and three base pairs, respectively. Dark color represents that all three viruses contained this base pair, whereas light color shows that one or two of the sequences did not contain the base pairs. SARS-CoV-2: Severe acute respiratory syndrome coronavirus 2.

References

    1. Zhu N, Zhang D, Wang W. et al. A novel coronavirus from patients with pneumonia in China, 2019. N. Engl. J. Med. 382(8), 727–733 (2020). - PMC - PubMed
    1. Coronaviridae Study Group of the International Committee on Taxonomy of Viruses. The species severe acute respiratory syndrome coronavirus: classifying 2019-nCoV and naming it SARS-CoV-2. Nat. Microbiol. 5(4), 536 (2020). - PMC - PubMed
    1. Weiss SR, Leibowitz JL. Coronavirus pathogenesis. Adv. Virus Res. 81, 85–164 (2011). - PMC - PubMed
    1. Drosten C, Günther S, Preiser W. et al. Identification of a novel coronavirus in patients with severe acute respiratory syndrome. N. Engl. J. Med. 348(20), 1967–1976 (2003). - PubMed
    1. Zaki AM, van Boheemen S, Bestebroer TM, Osterhaus AD, Fouchier RA. Isolation of a novel coronavirus from a man with pneumonia in Saudi Arabia. N. Engl. J. Med. 367(19), 1814–1820 (2003). - PubMed

LinkOut - more resources