Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Oct:84:104386.
doi: 10.1016/j.meegid.2020.104386. Epub 2020 May 29.

Unfolding SARS-CoV-2 viral genome to understand its gene expression regulation

Affiliations

Unfolding SARS-CoV-2 viral genome to understand its gene expression regulation

Hunduma Dinka et al. Infect Genet Evol. 2020 Oct.

Abstract

SARS-CoV-2 is a new virus responsible for an outbreak of respiratory illness known as COVID-19, which has spread to several countries around the world and a global effort is being undertaken to characterize the molecular features and evolutionary origins of this virus. In silico analysis of the transcription start sites, promoter regions, transcription factors and their binding sites, gene ontology, CpG islands for SARS-CoV-2 viral genome are a first step to understand the regulation mechanisms of gene expression and its association with genetic variations in the genomes. For this purpose, we first computationally surveyed all SARS-CoV-2 virus genes with the open reading frames from NCBI database and found eleven sequences to accomplish the mentioned features by using bioinformatics tools. Our analysis revealed that all (100%) of the SARS-CoV-2 virus genes have more than one TSS. By taking all TSSs with the highest predictive score we determined promoter regions and identified five common candidate motifs (MVI, MVII, MVIII, MVIV and MVV) of which MVI was found to be shared by all promoter regions of SARS-CoV-2 virus genes with the least E-value (3.8e-056, statistically highly significant). In our further analysis of MVI we showed MVI serve as binding sites for a single transcription factor (TF) family, EXPREG, involved in the regulatory mode of these genes. From EXPREG family four TFs that belongs to Cyclic AMP (cAMP) receptor protein (CRP) and Catabolite control protein A (CcpA) group mostly serve as transcriptional activator whereas two TFs that belong to LexA group always serve as transcriptional repressor in different kinds of cellular processes and molecular functions. Therefore, we unfolded SARS-CoV-2 viral genome to shed light on its gene expression regulation that could help to design and evaluate diagnostic tests, to track and trace the ongoing outbreak and to identify potential intervention options.

Keywords: COVID-19; CpG Island; Motif; Promoter; SARS-CoV-2; Transcription factor.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Block diagrams showing the relative positions of candidate motifs in different SARS-CoV-2 virus gene promoter sequences relative to TSSs. The nucleotide positions are indicated at the bottom ofthe graph from +1 (beginning of TSSs) to the upstream 1000 (−1000) bp.
Fig. 2
Fig. 2
Sequence logos for the identified best motif (MVI) for SARS-CoV-2 virus genes promoter regions. The analysis was carried out using MEME Suite.
Fig. 3
Fig. 3
GO-term associated with MVI motif. No gene ontology was identified.

Similar articles

Cited by

References

    1. Akeo S., Satoshi K., Noriko N., Aiko K., Seiki K., Shigeyuki Y. Transcription activation mediated by a cyclic AMP receptor protein from Thermus thermophilus HB8. J. Microbiol. 2007;189(10):3891–3901. - PMC - PubMed
    1. Ambrosini G., Vorontsov I., Penzar D., Groux R., Fornes O., Nikolaeva D.D., Ballester B., Grau J., Grosse I., Makeev V., Kulakovskiy I., Bucher P. Insights gained from a comprehensive all-against-all transcription factor binding motif benchmarking study. Genome Biol. 2020;21 - PMC - PubMed
    1. Bailey L.T., Elkan C. Proceedings of the Second International Conference on Intelligent Systems for MolecularBiology. AAAI Press; 1994. Fitting a mixture model by expectation maximization to discover motifs in biopolymers; pp. 28–36. - PubMed
    1. Bailey L.T., Nadya W., Chris M., Wilfred W.L. MEME: discovering and analyzing DNA and protein sequence motifs. Nucleic Acids Res. 2006;34:369–373. - PMC - PubMed
    1. Bailey L.T., Mikael B., Fabian A.B., Martin F., Charles E.G., Luca C., Jingyuan R., Wilfred W.L., William S.N. MEME SUITE: tools for motif discovery and searching. Nucleic Acids Res. 2009;37:W202–W208. - PMC - PubMed

MeSH terms