Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Apr 21:22:8.
doi: 10.1186/s12575-020-00121-9. eCollection 2020.

Combination of Biodata Mining and Computational Modelling in Identification and Characterization of ORF1ab Polyprotein of SARS-CoV-2 Isolated from Oronasopharynx of an Iranian Patient

Affiliations

Combination of Biodata Mining and Computational Modelling in Identification and Characterization of ORF1ab Polyprotein of SARS-CoV-2 Isolated from Oronasopharynx of an Iranian Patient

Reza Zolfaghari Emameh et al. Biol Proced Online. .

Abstract

Background: Coronavirus disease 2019 (COVID-19) is an emerging zoonotic viral infection, which was started in Wuhan, China, in December 2019 and transmitted to other countries worldwide as a pandemic outbreak. Iran is one of the top ranked countries in the tables of COVID-19-infected and -mortality cases that make the Iranian patients as the potential targets for diversity of studies including epidemiology, biomedical, biodata, and viral proteins computational modelling studies.

Results: In this study, we applied bioinformatic biodata mining methods to detect CDS and protein sequences of ORF1ab polyprotein of SARS-CoV-2 isolated from oronasopharynx of an Iranian patient. Then through the computational modelling and antigenicity prediction approaches, the identified polyprotein sequence was analyzed. The results revealed that the identified ORF1ab polyprotein belongs to a part of nonstructural protein 1 (nsp1) with the high antigenicity residues in a glycine-proline or hydrophobic amino acid rich domain.

Conclusions: The results revealed that nsp1 as a virulence factor and crucial agent in spreading of the COVID-19 among the society can be a potential target for the future epidemiology, drug, and vaccine studies.

Keywords: Biodata mining; COVID-19; ORF1ab; Protein Modelling; SARS-CoV-2; nsp1.

PubMed Disclaimer

Conflict of interest statement

Competing InterestsThe authors declare that they have no conflict of interests.

Figures

Fig. 1
Fig. 1
Multiple sequence alignment (MSA) analysis of ORF1ab polyprotein sequence. a MSA analysis of the Iranian CDS with the CDS query from Wuhan, China (yellow highlight); b MSA analysis of the Iranian partial ORF1ab polyprotein sequence with the query protein sequence from Wuhan, China (grey highlight). Both MSA evaluations show 100% identity
Fig. 2
Fig. 2
BLAST homology search analysis of the partial ORF1ab polyprotein sequence. The homology analysis defined that 39 out of 47 amino acids of the Iranian subject protein sequence are similar to the query protein sequence (NMR entry ID: 2GDT), which belonged to nsp1 from SARS-CoV
Fig. 3
Fig. 3
Protein Modelling of the partial ORF1ab polyprotein sequence. NGL (WebGL) viewer visualized the NMR structure of entry ID: 2GDT to (a) cartoon-rainbow style and (b) spacefill-hydrophobicity style. Both models show that partial ORF1ab polyprotein from the Iranian patient and nsp1 from SARS-CoV have highly similar structures. The black amino acids are from the query and the red amino acids are from the subject protein sequences
Fig. 4
Fig. 4
Amino acids locations of the most antigenic domain of the partial ORF1ab polyprotein sequence. a Cartoon-rainbow style and b spacefill-hydrophobicity style show the location of Gly94, Pro98, and Gly101 on the surface of the subject protein, which are accessible to the human immune system. The black amino acids are from the query and the red amino acids are from the subject protein sequences. The arrows show the buried amino acids

References

    1. Perlman S, Netland J. Coronaviruses post-SARS: update on replication and pathogenesis. Nat Rev Microbiol. 2009;7(6):439–450. doi: 10.1038/nrmicro2147. - DOI - PMC - PubMed
    1. Zheng J, Yamada Y, Fung TS, Huang M, Chia R, Liu DX. Identification of N-linked glycosylation sites in the spike protein and their functional impact on the replication and infectivity of coronavirus infectious bronchitis virus in cell culture. Virology. 2018;513:65–74. doi: 10.1016/j.virol.2017.10.003. - DOI - PMC - PubMed
    1. Belouzard S, Millet JK, Licitra BN, Whittaker GR. Mechanisms of coronavirus cell entry mediated by the viral spike protein. Viruses. 2012;4(6):1011–1033. doi: 10.3390/v4061011. - DOI - PMC - PubMed
    1. Plant EP, Sims AC, Baric RS, Dinman JD, Taylor DR. Altering SARS coronavirus frameshift efficiency affects genomic and subgenomic RNA production. Viruses. 2013;5(1):279–294. doi: 10.3390/v5010279. - DOI - PMC - PubMed
    1. Plant EP, Dinman JD. The role of programmed-1 ribosomal frameshifting in coronavirus propagation. Front Biosci. 2008;13:4873–4881. doi: 10.2741/3046. - DOI - PMC - PubMed