Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Feb:91:107276.
doi: 10.1016/j.intimp.2020.107276. Epub 2020 Dec 16.

Genome-wide analysis of Indian SARS-CoV-2 genomes to identify T-cell and B-cell epitopes from conserved regions based on immunogenicity and antigenicity

Affiliations

Genome-wide analysis of Indian SARS-CoV-2 genomes to identify T-cell and B-cell epitopes from conserved regions based on immunogenicity and antigenicity

Nimisha Ghosh et al. Int Immunopharmacol. 2021 Feb.

Abstract

SARS-CoV-2 has a high transmission rate and shows frequent mutations, thus making vaccine development an arduous task. However, researchers around the globe are working hard to find a solution e.g. synthetic vaccine. Here, we have performed genome-wide analysis of 566 Indian SARS-CoV-2 genomes to extract the potential conserved regions for identifying peptide based synthetic vaccines, viz. epitopes with high immunogenicity and antigenicity. In this regard, different multiple sequence alignment techniques are used to align the SARS-CoV-2 genomes separately. Subsequently, consensus conserved regions are identified after finding the conserved regions from each aligned result of alignment techniques. Further, the consensus conserved regions are refined considering that their lengths are greater than or equal to 60nt and their corresponding proteins are devoid of any stop codons. Subsequently, their specificity as query coverage are verified using Nucleotide BLAST. Finally, with these consensus conserved regions, T-cell and B-cell epitopes are identified based on their immunogenic and antigenic scores which are then used to rank the conserved regions. As a result, we have ranked 23 consensus conserved regions that are associated with different proteins. This ranking also resulted in 34 MHC-I and 37 MHC-II restricted T-cell epitopes with 16 and 19 unique HLA alleles and 29 B-cell epitopes. After ranking, the consensus conserved region from NSP3 gene is obtained that is highly immunogenic and antigenic. In order to judge the relevance of the identified epitopes, the physico-chemical properties and binding conformation of the MHC-I and MHC-II restricted T-cell epitopes are shown with respect to HLA alleles.

Keywords: B-cell epitopes; Conserved regions; Peptide based vaccine; Physico-chemical properties; SARS-CoV-2; T-cell epitopes.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no conflict of interest.

Figures

Fig. 1
Fig. 1
Pipeline of the Workflow.
Fig. 2
Fig. 2
125 Consensus Conserved Regions (CCnRs) from the four alignment techniques.
Fig. 3
Fig. 3
Graphical representation of B-cell epitopes for TLVSDIDITFLKKDAP and LHPDSATLVSDIDITF with the threshold marked by red line.
Fig. 4
Fig. 4
MHC-I, MHC-II restricted T-cell and B-cell epitopes underlined in the protein sequences of top 5 CCnRs for (a) NSP3 (b) 3CL-Proteinase (c) NSP10 (d) NSP3 and (e) NSP4.
Fig. 5
Fig. 5
Structural analysis for the highly immunogenic MHC-I restricted T-cell epitope “FLKKDAPYI” for NSP3 coded protein (a) 2D pose representation between the epitope and HLA allele showing the different non-covalent bonds (b) Docking structure of MHC-I restricted T-cell epitope (c) The surface interaction between the allele and epitopes showing the fitting sites in binding grooves (d) Ramachandran plot of the epitope allele structure showing lower energy sites of the residues in different frame (e) Z-score plot and (f) all residue energy.
Fig. 6
Fig. 6
Structural analysis for the highly antigenic MHC-I restricted T-cell epitope “TAVVIPTKK” for NSP3 coded protein (a) 2D pose representation between the epitope and HLA allele showing the different non-covalent bonds (b) Docking structure of MHC-I restricted T-cell epitope (c) The surface interaction between the allele and epitopes showing the fitting sites in binding grooves (d) Ramachandran plot of the epitope allele structure showing lower energy sites of the residues in different frame (e) Z-score plot and (f) all residue energy.
Fig. 7
Fig. 7
Structural analysis for the highly immunogenic MHC-II restricted T-cell epitope “ITFLKKDAPYIVGDV” for NSP3 coded protein (a) 2D pose representation between the epitope and HLA allele showing the different non-covalent bonds (b) Docking structure of MHC-II restricted T-cell epitope (c) The surface interaction between the allele and epitopes showing the fitting sites in binding grooves (d) Ramachandran plot of the epitope allele structure showing lower energy sites of the residues in different frame (e) Z-score plot and (f) all residue energy.
Fig. 8
Fig. 8
Structural analysis for the highly antigenic MHC-II restricted T-cell epitope “IDITFLKKDAPYIVG” for NSP3 coded protein (a) 2D pose representation between the epitope and HLA allele showing the different non-covalent bonds (b) Docking structure of MHC-II restricted T-cell epitope (c) The surface interaction between the allele and epitopes showing the fitting sites in binding grooves (d) Ramachandran plot of the epitope allele structure showing lower energy sites of the residues in different frame (e) Z-score plot and (f) all residue energy.

Similar articles

Cited by

References

    1. Zhou P., Yang X.L., Wang X.G., Hu B., Zhang L., Zhang W., Si H.R., Zhu Y., Li B., Huang C., Chen H., Chen J., Luo Y., Guo H., Jiang R., Liu M., Chen Y., Shen X., Wang X., Zheng X., Zhao K., Chen Q., Deng L.L.F., Yan B., Zhan F., Wang Y., Xiao G., Shi Z. A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature. 2020;579:270–273. doi: 10.1038/s41586-020-2012-7. - DOI - PMC - PubMed
    1. Ksiazek T., Erdman D., Goldsmith C., Zaki S., Peret T., Emery S., Tong S., Urbani C., Comer J., Lim W., Rollin P., Dowell S., Ling A., Humphrey C., Shieh W.J., Guarner J., Paddock C., Rota P., Fields B., Anderson L. A novel coronavirus associated with severe acute respiratory syndrome. New Engl. J. Med. 2003;348:1953–1966. doi: 10.1056/NEJMoa030781. - DOI - PubMed
    1. Holmes K., Enjuanes L. The sars coronavirus: A postgenomic era. Science (New York, N.Y.) 2003;300:1377–1378. doi: 10.1126/science.1086418. - DOI - PubMed
    1. Groot R.D., Baker S., Baric R., Brown C., Drosten C., Enjuanes L., Fouchier R., Galiano M., Gorbalenya A., Memish Z., Perlman S., Poon L., Snijder E., Stephens G., Woo P., Zaki A., Zambon M., Ziebuhr J. Middle east respiratory syndrome coronavirus (mers-cov): Announcement of the coronavirus study group. J. Virol. 2013;87 doi: 10.1128/JVI.01244-13. - DOI - PMC - PubMed
    1. Xu J., Zhao S., Teng T., Abdalla A., Zhu W., Xie L., Wang Y., Guo X. Systematic comparison of two animal-to-human transmitted human coronaviruses: Sars-cov-2 and sars-cov. Viruses. 2020;12:244. doi: 10.3390/v12020244. - DOI - PMC - PubMed

MeSH terms