Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 May 24;3(3):101460.
doi: 10.1016/j.xpro.2022.101460. eCollection 2022 Sep 16.

SPIKES: Identification of physicochemical properties of spike proteins across diverse host species of SARS-CoV-2

Affiliations

SPIKES: Identification of physicochemical properties of spike proteins across diverse host species of SARS-CoV-2

Srinivasulu Yerukala Sathipati et al. STAR Protoc. .

Abstract

We describe a protocol to identify physicochemical properties using amino acid sequences of spike (S) proteins of SARS-CoV-2. We present an S protein prediction technique named SPIKES, incorporating an inheritable bi-objective combinatorial genetic algorithm to determine the host species specificity. This protocol addresses the S protein amino acid sequence data collection, preprocessing, methodology, and analysis. For complete details on the use and execution of this protocol, please refer to Yerukala Sathipati et al. (2022).

Keywords: Bioinformatics; Microbiology; Proteomics; Systems biology.

PubMed Disclaimer

Conflict of interest statement

The authors declare competing interests.

Figures

None
Graphical abstract
Figure 1
Figure 1
Screenshots showing spike protein data acquisition from databases (A and B) (A) Displaying SARS-CoV-2 data from GISAID and (B) NCBI databases. An example of amino acid sequence in FASTA format.
Figure 2
Figure 2
The steps involved in feature selection algorithm
Figure 3
Figure 3
The comparison of physicochemical property (AAindex ID: RACS820104) between spike proteins of human and animal host coronaviruses The ID RACS820104 represents the average relative fractional occurrence in EL(i).
Figure 4
Figure 4
The comparison of amino acid compositions between spike proteins of human and animal host coronaviruses
Figure 5
Figure 5
Spike glycoprotein complex Spike glycoprotein (PDB: 6ACJ, EM 4.2 Angstrom) in complex with ACE2 (green ribbon) showing the amino acid changes that occurred between Rousettus bat coronavirus (GenBank: AOG30822.1) and hCoV/Wuhan/WIV05/2019. The mutations in different strains are shown as colored balls.
Figure 6
Figure 6
Secondary structure and surface hydrophobicity of spike protein 6VXX

References

    1. Chang C.-C., Lin C.-J. LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. 2011;2:1–27. doi: 10.1145/1961189.1961199. - DOI
    1. Charton M., Charton B.I. The dependence of the Chou-Fasman parameters on amino acid side chain structure. J. Theor. Biol. 1983;102:121–134. doi: 10.1016/0022-5193(83)90265-5. - DOI - PubMed
    1. Geisow M.J., Roberts R.D.B. Amino acid preferences for secondary structure vary with protein class. Int. J. Biol. Macromol. 1980;2:387–389. doi: 10.1016/0141-8130(80)90023-9. - DOI
    1. Ho S.Y., Chen J.H., Huang M.H. Inheritable genetic algorithm for biobjective 0/1 combinatorial optimization problems and its applications. IEEE Trans. Syst. Man Cybern. B Cybern. 2004;34:609–620. doi: 10.1109/tsmcb.2003.817090. - DOI - PubMed
    1. Huang Y., Niu B., Gao Y., Fu L., Li W. CD-HIT suite: a web server for clustering and comparing biological sequences. Bioinformatics. 2010;26:680–682. doi: 10.1093/bioinformatics/btq003. - DOI - PMC - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources