Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Nov 3;15(11):641.
doi: 10.3390/toxins15110641.

Conotoxin Prediction: New Features to Increase Prediction Accuracy

Affiliations

Conotoxin Prediction: New Features to Increase Prediction Accuracy

Lyman K Monroe et al. Toxins (Basel). .

Abstract

Conotoxins are toxic, disulfide-bond-rich peptides from cone snail venom that target a wide range of receptors and ion channels with multiple pathophysiological effects. Conotoxins have extraordinary potential for medical therapeutics that include cancer, microbial infections, epilepsy, autoimmune diseases, neurological conditions, and cardiovascular disorders. Despite the potential for these compounds in novel therapeutic treatment development, the process of identifying and characterizing the toxicities of conotoxins is difficult, costly, and time-consuming. This challenge requires a series of diverse, complex, and labor-intensive biological, toxicological, and analytical techniques for effective characterization. While recent attempts, using machine learning based solely on primary amino acid sequences to predict biological toxins (e.g., conotoxins and animal venoms), have improved toxin identification, these methods are limited due to peptide conformational flexibility and the high frequency of cysteines present in toxin sequences. This results in an enumerable set of disulfide-bridged foldamers with different conformations of the same primary amino acid sequence that affect function and toxicity levels. Consequently, a given peptide may be toxic when its cysteine residues form a particular disulfide-bond pattern, while alternative bonding patterns (isoforms) or its reduced form (free cysteines with no disulfide bridges) may have little or no toxicological effects. Similarly, the same disulfide-bond pattern may be possible for other peptide sequences and result in different conformations that all exhibit varying toxicities to the same receptor or to different receptors. We present here new features, when combined with primary sequence features to train machine learning algorithms to predict conotoxins, that significantly increase prediction accuracy.

Keywords: collisional cross section; conotoxins; ion mobility–mass spectrometry; machine learning; post-translational modifications; prediction.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest associated with this work.

Figures

Figure 1
Figure 1
Example of conotoxins adopting different structures while sharing the same primary amino acid sequence (ad) or conotoxins sharing the same disulfide bond pattern but different primary amino acid sequences (e,f). (a) Alpha conotoxin AuIB in its native conformation with a disulfide bond pattern of Cys2-Cys8 and Cys3-Cys15 (PDB: 1MXN [18]). (b) Alpha conotoxin AuIB in its ribbon (isoform) conformation with a disulfide bond pattern of Cys2-Cys15 and Cys3-Cys8 (PDB: 1MXP [18]). (c) Mu conotoxin KIIIA with a disulfide bond pattern of Cys1-Cys9, Cys2-Cys15, and Cys4-Cys16 (PDB: 7SAV [19]). (d) Mu conotoxin KIIIA with a disulfide bond pattern of Cys1-Cys16, Cys2-Cys9, and Cys4-Cys15 (PDB: 7SAW [19]). (e) Kappa conotoxin PVIIA with a disulfide bond pattern of Cys1-Cys16, Cys8-Cys20, and Cys15-Cys26 (PDB: 1AV3 [20]). (f) Omega conotoxin MVIIA with a disulfide bond pattern of Cys1-Cys16, Cys15-Cys25, and Cys8-Cys20 (PDB: 1DW4 [21]).
Figure 2
Figure 2
Features were divided into four groups (P, P2, SS, and CCS), and the effect of each feature group was evaluated with regard to conotoxin prediction accuracy.
Figure 3
Figure 3
Workflow for feature extraction.
Figure 4
Figure 4
Overall ML pipeline describing the process of using a dataset to train and cross-validate a classifier.

References

    1. Becker S., Terlau H. Toxins from Cone Snails: Properties, Applications and Biotechnological Production. Appl. Microbiol. Biotechnol. 2008;79:1–9. - PMC - PubMed
    1. Verdes A., Anand P., Gorson J., Jannetti S., Kelly P., Leffler A., Simpson D., Ramrattan G., Holford M. From Mollusks to Medicine: A Venomics Approach for the Discovery and Characterization of Therapeutics from Terebridae Peptide Toxins. Toxins. 2016;8:117. - PMC - PubMed
    1. Zouari-Kessentini R., Srairi-Abid N., Bazaa A., El Ayeb M., Luis J., Marrakchi N. Antitumoral Potential of Tunisian Snake Venoms Secreted Phospholipases A2. Biomed Res. Int. 2013;2013:391389. - PMC - PubMed
    1. Wulff H., Castle N.A., Pardo L.A. Voltage-Gated Potassium Channels as Therapeutic Targets. Nat. Rev. Drug Discov. 2009;8:982–1001. - PMC - PubMed
    1. de Oliveira Junior N.G., e Silva Cardoso M.H., Franco O.L. Snake Venoms: Attractive Antimicrobial Proteinaceous Compounds for Therapeutic Purposes. Cell Mol. Life Sci. 2013;70:4645–4658. - PMC - PubMed