Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Jan;78(1-2):441-463.
doi: 10.1007/s00285-018-1279-x. Epub 2018 Oct 5.

Identifying anticancer peptides by using a generalized chaos game representation

Affiliations

Identifying anticancer peptides by using a generalized chaos game representation

Li Ge et al. J Math Biol. 2019 Jan.

Abstract

We generalize chaos game representation (CGR) to higher dimensional spaces while maintaining its bijection, keeping such method sufficiently representative and mathematically rigorous compare to previous attempts. We first state and prove the asymptotic property of CGR and our generalized chaos game representation (GCGR) method. The prediction follows that the dissimilarity of sequences which possess identical subsequences but distinct positions would be lowered exponentially by the length of the identical subsequence; this effect was taking place unbeknownst to researchers. By shining a spotlight on it now, we show the effect fundamentally supports (G)CGR as a similarity measure or feature extraction technique. We develop two feature extraction techniques: GCGR-Centroid and GCGR-Variance. We use the GCGR-Centroid to analyze the similarity between protein sequences by using the datasets 9 ND5, 24 TF and 50 beta-globin proteins. We obtain consistent results compared with previous studies which proves the significance thereof. Finally, by utilizing support vector machines, we train the anticancer peptide prediction model by using both GCGR-Centroid and GCGR-Variance, and achieve a significantly higher prediction performance by employing the 3 well-studied anticancer peptide datasets.

Keywords: Anticancer peptides; Chaos game representation; Similarity analysis; Support vector machine.

PubMed Disclaimer

Similar articles

Cited by

References

    1. Nucleic Acids Res. 1990 Apr 25;18(8):2163-70 - PubMed
    1. J Comput Chem. 2008 Jul 30;29(10):1596-604 - PubMed
    1. J Comput Chem. 2010 Aug;31(11):2136-42 - PubMed
    1. Bioinformatics. 2001 May;17(5):429-37 - PubMed
    1. J Theor Biol. 2016 Oct 7;406:105-15 - PubMed

Publication types

MeSH terms

LinkOut - more resources