Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Nov 24:1:19-47.
doi: 10.4137/bbi.s315.

Prediction of functional class of proteins and peptides irrespective of sequence homology by support vector machines

Affiliations

Prediction of functional class of proteins and peptides irrespective of sequence homology by support vector machines

Zhi Qun Tang et al. Bioinform Biol Insights. .

Abstract

Various computational methods have been used for the prediction of protein and peptide function based on their sequences. A particular challenge is to derive functional properties from sequences that show low or no homology to proteins of known function. Recently, a machine learning method, support vector machines (SVM), have been explored for predicting functional class of proteins and peptides from amino acid sequence derived properties independent of sequence similarity, which have shown promising potential for a wide spectrum of protein and peptide classes including some of the low- and non-homologous proteins. This method can thus be explored as a potential tool to complement alignment-based, clustering-based, and structure-based methods for predicting protein function. This article reviews the strategies, current progresses, and underlying difficulties in using SVM for predicting the functional class of proteins. The relevant software and web-servers are described. The reported prediction performances in the application of these methods are also presented.

Keywords: machine learning method; peptide function; protein family; protein function; protein function prediction; support vector machines.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Schematic diagram illustrating the process of the training and prediction of the functional class of proteins and peptides by using support vector machine (SVM) method. A,B: feature vectors of proteins belong to a functional class; E,F: feature vectors of proteins not belong to a functional class. Sequence-derived feature hj, vj, pj, … represents such structural and physicochemical properties as hydrophobicity, polarizability, and volume; or such properties as domain information, subcellular localization, and post-translational (PT) modification profiles etc.
Figure 2.
Figure 2.
Support vector machines. (a) Definition of hyper-plane and margin. The circular dots and square dots represent samples of class −1 and class +1, respectively. (b) The available hyper-planes H, H’, H”, …, corresponding to a set of training data. (c) Unique optimal separating hyper-plane of a set of training data. (d) Basic idea of support vector machines: Projection of the training data nonlinearly into a higher-dimensional feature space via φ, and subsequent construction of a separating hyper-plane with maximum margin in that space.

Similar articles

Cited by

References

    1. Abbas AK, Lichtman AH.2005. Cellular and Molecular Immunology, Updated Edition. Saunders5th ed
    1. Aguilar D, Oliva B, Aviles FX, et al. TranScout: prediction of gene expression regulatory proteins from their sequences. Bioinformatics. 2002;18:597–607. - PubMed
    1. Al-Shahib A, Breitling R, Gilbert D. Feature selection and the class imbalance problem in predicting protein function from sequence. Appl Bioinformatics. 2005a;4:195–203. - PubMed
    1. Al-Shahib A, Breitling R, Gilbert D. FrankSum: new feature selection method for protein function prediction. Int. J. Neural Syst. 2005b;15:259–75. - PubMed
    1. Alexander S, Peters J, Mead A, et al. TiPS receptor and ion channel nomenclature supplement. Trends Pharmacol. Sci. 1999;19:5–85.

LinkOut - more resources