Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2003 Jul 1;31(13):3692-7.
doi: 10.1093/nar/gkg600.

SVM-Prot: Web-based support vector machine software for functional classification of a protein from its primary sequence

Affiliations

SVM-Prot: Web-based support vector machine software for functional classification of a protein from its primary sequence

C Z Cai et al. Nucleic Acids Res. .

Abstract

Prediction of protein function is of significance in studying biological processes. One approach for function prediction is to classify a protein into functional family. Support vector machine (SVM) is a useful method for such classification, which may involve proteins with diverse sequence distribution. We have developed a web-based software, SVMProt, for SVM classification of a protein into functional family from its primary sequence. SVMProt classification system is trained from representative proteins of a number of functional families and seed proteins of Pfam curated protein families. It currently covers 54 functional families and additional families will be added in the near future. The computed accuracy for protein family classification is found to be in the range of 69.1-99.6%. SVMProt shows a certain degree of capability for the classification of distantly related proteins and homologous proteins of different function and thus may be used as a protein function prediction tool that complements sequence alignment methods. SVMProt can be accessed at http://jing.cz3.nus.edu.sg/cgi-bin/svmprot.cgi.

PubMed Disclaimer

Figures

Figure 1
Figure 1
SVMProt web page.
Figure 2
Figure 2
Example of the SVMProt output returned to the user.
Figure 3
Figure 3
Hypothetical sequence for illustration of derivation of the feature vector of a protein.
Figure 4
Figure 4
Statistical relationship between the R-value and P-value (probability of correct classification) derived from analysis of 9932 positive and 45 999 negative samples of proteins.

References

    1. Eisenberg D., Marcotte,C.A., Xenarios,I. and Yeates,T.O. (2000) Protein function in the post-genomic era. Nature, 405, 823–826. - PubMed
    1. Bork P., Dandekar,T., Diaz-Lazcoz,Y., Eisenhaber,F., Huynen,M. and Yuan,Y. (1998) Predicting function: from genomes and back. J. Mol. Biol., 283, 707–725. - PubMed
    1. Pellegrini M. (2001) Computational methods for protein function analysis. Curr. Opin. Chem. Biol., 5, 46–50. - PubMed
    1. Teichman S.A. and Mitchison,G. (2000) Computing protein function. Nat. Biotechnol., 18, 27. - PubMed
    1. Huynen M., Snel,B., Lathe,W. and Bork,P. (2000) Predicting protein function by genomic context: quantitative evaluation and qualitative inferences. Genome Res., 10, 1204–1210. - PMC - PubMed

Publication types