Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Jun;16(2):264-78.
doi: 10.2478/s11658-011-0008-x. Epub 2011 Mar 20.

PPI_SVM: prediction of protein-protein interactions using machine learning, domain-domain affinities and frequency tables

Affiliations

PPI_SVM: prediction of protein-protein interactions using machine learning, domain-domain affinities and frequency tables

Piyali Chatterjee et al. Cell Mol Biol Lett. 2011 Jun.

Abstract

Protein-protein interactions (PPI) control most of the biological processes in a living cell. In order to fully understand protein functions, a knowledge of protein-protein interactions is necessary. Prediction of PPI is challenging, especially when the three-dimensional structure of interacting partners is not known. Recently, a novel prediction method was proposed by exploiting physical interactions of constituent domains. We propose here a novel knowledge-based prediction method, namely PPI_SVM, which predicts interactions between two protein sequences by exploiting their domain information. We trained a two-class support vector machine on the benchmarking set of pairs of interacting proteins extracted from the Database of Interacting Proteins (DIP). The method considers all possible combinations of constituent domains between two protein sequences, unlike most of the existing approaches. Moreover, it deals with both single-domain proteins and multi domain proteins; therefore it can be applied to the whole proteome in high-throughput studies. Our machine learning classifier, following a brainstorming approach, achieves accuracy of 86%, with specificity of 95%, and sensitivity of 75%, which are better results than most previous methods that sacrifice recall values in order to boost the overall precision. Our method has on average better sensitivity combined with good selectivity on the benchmarking dataset. The PPI_SVM source code, train/test datasets and supplementary files are available freely in the public domain at: http://code.google.com/p/cmater-bioinfo/.

PubMed Disclaimer

References

    1. Ito T., Tashiro K., Muta S., Ozawa R., Chiba T., Nishizawa M., Yamamoto K., Kuhara S., Sakaki Y. Toward a protein-protein interaction map of the budding yeast: a comprehensive system to examine two-hybrid interactions in all possible combinations between the yeast proteins. Proc. Natl. Acad. Sci. USA. 2000;97:1143–1147. doi: 10.1073/pnas.97.3.1143. - DOI - PMC - PubMed
    1. Plewczynski D., Basu S. AMS 3.0: prediction of post-translational modifications. BMC Bioinformatics. 2010;11:210. doi: 10.1186/1471-2105-11-210. - DOI - PMC - PubMed
    1. Gharakhanian E., Takahashi J., Clever J., Kasamatsu H. In vitro assay for protein-protein interaction: carboxyl-terminal 40 residues of simian virus 40 structural protein VP3 contain a determinant for interaction with VP1. Proc. Natl. Acad. Sci. USA. 1998;85:6607–6611. doi: 10.1073/pnas.85.18.6607. - DOI - PMC - PubMed
    1. Hu C.D., Chinenov Y., Kerppola T.K. Visualization of interactions among bZIP and Rel family proteins in living cells using bimolecular fluorescence complementation. Mol. Cell. 2002;9:789–798. doi: 10.1016/S1097-2765(02)00496-3. - DOI - PubMed
    1. Rigaut G., Shevchenko A., Rutz B., Wilm M., Mann M., Seraphin B. A generic protein purification method for protein complex characterization and proteome exploration. Nat. Biotechnol. 1999;17:1030–1032. doi: 10.1038/13732. - DOI - PubMed

Publication types

MeSH terms