Inter-species validation for domain combination based protein-protein interaction prediction method
- PMID: 16901097
Inter-species validation for domain combination based protein-protein interaction prediction method
Abstract
Domain Combination based Protein-Protein Interaction Prediction (DCPPIP) method is revealed to show outstanding prediction accuracy in Yeast proteins. However, it is not yet apparent whether the method is still valid and can achieve comparable prediction accuracy for the proteins in other species. In this paper, we report the validation results of applying the DCPPIP method for Fly and Human proteins. We also report the results of inter-species validation, in which protein interaction and domain data of other species are used as learning set. 10,351 interacting protein pairs are used for the validation for Fly, 2,345 protein pairs for Human. 80% of the data are used as learning sets and 20% are reserved as test sets. High prediction accuracies (Fly: sensitivity approximately 77%, specificity approximately 92%, Human: sensitivity approximately 96%, specificity approximately 95%) are achieved in both Fly and Human cases. Interactions of proteins in Human, Mouse, H. pylori, E. coli, and C. elegans are predicted and validated using the protein interaction and domain data in Yeast, Fly, and the combination of Yeast and Fly respectively. Again, good prediction accuracy is achieved when the test protein pair has common domains with the proteins in a learning set of proteins. A notion of Domain Overlapping Rate (DOR) among species is newly developed in this paper and the correlation between DOR and prediction accuracy is examined. According to out test results, there exists fairly obvious correlation between DOR and prediction accuracy.
Similar articles
-
PreSPI: design and implementation of protein-protein interaction prediction service system.Genome Inform. 2004;15(2):171-80. Genome Inform. 2004. PMID: 15706503 Review.
-
A domain combination based probabilistic framework for protein-protein interaction prediction.Genome Inform. 2003;14:250-9. Genome Inform. 2003. PMID: 15706539
-
PreSPI: a domain combination based prediction system for protein-protein interaction.Nucleic Acids Res. 2004 Dec 1;32(21):6312-20. doi: 10.1093/nar/gkh972. Print 2004. Nucleic Acids Res. 2004. PMID: 15576357 Free PMC article.
-
Message-passing algorithms for the prediction of protein domain interactions from protein-protein interaction data.Bioinformatics. 2008 Sep 15;24(18):2064-70. doi: 10.1093/bioinformatics/btn366. Epub 2008 Jul 17. Bioinformatics. 2008. PMID: 18641010
-
Inferring protein-protein interactions from multiple protein domain combinations.Methods Mol Biol. 2009;541:43-59. doi: 10.1007/978-1-59745-243-4_3. Methods Mol Biol. 2009. PMID: 19381530 Review.
Publication types
MeSH terms
Substances
LinkOut - more resources
Molecular Biology Databases