Determining confidence of predicted interactions between HIV-1 and human proteins using conformal method
- PMID: 22174286
- PMCID: PMC3249613
Determining confidence of predicted interactions between HIV-1 and human proteins using conformal method
Abstract
Identifying protein-protein interactions (PPI's) is critical for understanding virtually all cellular molecular mechanisms. Previously, predicting PPI's was treated as a binary classification task and has commonly been solved in a supervised setting which requires a positive labeled set of known PPI's and a negative labeled set of non-interacting protein pairs. In those methods, the learner provides the likelihood of the predicted interaction, but without a confidence level associated with each prediction. Here, we apply a conformal prediction framework to make predictions and estimate confidence of the predictions. The conformal predictor uses a function measuring relative 'strangeness' interacting pairs to check whether prediction of a new example added to the sequence of already known PPI's would conform to the 'exchangeability' assumption: distribution of interacting pairs is invariant with any permutations of the pairs. In fact, this is the only assumption we make about the data. Another advantage is that the user can control a number of errors by providing a desirable confidence level. This feature of CP is very useful for a ranking list of possible interactive pairs. In this paper, the conformal method has been developed to deal with just one class - class interactive proteins - while there is not clearly defined of 'non-interactive'pairs. The confidence level helps the biologist in the interpretation of the results, and better assists the choices of pairs for experimental validation. We apply the proposed conformal framework to improve the identification of interacting pairs between HIV-1 and human proteins.
Figures


Similar articles
-
Prediction of interactions between HIV-1 and human proteins by information integration.Pac Symp Biocomput. 2009:516-27. Pac Symp Biocomput. 2009. PMID: 19209727 Free PMC article.
-
Semi-supervised multi-task learning for predicting interactions between HIV-1 and human proteins.Bioinformatics. 2010 Sep 15;26(18):i645-52. doi: 10.1093/bioinformatics/btq394. Bioinformatics. 2010. PMID: 20823334 Free PMC article.
-
Progress in computational studies of host-pathogen interactions.J Bioinform Comput Biol. 2013 Apr;11(2):1230001. doi: 10.1142/S0219720012300018. Epub 2012 Oct 24. J Bioinform Comput Biol. 2013. PMID: 23600809 Review.
-
Refining literature curated protein interactions using expert opinions.Pac Symp Biocomput. 2015:318-29. Pac Symp Biocomput. 2015. PMID: 25592592
-
Functions of Tat: the versatile protein of human immunodeficiency virus type 1.J Gen Virol. 2010 Jan;91(Pt 1):1-12. doi: 10.1099/vir.0.016303-0. Epub 2009 Oct 7. J Gen Virol. 2010. PMID: 19812265 Review.
Cited by
-
Prediction of virus-host protein-protein interactions mediated by short linear motifs.BMC Bioinformatics. 2017 Mar 9;18(1):163. doi: 10.1186/s12859-017-1570-7. BMC Bioinformatics. 2017. PMID: 28279163 Free PMC article.
-
Computational Biology and Machine Learning Approaches to Understand Mechanistic Microbiome-Host Interactions.Front Microbiol. 2021 May 11;12:618856. doi: 10.3389/fmicb.2021.618856. eCollection 2021. Front Microbiol. 2021. PMID: 34046017 Free PMC article. Review.
-
The current Salmonella-host interactome.Proteomics Clin Appl. 2012 Jan;6(1-2):117-33. doi: 10.1002/prca.201100083. Epub 2011 Dec 27. Proteomics Clin Appl. 2012. PMID: 22213674 Free PMC article. Review.
-
Prediction and comparison of Salmonella-human and Salmonella-Arabidopsis interactomes.Chem Biodivers. 2012 May;9(5):991-1018. doi: 10.1002/cbdv.201100392. Chem Biodivers. 2012. PMID: 22589098 Free PMC article.
-
Computational approaches for prediction of pathogen-host protein-protein interactions.Front Microbiol. 2015 Feb 24;6:94. doi: 10.3389/fmicb.2015.00094. eCollection 2015. Front Microbiol. 2015. PMID: 25759684 Free PMC article. Review.
References
-
- Espadaler J, Romero-Isart O, Jackson RM, Oliva B. Prediction of protein-protein interactions using distant conservation of sequence patterns and structure relationships. Bioinformatics. 2005 Aug;21(16):3360–3368. - PubMed
-
- Gammerman A, Vovk V. Hedging predictions in machine learning. Comput. J. 2007;50:164–172.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous