Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012:311-22.

Determining confidence of predicted interactions between HIV-1 and human proteins using conformal method

Affiliations

Determining confidence of predicted interactions between HIV-1 and human proteins using conformal method

Ilia Nouretdinov et al. Pac Symp Biocomput. 2012.

Abstract

Identifying protein-protein interactions (PPI's) is critical for understanding virtually all cellular molecular mechanisms. Previously, predicting PPI's was treated as a binary classification task and has commonly been solved in a supervised setting which requires a positive labeled set of known PPI's and a negative labeled set of non-interacting protein pairs. In those methods, the learner provides the likelihood of the predicted interaction, but without a confidence level associated with each prediction. Here, we apply a conformal prediction framework to make predictions and estimate confidence of the predictions. The conformal predictor uses a function measuring relative 'strangeness' interacting pairs to check whether prediction of a new example added to the sequence of already known PPI's would conform to the 'exchangeability' assumption: distribution of interacting pairs is invariant with any permutations of the pairs. In fact, this is the only assumption we make about the data. Another advantage is that the user can control a number of errors by providing a desirable confidence level. This feature of CP is very useful for a ranking list of possible interactive pairs. In this paper, the conformal method has been developed to deal with just one class - class interactive proteins - while there is not clearly defined of 'non-interactive'pairs. The confidence level helps the biologist in the interpretation of the results, and better assists the choices of pairs for experimental validation. We apply the proposed conformal framework to improve the identification of interacting pairs between HIV-1 and human proteins.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Random Forest (RF) prediction scores (where they are positive) and p-values obtained by the conformal predictor for the same pairs. Crosses (interactions) are known interactions, other examples are represented as points. Examples with RF prediction score less than zero are not included. The horizontal line corresponds to 10% cut-off for p-value (significance level), or confidence level 90% (for a true interaction, we are 90% sure that it is above this line).
Fig. 2
Fig. 2
Dependence of the size of conformal prediction list on the confidence level. Lower line corresponds to the percentage of pairs predicted as interactions both by conformal prediction and by positive random forest score.

Similar articles

Cited by

References

    1. Espadaler J, Romero-Isart O, Jackson RM, Oliva B. Prediction of protein-protein interactions using distant conservation of sequence patterns and structure relationships. Bioinformatics. 2005 Aug;21(16):3360–3368. - PubMed
    1. Fu W, Sanders-Beer BE, Katz KS, Maglott DR, Pruitt KD, Ptak RG. Human immunodeficiency virus type 1, human protein interaction database at ncbi. Nucleic Acids Res. 2009 Jan;37(Database issue):D417–D422. - PMC - PubMed
    1. Gammerman A, Vovk V. Hedging predictions in machine learning. Comput. J. 2007;50:164–172.
    1. Shoemaker BA, Panchenko AR. Deciphering protein-protein interactions. part i. experimental techniques and databases. PLoS Comput Biol. 2007;3(3):e42. - PMC - PubMed
    1. Shoemaker BA, Panchenko AR. Deciphering protein-protein interactions. part ii. Computational methods to predict protein and domain interaction partners. PLoS Comput Biol. 2007;3(4):e43. - PMC - PubMed

Publication types

MeSH terms

Substances