Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Mar 10;106(10):3752-7.
doi: 10.1073/pnas.0806427106. Epub 2009 Feb 23.

Uncovering the rules for protein-protein interactions from yeast genomic data

Affiliations

Uncovering the rules for protein-protein interactions from yeast genomic data

Jin Wang et al. Proc Natl Acad Sci U S A. .

Abstract

Identifying protein-protein interactions is crucial for understanding cellular functions. Genomic data provides opportunities and challenges in identifying these interactions. We uncover the rules for predicting protein-protein interactions using a frequent pattern tree (FPT) approach modified to generate a minimum set of rules (mFPT), with rule attributes constructed from the interaction features of the yeast genomic data. The mFPT prediction accuracy is benchmarked against other commonly used methods such as Bayesian networks and logistic regressions under various statistical measures. Our study indicates that mFPT outranks other methods in predicting the protein-protein interactions for the database used. We predict a new protein-protein interaction complex whose biological function is related to premRNA splicing and new protein-protein interactions within existing complexes based on the rules generated. Our method is general and can be used to discover the underlying rules for protein-protein interactions, genomic interactions, structure-function relationships, and other fields of research.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Fig. 1.
Fig. 1.
FPT KS curve of training (A) and testing (B) samples.
Fig. 2.
Fig. 2.
FPT ROC curves (A and B) for different scales. Red curves represent training samples, and green curves represent testing samples.
Fig. 3.
Fig. 3.
ROC curve and KS value comparisons for 4 methods. (A and B) ROC curve comparisons of the training (A) and testing (B) samples for 4 methods. (C and D) KS value comparisons of the training (C) and testing (D) samples for 4 methods.
Fig. 4.
Fig. 4.
Comparisons of correct prediction rate for 4 methods for both training and testing samples
Fig. 5.
Fig. 5.
Comparisons of correct prediction rate for 5 features and 13 features with both training and testing data samples
Fig. 6.
Fig. 6.
Predicted complex.
Fig. 7.
Fig. 7.
Mitochondrial ribosome complex.
Fig. 8.
Fig. 8.
New complex.

References

    1. Jansen R, et al. A Bayesian networks approach for predicting protein–protein interactions from genomic data. Science. 2003;302:449–453. - PubMed
    1. Lin N, Wu BL, Jansen R, Gerstein M, Zhao HY. Information assessment on predicting protein–protein interactions. BMC Bioinformatics. 2004;5:154. - PMC - PubMed
    1. Cao JP MA YC, Li YX, Shi TL. The application of the computational methods in protein–protein interaction study. Chinese Bulletin of Life Sci. 2005;17:82–87.
    1. Uetz P, et al. A comprehensive analysis of protein–protein interactions in Saccharomyces cerevisiae. Nature. 2000;403:623–627. - PubMed
    1. Ito T, et al. A comprehensive two-hybrid analysis to explore the yeast protein interactome. Proc Natl Acad Sci USA. 2001;98:4569–4574. - PMC - PubMed

Publication types

MeSH terms

Substances