Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007 Oct 16:8:391.
doi: 10.1186/1471-2105-8-391.

Analysis on multi-domain cooperation for predicting protein-protein interactions

Affiliations

Analysis on multi-domain cooperation for predicting protein-protein interactions

Rui-Sheng Wang et al. BMC Bioinformatics. .

Abstract

Background: Domains are the basic functional units of proteins. It is believed that protein-protein interactions are realized through domain interactions. Revealing multi-domain cooperation can provide deep insights into the essential mechanism of protein-protein interactions at the domain level and be further exploited to improve the accuracy of protein interaction prediction.

Results: In this paper, we aim to identify cooperative domains for protein interactions by extending two-domain interactions to multi-domain interactions. Based on the high-throughput experimental data from multiple organisms with different reliabilities, the interactions of domains were inferred by a Linear Programming algorithm with Multi-domain pairs (LPM) and an Association Probabilistic Method with Multi-domain pairs (APMM). Experimental results demonstrate that our approach not only can find cooperative domains effectively but also has a higher accuracy for predicting protein interaction than the existing methods. Cooperative domains, including strongly cooperative domains and superdomains, were detected from major interaction databases MIPS and DIP, and many of them were verified by physical interactions from the crystal structures of protein complexes in PDB which provide intuitive evidences for such cooperation. Comparison experiments in terms of protein/domain interaction prediction justified the benefit of considering multi-domain cooperation.

Conclusion: From the computational viewpoint, this paper gives a general framework to predict protein interactions in a more accurate manner by considering the information of both multi-domains and multiple organisms, which can also be applied to identify cooperative domains, to reconstruct large complexes and further to annotate functions of domains. Supplementary information and software are provided in http://intelligent.eic.osaka-sandai.ac.jp/chenen/MDCinfer.htm and http://zhangroup.aporc.org/bioinfo/MDCinfer.

PubMed Disclaimer

Figures

Figure 1
Figure 1
An illustrative example for multi-domain interactions. (a) All multi-domain pairs are listed for two proteins P1 and P2. Proteins: P1 = {Da, Db, Dc }, P2 = {De, Df} ; Domains: Da, Db, Dc, De, Df; (b) The illustration of domain interactions by considering multi-domain pairs in the proposed model. There are one pair of interacting proteins and three pairs of non-interacting proteins. The bold line (red) represents interacting domain pair, while the dotted lines (green) are the deleted non-interacting domain pairs.
Figure 2
Figure 2
Cooperative domains in the complex crystal structure formed by proteins P02994 (with ORFs: YBR118W, YPR080W) and P32471 (with ORF: YAL003W). Protein sequences are shown using thick gray lines, and Pfam domain annotations are shown using colored rectangular boxes and drawn to scale (based on the Pfam database). The names of the protein sequences in this protein complex are listed to the upper left of the domain architecture. The identified cooperative domain pairs are listed to the upper right of the domain architecture. The domain names are labeled by the same color as in the Pfam domain annotation. The cartoon of PDB crystal structure (PDB ID: 1f60, Crystal structure of the yeast elongation factor complex) demonstrates the cooperative domain interactions (where domain colors are consistent with the domain annotation), i.e. domain PF00736 in protein P32471 interacts physically with domains of protein P02994. Other complexes in PDB containing these cooperative domains are also listed by their matched PDB IDs and chain IDs.
Figure 3
Figure 3
Comparisons of RMSE on two-domain pairs and on multiple-domain pairs for Krogan's yeast extended datasets. (a) The results of LPM on training. (b) The results of LPM on testing. (c) The results of APMM on training. (d) The results of APMM on testing.
Figure 4
Figure 4
(a) ROC curve comparison of APMM and the extended EM on multiple-organism data. (b) ROC curve comparison of APMM based on two-domain pairs and multi-domain pairs.
Figure 5
Figure 5
Mean confidence score of the predicted domain interactions (by APMM) at different domain interaction thresholds respectively based on single-organism data and multiple-organism data.
Figure 6
Figure 6
The overlaps of domain-domain interactions predicted by APMM, DPEA and PE with iPfam.
Figure 7
Figure 7
The distribution of the predicted DDI overlaps with iPfam by DPEA, PE and APMM.
Figure 8
Figure 8
Reconstruction of DNA-directed RNA polymerase complex. (a) The RNA Polymerase II-TFIIS complex (PDB ID 1y1v) with 13 subunits (from chain A to chain S). Every chain is one protein (shown by their UniProtKB accessions) and their complex interactions form the large polymer. (b) The PfamA domain architecture for every protein. (c) The cooperative domains identified by our method with protein interaction pairs containing them. The red or blue colors of proteins and domains indicate their memberships.

Similar articles

Cited by

References

    1. Eisbacher M, Holmes M, Newton A, Hogg P, Khachigian L, Crossley M, Chong B. Protein-protein interaction between Fli-1 and GATA-1 mediates synergistic expression of megakaryocyte-specific genes through cooperative DNA binding. Mol Cell Biol. 2003;23:3427–3441. doi: 10.1128/MCB.23.10.3427-3441.2003. - DOI - PMC - PubMed
    1. Labalette C, Renard C, Neuveut C, Buendia M, Wei Y. Interaction and functional cooperation between the LIM protein FHL2, CBP/p300, and β-Catenin. Mol Cell Biol. 2004;24:10689–10702. doi: 10.1128/MCB.24.24.10689-10702.2004. - DOI - PMC - PubMed
    1. Salwinski L, Miller C, Smith A, Pettit F, Bowie J, Eisenberg D. The database of interacting proteins: 2004 update. Nucl Acids Res. 2004;32:D449–451. doi: 10.1093/nar/gkh086. - DOI - PMC - PubMed
    1. Mewes H, Frishman D, Mayer K, Munsterkotter M, Noubibou O, Pagel P, Rattei T, Oesterheld M, Ruepp A, Stumpflen V. MIPS: analysis and annotation of proteins from whole genomes in 2005. Nucl Acids Res. 2006;34:D169–172. doi: 10.1093/nar/gkj148. - DOI - PMC - PubMed
    1. Deng M, Sun F, Chen T. Assessment of the reliability of protein-protein interactions and protein function prediction. Pac Symp Biocomput. 2003:140–51. - PubMed

Publication types

Substances

LinkOut - more resources