Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Jun 16:11:326.
doi: 10.1186/1471-2105-11-326.

PPLook: an automated data mining tool for protein-protein interaction

Affiliations

PPLook: an automated data mining tool for protein-protein interaction

Shao-Wu Zhang et al. BMC Bioinformatics. .

Abstract

Background: Extracting and visualizing of protein-protein interaction (PPI) from text literatures are a meaningful topic in protein science. It assists the identification of interactions among proteins. There is a lack of tools to extract PPI, visualize and classify the results.

Results: We developed a PPI search system, termed PPLook, which automatically extracts and visualizes protein-protein interaction (PPI) from text. Given a query protein name, PPLook can search a dataset for other proteins interacting with it by using a keywords dictionary pattern-matching algorithm, and display the topological parameters, such as the number of nodes, edges, and connected components. The visualization component of PPLook enables us to view the interaction relationship among the proteins in a three-dimensional space based on the OpenGL graphics interface technology. PPLook can also provide the functions of selecting protein semantic class, counting the number of semantic class proteins which interact with query protein, counting the literature number of articles appearing the interaction relationship about the query protein. Moreover, PPLook provides heterogeneous search and a user-friendly graphical interface.

Conclusions: PPLook is an effective tool for biologists and biosystem developers who need to access PPI information from the literature. PPLook is freely available for non-commercial users at http://meta.usc.edu/softs/PPLook.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Architecture of PPLook. PPLook contains four modules: (i) submission authentication module aims to identify whether the query word is a protein name or not; (ii) article parser module acts as a collector to pick up sentences containing the query protein name; (iii) full-sentence parser module determines the PPI and (iv) PPI visualization module displays the PPI in the form of a 3-D graph.
Figure 2
Figure 2
An example of PPLook search results for the protein IL-2. (A) is the protein semantic class selection window, (B) is the IL-2 protein input window, (C) is the text results output window, and (D) is the 3-D display output window.
Figure 3
Figure 3
Heterogeneous Search results: Structure information and MEDLINE results for the protein IL-2. The top left window is the protein IL-2 structure information coming from PDB database. The bottom right window is the Google search results.

Similar articles

Cited by

References

    1. Salwinski L, Miller CS, Smith AJ, Pettit FK, Bowie JU, Eisenberg D. The Database of Interacting Proteins: 2004 update. Nucleic Acids Res. 2004;32:D449–D451. doi: 10.1093/nar/gkh086. - DOI - PMC - PubMed
    1. Chatr-aryamontri A, Ceol A, Palazzi LM, Nardelli G, Schneider MV, Castagnoli L, Cesareni G. MINT: the Molecular INTeraction database. Nucleic Acids Res. 2007;35:D572–D574. doi: 10.1093/nar/gkl950. - DOI - PMC - PubMed
    1. Hermjakob L, Montecchi-Palazzi L, Lewington C, Mudali S, Kerrien S, Orchard S, Vingron M, Roechert B, Roepstorff P, Valencia A, Margalit H, Armstrong J, Bairoch A, Cesareni1 G, Sherman D, Apweiler R. IntAct: an open source molecular interaction database. Nucleic Acids Res. 2004;32:D452–D455. doi: 10.1093/nar/gkh052. - DOI - PMC - PubMed
    1. Breitkreutz BJ, Stark C, Reguly T, Boucher L, Breitkreutz A, Livstone M, Oughtred R, Lackner DH, Bahler J, Wood V, Dolinski K, Tyers M. The BioGRID inter- action database: 2008 update. Nucleic Acids Res. 2008;36:D637–D640. doi: 10.1093/nar/gkm1001. - DOI - PMC - PubMed
    1. Zhou D, He Y. Extracting interactions between proteins from the literature. J Biomedical Informatics. 2008;41:393–407. doi: 10.1016/j.jbi.2007.11.008. - DOI - PubMed

Publication types

LinkOut - more resources