Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Jan 15:2013:bas052.
doi: 10.1093/database/bas052. Print 2013.

PPInterFinder--a mining tool for extracting causal relations on human proteins from literature

Affiliations

PPInterFinder--a mining tool for extracting causal relations on human proteins from literature

Kalpana Raja et al. Database (Oxford). .

Abstract

One of the most common and challenging problem in biomedical text mining is to mine protein-protein interactions (PPIs) from MEDLINE abstracts and full-text research articles because PPIs play a major role in understanding the various biological processes and the impact of proteins in diseases. We implemented, PPInterFinder--a web-based text mining tool to extract human PPIs from biomedical literature. PPInterFinder uses relation keyword co-occurrences with protein names to extract information on PPIs from MEDLINE abstracts and consists of three phases. First, it identifies the relation keyword using a parser with Tregex and a relation keyword dictionary. Next, it automatically identifies the candidate PPI pairs with a set of rules related to PPI recognition. Finally, it extracts the relations by matching the sentence with a set of 11 specific patterns based on the syntactic nature of PPI pair. We find that PPInterFinder is capable of predicting PPIs with the accuracy of 66.05% on AIMED corpus and outperforms most of the existing systems. DATABASE URL: http://www.biomining-bu.in/ppinterfinder/

PubMed Disclaimer

Figures

Figure 1
Figure 1
Work flow of PPInterFinder.
Figure 2
Figure 2
Tregex-based algorithm for extracting the relation keyword.
Figure 3
Figure 3
Algorithm to extract PPI triplets from complex sentences with more than two proteins.
Figure 4
Figure 4
PPI extraction—methodology.
Figure 5
Figure 5
Screenshot of PPInterFinder showing input and extracted PPI pairs.

Similar articles

Cited by

References

    1. Kann MG. Protein interactions and disease: computational approaches to uncover the etiology of diseases. Brief. Bioinform. 2007;8:333–346. - PubMed
    1. Huang M, Ding S, Wang H, Zhu X. Mining physical protein-protein interactions from the literature. Genome Biol. 2008;9(Suppl 2):S12. - PMC - PubMed
    1. Kerrien S, Aranda B, Breuza L, et al. IntAct – open source resource for molecular interaction data. Nucleic Acids Res. 2007;35:d561–d565. - PMC - PubMed
    1. Zanzoni A, Montecchi-Palazzi L, Quondam M, et al. MINT: a molecular INTeraction database. FEBS Lett. 2002;513:135–140. - PubMed
    1. Bader GD, Donaldson I, Wolting C, et al. BIND – the biomolecular interaction network database. Nucleic Acids Res. 2001;29:242–245. - PMC - PubMed

Publication types

LinkOut - more resources