Using unsupervised patterns to extract gene regulation relationships for network construction
- PMID: 21573008
- PMCID: PMC3091867
- DOI: 10.1371/journal.pone.0019633
Using unsupervised patterns to extract gene regulation relationships for network construction
Abstract
Background: The gene expression is usually described in the literature as a transcription factor X that regulates the target gene Y. Previously, some studies discovered gene regulations by using information from the biomedical literature and most of them require effort of human annotators to build the training dataset. Moreover, the large amount of textual knowledge recorded in the biomedical literature grows very rapidly, and the creation of manual patterns from literatures becomes more difficult. There is an increasing need to automate the process of establishing patterns.
Methodology/principal findings: In this article, we describe an unsupervised pattern generation method called AutoPat. It is a gene expression mining system that can generate unsupervised patterns automatically from a given set of seed patterns. The high scalability and low maintenance cost of the unsupervised patterns could help our system to extract gene expression from PubMed abstracts more precisely and effectively.
Conclusions/significance: Experiments on several regulators show reasonable precision and recall rates which validate AutoPat's practical applicability. The conducted regulation networks could also be built precisely and effectively. The system in this study is available at http://ikmbio.csie.ncku.edu.tw/AutoPat/.
Conflict of interest statement
Figures
References
-
- Blaschke C, Valencia A. The Potential Use of SUISEKI as a Protein Interaction Discovery Tool. Genome Informatics. 2001;12:123–134. - PubMed
-
- Huang M, Zhu X, Hao Y, Payan DG, Qu K, et al. Discovering patterns to extract protein-protein interactions from full texts. Bioinformatics. 2004;20:3604–3612. - PubMed
-
- Hoffmann R, Valencia A. A gene network for navigating the literature. Nat Genet. 2004;36:664. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
