PRIME: automatically extracted PRotein Interactions and Molecular Information databasE
- PMID: 15972002
PRIME: automatically extracted PRotein Interactions and Molecular Information databasE
Abstract
With the exponentially increasing amount of information in the biomedical field, the significance of advanced information retrieval and information extraction, as well as the role of databases, has been increasing. PRIME is an integrated gene/protein informatics database based on natural language processing. It provides automatically extracted protein/family/gene/compound interaction information including both physical and genetic interactions, gene ontology based functions, and graphic pathway viewers. Gene/protein/family names and functional terms are recognized based on dictionaries developed in our laboratory. The interaction and functional information are extracted by syntactic dependencies and various phrase patterns. We have included about 920,000 (non-redundant) protein interactions and 360,000 annotated gene-function relationships for major eukaryotes. By combining the sequence and text information, the pathway comparison between two organisms and simple pathway deduction based on other organism interaction data, and pathway filtering using tissue expression data, are also available. This database is accessible at http://prime.ontology.ims.u-tokyo.ac.jp:8081.
Similar articles
-
Automatic extraction of gene/protein biological functions from biomedical text.Bioinformatics. 2005 Apr 1;21(7):1227-36. doi: 10.1093/bioinformatics/bti084. Epub 2004 Oct 27. Bioinformatics. 2005. PMID: 15509601
-
Text mining and protein annotations: the construction and use of protein description sentences.Genome Inform. 2006;17(2):121-30. Genome Inform. 2006. PMID: 17503385
-
Extracting human protein interactions from MEDLINE using a full-sentence parser.Bioinformatics. 2004 Mar 22;20(5):604-11. doi: 10.1093/bioinformatics/btg452. Epub 2004 Jan 22. Bioinformatics. 2004. PMID: 15033866
-
Facts from text: can text mining help to scale-up high-quality manual curation of gene products with ontologies?Brief Bioinform. 2008 Nov;9(6):466-78. doi: 10.1093/bib/bbn043. Epub 2008 Dec 6. Brief Bioinform. 2008. PMID: 19060303 Review.
-
Status of text-mining techniques applied to biomedical text.Drug Discov Today. 2006 Apr;11(7-8):315-25. doi: 10.1016/j.drudis.2006.02.011. Drug Discov Today. 2006. PMID: 16580973 Review.
Cited by
-
Identification of transcription factor contexts in literature using machine learning approaches.BMC Bioinformatics. 2008 Apr 11;9 Suppl 3(Suppl 3):S11. doi: 10.1186/1471-2105-9-S3-S11. BMC Bioinformatics. 2008. PMID: 18426546 Free PMC article.
-
The role of positive selection in determining the molecular cause of species differences in disease.BMC Evol Biol. 2008 Oct 6;8:273. doi: 10.1186/1471-2148-8-273. BMC Evol Biol. 2008. PMID: 18837980 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources