BioContrasts: extracting and exploiting protein-protein contrastive relations from biomedical literature
- PMID: 16368768
- DOI: 10.1093/bioinformatics/btk016
BioContrasts: extracting and exploiting protein-protein contrastive relations from biomedical literature
Abstract
Motivation: Contrasts are useful conceptual vehicles for learning processes and exploratory research of the unknown. For example, contrastive information between proteins can reveal what similarities, divergences and relations there are of the two proteins, leading to invaluable insights for better understanding about the proteins. Such contrastive information are found to be reported in the biomedical literature. However, there have been no reported attempts in current biomedical text mining work that systematically extract and present such useful contrastive information from the literature for exploitation.
Results: Our BioContrasts system extracts protein-protein contrastive information from MEDLINE abstracts and presents the information to biologists in a web-application for exploitation. Contrastive information are identified in the text abstracts with contrastive negation patterns such as 'A but not B'. A total of 799 169 pairs of contrastive expressions were successfully extracted from 2.5 million MEDLINE abstracts. Using grounding of contrastive protein names to Swiss-Prot entries, we were able to produce 41 471 pieces of contrasts between Swiss-Prot protein entries. These contrastive pieces of information are then presented via a user-friendly interactive web portal that can be exploited for applications such as the refinement of biological pathways.
Availability: BioContrasts can be accessed at http://biocontrasts.i2r.a-star.edu.sg. It is also mirrored at http://biocontrasts.biopathway.org.
Supplementary information: Supplementary materials are available at Bioinformatics online.
Similar articles
-
Discovering patterns to extract protein-protein interactions from full texts.Bioinformatics. 2004 Dec 12;20(18):3604-12. doi: 10.1093/bioinformatics/bth451. Epub 2004 Jul 29. Bioinformatics. 2004. PMID: 15284092
-
Extracting human protein interactions from MEDLINE using a full-sentence parser.Bioinformatics. 2004 Mar 22;20(5):604-11. doi: 10.1093/bioinformatics/btg452. Epub 2004 Jan 22. Bioinformatics. 2004. PMID: 15033866
-
GeneInfoMiner--a web server for exploring biomedical literature using batch sequence ID.Bioinformatics. 2005 Aug 15;21(16):3452-3. doi: 10.1093/bioinformatics/bti559. Epub 2005 Jun 30. Bioinformatics. 2005. PMID: 15994195
-
Extracting interactions between proteins from the literature.J Biomed Inform. 2008 Apr;41(2):393-407. doi: 10.1016/j.jbi.2007.11.008. Epub 2007 Dec 15. J Biomed Inform. 2008. PMID: 18207462 Review.
-
Text-mining approaches in molecular biology and biomedicine.Drug Discov Today. 2005 Mar 15;10(6):439-45. doi: 10.1016/S1359-6446(05)03376-3. Drug Discov Today. 2005. PMID: 15808823 Review.
Cited by
-
K-RET: knowledgeable biomedical relation extraction system.Bioinformatics. 2023 Apr 3;39(4):btad174. doi: 10.1093/bioinformatics/btad174. Bioinformatics. 2023. PMID: 37018156 Free PMC article.
-
Text-mining solutions for biomedical research: enabling integrative biology.Nat Rev Genet. 2012 Dec;13(12):829-39. doi: 10.1038/nrg3337. Epub 2012 Nov 14. Nat Rev Genet. 2012. PMID: 23150036 Review.
-
The biomedical discourse relation bank.BMC Bioinformatics. 2011 May 23;12:188. doi: 10.1186/1471-2105-12-188. BMC Bioinformatics. 2011. PMID: 21605399 Free PMC article.
-
BioNØT: a searchable database of biomedical negated sentences.BMC Bioinformatics. 2011 Oct 27;12:420. doi: 10.1186/1471-2105-12-420. BMC Bioinformatics. 2011. PMID: 22032181 Free PMC article.
-
A Relation Extraction Framework for Biomedical Text Using Hybrid Feature Set.Comput Math Methods Med. 2015;2015:910423. doi: 10.1155/2015/910423. Epub 2015 Aug 10. Comput Math Methods Med. 2015. PMID: 26347797 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources