A context-based ABC model for literature-based discovery
- PMID: 31017923
- PMCID: PMC6481912
- DOI: 10.1371/journal.pone.0215313
A context-based ABC model for literature-based discovery
Abstract
Background: In the literature-based discovery, considerable research has been done based on the ABC model developed by Swanson. ABC model hypothesizes that there is a meaningful relation between entity A extracted from document set 1 and entity C extracted from document set 2 through B entities that appear commonly in both document sets. The results of ABC model are relations among entity A, B, and C, which is referred as paths. A path allows for hypothesizing the relationship between entity A and entity C, or helps discover entity B as a new evidence for the relationship between entity A and entity C. The co-occurrence based approach of ABC model is a well-known approach to automatic hypothesis generation by creating various paths. However, the co-occurrence based ABC model has a limitation, in that biological context is not considered. It focuses only on matching of B entity which commonly appears in relation between two entities. Therefore, the paths extracted by the co-occurrence based ABC model tend to include a lot of irrelevant paths, meaning that expert verification is essential.
Methods: In order to overcome this limitation of the co-occurrence based ABC model, we propose a context-based approach to connecting one entity relation to another, modifying the ABC model using biological contexts. In this study, we defined four biological context elements: cell, drug, disease, and organism. Based on these biological context, we propose two extended ABC models: a context-based ABC model and a context-assignment-based ABC model. In order to measure the performance of the both proposed models, we examined the relevance of the B entities between the well-known relations "APOE-MAPT" as well as "FUS-TARDBP". Each relation means interaction between neurodegenerative disease associated with proteins. The interaction between APOE and MAPT is known to play a crucial role in Alzheimer's disease as APOE affects tau-mediated neurodegeneration. It has been shown that mutation in FUS and TARDBP are associated with amyotrophic lateral sclerosis(ALS), a motor neuron disease by leading to neuronal cell death. Using these two relations, we compared both of proposed models to co-occurrence based ABC model.
Results: The precision of B entities by co-occurrence based ABC model was 27.1% for "APOE-MAPT" and 22.1% for "FUS-TARDBP", respectively. In context-based ABC model, precision of extracted B entities was 71.4% for "APOE-MAPT", and 77.9% for "FUS-TARDBP". Context-assignment based ABC model achieved 89% and 97.5% precision for the two relations, respectively. Both proposed models achieved a higher precision than co-occurrence-based ABC model.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures







Similar articles
-
Combining entity co-occurrence with specialized word embeddings to measure entity relation in Alzheimer's disease.BMC Med Inform Decis Mak. 2019 Dec 5;19(Suppl 5):240. doi: 10.1186/s12911-019-0934-5. BMC Med Inform Decis Mak. 2019. PMID: 31801521 Free PMC article.
-
Discovering context-specific relationships from biological literature by using multi-level context terms.BMC Med Inform Decis Mak. 2012 Apr 30;12 Suppl 1(Suppl 1):S1. doi: 10.1186/1472-6947-12-S1-S1. BMC Med Inform Decis Mak. 2012. PMID: 22595086 Free PMC article.
-
TARDBP and FUS mutations associated with amyotrophic lateral sclerosis: summary and update.Hum Mutat. 2013 Jun;34(6):812-26. doi: 10.1002/humu.22319. Epub 2013 Apr 29. Hum Mutat. 2013. PMID: 23559573 Review.
-
The contribution of co-reference resolution to supervised relation detection between bacteria and biotopes entities.BMC Bioinformatics. 2015;16 Suppl 10(Suppl 10):S6. doi: 10.1186/1471-2105-16-S10-S6. Epub 2015 Jul 13. BMC Bioinformatics. 2015. PMID: 26201352 Free PMC article.
-
From Mouse Models to Human Disease: An Approach for Amyotrophic Lateral Sclerosis.In Vivo. 2018 Sep-Oct;32(5):983-998. doi: 10.21873/invivo.11339. In Vivo. 2018. PMID: 30150420 Free PMC article. Review.
Cited by
-
Literature-based discovery approaches for evidence-based healthcare: a systematic review.Health Technol (Berl). 2021;11(6):1205-1217. doi: 10.1007/s12553-021-00605-y. Epub 2021 Oct 25. Health Technol (Berl). 2021. PMID: 34722102 Free PMC article. Review.
-
A systematic review on literature-based discovery workflow.PeerJ Comput Sci. 2019 Nov 18;5:e235. doi: 10.7717/peerj-cs.235. eCollection 2019. PeerJ Comput Sci. 2019. PMID: 33816888 Free PMC article.
-
Visualizing a field of research: A methodology of systematic scientometric reviews.PLoS One. 2019 Oct 31;14(10):e0223994. doi: 10.1371/journal.pone.0223994. eCollection 2019. PLoS One. 2019. PMID: 31671124 Free PMC article.
-
An automatic hypothesis generation for plausible linkage between xanthium and diabetes.Sci Rep. 2022 Oct 20;12(1):17547. doi: 10.1038/s41598-022-20752-0. Sci Rep. 2022. PMID: 36266295 Free PMC article.
-
Combining Literature Mining and Machine Learning for Predicting Biomedical Discoveries.Methods Mol Biol. 2022;2496:123-140. doi: 10.1007/978-1-0716-2305-3_7. Methods Mol Biol. 2022. PMID: 35713862
References
-
- Swanson DR. Undiscovered public knowledge. The Library Quarterly. 1986;56(2):103–118.
-
- Leroy G, Chen H. Genescene: An ontology-enhanced integration of linguistic and co-occurrence based relations in biomedical texts. Journal of the American Society for Information Science and Technology. 2005;56(5):457–468.
Publication types
MeSH terms
Associated data
LinkOut - more resources
Full Text Sources
Miscellaneous