Bio-AnswerFinder: a system to find answers to questions from biomedical texts

Ibrahim Burak Ozyurt¹, Anita Bandrowski¹, Jeffrey S Grethe¹

Affiliations

PMID: 31925435
PMCID: PMC7053013
DOI: 10.1093/database/baz137

Bio-AnswerFinder: a system to find answers to questions from biomedical texts

Ibrahim Burak Ozyurt et al. Database (Oxford). 2020.

. 2020 Jan 1:2020:baz137.

doi: 10.1093/database/baz137.

Authors

Ibrahim Burak Ozyurt¹, Anita Bandrowski¹, Jeffrey S Grethe¹

Affiliation

¹ Center for Research in Biological Systems, University of California, San Diego, 9500 Gilman Drive, M/C 0608, La Jolla, CA 92093-0608.

PMID: 31925435
PMCID: PMC7053013
DOI: 10.1093/database/baz137

Abstract

The ever accelerating pace of biomedical research results in corresponding acceleration in the volume of biomedical literature created. Since new research builds upon existing knowledge, the rate of increase in the available knowledge encoded in biomedical literature makes the easy access to that implicit knowledge more vital over time. Toward the goal of making implicit knowledge in the biomedical literature easily accessible to biomedical researchers, we introduce a question answering system called Bio-AnswerFinder. Bio-AnswerFinder uses a weighted-relaxed word mover's distance based similarity on word/phrase embeddings learned from PubMed abstracts to rank answers after question focus entity type filtering. Our approach retrieves relevant documents iteratively via enhanced keyword queries from a traditional search engine. To improve document retrieval performance, we introduced a supervised long short term memory neural network to select keywords from the question to facilitate iterative keyword search. Our unsupervised baseline system achieves a mean reciprocal rank score of 0.46 and Precision@1 of 0.32 on 936 questions from BioASQ. The answer sentences are further ranked by a fine-tuned bidirectional encoder representation from transformers (BERT) classifier trained using 100 answer candidate sentences per question for 492 BioASQ questions. To test ranking performance, we report a blind test on 100 questions that three independent annotators scored. These experts preferred BERT based reranking with 7% improvement on MRR and 13% improvement on Precision@1 scores on average.

PubMed Disclaimer

Figures

**Figure 1**
Overview of the Bio-AnswerFinder.

**Figure 2**
Illustration of iterative weighted Elasticsearch keyword querying process on the question “Is alemtuzumab effective for remission induction in patients diagnosed with T-cell prolymphocytic leukemia?”

See this image and copyright information in PMC

References

1. Athenikos S.J. and Han H. (2010) Biomedical question answering: a survey. Comput. Methods Prog. Biomed., 99, 1–24. - PubMed
1. Tsatsaronis G., Georgios B., Prodromos M. et al. (2015) An overview of the BIOASQ large-scale biomedical semantic indexing and question answering competition. BMC Bioinformatics, 16. - PMC - PubMed
1. Kusner M., Sun Y., Kolkin N., Weinberger K.Q. (2015) From word embeddings to document distances Procedings of the 32nd International Conference on Machine Learning (ICML). Lille, France, pp. 957–966. doi: 10.1186/s12859-015-0564-6. - DOI
1. Pennington J., Socher R., Manning C (2014) Glove: global vectors for word representation Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP). Qatar. ACL. doi: 10.3115/v1/d14-1162. - DOI
1. Devlin J., Chang M., Lee K., Toutanova K. (2018) BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. CoRR; http://arxiv.org/abs/1810.04805.

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

U24 DK097771/DK/NIDDK NIH HHS/United States

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Bio-AnswerFinder: a system to find answers to questions from biomedical texts

Affiliation

Bio-AnswerFinder: a system to find answers to questions from biomedical texts

Authors

Affiliation

Abstract

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources