Information retrieval and question answering: A case study on COVID-19 scientific literature
- PMID: 35002094
- PMCID: PMC8719365
- DOI: 10.1016/j.knosys.2021.108072
Information retrieval and question answering: A case study on COVID-19 scientific literature
Abstract
Biosanitary experts around the world are directing their efforts towards the study of COVID-19. This effort generates a large volume of scientific publications at a speed that makes the effective acquisition of new knowledge difficult. Therefore, Information Systems are needed to assist biosanitary experts in accessing, consulting and analyzing these publications. In this work we develop a study of the variables involved in the development of a Question Answering system that receives a set of questions asked by experts about the disease COVID-19 and its causal virus SARS-CoV-2, and provides a ranked list of expert-level answers to each question. In particular, we address the interrelation of the Information Retrieval and the Answer Extraction steps. We found that a recall based document retrieval that leaves to a neural answer extraction module the scanning of the whole documents to find the best answer is a better strategy than relying in a precise passage retrieval before extracting the answer span.
Keywords: 00-01; 99-00; COVID-19; Question answering.
© 2021 Published by Elsevier B.V.
Conflict of interest statement
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Figures
References
-
- L.L. Wang, K. Lo, Y. Chandrasekhar, R. Reas, J. Yang, D. Burdick, D. Eide, K. Funk, Y. Katsis, R.M. Kinney, et al. CORD-19: The COVID-19 Open Research Dataset, in: Proceedings of the 1st Workshop on NLP for COVID-19 at ACL 2020, 2020.
-
- E.M. Voorhees, et al. The TREC-8 Question Answering Track Report, in: Proceedings of the 8th Text REtrieval Conference, TREC-8, 1999, pp. 77–82.
-
- Brill E., Dumais S., Banko M. An analysis of the AskMSR question-answering system. Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing; EMNLP 2002; 2002. pp. 257–264. - DOI
-
- Ferrucci D.A. Introduction to this is Watson. IBM J. Res. Dev. 2012;56(3.4):235–249. doi: 10.1147/JRD.2012.2184356. - DOI
-
- Chen D., Fisch A., Weston J., Bordes A. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, Volume 1: Long Papers. 2017. Reading wikipedia to answer open-domain questions; pp. 1870–1879. - DOI
LinkOut - more resources
Full Text Sources
Miscellaneous