Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Mar 15:240:108072.
doi: 10.1016/j.knosys.2021.108072. Epub 2021 Dec 31.

Information retrieval and question answering: A case study on COVID-19 scientific literature

Affiliations

Information retrieval and question answering: A case study on COVID-19 scientific literature

Arantxa Otegi et al. Knowl Based Syst. .

Abstract

Biosanitary experts around the world are directing their efforts towards the study of COVID-19. This effort generates a large volume of scientific publications at a speed that makes the effective acquisition of new knowledge difficult. Therefore, Information Systems are needed to assist biosanitary experts in accessing, consulting and analyzing these publications. In this work we develop a study of the variables involved in the development of a Question Answering system that receives a set of questions asked by experts about the disease COVID-19 and its causal virus SARS-CoV-2, and provides a ranked list of expert-level answers to each question. In particular, we address the interrelation of the Information Retrieval and the Answer Extraction steps. We found that a recall based document retrieval that leaves to a neural answer extraction module the scanning of the whole documents to find the best answer is a better strategy than relying in a precise passage retrieval before extracting the answer span.

Keywords: 00-01; 99-00; COVID-19; Question answering.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Fig. 1
Fig. 1
NDNS-Relaxed results (y-axis) of the exploration for each of the values of the hyperparameters.
Fig. 2
Fig. 2
NDNS-Relaxed results for different values of k in linear combination.

References

    1. L.L. Wang, K. Lo, Y. Chandrasekhar, R. Reas, J. Yang, D. Burdick, D. Eide, K. Funk, Y. Katsis, R.M. Kinney, et al. CORD-19: The COVID-19 Open Research Dataset, in: Proceedings of the 1st Workshop on NLP for COVID-19 at ACL 2020, 2020.
    1. E.M. Voorhees, et al. The TREC-8 Question Answering Track Report, in: Proceedings of the 8th Text REtrieval Conference, TREC-8, 1999, pp. 77–82.
    1. Brill E., Dumais S., Banko M. An analysis of the AskMSR question-answering system. Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing; EMNLP 2002; 2002. pp. 257–264. - DOI
    1. Ferrucci D.A. Introduction to this is Watson. IBM J. Res. Dev. 2012;56(3.4):235–249. doi: 10.1147/JRD.2012.2184356. - DOI
    1. Chen D., Fisch A., Weston J., Bordes A. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, Volume 1: Long Papers. 2017. Reading wikipedia to answer open-domain questions; pp. 1870–1879. - DOI

LinkOut - more resources