. 2022 Oct 27;13(1):26.

doi: 10.1186/s13326-022-00280-6.

We are not ready yet: limitations of state-of-the-art disease named entity recognizers

Lisa Kühnel^{1

2}, Juliane Fluck^{3

4}

Affiliations

¹ ZB MED - Information Centre for Life Sciences, Gleueler Str. 60, Cologne, Germany. kuehnel@zbmed.de.
² Graduate School DILS, Bielefeld Institute for Bioinformatics Infrastructure (BIBI), Faculty of Technology, Bielefeld University, Postfach 10 01 31, 33501, Bielefeld, Germany. kuehnel@zbmed.de.
³ ZB MED - Information Centre for Life Sciences, Gleueler Str. 60, Cologne, Germany.
⁴ Institute of Geodesy and Geoinformation, Agricultural Faculty, University of Bonn, Nussallee 1, 53115, Bonn, Germany.

PMID: 36303237
PMCID: PMC9612606
DOI: 10.1186/s13326-022-00280-6

We are not ready yet: limitations of state-of-the-art disease named entity recognizers

Lisa Kühnel et al. J Biomed Semantics. 2022.

. 2022 Oct 27;13(1):26.

doi: 10.1186/s13326-022-00280-6.

Authors

Lisa Kühnel^{1

2}, Juliane Fluck^{3

4}

Affiliations

¹ ZB MED - Information Centre for Life Sciences, Gleueler Str. 60, Cologne, Germany. kuehnel@zbmed.de.
² Graduate School DILS, Bielefeld Institute for Bioinformatics Infrastructure (BIBI), Faculty of Technology, Bielefeld University, Postfach 10 01 31, 33501, Bielefeld, Germany. kuehnel@zbmed.de.
³ ZB MED - Information Centre for Life Sciences, Gleueler Str. 60, Cologne, Germany.
⁴ Institute of Geodesy and Geoinformation, Agricultural Faculty, University of Bonn, Nussallee 1, 53115, Bonn, Germany.

PMID: 36303237
PMCID: PMC9612606
DOI: 10.1186/s13326-022-00280-6

Abstract

Background: Intense research has been done in the area of biomedical natural language processing. Since the breakthrough of transfer learning-based methods, BERT models are used in a variety of biomedical and clinical applications. For the available data sets, these models show excellent results - partly exceeding the inter-annotator agreements. However, biomedical named entity recognition applied on COVID-19 preprints shows a performance drop compared to the results on test data. The question arises how well trained models are able to predict on completely new data, i.e. to generalize.

Results: Based on the example of disease named entity recognition, we investigate the robustness of different machine learning-based methods - thereof transfer learning - and show that current state-of-the-art methods work well for a given training and the corresponding test set but experience a significant lack of generalization when applying to new data.

Conclusions: We argue that there is a need for larger annotated data sets for training and testing. Therefore, we foresee the curation of further data sets and, moreover, the investigation of continual learning processes for machine learning-based models.

Keywords: BERT; Manual Curation; Text mining; bioNLP.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

**Fig. 1**
Semantic comparison of the NCBI and BC5CDR corpora on disease mention and concept level. The training sets are compared to their corresponding test sets. Additionally, the two different training sets are compared to the test sets of the respective other corpus

**Fig. 2**
Comparison of the data sets with *scattertext*. On each axis, the frequency of a term is shown for the given documents. In Fig. 2a, the BC5CDR training set is compared to its given test set whereas in Fig. 2b, the BC5CDR training set is compared to the NCBI training set. In Figs. 2c and 2d, the BC5CDR training set and the NCBI training set are compared against a randomly chosen PubMed corpus of similar size

**Fig. 3**
NER results for all tested ML algorithms. The F1-score is shown for the test set that belongs to the training set (corresponding test set) and to the test set of the respective other data set

See this image and copyright information in PMC

References

1. School HM. N2C2: National NLP Clinical Challenges. https://n2c2.dbmi.hms.harvard.edu/. Accessed 20 June 2021.
1. Doğan RI, Leaman R, Lu Z. The NCBI Disease Corpus. https://www.ncbi.nlm.nih.gov/CBBresearch/Dogan/DISEASE/. Accessed 11 July 2021.
1. Li J, Sun Y, Johnson RJ, Sciaky D, Wei C-H, Leaman R, Davis AP, Mattingly CJ, Wiegers TC, Lu Z. BioCreative v CDR task corpus: a resource for chemical disease relation extraction. 2016. 10.1093/database/baw068. Accessed 11 July 2021. - PMC - PubMed
1. The NCBI Disease Corpus Guidelines. https://www.ncbi.nlm.nih.gov/CBBresearch/Dogan/DISEASE/Guidelines.html. Accessed 12 July 2021.
1. The BC5CDR Corpus Guidelines. https://biocreative.bioinformatics.udel.edu/media/store/files/2015/bc5_C.... Accessed 12 July 2021.

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

We are not ready yet: limitations of state-of-the-art disease named entity recognizers

Affiliations

We are not ready yet: limitations of state-of-the-art disease named entity recognizers

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Medical