Named entity recognition from Chinese adverse drug event reports with lexical feature based BiLSTM-CRF and tri-training
- PMID: 31323311
- DOI: 10.1016/j.jbi.2019.103252
Named entity recognition from Chinese adverse drug event reports with lexical feature based BiLSTM-CRF and tri-training
Abstract
Background: The Adverse Drug Event Reports (ADERs) from the spontaneous reporting system are important data sources for studying Adverse Drug Reactions (ADRs) as well as post-marketing pharmacovigilance. Apart from the conventional ADR information contained in the structured section of ADERs, more detailed information such as pre- and post- ADR symptoms, multi-drug usages and ADR-relief treatments are described in the free-text section, which can be mined through Natural Language Processing (NLP) tools.
Objective: The goal of this study was to extract ADR-related entities from free-text section of Chinese ADERs, which can act as supplements for the information contained in structured section, so as to further assist in ADR evaluation.
Methods: Three models of Conditional Random Field (CRF), Bidirectional Long Short-Term Memory-CRF (BiLSTM-CRF) and Lexical Feature based BiLSTM-CRF (LF-BiLSTM-CRF) were constructed to conduct Named Entity Recognition (NER) tasks in free-text section of Chinese ADERs. A semi-supervised learning method of tri-training was applied on the basis of the three established models to give un-annotated raw data with reliable tags.
Results: Among the three basic models, the LF-BiLSTM-CRF achieved the highest average F1 score of 94.35%. After the process of tri-training, almost half of the un-annotated cases were tagged with labels, and the performances of all the three models improved after iterative training.
Conclusions: The LF-BiLSTM-CRF model that we constructed could achieve a comparatively high F1 score, and the fusion of CRF, while BiLSTM-CRF and LF-BiLSTM-CRF in tri-training might further strengthen the reliability of predicted tags. The results suggested the usefulness of our methods in developing the specialized NER tools for identifying ADR-related information from Chinese ADERs.
Keywords: Adverse drug reaction; Chinese natural language processing; Lexical feature based bidirectional long short-term memory; Named entity recognition; Tri-training.
Copyright © 2019 Elsevier Inc. All rights reserved.
Similar articles
-
Extracting clinical named entity for pituitary adenomas from Chinese electronic medical records.BMC Med Inform Decis Mak. 2022 Mar 23;22(1):72. doi: 10.1186/s12911-022-01810-z. BMC Med Inform Decis Mak. 2022. PMID: 35321705 Free PMC article.
-
Chinese-Named Entity Recognition From Adverse Drug Event Records: Radical Embedding-Combined Dynamic Embedding-Based BERT in a Bidirectional Long Short-term Conditional Random Field (Bi-LSTM-CRF) Model.JMIR Med Inform. 2021 Dec 1;9(12):e26407. doi: 10.2196/26407. JMIR Med Inform. 2021. PMID: 34855616 Free PMC article.
-
Chinese clinical named entity recognition with radical-level feature and self-attention mechanism.J Biomed Inform. 2019 Oct;98:103289. doi: 10.1016/j.jbi.2019.103289. Epub 2019 Sep 18. J Biomed Inform. 2019. PMID: 31541715
-
Adverse drug event detection using natural language processing: A scoping review of supervised learning methods.PLoS One. 2023 Jan 3;18(1):e0279842. doi: 10.1371/journal.pone.0279842. eCollection 2023. PLoS One. 2023. PMID: 36595517 Free PMC article.
-
Leveraging Natural Language Processing and Machine Learning Methods for Adverse Drug Event Detection in Electronic Health/Medical Records: A Scoping Review.Drug Saf. 2025 Apr;48(4):321-337. doi: 10.1007/s40264-024-01505-6. Epub 2025 Jan 9. Drug Saf. 2025. PMID: 39786481 Free PMC article.
Cited by
-
MedLexSp - a medical lexicon for Spanish medical natural language processing.J Biomed Semantics. 2023 Feb 2;14(1):2. doi: 10.1186/s13326-022-00281-5. J Biomed Semantics. 2023. PMID: 36732862 Free PMC article.
-
A Year of Papers Using Biomedical Texts.Yearb Med Inform. 2020 Aug;29(1):221-225. doi: 10.1055/s-0040-1701997. Epub 2020 Aug 21. Yearb Med Inform. 2020. PMID: 32823319 Free PMC article. Review.
-
Chemical named entity recognition in the texts of scientific publications using the naïve Bayes classifier approach.J Cheminform. 2022 Aug 13;14(1):55. doi: 10.1186/s13321-022-00633-4. J Cheminform. 2022. PMID: 35964150 Free PMC article.
-
Adoption of Dexmedetomidine in Different Doses at Different Timing in Perioperative Patients.Biomed Res Int. 2022 Jul 15;2022:4008941. doi: 10.1155/2022/4008941. eCollection 2022. Biomed Res Int. 2022. Retraction in: Biomed Res Int. 2024 Jan 9;2024:9862705. doi: 10.1155/2024/9862705. PMID: 35872874 Free PMC article. Retracted.
-
Application of knowledge graph in smart irrigation district management decision making.Heliyon. 2024 Sep 24;10(19):e38398. doi: 10.1016/j.heliyon.2024.e38398. eCollection 2024 Oct 15. Heliyon. 2024. PMID: 39391511 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials