Named entity recognition from Chinese adverse drug event reports with lexical feature based BiLSTM-CRF and tri-training

Yao Chen¹, Changjiang Zhou¹, Tianxin Li¹, Hong Wu¹, Xia Zhao², Kai Ye³, Jun Liao⁴

Affiliations

¹ School of Science, China Pharmaceutical University, Nanjing, China.
² Adverse Drug Reaction Monitoring Center of Wuxi, Wuxi, China.
³ MandalaT Software Corporation, Wuxi, China.
⁴ School of Science, China Pharmaceutical University, Nanjing, China. Electronic address: liaojun@cpu.edu.cn.

PMID: 31323311
DOI: 10.1016/j.jbi.2019.103252

Free article

Named entity recognition from Chinese adverse drug event reports with lexical feature based BiLSTM-CRF and tri-training

Yao Chen et al. J Biomed Inform. 2019 Aug.

Free article

. 2019 Aug:96:103252.

doi: 10.1016/j.jbi.2019.103252. Epub 2019 Jul 16.

Authors

Yao Chen¹, Changjiang Zhou¹, Tianxin Li¹, Hong Wu¹, Xia Zhao², Kai Ye³, Jun Liao⁴

Affiliations

¹ School of Science, China Pharmaceutical University, Nanjing, China.
² Adverse Drug Reaction Monitoring Center of Wuxi, Wuxi, China.
³ MandalaT Software Corporation, Wuxi, China.
⁴ School of Science, China Pharmaceutical University, Nanjing, China. Electronic address: liaojun@cpu.edu.cn.

PMID: 31323311
DOI: 10.1016/j.jbi.2019.103252

Abstract

Background: The Adverse Drug Event Reports (ADERs) from the spontaneous reporting system are important data sources for studying Adverse Drug Reactions (ADRs) as well as post-marketing pharmacovigilance. Apart from the conventional ADR information contained in the structured section of ADERs, more detailed information such as pre- and post- ADR symptoms, multi-drug usages and ADR-relief treatments are described in the free-text section, which can be mined through Natural Language Processing (NLP) tools.

Objective: The goal of this study was to extract ADR-related entities from free-text section of Chinese ADERs, which can act as supplements for the information contained in structured section, so as to further assist in ADR evaluation.

Methods: Three models of Conditional Random Field (CRF), Bidirectional Long Short-Term Memory-CRF (BiLSTM-CRF) and Lexical Feature based BiLSTM-CRF (LF-BiLSTM-CRF) were constructed to conduct Named Entity Recognition (NER) tasks in free-text section of Chinese ADERs. A semi-supervised learning method of tri-training was applied on the basis of the three established models to give un-annotated raw data with reliable tags.

Results: Among the three basic models, the LF-BiLSTM-CRF achieved the highest average F1 score of 94.35%. After the process of tri-training, almost half of the un-annotated cases were tagged with labels, and the performances of all the three models improved after iterative training.

Conclusions: The LF-BiLSTM-CRF model that we constructed could achieve a comparatively high F1 score, and the fusion of CRF, while BiLSTM-CRF and LF-BiLSTM-CRF in tri-training might further strengthen the reliability of predicted tags. The results suggested the usefulness of our methods in developing the specialized NER tools for identifying ADR-related information from Chinese ADERs.

Keywords: Adverse drug reaction; Chinese natural language processing; Lexical feature based bidirectional long short-term memory; Named entity recognition; Tri-training.

PubMed Disclaimer

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- Elsevier Science
Other Literature Sources
- The Lens - Patent Citations Database
Research Materials
- NCI CPTC Antibody Characterization Program

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Named entity recognition from Chinese adverse drug event reports with lexical feature based BiLSTM-CRF and tri-training

Affiliations

Named entity recognition from Chinese adverse drug event reports with lexical feature based BiLSTM-CRF and tri-training

Authors

Affiliations

Abstract

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Other Literature Sources

Research Materials