Natural Language Processing in Electronic Health Records in relation to healthcare decision-making: A systematic review

Elias Hossain¹, Rajib Rana², Niall Higgins³, Jeffrey Soar⁴, Prabal Datta Barua⁴, Anthony R Pisani⁵, Kathryn Turner⁶

Affiliations

¹ School of Engineering & Physical Sciences, North South University, Dhaka 1229, Bangladesh. Electronic address: elias.hossain191@gmail.com.
² School of Mathematics, Physics and Computing, University of Southern Queensland, Springfield Central QLD 4300, Australia.
³ School of Management and Enterprise, University of Southern Queensland, Darling Heights QLD 4350, Australia; School of Nursing, Queensland University of Technology, Kelvin Grove, Brisbane, QLD 4000, Australia; Metro North Mental Health, Herston QLD 4029, Australia.
⁴ School of Business, University of Southern Queensland, Springfield Central QLD 4300, Australia.
⁵ Center for the Study and Prevention of Suicide, University of Rochester, Rochester, NY, United States.
⁶ School of Nursing, Queensland University of Technology, Kelvin Grove, Brisbane, QLD 4000, Australia.

PMID: 36805219
DOI: 10.1016/j.compbiomed.2023.106649

Natural Language Processing in Electronic Health Records in relation to healthcare decision-making: A systematic review

Elias Hossain et al. Comput Biol Med. 2023 Mar.

. 2023 Mar:155:106649.

doi: 10.1016/j.compbiomed.2023.106649. Epub 2023 Feb 10.

Authors

Elias Hossain¹, Rajib Rana², Niall Higgins³, Jeffrey Soar⁴, Prabal Datta Barua⁴, Anthony R Pisani⁵, Kathryn Turner⁶

Affiliations

¹ School of Engineering & Physical Sciences, North South University, Dhaka 1229, Bangladesh. Electronic address: elias.hossain191@gmail.com.
² School of Mathematics, Physics and Computing, University of Southern Queensland, Springfield Central QLD 4300, Australia.
³ School of Management and Enterprise, University of Southern Queensland, Darling Heights QLD 4350, Australia; School of Nursing, Queensland University of Technology, Kelvin Grove, Brisbane, QLD 4000, Australia; Metro North Mental Health, Herston QLD 4029, Australia.
⁴ School of Business, University of Southern Queensland, Springfield Central QLD 4300, Australia.
⁵ Center for the Study and Prevention of Suicide, University of Rochester, Rochester, NY, United States.
⁶ School of Nursing, Queensland University of Technology, Kelvin Grove, Brisbane, QLD 4000, Australia.

PMID: 36805219
DOI: 10.1016/j.compbiomed.2023.106649

Abstract

Background: Natural Language Processing (NLP) is widely used to extract clinical insights from Electronic Health Records (EHRs). However, the lack of annotated data, automated tools, and other challenges hinder the full utilisation of NLP for EHRs. Various Machine Learning (ML), Deep Learning (DL) and NLP techniques are studied and compared to understand the limitations and opportunities in this space comprehensively.

Methodology: After screening 261 articles from 11 databases, we included 127 papers for full-text review covering seven categories of articles: (1) medical note classification, (2) clinical entity recognition, (3) text summarisation, (4) deep learning (DL) and transfer learning architecture, (5) information extraction, (6) Medical language translation and (7) other NLP applications. This study follows the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines.

Result and discussion: EHR was the most commonly used data type among the selected articles, and the datasets were primarily unstructured. Various ML and DL methods were used, with prediction or classification being the most common application of ML or DL. The most common use cases were: the International Classification of Diseases, Ninth Revision (ICD-9) classification, clinical note analysis, and named entity recognition (NER) for clinical descriptions and research on psychiatric disorders.

Conclusion: We find that the adopted ML models were not adequately assessed. In addition, the data imbalance problem is quite important, yet we must find techniques to address this underlining problem. Future studies should address key limitations in studies, primarily identifying Lupus Nephritis, Suicide Attempts, perinatal self-harmed and ICD-9 classification.

Keywords: Artificial intelligence in medicine; Automated tools; Electronic Health Records; Machine learning; Medical natural language processing; State-of-the-art deep learning.

PubMed Disclaimer

Conflict of interest statement

Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- ClinicalKey
- Elsevier Science

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Natural Language Processing in Electronic Health Records in relation to healthcare decision-making: A systematic review

Affiliations

Natural Language Processing in Electronic Health Records in relation to healthcare decision-making: A systematic review

Authors

Affiliations

Abstract

Conflict of interest statement

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources