Evaluating resampling methods and structured features to improve fall incident report identification by the severity level
- PMID: 34010385
- PMCID: PMC8324236
- DOI: 10.1093/jamia/ocab048
Evaluating resampling methods and structured features to improve fall incident report identification by the severity level
Abstract
Objective: This study aims to improve the classification of the fall incident severity level by considering data imbalance issues and structured features through machine learning.
Materials and methods: We present an incident report classification (IRC) framework to classify the in-hospital fall incident severity level by addressing the imbalanced class problem and incorporating structured attributes. After text preprocessing, bag-of-words features, structured text features, and structured clinical features were extracted from the reports. Next, resampling techniques were incorporated into the training process. Machine learning algorithms were used to build classification models. IRC systems were trained, validated, and tested using a repeated and randomly stratified shuffle-split cross-validation method. Finally, we evaluated the system performance using the F1-measure, precision, and recall over 15 stratified test sets.
Results: The experimental results demonstrated that the classification system setting considering both data imbalance issues and structured features outperformed the other system settings (with a mean macro-averaged F1-measure of 0.733). Considering the structured features and resampling techniques, this classification system setting significantly improved the mean F1-measure for the rare class by 30.88% (P value < .001) and the mean macro-averaged F1-measure by 8.26% from the baseline system setting (P value < .001). In general, the classification system employing the random forest algorithm and random oversampling method outperformed the others.
Conclusions: Structured features provide essential information for categorizing the fall incident severity level. Resampling methods help rebalance the class distribution of the original incident report data, which improves the performance of machine learning models. The IRC framework presented in this study effectively automates the identification of fall incident reports by the severity level.
Keywords: clinical incident reports; clinical text classification; falls; imbalanced learning; patient safety.
© The Author(s) 2021. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Figures
Similar articles
-
Social Reminiscence in Older Adults' Everyday Conversations: Automated Detection Using Natural Language Processing and Machine Learning.J Med Internet Res. 2020 Sep 15;22(9):e19133. doi: 10.2196/19133. J Med Internet Res. 2020. PMID: 32866108 Free PMC article.
-
Using convolutional neural networks to identify patient safety incident reports by type and severity.J Am Med Inform Assoc. 2019 Dec 1;26(12):1600-1608. doi: 10.1093/jamia/ocz146. J Am Med Inform Assoc. 2019. PMID: 31730700 Free PMC article.
-
Automatic Patient Fall Outcome Extraction Using Narrative Incident Reports.Stud Health Technol Inform. 2022 Jun 6;290:724-728. doi: 10.3233/SHTI220173. Stud Health Technol Inform. 2022. PMID: 35673112
-
Comparative Studies on Resampling Techniques in Machine Learning and Deep Learning Models for Drug-Target Interaction Prediction.Molecules. 2023 Feb 9;28(4):1663. doi: 10.3390/molecules28041663. Molecules. 2023. PMID: 36838652 Free PMC article. Review.
-
A systematic review of natural language processing for classification tasks in the field of incident reporting and adverse event analysis.Int J Med Inform. 2019 Dec;132:103971. doi: 10.1016/j.ijmedinf.2019.103971. Epub 2019 Oct 5. Int J Med Inform. 2019. PMID: 31630063
Cited by
-
Artificial intelligence in healthcare: transforming patient safety with intelligent systems-A systematic review.Front Med (Lausanne). 2025 Jan 8;11:1522554. doi: 10.3389/fmed.2024.1522554. eCollection 2024. Front Med (Lausanne). 2025. PMID: 39845830 Free PMC article.
-
Performance of Natural Language Processing versus International Classification of Diseases Codes in Building Registries for Patients With Fall Injury: Retrospective Analysis.JMIR Med Inform. 2025 Jul 14;13:e66973. doi: 10.2196/66973. JMIR Med Inform. 2025. PMID: 40658984 Free PMC article.
-
The use of natural language processing in detecting and predicting falls within the healthcare setting: a systematic review.Int J Qual Health Care. 2023 Oct 17;35(4):mzad077. doi: 10.1093/intqhc/mzad077. Int J Qual Health Care. 2023. PMID: 37758209 Free PMC article.
-
Development and validation of an interpretable longitudinal preeclampsia risk prediction using machine learning.PLoS One. 2025 Jun 10;20(6):e0323873. doi: 10.1371/journal.pone.0323873. eCollection 2025. PLoS One. 2025. PMID: 40493626 Free PMC article.
-
A large dataset of annotated incident reports on medication errors.Sci Data. 2024 Feb 29;11(1):260. doi: 10.1038/s41597-024-03036-2. Sci Data. 2024. PMID: 38424103 Free PMC article.
References
-
- Currie L. Fall and Injury Prevention. In: Hughes RG, ed. Patient Safety and Quality: An Evidence-Based Handbook for Nurses. Rockville, MD: Agency for Healthcare Research and Quality; 2008. - PubMed
-
- Healey F, Scobie S, Oliver D, et al.Falls in English and Welsh hospitals: a national observational study based on retrospective analysis of 12 months of patient safety incident reports. Qual Saf Health Care 2008; 17 (6): 424–30. - PubMed
-
- Dunne TJ, Gaboury I, Ashe MC.. Falls in hospital increase length of stay regardless of degree of harm. J Eval Clin Pract 2014; 20 (4): 396–400. - PubMed
-
- Hill KD, Vu M, Walsh W.. Falls in the acute hospital setting–impact on resource utilisation. Aust Health Review 2007; 31 (3): 471–7. - PubMed
-
- Brand CA, Sundararajan V.. A 10-year cohort study of the burden and risk of in-hospital falls and fractures using routinely collected hospital data. Quality Saf Health Care 2010; 19 (6): e51–e51. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources