Pediatric Injury Surveillance From Uncoded Emergency Department Admission Records in Italy: Machine Learning-Based Text-Mining Approach

Affiliations

¹ Department of Environmental and Preventive Sciences, University of Ferrara, Ferrara, Italy.
² Department of Women's and Children's Health, University of Padova, Padua, Italy.
³ Unit of Biostatistics, Epidemiology and Public Health, Department of Cardiac, Thoracic, Vascular Sciences, and Public Health, University of Padova, Padua, Italy.

^# Contributed equally.

PMID: 37436799
PMCID: PMC10372563
DOI: 10.2196/44467

Pediatric Injury Surveillance From Uncoded Emergency Department Admission Records in Italy: Machine Learning-Based Text-Mining Approach

Danila Azzolina et al. JMIR Public Health Surveill. 2023.

. 2023 Jul 12:9:e44467.

doi: 10.2196/44467.

Affiliations

¹ Department of Environmental and Preventive Sciences, University of Ferrara, Ferrara, Italy.
² Department of Women's and Children's Health, University of Padova, Padua, Italy.
³ Unit of Biostatistics, Epidemiology and Public Health, Department of Cardiac, Thoracic, Vascular Sciences, and Public Health, University of Padova, Padua, Italy.

^# Contributed equally.

PMID: 37436799
PMCID: PMC10372563
DOI: 10.2196/44467

Abstract

Background: Unintentional injury is the leading cause of death in young children. Emergency department (ED) diagnoses are a useful source of information for injury epidemiological surveillance purposes. However, ED data collection systems often use free-text fields to report patient diagnoses. Machine learning techniques (MLTs) are powerful tools for automatic text classification. The MLT system is useful to improve injury surveillance by speeding up the manual free-text coding tasks of ED diagnoses.

Objective: This research aims to develop a tool for automatic free-text classification of ED diagnoses to automatically identify injury cases. The automatic classification system also serves for epidemiological purposes to identify the burden of pediatric injuries in Padua, a large province in the Veneto region in the Northeast Italy.

Methods: The study includes 283,468 pediatric admissions between 2007 and 2018 to the Padova University Hospital ED, a large referral center in Northern Italy. Each record reports a diagnosis by free text. The records are standard tools for reporting patient diagnoses. An expert pediatrician manually classified a randomly extracted sample of approximately 40,000 diagnoses. This study sample served as the gold standard to train an MLT classifier. After preprocessing, a document-term matrix was created. The machine learning classifiers, including decision tree, random forest, gradient boosting method (GBM), and support vector machine (SVM), were tuned by 4-fold cross-validation. The injury diagnoses were classified into 3 hierarchical classification tasks, as follows: injury versus noninjury (task A), intentional versus unintentional injury (task B), and type of unintentional injury (task C), according to the World Health Organization classification of injuries.

Results: The SVM classifier achieved the highest performance accuracy (94.14%) in classifying injury versus noninjury cases (task A). The GBM method produced the best results (92% accuracy) for the unintentional and intentional injury classification task (task B). The highest accuracy for the unintentional injury subclassification (task C) was achieved by the SVM classifier. The SVM, random forest, and GBM algorithms performed similarly against the gold standard across different tasks.

Conclusions: This study shows that MLTs are promising techniques for improving epidemiological surveillance, allowing for the automatic classification of pediatric ED free-text diagnoses. The MLTs revealed a suitable classification performance, especially for general injuries and intentional injury classification. This automatic classification could facilitate the epidemiological surveillance of pediatric injuries by also reducing the health professionals' efforts in manually classifying diagnoses for research purposes.

Keywords: child and adolescent health; death; emergency; emergency department; epidemiological surveillance; hospitalization; injury; machine learning; patient record; pediatric admission; pediatrics; surveillance; text mining; unintentional injury.

©Danila Azzolina, Silvia Bressan, Giulia Lorenzoni, Giulia Andrea Baldan, Patrizia Bartolotta, Federico Scognamiglio, Andrea Francavilla, Corrado Lanera, Liviana Da Dalt, Dario Gregori. Originally published in JMIR Public Health and Surveillance (https://publichealth.jmir.org), 12.07.2023.

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest: None declared.

Figures

**Figure 1**
Study flowchart—emergency department (ED) selection and gold standard identification together with manual injury classification procedure. Machine learning technique (MLT) cross-validation and prediction procedures for tasks A, B, and C are represented by the dark grey box.

**Figure 2**
Flowchart of learning algorithm development for injury epidemiological surveillance. ED: emergency department; MLT: machine learning technique.

See this image and copyright information in PMC

References

1. Alonge O, Khan UR, Hyder AA. Our shrinking globe: implications for child unintentional injuries. Pediatr Clin North Am. 2016 Feb;63(1):167–81. doi: 10.1016/j.pcl.2015.08.009.S0031-3955(15)00149-2 - DOI - PubMed
1. Deal LW, Gomby DS, Zippiroli L, Behrman RE. Unintentional injuries in childhood: analysis and recommendations. Future Child. 2000;10(1):4. doi: 10.2307/1602823. - DOI - PubMed
1. Judy K. Unintentional injuries in pediatrics. Pediatr Rev. 2011 Oct;32(10):431–8; quiz 439. doi: 10.1542/pir.32-10-431.32/10/431 - DOI - PubMed
1. Sethi D, Towner E, Vincenten J, et al. European report on child injury prevention. Copenhagen: World Health Organization. Regional Office for Europe; 2008.
1. WHO. World report on child injury prevention. [2019-02-04]. https://www.who.int/publications/i/item/9789241563574 .

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Pediatric Injury Surveillance From Uncoded Emergency Department Admission Records in Italy: Machine Learning-Based Text-Mining Approach

Affiliations

Pediatric Injury Surveillance From Uncoded Emergency Department Admission Records in Italy: Machine Learning-Based Text-Mining Approach

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

MeSH terms

LinkOut - more resources

Full Text Sources