Detecting hospital-acquired infections: A document classification approach using support vector machines and gradient tree boosting
- PMID: 27496862
- PMCID: PMC5802538
- DOI: 10.1177/1460458216656471
Detecting hospital-acquired infections: A document classification approach using support vector machines and gradient tree boosting
Abstract
Hospital-acquired infections pose a significant risk to patient health, while their surveillance is an additional workload for hospital staff. Our overall aim is to build a surveillance system that reliably detects all patient records that potentially include hospital-acquired infections. This is to reduce the burden of having the hospital staff manually check patient records. This study focuses on the application of text classification using support vector machines and gradient tree boosting to the problem. Support vector machines and gradient tree boosting have never been applied to the problem of detecting hospital-acquired infections in Swedish patient records, and according to our experiments, they lead to encouraging results. The best result is yielded by gradient tree boosting, at 93.7 percent recall, 79.7 percent precision and 85.7 percent F1 score when using stemming. We can show that simple preprocessing techniques and parameter tuning can lead to high recall (which we aim for in screening patient records) with appropriate precision for this task.
Keywords: clinical decision-making; databases and data mining; ehealth; electronic health records; secondary care.
Conflict of interest statement
Figures




References
-
- Ducel G, Fabry J, Nicolle L. Prevention of hospital-acquired infections: a practical guide. 2nd ed. Geneva: World Health Organization, 2002, p. 1.
-
- Ehrentraut C, Tiedemann J, Dalianis H, et al. Detection of hospital acquired infections in sparse and noisy Swedish patient records. In: Proceedings of the sixth workshop on analytics for noisy unstructured text data (AND 2012), Mumbai, India, 9 December 2012, pp. 1–8. New York: ACM.
-
- Hastie T, Tibshirani R, Friedman J. The elements of statistical learning: data mining, inference and prediction. 2nd ed. New York: Springer, 2008, p. 758.
-
- Klompas M, Yokoe DS. Automated surveillance of health care-associated infections. Clin Infect Dis 2009; 48(9): 1268–1275. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources