Forecasting daily emergency department arrivals using high-dimensional multivariate data: a feature selection approach
- PMID: 35581648
- PMCID: PMC9112570
- DOI: 10.1186/s12911-022-01878-7
Forecasting daily emergency department arrivals using high-dimensional multivariate data: a feature selection approach
Abstract
Background and objective: Emergency Department (ED) overcrowding is a chronic international issue that is associated with adverse treatment outcomes. Accurate forecasts of future service demand would enable intelligent resource allocation that could alleviate the problem. There has been continued academic interest in ED forecasting but the number of used explanatory variables has been low, limited mainly to calendar and weather variables. In this study we investigate whether predictive accuracy of next day arrivals could be enhanced using high number of potentially relevant explanatory variables and document two feature selection processes that aim to identify which subset of variables is associated with number of next day arrivals. Performance of such predictions over longer horizons is also shown.
Methods: We extracted numbers of total daily arrivals from Tampere University Hospital ED between the time period of June 1, 2015 and June 19, 2019. 158 potential explanatory variables were collected from multiple data sources consisting not only of weather and calendar variables but also an extensive list of local public events, numbers of website visits to two hospital domains, numbers of available hospital beds in 33 local hospitals or health centres and Google trends searches for the ED. We used two feature selection processes: Simulated Annealing (SA) and Floating Search (FS) with Recursive Least Squares (RLS) and Least Mean Squares (LMS). Performance of these approaches was compared against autoregressive integrated moving average (ARIMA), regression with ARIMA errors (ARIMAX) and Random Forest (RF). Mean Absolute Percentage Error (MAPE) was used as the main error metric.
Results: Calendar variables, load of secondary care facilities and local public events were dominant in the identified predictive features. RLS-SA and RLS-FA provided slightly better accuracy compared ARIMA. ARIMAX was the most accurate model but the difference between RLS-SA and RLS-FA was not statistically significant.
Conclusions: Our study provides new insight into potential underlying factors associated with number of next day presentations. It also suggests that predictive accuracy of next day arrivals can be increased using high-dimensional feature selection approach when compared to both univariate and nonfiltered high-dimensional approach. Performance over multiple horizons was similar with a gradual decline for longer horizons. However, outperforming ARIMAX remains a challenge when working with daily data. Future work should focus on enhancing the feature selection mechanism, investigating its applicability to other domains and in identifying other potentially relevant explanatory variables.
Keywords: Crowding; Emergency department; Feature selection; Machine learning; Statistical learning; Time series forecasting.
© 2022. The Author(s).
Conflict of interest statement
NO is a shareholder of Unitary Healthcare Ltd. which has developed patient logistics system currently used in the study emergency department. JT, FL and AR are shareholders of Aika Analytics Ltd. which is a company specialized in time series forecasting. Other authors do not have competing interests.
Figures




Similar articles
-
Performance evaluation of Emergency Department patient arrivals forecasting models by including meteorological and calendar information: A comparative study.Comput Biol Med. 2021 Aug;135:104541. doi: 10.1016/j.compbiomed.2021.104541. Epub 2021 Jun 3. Comput Biol Med. 2021. PMID: 34166880
-
Accurate Forecasting of Emergency Department Arrivals With Internet Search Index and Machine Learning Models: Model Development and Performance Evaluation.JMIR Med Inform. 2022 Jul 20;10(7):e34504. doi: 10.2196/34504. JMIR Med Inform. 2022. PMID: 35857360 Free PMC article.
-
Internet search query data improve forecasts of daily emergency department volume.J Am Med Inform Assoc. 2019 Dec 1;26(12):1574-1583. doi: 10.1093/jamia/ocz154. J Am Med Inform Assoc. 2019. PMID: 31730701 Free PMC article.
-
A systematic review of the modelling of patient arrivals in emergency departments.Quant Imaging Med Surg. 2023 Mar 1;13(3):1957-1971. doi: 10.21037/qims-22-268. Epub 2022 Oct 9. Quant Imaging Med Surg. 2023. PMID: 36915315 Free PMC article. Review.
-
Urgency classification methods for emergency department visits: do they measure up?Pediatr Emerg Care. 2008 Dec;24(12):870-4. doi: 10.1097/PEC.0b013e31818fa79d. Pediatr Emerg Care. 2008. PMID: 19092571 Review.
Cited by
-
Evaluation of different machine learning algorithms for predicting the length of stay in the emergency departments: a single-centre study.Front Digit Health. 2024 Jan 8;5:1323849. doi: 10.3389/fdgth.2023.1323849. eCollection 2023. Front Digit Health. 2024. PMID: 38259256 Free PMC article.
-
Enhanced forecasting of emergency department patient arrivals using feature engineering approach and machine learning.BMC Med Inform Decis Mak. 2024 Dec 18;24(1):377. doi: 10.1186/s12911-024-02788-6. BMC Med Inform Decis Mak. 2024. PMID: 39696224 Free PMC article.
-
Prognostic models for predicting patient arrivals in emergency departments: an updated systematic review and research agenda.BMC Emerg Med. 2025 Jul 1;25(1):106. doi: 10.1186/s12873-025-01250-8. BMC Emerg Med. 2025. PMID: 40596904 Free PMC article.
-
A multi-granular stacked regression for forecasting long-term demand in Emergency Departments.BMC Med Inform Decis Mak. 2023 Feb 7;23(1):29. doi: 10.1186/s12911-023-02109-3. BMC Med Inform Decis Mak. 2023. PMID: 36750952 Free PMC article.
References
-
- Berg LM, Ehrenberg A, Florin J, Östergren J, Discacciati A, Göransson KE. Associations between crowding and ten-day mortality among patients allocated lower triage acuity levels without need of acute hospital care on departure from the emergency department. Ann Emerg Med. 2019;74(3):345–356. doi: 10.1016/j.annemergmed.2019.04.012. - DOI - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources