Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Nov 11;5(4):ooac086.
doi: 10.1093/jamiaopen/ooac086. eCollection 2022 Dec.

The benefit of augmenting open data with clinical data-warehouse EHR for forecasting SARS-CoV-2 hospitalizations in Bordeaux area, France

Collaborators, Affiliations

The benefit of augmenting open data with clinical data-warehouse EHR for forecasting SARS-CoV-2 hospitalizations in Bordeaux area, France

Thomas Ferté et al. JAMIA Open. .

Abstract

Objective: The aim of this study was to develop an accurate regional forecast algorithm to predict the number of hospitalized patients and to assess the benefit of the Electronic Health Records (EHR) information to perform those predictions.

Materials and methods: Aggregated data from SARS-CoV-2 and weather public database and data warehouse of the Bordeaux hospital were extracted from May 16, 2020 to January 17, 2022. The outcomes were the number of hospitalized patients in the Bordeaux Hospital at 7 and 14 days. We compared the performance of different data sources, feature engineering, and machine learning models.

Results: During the period of 88 weeks, 2561 hospitalizations due to COVID-19 were recorded at the Bordeaux Hospital. The model achieving the best performance was an elastic-net penalized linear regression using all available data with a median relative error at 7 and 14 days of 0.136 [0.063; 0.223] and 0.198 [0.105; 0.302] hospitalizations, respectively. Electronic health records (EHRs) from the hospital data warehouse improved median relative error at 7 and 14 days by 10.9% and 19.8%, respectively. Graphical evaluation showed remaining forecast error was mainly due to delay in slope shift detection.

Discussion: Forecast model showed overall good performance both at 7 and 14 days which were improved by the addition of the data from Bordeaux Hospital data warehouse.

Conclusions: The development of hospital data warehouse might help to get more specific and faster information than traditional surveillance system, which in turn will help to improve epidemic forecasting at a larger and finer scale.

Keywords: SARS-CoV-2; data warehouse; electronic health records; forecasting; machine learning.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Top 20 features importance with 95% confidence interval of the best model. As model is retrained every day and is elastic-net penalized, each dot represents the proportion of days for which the feature is selected. SAMU—French ambulance service.
Figure 2.
Figure 2.
Forecast of best model at 7 and 14 days. (A) The best model forecast from 1 to 14 days. Each string corresponds to the forecast from 1 (yellow) to 14 days (purple) at each day. (B) The 20% prediction intervals at 7 days and 40% at 14 days. Image (C) compares the forecast without EHR data (ie, hospitalizations+RT-PCR+Gironde hospitalizations+Gironde RT-PCR+weather+variant+vaccine).
Figure 3.
Figure 3.
Evolution of MAE and MRE over time at 7 and 14 days. Each dot corresponds to the median absolute error of the corresponding month with the 95% bootstrapped confidence interval.

References

    1. Carrat F, Figoni J, Henny J, et al.Evidence of early circulation of SARS-CoV-2 in France: findings from the population-based “CONSTANCES” cohort. Eur J Epidemiol 2021; 36 (2): 219–22. - PMC - PubMed
    1. World Health Organisation. WHO Coronavirus (COVID-19) Dashboard. 2020. https://covid19.who.int. Accessed September 27, 2021.
    1. COVID-19 Cumulative Infection Collaborators. Estimating global, regional, and national daily and cumulative infections with SARS-CoV-2 through Nov 14, 2021: a statistical analysis. Lancet (London, England) 2022; 399 (10344): 2351–80. - PMC - PubMed
    1. Kim G, Wang M, Pan H, et al.A health system response to COVID-19 in long-term care and post-acute care: a three-phase approach. J Am Geriatr Soc 2020; 68 (6): 1155–61. - PMC - PubMed
    1. Simões J, Magalhães JPM, Biscaia A, et al.Organisation of the State, model of health system and COVID‐19 health outcomes in six European countries, during the first months of the COVID‐19 epidemic in 2020. Int J Health Plann Manage 2021; 36 (5): 1874–86. - PMC - PubMed

LinkOut - more resources