Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Nov 5:2015:406-15.
eCollection 2015.

Cloud-based Predictive Modeling System and its Application to Asthma Readmission Prediction

Affiliations

Cloud-based Predictive Modeling System and its Application to Asthma Readmission Prediction

Robert Chen et al. AMIA Annu Symp Proc. .

Abstract

The predictive modeling process is time consuming and requires clinical researchers to handle complex electronic health record (EHR) data in restricted computational environments. To address this problem, we implemented a cloud-based predictive modeling system via a hybrid setup combining a secure private server with the Amazon Web Services (AWS) Elastic MapReduce platform. EHR data is preprocessed on a private server and the resulting de-identified event sequences are hosted on AWS. Based on user-specified modeling configurations, an on-demand web service launches a cluster of Elastic Compute 2 (EC2) instances on AWS to perform feature selection and classification algorithms in a distributed fashion. Afterwards, the secure private server aggregates results and displays them via interactive visualization. We tested the system on a pediatric asthma readmission task on a de-identified EHR dataset of 2,967 patients. We conduct a larger scale experiment on the CMS Linkable 2008-2010 Medicare Data Entrepreneurs' Synthetic Public Use File dataset of 2 million patients, which achieves over 25-fold speedup compared to sequential execution.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
An overview of our predictive modeling system. We extract EHR information from the database from the hospital and store the information in these event sequence files through persistent web services running on a private dedicated server. These event sequence files are uploaded to the web service on the cloud.
Figure 2.
Figure 2.
Screenshots of the cohort construction module (A), feature construction module (B), predictive modeling module (C) and the performance analysis module (D). The cohort construction module allows users to specify criteria for selection of cases and for identifying matching controls. In the predictive modeling module, the user may specify parameters for particular feature selection and classification algorithms. The performance analysis module allows users to visualize key performance metrics as well as top predictive features selected in feature selection.
Figure 3:
Figure 3:
Timeline of modules run and elapsed time. The data-splitting, training and testing times refer to the run times for each respective step of cross validation. Times are shown in seconds (s).

Similar articles

Cited by

References

    1. Braunstein ML. Health Informatics in the Cloud. Springer; New York: 2013. pp. 1–8. at < http://link.springer.com/chapter/10.1007/978-1-4614-5629-2_1>. - DOI
    1. Bodenheimer T, Fernandez A. High and Rising Health Care Costs. Part 4: Can Costs Be Controlled While Preserving Quality? Ann Intern Med. 2005;143:26–31. - PubMed
    1. Gorelick M, Scribano PV, Stevens MW, Schultz T, Shults J. Predicting Need for Hospitalization in Acute Pediatric Asthma. Pediatr Emerg Care. 2008;24:735–744. - PubMed
    1. Reznik M, Hailpern SM, Ozuah PO. Predictors of Early Hospital Readmission for Asthma Among Inner-City Children. J Asthma. 2006;43:37–40. - PubMed
    1. Feudtner C, et al. How Well Can Hospital Readmission Be Predicted in a Cohort of Hospitalized Children? A Retrospective, Multicenter Study. PEDIATRICS. 2009;123:286–293. - PMC - PubMed