Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Dec 1:4:248.
doi: 10.3389/fpubh.2016.00248. eCollection 2016.

The Problem with Big Data: Operating on Smaller Datasets to Bridge the Implementation Gap

Affiliations

The Problem with Big Data: Operating on Smaller Datasets to Bridge the Implementation Gap

Richard P Mann et al. Front Public Health. .

Abstract

Big datasets have the potential to revolutionize public health. However, there is a mismatch between the political and scientific optimism surrounding big data and the public's perception of its benefit. We suggest a systematic and concerted emphasis on developing models derived from smaller datasets to illustrate to the public how big data can produce tangible benefits in the long term. In order to highlight the immediate value of a small data approach, we produced a proof-of-concept model predicting hospital length of stay. The results demonstrate that existing small datasets can be used to create models that generate a reasonable prediction, facilitating health-care delivery. We propose that greater attention (and funding) needs to be directed toward the utilization of existing information resources in parallel with current efforts to create and exploit "big data."

Keywords: big data; health economics; length of stay; small data; surgery.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Predicting patient stay duration. The results from the model are presented using line plots (the median prediction is represented by a solid black line and the gray region represents the 95% quantile). (A) These data show a steady downwards trend in stay duration over the nine relevant years; (B) the duration of stay is longer for operations on Saturdays – most likely due to weekday discharge; (C) for patients above 55 years of age, stay duration rapidly increases with age; and finally, (D) stay duration also increases with operation time – presumably an indicator of complications in surgery or intrinsically more difficult cases. Interestingly, stay duration reaches a plateau for operations over ~3 h, though there are relatively few data points for surgeries of this length – and as such, this relationship should be treated with caution.
Figure 2
Figure 2
Upper 95% confidence interval of predicted stay duration. This figure shows that while, for younger patients, the effect of surgery length on length of stay is relatively weak up to the 3-h mark, for older patients (>55 years), stay duration increases strongly with duration length across all time scales.

References

    1. NHS England. High Quality Datasets: Our Vision. (2014). Available from: http://www.nhs.uk/NHSEngland/thenhs/records/healthrecords/Pages/care-dat...
    1. Carter P. Operational Productivity and Performance in English NHS Acute Hospitals: Unwarranted Variations: An Independent Report for the Department of Health by Lord Carter. London: Department of Health; (2016). Available from: https://www.gov.uk/government/uploads/system/uploads/attachment_data/fil... (accessed 1 April 2016).
    1. Raghupathi W, Raghupathi V. Big data analytics in healthcare: promise and potential. Health Inf Sci Syst (2014) 2:1–10.10.1186/2047-2501-2-3 - DOI - PMC - PubMed
    1. Ipsos MORI Social Research Institute. The One-Way Mirror: Public Attitudes to Commercial Access to Health Data. (2016). Available from: https://www.ipsos-mori.com/Assets/Docs/Publications/sri-wellcome-trust-c... (accessed 1 April 2016).
    1. Parliamentary Office of Science & Technology. Big data and public health. POSTnote. London: The Parliamentary Office of Science and Technology; (2014). p. 1–4.