Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Jul 2:4:8.
doi: 10.1186/s41512-020-00077-0. eCollection 2020.

Informative missingness in electronic health record systems: the curse of knowing

Affiliations

Informative missingness in electronic health record systems: the curse of knowing

Rolf H H Groenwold. Diagn Progn Res. .

Abstract

Electronic health records provide a potentially valuable data source of information for developing clinical prediction models. However, missing data are common in routinely collected health data and often missingness is informative. Informative missingness can be incorporated in a clinical prediction model, for example by including a separate category of a predictor variable that has missing values. The predictive performance of such a model depends on the transportability of the missing data mechanism, which may be compromised once the model is deployed in practice and the predictive value of certain variables becomes known. Using synthetic data, this phenomenon is explained and illustrated.

Keywords: Missing data; Prediction modelling; Routine care data.

PubMed Disclaimer

Conflict of interest statement

Competing interestsThe author declares that he has no competing interests.

Figures

Fig. 1
Fig. 1
Impact of differences in missing data mechanisms when applying a prediction model that was derived under informative missingness using different approaches to handle missing data. Plots are based on 500 randomly sampled observations out of a dataset of 20000 observations. Predicted probabilities are jittered for visual clarity. See main text for a description of the scenarios and details about the methods

References

    1. Raghupathi W, Raghupathi V. Big data analytics in healthcare: promise and potential. Health Inf Sci Syst. 2014;2:3. doi: 10.1186/2047-2501-2-3. - DOI - PMC - PubMed
    1. Beam AL, Kohane IS. Big Data and Machine Learning in Health Care. JAMA. 2018;319(13):1317–1318. doi: 10.1001/jama.2017.18391. - DOI - PubMed
    1. Ludvigsson JF, Adami HO. The urgency to embrace Big Data opportunities in medicine. J Intern Med. 2018;283(5):479–480. doi: 10.1111/joim.12749. - DOI - PubMed
    1. McKinstry B. All watched over by machines of loving grace: an optimistic view of big data. BMJ. 2017;358:j3967. doi: 10.1136/bmj.j3967. - DOI - PubMed
    1. Hemingway H, Asselbergs FW, Danesh J, Dobson R, Maniadakis N, Maggioni A, van Thiel GJM, Cronin M, Brobert G, Vardas P, Anker SD, Grobbee DE, Denaxas S. Big data from electronic health records for early and late translational cardiovascular research: challenges and potential. Eur Heart J. 2018;39(16):1481–1495. doi: 10.1093/eurheartj/ehx487. - DOI - PMC - PubMed

LinkOut - more resources