Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Mar;65(3):343-349.e2.
doi: 10.1016/j.jclinepi.2011.09.002. Epub 2011 Dec 23.

Tradeoffs between accuracy measures for electronic health care data algorithms

Affiliations

Tradeoffs between accuracy measures for electronic health care data algorithms

Jessica Chubak et al. J Clin Epidemiol. 2012 Mar.

Abstract

Objective: We review the uses of electronic health care data algorithms, measures of their accuracy, and reasons for prioritizing one measure of accuracy over another.

Study design and setting: We use real studies to illustrate the variety of uses of automated health care data in epidemiologic and health services research. Hypothetical examples show the impact of different types of misclassification when algorithms are used to ascertain exposure and outcome.

Results: High algorithm sensitivity is important for reducing the costs and burdens associated with the use of a more accurate measurement tool, for enhancing study inclusiveness, and for ascertaining common exposures. High specificity is important for classifying outcomes. High positive predictive value is important for identifying a cohort of persons with a condition of interest but that need not be representative of or include everyone with that condition. Finally, a high negative predictive value is important for reducing the likelihood that study subjects have an exclusionary condition.

Conclusion: Epidemiologists must often prioritize one measure of accuracy over another when generating an algorithm for use in their study. We recommend researchers publish all tested algorithms-including those without acceptable accuracy levels-to help future studies refine and apply algorithms that are well suited to their objectives.

PubMed Disclaimer

References

    1. Mullooly JP. Misclassification Model for Person-Time Analysis of Automated Medical Care Databases. Am J Epidemiol. 1996 October 15;144(8):782–92. - PubMed
    1. Ray WA, Griffin MR. Use of Medicaid data for pharmacoepidemiology. Am J Epidemiol. 1989 Apr;129(4):837–49. - PubMed
    1. Schneeweiss S, Avorn J. A review of uses of health care utilization databases for epidemiologic research on therapeutics. J Clin Epidemiol. 2005;58(4):323–37. - PubMed
    1. Hornbrook MC, Goodman MJ, Fishman PA, Meenan RT, O’Keeffe-Rosetti M, Bachman DJ. Building health plan databases to risk adjust outcomes and payments. Int J Qual Health Care. 1998 Dec;10(6):531–8. - PubMed
    1. Brookhart MAP, Sturmer TMDMPH, Glynn RJPS, Rassen JS, Schneeweiss SMDS. Confounding Control in Healthcare Database Research: Challenges and Potential Approaches. Medical Care. 2010;48(6 Supplement 1):S114–S20. - PMC - PubMed

Publication types