Tradeoffs between accuracy measures for electronic health care data algorithms

Jessica Chubak¹, Gaia Pocobelli, Noel S Weiss

Affiliations

PMID: 22197520
PMCID: PMC3264740
DOI: 10.1016/j.jclinepi.2011.09.002

Tradeoffs between accuracy measures for electronic health care data algorithms

Jessica Chubak et al. J Clin Epidemiol. 2012 Mar.

. 2012 Mar;65(3):343-349.e2.

doi: 10.1016/j.jclinepi.2011.09.002. Epub 2011 Dec 23.

Authors

Jessica Chubak¹, Gaia Pocobelli, Noel S Weiss

Affiliation

¹ Group Health Research Institute, Group Health, Seattle, WA 98101, USA. chubak.j@ghc.org

PMID: 22197520
PMCID: PMC3264740
DOI: 10.1016/j.jclinepi.2011.09.002

Abstract

Objective: We review the uses of electronic health care data algorithms, measures of their accuracy, and reasons for prioritizing one measure of accuracy over another.

Study design and setting: We use real studies to illustrate the variety of uses of automated health care data in epidemiologic and health services research. Hypothetical examples show the impact of different types of misclassification when algorithms are used to ascertain exposure and outcome.

Results: High algorithm sensitivity is important for reducing the costs and burdens associated with the use of a more accurate measurement tool, for enhancing study inclusiveness, and for ascertaining common exposures. High specificity is important for classifying outcomes. High positive predictive value is important for identifying a cohort of persons with a condition of interest but that need not be representative of or include everyone with that condition. Finally, a high negative predictive value is important for reducing the likelihood that study subjects have an exclusionary condition.

Conclusion: Epidemiologists must often prioritize one measure of accuracy over another when generating an algorithm for use in their study. We recommend researchers publish all tested algorithms-including those without acceptable accuracy levels-to help future studies refine and apply algorithms that are well suited to their objectives.

PubMed Disclaimer

References

1. Mullooly JP. Misclassification Model for Person-Time Analysis of Automated Medical Care Databases. Am J Epidemiol. 1996 October 15;144(8):782–92. - PubMed
1. Ray WA, Griffin MR. Use of Medicaid data for pharmacoepidemiology. Am J Epidemiol. 1989 Apr;129(4):837–49. - PubMed
1. Schneeweiss S, Avorn J. A review of uses of health care utilization databases for epidemiologic research on therapeutics. J Clin Epidemiol. 2005;58(4):323–37. - PubMed
1. Hornbrook MC, Goodman MJ, Fishman PA, Meenan RT, O’Keeffe-Rosetti M, Bachman DJ. Building health plan databases to risk adjust outcomes and payments. Int J Qual Health Care. 1998 Dec;10(6):531–8. - PubMed
1. Brookhart MAP, Sturmer TMDMPH, Glynn RJPS, Rassen JS, Schneeweiss SMDS. Confounding Control in Healthcare Database Research: Challenges and Potential Approaches. Medical Care. 2010;48(6 Supplement 1):S114–S20. - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Tradeoffs between accuracy measures for electronic health care data algorithms

Affiliation

Tradeoffs between accuracy measures for electronic health care data algorithms

Authors

Affiliation

Abstract

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources