Leveraging Linkage of Cohort Studies With Administrative Claims Data to Identify Individuals With Cancer
- PMID: 29334524
- PMCID: PMC6043405
- DOI: 10.1097/MLR.0000000000000875
Leveraging Linkage of Cohort Studies With Administrative Claims Data to Identify Individuals With Cancer
Abstract
Background: In an effort to overcome quality and cost constraints inherent in population-based research, diverse data sources are increasingly being combined. In this paper, we describe the performance of a Medicare claims-based incident cancer identification algorithm in comparison with observational cohort data from the Nurses' Health Study (NHS).
Methods: NHS-Medicare linked participants' claims data were analyzed using 4 versions of a cancer identification algorithm across 3 cancer sites (breast, colorectal, and lung). The algorithms evaluated included an update of the original Setoguchi algorithm, and 3 other versions that differed in the data used for prevalent cancer exclusions.
Results: The algorithm that yielded the highest positive predictive value (PPV) (0.52-0.82) and κ statistic (0.62-0.87) in identifying incident cancer cases utilized both Medicare claims and observational cohort data (NHS) to remove prevalent cases. The algorithm that only used NHS data to inform the removal of prevalent cancer cases performed nearly equivalently in statistical performance (PPV, 0.50-0.79; κ, 0.61-0.85), whereas the version that used only claims to inform the removal of prevalent cancer cases performed substantially worse (PPV, 0.42-0.60; κ, 0.54-0.70), in comparison with the dual data source-informed algorithm.
Conclusions: Our findings suggest claims-based algorithms identify incident cancer with variable reliability when measured against an observational cohort study reference standard. Self-reported baseline information available in cohort studies is more effective in removing prevalent cancer cases than are claims data algorithms. Use of claims-based algorithms should be tailored to the research question at hand and the nature of available observational cohort data.
Figures
References
-
- Setoguchi S, Solomon DH, Glynn RJ, et al. Agreement of diagnosis and its date for hematologic malignancies and solid tumors between Medicare claims and cancer registry data. Cancer Causes Control. 2007;18(5):561–9. - PubMed
-
- Baldi I, Vicari P, Di Cuonzo D, et al. A high positive predictive value algorithm using hospital administrative data identified incident cancer cases. J Clin Epidemiol. 2008;61(4):373–9. - PubMed
-
- Couris CM, Polazzi S, Olive F, et al. Breast cancer incidence using administrative data: correction with sensitivity and specificity. J Clin Epidemiol. 2009;62(6):660–6. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources