An algorithm for the use of Medicare claims data to identify women with incident breast cancer
- PMID: 15533184
- PMCID: PMC1361095
- DOI: 10.1111/j.1475-6773.2004.00315.x
An algorithm for the use of Medicare claims data to identify women with incident breast cancer
Erratum in
- Health Serv Res. 41:302.
Abstract
Objective: To develop and validate a clinically informed algorithm that uses solely Medicare claims to identify, with a high positive predictive value, incident breast cancer cases.
Data source: Population-based Surveillance, Epidemiology, and End Results (SEER) Tumor Registry data linked to Medicare claims, and Medicare claims from a 5 percent random sample of beneficiaries in SEER areas.
Study design: An algorithm was developed using claims from 1995 breast cancer patients from the SEER-Medicare database, as well as 1995 claims from Medicare control subjects. The algorithm was validated on claims from breast cancer subjects and controls from 1994. The algorithm development process used both clinical insight and logistic regression methods.
Data extraction: Training set: Claims from 7,700 SEER-Medicare breast cancer subjects diagnosed in 1995, and 124,884 controls. Validation set: Claims from 7,607 SEER-Medicare breast cancer subjects diagnosed in 1994, and 120,317 controls.
Principal findings: A four-step prediction algorithm was developed and validated. It has a positive predictive value of 89 to 93 percent, and a sensitivity of 80 percent for identifying incident breast cancer. The sensitivity is 82-87 percent for stage I or II, and lower for other stages. The sensitivity is 82-83 percent for women who underwent either breast-conserving surgery or mastectomy, and is similar across geographic sites. A cohort identified with this algorithm will have 89-93 percent incident breast cancer cases, 1.5-6 percent cancer-free cases, and 4-5 percent prevalent breast cancer cases.
Conclusions: This algorithm has better performance characteristics than previously proposed algorithms. The ability to examine national patterns of breast cancer care using Medicare claims data would open new avenues for the assessment of quality of care.
Figures

Similar articles
-
Evaluation of three algorithms to identify incident breast cancer in Medicare claims data.Health Serv Res. 2007 Oct;42(5):2056-69. doi: 10.1111/j.1475-6773.2007.00705.x. Health Serv Res. 2007. PMID: 17850533 Free PMC article.
-
Comparing breast cancer case identification using HMO computerized diagnostic data and SEER data.Am J Manag Care. 2004 Apr;10(4):257-62. Am J Manag Care. 2004. PMID: 15124502
-
A SEER-Medicare population-based study of lymphedema-related claims incidence following breast cancer in men.Breast Cancer Res Treat. 2011 Nov;130(1):301-6. doi: 10.1007/s10549-011-1649-1. Epub 2011 Jul 7. Breast Cancer Res Treat. 2011. PMID: 21735047
-
Development and Validation of Claims-Based Definitions to Identify Incident and Prevalent Inflammatory Bowel Disease in Administrative Healthcare Databases.Inflamm Bowel Dis. 2023 Dec 5;29(12):1993-1996. doi: 10.1093/ibd/izad053. Inflamm Bowel Dis. 2023. PMID: 37043675 Free PMC article. Review.
-
Updated Overview of the SEER-Medicare Data: Enhanced Content and Applications.J Natl Cancer Inst Monogr. 2020 May 1;2020(55):3-13. doi: 10.1093/jncimonographs/lgz029. J Natl Cancer Inst Monogr. 2020. PMID: 32412076 Free PMC article. Review.
Cited by
-
EHR phenotyping via jointly embedding medical concepts and words into a unified vector space.BMC Med Inform Decis Mak. 2018 Dec 12;18(Suppl 4):123. doi: 10.1186/s12911-018-0672-0. BMC Med Inform Decis Mak. 2018. PMID: 30537974 Free PMC article.
-
Surgeon specialization and use of sentinel lymph node biopsy for breast cancer.JAMA Surg. 2014 Feb;149(2):185-92. doi: 10.1001/jamasurg.2013.4350. JAMA Surg. 2014. PMID: 24369337 Free PMC article.
-
Early Post-Therapy Prescription Drug Usage among Childhood and Adolescent Cancer Survivors.J Pediatr. 2018 Apr;195:161-168.e7. doi: 10.1016/j.jpeds.2017.11.063. Epub 2018 Feb 12. J Pediatr. 2018. PMID: 29395178 Free PMC article.
-
Development and validation of algorithms to differentiate ductal carcinoma in situ from invasive breast cancer within administrative claims data.Cancer. 2018 Jul 1;124(13):2815-2823. doi: 10.1002/cncr.31393. Epub 2018 Apr 18. Cancer. 2018. Retraction in: Cancer. 2019 Apr 1;125(7):1200. doi: 10.1002/cncr.31886. PMID: 29669162 Free PMC article. Retracted.
-
Leveraging Linkage of Cohort Studies With Administrative Claims Data to Identify Individuals With Cancer.Med Care. 2018 Dec;56(12):e83-e89. doi: 10.1097/MLR.0000000000000875. Med Care. 2018. PMID: 29334524 Free PMC article.
References
-
- Cooper G S, Yuan Z, Stange K C, Dennis L K, Amini S B, Rimm A A. “Agreement of Medicare and Tumor Registry Data for Assessment of Cancer-Related Treatment.”. Medical Care. 2000;38(4):411–21. - PubMed
-
- Cooper G S, Yuan Z, Stange K C, Dennis L K, Amini S B, Rimm A A. “The Sensitivity of Medicare Claims Data for Case Ascertainment of Six Common Cancers.”. Medical Care. 1999;37(5):436–44. - PubMed
-
- Fieller E C. “The Biological Standardization of Insulin.”. Journal of the Royal Statistical Society. 1940;7(supplement):1–64.
-
- Freeman J, Zhang D, Freeman D, Goodwin J. “An Approach to Identifying Incident Breast Cancer Cases Using Medicare Claims Data.”. Journal of Clinical Epidemiology. 2000;53(6):605–14. - PubMed
-
- Gilligan M A, Kneusel R T, Hoffmann R G, Greer A L, Nattinger A B. “Persistent Differences in Sociodemographic Determinants of Breast Conserving Treatment Despite Overall Increased Adoption.”. Medical Care. 2002;40(3):181–9. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical