Classifying Lung Cancer Severity with Ensemble Machine Learning in Health Care Claims Data
- PMID: 30542673
- PMCID: PMC6287925
Classifying Lung Cancer Severity with Ensemble Machine Learning in Health Care Claims Data
Abstract
Research in oncology quality of care and health outcomes has been limited by the difficulty of identifying cancer stage in health care claims data. Using linked cancer registry and Medicare claims data, we develop a tool for classifying lung cancer patients receiving chemotherapy into early vs. late stage cancer by (i) deploying ensemble machine learning for prediction, (ii) establishing a set of classification rules for the predicted probabilities, and (iii) considering an augmented set of administrative claims data. We find our ensemble machine learning algorithm with a classification rule defined by the median substantially outperforms an existing clinical decision tree for this problem, yielding full sample performance of 93% sensitivity, 92% specificity, and 93% accuracy. This work has the potential for broad applicability as provider organizations, payers, and policy makers seek to measure quality and outcomes of cancer care and improve on risk adjustment methods.
Figures
References
-
- Brooks GA, Landrum MB, and Keating NL. Inferring cancer stage from administrative data, March 2017. Report submitted to the Centers for Medicare and Medicaid Innovation.
-
- Cooper GS, Yuan Z, Stange KC, Amini SB, Dennis LK, and Rimm AA. The utility of Medicare claims data for measuring cancer stage. Med Care, 37(7):706–711, 1999. - PubMed
-
- Hassett MJ, Ritzwoller DP, Taback N, Carroll N, Cronin AM, Ting GV, Schrag D, Warren JL, Hornbrook MC, and Weeks JC. Validating billing/encounter codes as indicators of lung, colorectal, breast, and prostate cancer recurrence using 2 large contemporary cohorts. Med Care, 52(10):65–73, 2014. - PMC - PubMed
-
- Howlader N, Noone AM, Krapcho M, Miller D, Bishop K, Altekruse SF, Kosary CL, Yu M, Ruhl J, Tatalovich Z, Mariotto A, Lewis DR, Chen HS, Feuer EJ, and Cronin KA (eds.). SEER cancer statistics review 1975–2013. Report, National Cancer Institute, 2016.
Grants and funding
LinkOut - more resources
Full Text Sources