. 2024 Apr 5;5(4):e240625.

doi: 10.1001/jamahealthforum.2024.0625.

A Novel Machine Learning Algorithm for Creating Risk-Adjusted Payment Formulas

Corinne Andriola¹, Randall P Ellis², Jeffrey J Siracuse³, Alex Hoagland⁴, Tzu-Chun Kuo⁵, Heather E Hsu⁶, Allan Walkey⁷, Karen E Lasser^{8

9

10

11}, Arlene S Ash¹²

Affiliations

¹ Center for Innovation in Population Health, College of Public Health, University of Kentucky, Lexington.
² Department of Economics, Boston University, Boston, Massachusetts.
³ Division of Vascular and Endovascular Surgery, Boston Medical Center, Boston University Chobanian and Avedisian School of Medicine, Boston, Massachusetts.
⁴ Institute of Health Policy, Management and Evaluation, University of Toronto, Toronto, Ontario, Canada.
⁵ BMC HealthNet Plan, Boston, Massachusetts.
⁶ Department of Pediatrics, Boston University Chobanian and Avedisian School of Medicine, Boston, Massachusetts.
⁷ Department of Medicine, University of Massachusetts Chan Medical School, Worcester.
⁸ Section of General Internal Medicine, Department of Medicine, Boston University Chobanian and Avedisian School of Medicine, Boston, Massachusetts.
⁹ Community Health Sciences, Boston University School of Public Health, Boston, Massachusetts.
¹⁰ Boston Medical Center, Boston, Massachusetts.
¹¹ Senior Editor, JAMA.
¹² Department of Population and Quantitative Health Sciences, University of Massachusetts Chan Medical School, Worcester.

PMID: 38639980
PMCID: PMC11065160
DOI: 10.1001/jamahealthforum.2024.0625

A Novel Machine Learning Algorithm for Creating Risk-Adjusted Payment Formulas

Corinne Andriola et al. JAMA Health Forum. 2024.

. 2024 Apr 5;5(4):e240625.

doi: 10.1001/jamahealthforum.2024.0625.

Authors

Corinne Andriola¹, Randall P Ellis², Jeffrey J Siracuse³, Alex Hoagland⁴, Tzu-Chun Kuo⁵, Heather E Hsu⁶, Allan Walkey⁷, Karen E Lasser^{8

9

10

11}, Arlene S Ash¹²

Affiliations

¹ Center for Innovation in Population Health, College of Public Health, University of Kentucky, Lexington.
² Department of Economics, Boston University, Boston, Massachusetts.
³ Division of Vascular and Endovascular Surgery, Boston Medical Center, Boston University Chobanian and Avedisian School of Medicine, Boston, Massachusetts.
⁴ Institute of Health Policy, Management and Evaluation, University of Toronto, Toronto, Ontario, Canada.
⁵ BMC HealthNet Plan, Boston, Massachusetts.
⁶ Department of Pediatrics, Boston University Chobanian and Avedisian School of Medicine, Boston, Massachusetts.
⁷ Department of Medicine, University of Massachusetts Chan Medical School, Worcester.
⁸ Section of General Internal Medicine, Department of Medicine, Boston University Chobanian and Avedisian School of Medicine, Boston, Massachusetts.
⁹ Community Health Sciences, Boston University School of Public Health, Boston, Massachusetts.
¹⁰ Boston Medical Center, Boston, Massachusetts.
¹¹ Senior Editor, JAMA.
¹² Department of Population and Quantitative Health Sciences, University of Massachusetts Chan Medical School, Worcester.

PMID: 38639980
PMCID: PMC11065160
DOI: 10.1001/jamahealthforum.2024.0625

Abstract

Importance: Models predicting health care spending and other outcomes from administrative records are widely used to manage and pay for health care, despite well-documented deficiencies. New methods are needed that can incorporate more than 70 000 diagnoses without creating undesirable coding incentives.

Objective: To develop a machine learning (ML) algorithm, building on Diagnostic Item (DXI) categories and Diagnostic Cost Group (DCG) methods, that automates development of clinically credible and transparent predictive models for policymakers and clinicians.

Design, setting, and participants: DXIs were organized into disease hierarchies and assigned an Appropriateness to Include (ATI) score to reflect vagueness and gameability concerns. A novel automated DCG algorithm iteratively assigned DXIs in 1 or more disease hierarchies to DCGs, identifying sets of DXIs with the largest regression coefficient as dominant; presence of a previously identified dominating DXI removed lower-ranked ones before the next iteration. The Merative MarketScan Commercial Claims and Encounters Database for commercial health insurance enrollees 64 years and younger was used. Data from January 2016 through December 2018 were randomly split 90% to 10% for model development and validation, respectively. Deidentified claims and enrollment data were delivered by Merative the following November in each calendar year and analyzed from November 2020 to January 2024.

Main outcome and measures: Concurrent top-coded total health care cost. Model performance was assessed using validation sample weighted least-squares regression, mean absolute errors, and mean errors for rare and common diagnoses.

Results: This study included 35 245 586 commercial health insurance enrollees 64 years and younger (65 901 460 person-years) and relied on 19 clinicians who provided reviews in the base model. The algorithm implemented 218 clinician-specified hierarchies compared with the US Department of Health and Human Services (HHS) hierarchical condition category (HCC) model's 64 hierarchies. The base model that dropped vague and gameable DXIs reduced the number of parameters by 80% (1624 of 3150), achieved an R2 of 0.535, and kept mean predicted spending within 12% ($3843 of $31 313) of actual spending for the 3% of people with rare diseases. In contrast, the HHS HCC model had an R2 of 0.428 and underpaid this group by 33% ($10 354 of $31 313).

Conclusions and relevance: In this study, by automating DXI clustering within clinically specified hierarchies, this algorithm built clinically interpretable risk models in large datasets while addressing diagnostic vagueness and gameability concerns.

PubMed Disclaimer

Conflict of interest statement

Conflict of Interest Disclosures: Dr Hsu reported grants from the National Institute on Drug Abuse and Agency for Healthcare Research and Quality during the conduct of the study. No other disclosures were reported.

Figures

**Figure 1.. Overview of Diagnostic Item (DXI) and Diagnostic Cost Group (DCG) Clinical and Machine Learning Algorithm Steps**
ATI indicates Appropriateness to Include; CCSR, Clinical Classification Software Revised; WLS, weighted least squares.

**Figure 2.. Distribution of Appropriateness to Include (ATI) Scores in Diagnostic Item (DXI) Main Effects and Clinical Classification Software Revised (CCSR) Classifications**
Percentages are calculated as the fraction of all main-effect DXIs and CCSRs.

**Figure 3.. Model Parameter Counts and R² across Diagnostic Cost Group (DCG) Iterations for the Base Model**
The US Department of Health and Human Services (HHS) hierarchical condition category (HCC) model used the combined set of HHS HCCs included in the adult, child, and infant models in a single regression. The Clinical Classification Software Revised (CCSR) model used weighted least squares on all 538 observed CCSR categories, while the base Diagnostic Item (DXI) model used main-effect DXIs and CCSRs. As DCGs were created, DXIs assigned to them were dropped from the model. After all DCGs were found, the DCG stepwise iteration estimated a stepwise regression that omitted all remaining DXI variables not assigned to DCGs and included only statistically significant and nonnegative DCGs. The final run constrained coefficients to be monotonically decreasing within disease hierarchies. All models included 30 age-sex dummy variables.

**Figure 4.. Mean Residuals of Total Spending in the Validation Sample Top-Coded at $250 000 for 5 Models by Frequency of Enrollee-Year Rarest Diagnosis**
All models include age-sex dummy variables. We calculated enrollee-weighted mean residuals in the validation sample using the binned frequencies of diagnoses in the full sample, with frequency intervals determined by powers of 10 per million. Plot whiskers indicate 95% CIs, corrected for clustering at the patient level. CCI indicates Charlson Comorbidity Index; CCSR, Clinical Classifications Software Refined; DCG, Diagnostic Cost Group; DXI, Diagnostic Item; HCG, hierarchical condition category; HHS, US Department of Health and Human Services; *ICD*-10-CM, *International Statistical Classification of Diseases, Tenth Revision, Clinical Modification*.

See this image and copyright information in PMC

Cited by

Artificial Intelligence in Relation to Accurate Information and Tasks in Gynecologic Oncology and Clinical Medicine-Dunning-Kruger Effects and Ultracrepidarianism.
Pavlik EJ, Land Woodward J, Lawton F, Swiecki-Sikora AL, Ramaiah DD, Rives TA. Pavlik EJ, et al. Diagnostics (Basel). 2025 Mar 15;15(6):735. doi: 10.3390/diagnostics15060735. Diagnostics (Basel). 2025. PMID: 40150078 Free PMC article. Review.
Algorithms to Improve Fairness in Medicare Risk Adjustment.
Reitsma MB, McGuire TG, Rose S. Reitsma MB, et al. JAMA Health Forum. 2025 Aug 1;6(8):e252640. doi: 10.1001/jamahealthforum.2025.2640. JAMA Health Forum. 2025. PMID: 40880105 Free PMC article.
Algorithms to Improve Fairness in Medicare Risk Adjustment.
Reitsma MB, McGuire TG, Rose S. Reitsma MB, et al. medRxiv [Preprint]. 2025 Jan 27:2025.01.25.25321057. doi: 10.1101/2025.01.25.25321057. medRxiv. 2025. Update in: JAMA Health Forum. 2025 Aug 1;6(8):e252640. doi: 10.1001/jamahealthforum.2025.2640. PMID: 39974004 Free PMC article. Updated. Preprint.

References

1. Ash AS, Ellis RP, Pope GC, et al. . Using diagnoses to describe populations and predict costs. Health Care Financ Rev. 2000;21(3):7-28. - PMC - PubMed
1. Pope GC, Ellis RP, Ash AS, et al. . Diagnostic Cost Group hierarchical condition category models for Medicare risk adjustment: Final Report. Accessed January 15, 2024. https://www.cms.gov/Research-Statistics-Data-and-Systems/Statistics-Tren...
1. Kautter J, Pope GC, Ingber M, et al. . The HHS-HCC risk adjustment model for individual and small group markets under the Affordable Care Act. Medicare Medicaid Res Rev. 2014;4(3):mmrr2014-004-03-a03. doi:10.5600/mmrr.004.03.a03 - DOI - PMC - PubMed
1. US Centers for Medicare & Medicaid Services . Patient protection and Affordable Care Act; HHS notice of benefit and payment parameters for 2018. Accessed January 15, 2024. https://www.gpo.gov/fdsys/pkg/FR-2016-09-06/pdf/2016-20896.pdf
1. Ellis RP, Hsu HE, Siracuse JJ, et al. . Development and assessment of a new framework for disease surveillance, prediction, and risk adjustment: the Diagnostic Items Classification System. JAMA Health Forum. 2022;3(3):e220276. doi:10.1001/jamahealthforum.2022.0276 - DOI - PMC - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A Novel Machine Learning Algorithm for Creating Risk-Adjusted Payment Formulas

Affiliations

A Novel Machine Learning Algorithm for Creating Risk-Adjusted Payment Formulas

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

MeSH terms

LinkOut - more resources

Full Text Sources

Medical

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

MeSH terms

Related information

LinkOut - more resources

Full Text Sources

Medical