Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Apr 5;5(4):e240625.
doi: 10.1001/jamahealthforum.2024.0625.

A Novel Machine Learning Algorithm for Creating Risk-Adjusted Payment Formulas

Affiliations

A Novel Machine Learning Algorithm for Creating Risk-Adjusted Payment Formulas

Corinne Andriola et al. JAMA Health Forum. .

Abstract

Importance: Models predicting health care spending and other outcomes from administrative records are widely used to manage and pay for health care, despite well-documented deficiencies. New methods are needed that can incorporate more than 70 000 diagnoses without creating undesirable coding incentives.

Objective: To develop a machine learning (ML) algorithm, building on Diagnostic Item (DXI) categories and Diagnostic Cost Group (DCG) methods, that automates development of clinically credible and transparent predictive models for policymakers and clinicians.

Design, setting, and participants: DXIs were organized into disease hierarchies and assigned an Appropriateness to Include (ATI) score to reflect vagueness and gameability concerns. A novel automated DCG algorithm iteratively assigned DXIs in 1 or more disease hierarchies to DCGs, identifying sets of DXIs with the largest regression coefficient as dominant; presence of a previously identified dominating DXI removed lower-ranked ones before the next iteration. The Merative MarketScan Commercial Claims and Encounters Database for commercial health insurance enrollees 64 years and younger was used. Data from January 2016 through December 2018 were randomly split 90% to 10% for model development and validation, respectively. Deidentified claims and enrollment data were delivered by Merative the following November in each calendar year and analyzed from November 2020 to January 2024.

Main outcome and measures: Concurrent top-coded total health care cost. Model performance was assessed using validation sample weighted least-squares regression, mean absolute errors, and mean errors for rare and common diagnoses.

Results: This study included 35 245 586 commercial health insurance enrollees 64 years and younger (65 901 460 person-years) and relied on 19 clinicians who provided reviews in the base model. The algorithm implemented 218 clinician-specified hierarchies compared with the US Department of Health and Human Services (HHS) hierarchical condition category (HCC) model's 64 hierarchies. The base model that dropped vague and gameable DXIs reduced the number of parameters by 80% (1624 of 3150), achieved an R2 of 0.535, and kept mean predicted spending within 12% ($3843 of $31 313) of actual spending for the 3% of people with rare diseases. In contrast, the HHS HCC model had an R2 of 0.428 and underpaid this group by 33% ($10 354 of $31 313).

Conclusions and relevance: In this study, by automating DXI clustering within clinically specified hierarchies, this algorithm built clinically interpretable risk models in large datasets while addressing diagnostic vagueness and gameability concerns.

PubMed Disclaimer

Conflict of interest statement

Conflict of Interest Disclosures: Dr Hsu reported grants from the National Institute on Drug Abuse and Agency for Healthcare Research and Quality during the conduct of the study. No other disclosures were reported.

Figures

Figure 1.
Figure 1.. Overview of Diagnostic Item (DXI) and Diagnostic Cost Group (DCG) Clinical and Machine Learning Algorithm Steps
ATI indicates Appropriateness to Include; CCSR, Clinical Classification Software Revised; WLS, weighted least squares.
Figure 2.
Figure 2.. Distribution of Appropriateness to Include (ATI) Scores in Diagnostic Item (DXI) Main Effects and Clinical Classification Software Revised (CCSR) Classifications
Percentages are calculated as the fraction of all main-effect DXIs and CCSRs.
Figure 3.
Figure 3.. Model Parameter Counts and R2 across Diagnostic Cost Group (DCG) Iterations for the Base Model
The US Department of Health and Human Services (HHS) hierarchical condition category (HCC) model used the combined set of HHS HCCs included in the adult, child, and infant models in a single regression. The Clinical Classification Software Revised (CCSR) model used weighted least squares on all 538 observed CCSR categories, while the base Diagnostic Item (DXI) model used main-effect DXIs and CCSRs. As DCGs were created, DXIs assigned to them were dropped from the model. After all DCGs were found, the DCG stepwise iteration estimated a stepwise regression that omitted all remaining DXI variables not assigned to DCGs and included only statistically significant and nonnegative DCGs. The final run constrained coefficients to be monotonically decreasing within disease hierarchies. All models included 30 age-sex dummy variables.
Figure 4.
Figure 4.. Mean Residuals of Total Spending in the Validation Sample Top-Coded at $250 000 for 5 Models by Frequency of Enrollee-Year Rarest Diagnosis
All models include age-sex dummy variables. We calculated enrollee-weighted mean residuals in the validation sample using the binned frequencies of diagnoses in the full sample, with frequency intervals determined by powers of 10 per million. Plot whiskers indicate 95% CIs, corrected for clustering at the patient level. CCI indicates Charlson Comorbidity Index; CCSR, Clinical Classifications Software Refined; DCG, Diagnostic Cost Group; DXI, Diagnostic Item; HCG, hierarchical condition category; HHS, US Department of Health and Human Services; ICD-10-CM, International Statistical Classification of Diseases, Tenth Revision, Clinical Modification.

Similar articles

Cited by

References

    1. Ash AS, Ellis RP, Pope GC, et al. . Using diagnoses to describe populations and predict costs. Health Care Financ Rev. 2000;21(3):7-28. - PMC - PubMed
    1. Pope GC, Ellis RP, Ash AS, et al. . Diagnostic Cost Group hierarchical condition category models for Medicare risk adjustment: Final Report. Accessed January 15, 2024. https://www.cms.gov/Research-Statistics-Data-and-Systems/Statistics-Tren...
    1. Kautter J, Pope GC, Ingber M, et al. . The HHS-HCC risk adjustment model for individual and small group markets under the Affordable Care Act. Medicare Medicaid Res Rev. 2014;4(3):mmrr2014-004-03-a03. doi:10.5600/mmrr.004.03.a03 - DOI - PMC - PubMed
    1. US Centers for Medicare & Medicaid Services . Patient protection and Affordable Care Act; HHS notice of benefit and payment parameters for 2018. Accessed January 15, 2024. https://www.gpo.gov/fdsys/pkg/FR-2016-09-06/pdf/2016-20896.pdf
    1. Ellis RP, Hsu HE, Siracuse JJ, et al. . Development and assessment of a new framework for disease surveillance, prediction, and risk adjustment: the Diagnostic Items Classification System. JAMA Health Forum. 2022;3(3):e220276. doi:10.1001/jamahealthforum.2022.0276 - DOI - PMC - PubMed