Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Oct;27(10):858-870.
doi: 10.1089/dia.2025.0074. Epub 2025 Apr 29.

Blood Glucose Prediction Algorithms Require Clinically Relevant Performance Criteria Beyond Accuracy

Affiliations

Blood Glucose Prediction Algorithms Require Clinically Relevant Performance Criteria Beyond Accuracy

Miriam K Wolff et al. Diabetes Technol Ther. 2025 Oct.

Abstract

Background: The root mean squared error (RMSE) is commonly used to evaluate blood glucose prediction algorithms. However, it primarily measures how well predictions align with the most likely future values, rather than supporting optimal and proactive treatment decisions. Since diabetes management data predominantly features blood glucose values within the target range, RMSE tends to favor models that consistently predict target-range values, often at the expense of detecting clinically critical events such as rapid fluctuations, hypoglycemia, or hyperglycemia. This study examines how and why RMSE biases evaluations toward trivial models, highlighting the need for alternative performance criteria that better reflect clinical priorities. Methods: We developed the composite glucose prediction metric (CGPM) to integrate three components: RMSE, temporal gain and geometric mean (glycemic event prediction). A custom loss function was designed to emphasize clinically critical predictions during model training. Pareto frontier analysis was used to assess trade-offs among models with comparable performance. Results: CGPM was computed for five blood glucose prediction techniques (zero-order hold, naïve linear regression, ridge regression, ridge regression trained with a custom loss function, and a physiology-based model) applied to the OhioT1DM dataset. The data-driven model with the lowest RMSE performed poorly on glycemic event prediction, highlighting RMSE's bias toward target-range predictions. In contrast, the ridge regressor trained with the custom loss function improved event prediction, showing that clinically weighted optimization mitigates biases. Conclusions: Blood glucose prediction algorithms require evaluation and optimization criteria beyond accuracy to better support optimal treatment decisions. This study introduced the CGPM as an alternative evaluation framework, along with a loss function designed for model optimization that emphasizes clinically critical but rare events. Further clinical validation is needed to refine these criteria and ensure they align more closely with the needs of diabetes management.

Keywords: Pareto frontier analysis; blood glucose prediction; evaluation metric; loss function; machine learning; predictive modeling.

PubMed Disclaimer

LinkOut - more resources