A Graphical Toolkit for Longitudinal Dataset Maintenance and Predictive Model Training in Health Care
- PMID: 35172371
- PMCID: PMC8850007
- DOI: 10.1055/s-0041-1740923
A Graphical Toolkit for Longitudinal Dataset Maintenance and Predictive Model Training in Health Care
Abstract
Background: Predictive analytic models, including machine learning (ML) models, are increasingly integrated into electronic health record (EHR)-based decision support tools for clinicians. These models have the potential to improve care, but are challenging to internally validate, implement, and maintain over the long term. Principles of ML operations (MLOps) may inform development of infrastructure to support the entire ML lifecycle, from feature selection to long-term model deployment and retraining.
Objectives: This study aimed to present the conceptual prototypes for a novel predictive model management system and to evaluate the acceptability of the system among three groups of end users.
Methods: Based on principles of user-centered software design, human-computer interaction, and ethical design, we created graphical prototypes of a web-based MLOps interface to support the construction, deployment, and maintenance of models using EHR data. To assess the acceptability of the interface, we conducted semistructured user interviews with three groups of users (health informaticians, clinical and data stakeholders, chief information officers) and evaluated preliminary usability using the System Usability Scale (SUS). We subsequently revised prototypes based on user input and developed user case studies.
Results: Our prototypes include design frameworks for feature selection, model training, deployment, long-term maintenance, visualization over time, and cross-functional collaboration. Users were able to complete 71% of prompted tasks without assistance. The average SUS score of the initial prototype was 75.8 out of 100, translating to a percentile range of 70 to 79, a letter grade of B, and an adjective rating of "good." We reviewed persona-based case studies that illustrate functionalities of this novel prototype.
Conclusion: The initial graphical prototypes of this MLOps system are preliminarily usable and demonstrate an unmet need within the clinical informatics landscape.
Thieme. All rights reserved.
Conflict of interest statement
M.L.R. reports all support from NIGMS (grant no.: U54GM115677) for early phases of this work. S.L.S. reports all support from Brown University Summer Assistantship Fund.
Figures
References
-
- Roski J, Bo-Linn G W, Andrews T A. Creating value in health care through big data: opportunities and policy implications. Health Aff (Millwood) 2014;33(07):1115–1122. - PubMed
-
- Ngiam K Y, Khor I W. Big data and machine learning algorithms for health-care delivery. Lancet Oncol. 2019;20(05):e262–e273. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
