Using an optimized generative model to infer the progression of complications in type 2 diabetes patients
- PMID: 35778708
- PMCID: PMC9250218
- DOI: 10.1186/s12911-022-01915-5
Using an optimized generative model to infer the progression of complications in type 2 diabetes patients
Abstract
Background: People live a long time in pre-diabetes/early diabetes without a formal diagnosis or management. Heterogeneity of progression coupled with deficiencies in electronic health records related to incomplete data, discrete events, and irregular event intervals make identification of pre-diabetes and critical points of diabetes progression challenging.
Methods: We utilized longitudinal electronic health records of 9298 patients with type 2 diabetes or prediabetes from 2005 to 2016 from a large regional healthcare delivery network in China. We optimized a generative Markov-Bayesian-based model to generate 5000 synthetic illness trajectories. The synthetic data were manually reviewed by endocrinologists.
Results: We build an optimized generative progression model for type 2 diabetes using anchor information to reduce the number of parameters learning in the third layer of the model from [Formula: see text] to [Formula: see text], where [Formula: see text] is the number of clinical findings, [Formula: see text] is the number of complications, [Formula: see text] is the number of anchors. Based on this model, we infer the relationships between progression stages, the onset of complication categories, and the associated diagnoses during the whole progression of type 2 diabetes using electronic health records.
Discussion: Our findings indicate that 55.3% of single complications and 31.8% of complication patterns could be predicted early and managed appropriately to potentially delay (as it is a progressive disease) or prevented (by lifestyle modifications that keep patient from developing/triggering diabetes in the first place).
Conclusions: The full type 2 diabetes patient trajectories generated by the chronic disease progression model can counter a lack of real-world evidence of desired longitudinal timeframe while facilitating population health management.
Keywords: Computer simulation; Diabetes mellitus, type 2; Disease progression model; Electronic health records; Probabilistic generative model.
© 2022. The Author(s).
Conflict of interest statement
JP reports receiving personal fees from Summary Medical Inc and DispatchHealth and equity from Summary Medical Inc outside the submitted work. DB reports receiving grants and personal fees from EarlySense, personal fees from CDI Negev, equity from Valera Health, equity from CLEW Medical, equity from MDClone, personal fees and equity from AESOP, personal fees and equity from FeelBetter, and grants from IBM Watson Health, outside the submitted work.
Figures
References
-
- Prediabetes—your chance to prevent type II diabetes. US Centers for Disease Control and Prevention. 11 June 2020. https://www.cdc.gov/diabetes/basics/prediabetes.html#:~:text=Approximate.... Accessed Aug 2020.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical
