Deep Phenotyping of Obesity: Electronic Health Record-Based Temporal Modeling Study
- PMID: 40834423
- PMCID: PMC12373304
- DOI: 10.2196/70140
Deep Phenotyping of Obesity: Electronic Health Record-Based Temporal Modeling Study
Abstract
Background: Obesity affects approximately 40% of adults and 15%-20% of children and adolescents in the United States, and poses significant economic and psychosocial burdens. Currently, patient responses to any single antiobesity medication (AOM) vary significantly, making obesity deep phenotyping and associated precision medicine important targets of investigation.
Objective: This study aimed to evaluate the potential of electronic health records (EHR) as a primary data source for obesity deep phenotyping. We conducted an in-depth analysis of the data elements and quality available from obesity patients prior to pharmacotherapy and applied a multimodal longitudinal deep autoencoder to investigate the feasibility, data requirements, clustering patterns, and challenges associated with EHR-based obesity deep phenotyping.
Methods: We analyzed 53,688 pre-AOM periods from 32,969 patients with obesity or overweight who underwent medium- to long-term AOM treatment. A total of 92 laboratory and vital measurements, along with 79 ICD (International Classification of Diseases)-derived clinical classifications software (CCS) codes recorded within one year prior to AOM treatment, were used to train a gated recurrent unit with decay-based longitudinal autoencoder (GRU-D-AE) to generate dense embeddings for each pre-AOM record. Principal component analysis and Gaussian mixture modeling (GMM) were applied to identify clusters.
Results: Our analysis identified at least 9 clusters, with 5 exhibiting distinct and explainable clinical relevance. Certain clusters show characteristics overlapping with phenotypes from traditional phenotyping strategy. Results from multiple training folds demonstrated stable clustering patterns in 2D space and reproducible clinical significance. However, challenges persist regarding the stability of missing data imputation across folds, maintaining consistency in input features, and effectively visualizing complex diseases in low-dimensional spaces.
Conclusions: In this proof-of-concept study, we demonstrated longitudinal EHR as a valuable resource for deep phenotyping the pre-AOM period at per patient visit level. Our analysis revealed the presence of clusters with distinct clinical significance, which could have implications in AOM treatment options. Further research using larger, independent cohorts is necessary to validate the reproducibility and clinical relevance of these clusters, uncover more detailed substructures and corresponding AOM treatment responses.
Keywords: EHR; anti-obesity medication; obesity; phenotyping; precision medicine.
©Xiaoyang Ruan, Shuyu Lu, Liwei Wang, Andrew Wen, Sameer Murali, Hongfang Liu. Originally published in the Journal of Medical Internet Research (https://www.jmir.org).
Conflict of interest statement
Figures







Similar articles
-
Deep phenotyping obesity using EHR data: Promise, Challenges, and Future Directions.medRxiv [Preprint]. 2024 Dec 16:2024.12.06.24318608. doi: 10.1101/2024.12.06.24318608. medRxiv. 2024. PMID: 39677469 Free PMC article. Preprint.
-
Prescription of Controlled Substances: Benefits and Risks.2025 Jul 6. In: StatPearls [Internet]. Treasure Island (FL): StatPearls Publishing; 2025 Jan–. 2025 Jul 6. In: StatPearls [Internet]. Treasure Island (FL): StatPearls Publishing; 2025 Jan–. PMID: 30726003 Free Books & Documents.
-
Survivor, family and professional experiences of psychosocial interventions for sexual abuse and violence: a qualitative evidence synthesis.Cochrane Database Syst Rev. 2022 Oct 4;10(10):CD013648. doi: 10.1002/14651858.CD013648.pub2. Cochrane Database Syst Rev. 2022. PMID: 36194890 Free PMC article.
-
Drugs for preventing postoperative nausea and vomiting in adults after general anaesthesia: a network meta-analysis.Cochrane Database Syst Rev. 2020 Oct 19;10(10):CD012859. doi: 10.1002/14651858.CD012859.pub2. Cochrane Database Syst Rev. 2020. PMID: 33075160 Free PMC article.
-
Validation of administrative health data for the identification of endometriosis diagnosis.Hum Reprod. 2025 Feb 1;40(2):289-295. doi: 10.1093/humrep/deae281. Hum Reprod. 2025. PMID: 39704741 Free PMC article.
References
-
- Centers for disease control and prevention (CDC) Obesity and Severe Obesity Prevalence in Adults: United States, August 2021–August 2023. [15-08-2025]. https://www.cdc.gov/nchs/products/databriefs/db508.htm URL. Accessed.
-
- Stunkard AJ, Foch TT, Hrubec Z. A twin study of human obesity. JAMA. 1986 Jul 4;256(1):51–54. Medline. - PubMed
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical