Development and validation of identification algorithms for five autoimmune diseases using electronic health records: a retrospective cohort study in China
- PMID: 40276516
- PMCID: PMC12018398
- DOI: 10.3389/fimmu.2025.1541203
Development and validation of identification algorithms for five autoimmune diseases using electronic health records: a retrospective cohort study in China
Abstract
Objective: This study aims to assess the identification algorithms for five autoimmune diseases-Hashimoto's thyroiditis, inflammatory bowel disease (IBD), primary immune thrombocytopenia (ITP), rheumatoid arthritis (RA), and type 1 diabetes (T1D)-using the Yinzhou Regional Health Information Platform (YRHIP) in China.
Methods: Diagnostic data was extracted from YRHIP's population registry (2010-2021), combining ICD-10 codes and Chinese medical terminology from outpatient, inpatient, and discharge records. Algorithms were validated through chart reviews, adhering to global clinical guidelines. Cases were adjudicated using electronic case report forms. We evaluated algorithm performance based on sensitivity and positive predictive value (PPV), with a 70% PPV threshold for optimization.
Results: Among all reviewed cases, we identified 136 cases for Hashimoto's thyroiditis, 65 for IBD, 76 for ITP, 130 for RA, and 43 for T1D. Algorithm performance varied across diseases: the final algorithm for Hashimoto's thyroiditis achieved optimal accuracy (sensitivity 97.44%, PPV 98.28%), followed by RA (sensitivity 100.00%, PPV 76.92%). Algorithms for IBD and ITP required synthesis of multiple data sources to achieve acceptable performance (IBD: sensitivity 79.66%, PPV 70.15%; ITP: sensitivity 62.50%, PPV 70.00%). For T1D, the final algorithm utilizing both admission and outpatient records yielded satisfactory results (sensitivity 84.09%, PPV 74.00%).
Conclusions: This study presents the first validated algorithms for identifying autoimmune diseases using EHR data in China, demonstrating satisfactory performance (PPV >70%) across all diseases. Our findings demonstrate that a combination of data sources is crucial for accurate case identification in complex autoimmune conditions, providing an important methodological foundation for future real-world studies in Chinese populations.
Keywords: Hashimoto's thyroiditis; algorithms; computable phenotype; electronic health records (EHR); inflammatory bowel disease (IBD); primary immune thrombocytopenia; rheumatoid arthritis (RA); type 1 diabetes (T1D).
Copyright © 2025 Yang, Wu, Guo, Wang, Gao, Chen, Zhang, Yang, Liu, Liu, Liu and Zhan.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Similar articles
-
Development and validation of algorithms to classify type 1 and 2 diabetes according to age at diagnosis using electronic health records.BMC Med Res Methodol. 2020 Feb 24;20(1):35. doi: 10.1186/s12874-020-00921-3. BMC Med Res Methodol. 2020. PMID: 32093635 Free PMC article.
-
Methods to Develop an Electronic Medical Record Phenotype Algorithm to Compare the Risk of Coronary Artery Disease across 3 Chronic Disease Cohorts.PLoS One. 2015 Aug 24;10(8):e0136651. doi: 10.1371/journal.pone.0136651. eCollection 2015. PLoS One. 2015. PMID: 26301417 Free PMC article.
-
Validation of Immune-Related Adverse Event (irAE) Case Definitions in a Real-World Lung Cancer Population.Pharmacoepidemiol Drug Saf. 2025 Feb;34(2):e70100. doi: 10.1002/pds.70100. Pharmacoepidemiol Drug Saf. 2025. PMID: 39961795
-
A Systematic Review of Case-Identification Algorithms Based on Italian Healthcare Administrative Databases for Two Relevant Diseases of the Endocrine System: Diabetes Mellitus and Thyroid Disorders.Epidemiol Prev. 2019 Jul-Aug;43(4 Suppl 2):17-36. doi: 10.19191/EP19.4.S2.P008.089. Epidemiol Prev. 2019. PMID: 31650804
-
The role of the bone marrow examination in the diagnosis of immune thrombocytopenic purpura: case series and literature review.Clin Appl Thromb Hemost. 2002 Jan;8(1):73-6. doi: 10.1177/107602960200800110. Clin Appl Thromb Hemost. 2002. PMID: 11991243 Review.
References
-
- Conrad N, Misra S, Verbakel JY, Verbeke G, Molenberghs G, Taylor PN, et al. . Incidence, prevalence, and co-occurrence of autoimmune disorders over time and by age, sex, and socioeconomic status: a population-based cohort study of 22 million individuals in the UK. Lancet. (2023) 401:1878–90. doi: 10.1016/S0140-6736(23)00457-9 - DOI - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical