Machine learning-assisted diagnosis classification of primary immune dysregulation using IDDA2.1 phenotype profiling
- PMID: 41202990
- DOI: 10.1016/j.jaci.2025.10.022
Machine learning-assisted diagnosis classification of primary immune dysregulation using IDDA2.1 phenotype profiling
Abstract
Background: Immune dysregulation, including autoimmunity, autoinflammation, allergy, and malignancy predisposition, adds significant disease burden in primary immune disorders (PID) and inborn errors of immunity (IEIs).
Objective: We evaluated whether the 5-graded immune deficiency and dysregulation activity (IDDA2.1) score, encompassing 21 organ involvement and disease burden parameters, supports diagnosis across a wide spectrum of IEIs.
Methods: From April 2022 to November 2024, collaborators from 84 centers collected 1,043 IDDA score datasets from 825 patients across 89 IEIs (17 disorders with ≥10 patients each; range, 1-196 per IEI), including 177 scores from 141 treated patients. Supervised machine learning models (k-nearest neighbors, support vector machine, logistic regression, random forest) classified patients into disease groups and ranked corresponding predictive features, while unsupervised uniform manifold approximation and projection (UMAP) visualized disease-specific clustering.
Results: Feature analysis reflected clinicians' recognition of IEI patterns and confirmed internal IDDA score consistency. Phenotype profiles in treated patients remained informative, inversely reflecting anticipated treatment-dependent phenotype amelioration. UMAP effectively distinguished IEIs by IDDA2.1 profiles. Genetic disorder prediction achieved 73% overall accuracy, 70% for the correct monogenic IEI, and 93% within the top 3 predictions; classification reached 43% for IEI-International Union of Immunological Society categories and 59% for 12 "cardinal" IEIs (25 genes).
Conclusions: Random forest feature importance analysis can inform targeted clinical screening for key disease manifestations. The top 3 prediction approach demonstrates diagnostic potential, but improved accuracy will require larger, globally shared datasets. Small sample sizes for rare diseases highlight the necessity of broader collaboration to enhance AI-assisted clinical decision-making in the future.
Keywords: Inborn error of immunity (IEI); artificial intelligence (AI); immune deficiency and dysregulation activity (IDDA) score; interoperable patient data; phenotype-driven disease classification; primary immune disorder (PID); primary immune regulatory disorder (PIRD); primary immunodeficiency (PID); unsupervised and supervised machine learning (ML).
Copyright © 2025 The Authors. Published by Elsevier Inc. All rights reserved.
Conflict of interest statement
Disclosure statement The programming of the IDDA2.1 score module in the ESID registry was financially supported by a grant from Takeda, Austria, who had no role in the study design, study conduct, data collection, data management, data analysis, data interpretation, or writing of this report. M.G.S. was in part funded by the Styrian Children’s Cancer Aid Foundation (Steirische Kinderkrebshilfe). Declaration of generative AI in scientific writing: The authors used ChatGPT for partial text editing to improve readability; they then reviewed and edited the text and take full responsibility for the content. Disclosure of potential conflict of interest: M. G. Seidel received advisory board honoraria from Pharming and Takeda unrelated to this work. The rest of the authors declare that they have no relevant conflicts of interest.
LinkOut - more resources
Full Text Sources
Miscellaneous
