Establishment and analysis of a novel diagnostic model for systemic juvenile idiopathic arthritis based on machine learning
- PMID: 38243323
- PMCID: PMC10797915
- DOI: 10.1186/s12969-023-00949-x
Establishment and analysis of a novel diagnostic model for systemic juvenile idiopathic arthritis based on machine learning
Abstract
Background: Systemic juvenile idiopathic arthritis (SJIA) is a form of childhood arthritis with clinical features such as fever, lymphadenopathy, arthritis, rash, and serositis. It seriously affects the growth and development of children and has a high rate of disability and mortality. SJIA may result from genetic, infectious, or autoimmune factors since the precise source of the disease is unknown. Our study aims to develop a genetic-based diagnostic model to explore the identification of SJIA at the genetic level.
Methods: The gene expression dataset of peripheral blood mononuclear cell (PBMC) samples from SJIA was collected from the Gene Expression Omnibus (GEO) database. Then, three GEO datasets (GSE11907-GPL96, GSE8650-GPL96 and GSE13501) were merged and used as a training dataset, which included 125 SJIA samples and 92 health samples. GSE7753 was used as a validation dataset. The limma method was used to screen differentially expressed genes (DEGs). Feature selection was performed using Lasso, random forest (RF)-recursive feature elimination (RFE) and RF classifier.
Results: We finally identified 4 key genes (ALDH1A1, CEACAM1, YBX3 and SLC6A8) that were essential to distinguish SJIA from healthy samples. And we combined the 4 key genes and performed a grid search as well as 10-fold cross-validation with 5 repetitions to finally identify the RF model with optimal mtry. The mean area under the curve (AUC) value for 5-fold cross-validation was greater than 0.95. The model's performance was then assessed once more using the validation dataset, and an AUC value of 0.990 was obtained. All of the above AUC values demonstrated the strong robustness of the SJIA diagnostic model.
Conclusions: We successfully developed a new SJIA diagnostic model that can be used for a novel aid in the identification of SJIA. In addition, the identification of 4 key genes that may serve as potential biomarkers for SJIA provides new insights to further understand the mechanisms of SJIA.
Keywords: Diagnostic model; GEO; Machine learning; Random forest; Systemic juvenile idiopathic arthritis.
© 2024. The Author(s).
Conflict of interest statement
The authors declare that they have no competing interests.
Figures






Similar articles
-
Establishment and analysis of a disease risk prediction model for the systemic lupus erythematosus with random forest.Front Immunol. 2022 Nov 1;13:1025688. doi: 10.3389/fimmu.2022.1025688. eCollection 2022. Front Immunol. 2022. PMID: 36405750 Free PMC article.
-
Identification of Key Biomarkers and Immune Infiltration in Systemic Juvenile Idiopathic Arthritis by Integrated Bioinformatic Analysis.Front Mol Biosci. 2021 Jul 14;8:681526. doi: 10.3389/fmolb.2021.681526. eCollection 2021. Front Mol Biosci. 2021. PMID: 34336925 Free PMC article.
-
Application of Weighted Gene Coexpression Network Analysis to Identify Key Modules and Hub Genes in Systemic Juvenile Idiopathic Arthritis.Biomed Res Int. 2021 Aug 13;2021:9957569. doi: 10.1155/2021/9957569. eCollection 2021. Biomed Res Int. 2021. PMID: 34435051 Free PMC article.
-
Biomarkers in systemic juvenile idiopathic arthritis: a comparison with biomarkers in cryopyrin-associated periodic syndromes.Curr Opin Rheumatol. 2014 Sep;26(5):543-52. doi: 10.1097/BOR.0000000000000098. Curr Opin Rheumatol. 2014. PMID: 25050926 Free PMC article. Review.
-
Systemic Juvenile Idiopathic Arthritis.Pediatr Clin North Am. 2018 Aug;65(4):691-709. doi: 10.1016/j.pcl.2018.04.005. Pediatr Clin North Am. 2018. PMID: 30031494 Review.
Cited by
-
UBE2D1 as a key biomarker in systemic juvenile idiopathic arthritis: a new perspective on diagnosis and disease activity assessment.Arthritis Res Ther. 2025 Jul 9;27(1):140. doi: 10.1186/s13075-025-03606-8. Arthritis Res Ther. 2025. PMID: 40635084 Free PMC article.
-
Reliability of a generative artificial intelligence tool for pediatric familial Mediterranean fever: insights from a multicentre expert survey.Pediatr Rheumatol Online J. 2024 Aug 23;22(1):78. doi: 10.1186/s12969-024-01011-0. Pediatr Rheumatol Online J. 2024. PMID: 39180115 Free PMC article.
-
The emerging paradigm in pediatric rheumatology: harnessing the power of artificial intelligence.Rheumatol Int. 2024 Nov;44(11):2315-2325. doi: 10.1007/s00296-024-05661-x. Epub 2024 Jul 16. Rheumatol Int. 2024. PMID: 39012357 Free PMC article. Review.
References
-
- Petty RE, Southwood TR, Manners P, Baum J, Glass DN, Goldenberg J, et al. International League of Associations for Rheumatology classification of juvenile idiopathic arthritis: second revision, Edmonton, 2001. J Rheumatol. 2004;31(2):390–2. - PubMed
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Medical
Miscellaneous