Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 May;6(5):e279-e290.
doi: 10.1016/S2665-9913(24)00059-6.

The Florida Scoring System for stratifying children with suspected Sjögren's disease: a cross-sectional machine learning study

Affiliations

The Florida Scoring System for stratifying children with suspected Sjögren's disease: a cross-sectional machine learning study

Wenjie Zeng et al. Lancet Rheumatol. 2024 May.

Abstract

Background: Childhood Sjögren's disease is a rare, underdiagnosed, and poorly-understood condition. By integrating machine learning models on a paediatric cohort in the USA, we aimed to develop a novel system (the Florida Scoring System) for stratifying symptomatic paediatric patients with suspected Sjögren's disease.

Methods: This cross-sectional study was done in symptomatic patients who visited the Department of Pediatric Rheumatology at the University of Florida, FL, USA. Eligible patients were younger than 18 years or had symptom onset before 18 years of age. Patients with confirmed diagnosis of another autoimmune condition or infection with a clear aetiological microorganism were excluded. Eligible patients underwent comprehensive examinations to rule out or diagnose childhood Sjögren's disease. We used latent class analysis with clinical and laboratory variables to detect heterogeneous patient classes. Machine learning models, including random forest, gradient-boosted decision tree, partial least square discriminatory analysis, least absolute shrinkage and selection operator-penalised ordinal regression, artificial neural network, and super learner were used to predict patient classes and rank the importance of variables. Causal graph learning selected key features to build the final Florida Scoring System. The predictors for all models were the clinical and laboratory variables and the outcome was the definition of patient classes.

Findings: Between Jan 16, 2018, and April 28, 2022, we screened 448 patients for inclusion. After excluding 205 patients due to symptom onset later than 18 years of age, we recruited 243 patients into our cohort. 26 patients were excluded because of confirmed diagnosis of a disorder other than Sjögren's disease, and 217 patients were included in the final analysis. Median age at diagnosis was 15 years (IQR 11-17). 155 (72%) of 216 patients were female and 61 (28%) were male, 167 (79%) of 212 were White, and 20 (9%) of 213 were Hispanic, Latino, or Spanish. The latent class analysis identified three distinct patient classes: class I (dryness dominant with positive tests, n=27), class II (high symptoms with negative tests, n=98), and class III (low symptoms with negative tests, n=92). Machine learning models accurately predicted patient class and ranked variable importance consistently. The causal graphical model discovered key features for constructing the Florida Scoring System.

Interpretation: The Florida Scoring System is a paediatrician-friendly tool that can be used to assist classification and long-term monitoring of suspected childhood Sjögren's disease. The resulting stratification has important implications for clinical management, trial design, and pathobiological research. We found a highly symptomatic patient group with negative serology and diagnostic profiles, which warrants clinical attention. We further revealed that salivary gland ultrasonography can be a non-invasive alternative to minor salivary gland biopsy in children. The Florida Scoring System requires validation in larger prospective paediatric cohorts.

Funding: National Institute of Dental and Craniofacial Research, National Institute of Arthritis, Musculoskeletal and Skin Diseases, National Heart, Lung, and Blood Institute, and Sjögren's Foundation.

PubMed Disclaimer

Conflict of interest statement

Declaration of interests We declare no competing interests.

Figures

Figure 1.
Figure 1.. Schematic presentation of the analytical pipeline for the development of FSS.
We first performed latent class analysis (LCA) for the identification of clinically meaningful patient groups, validated the LCA-derived patient classes using 10-fold cross-validation, and compared them with k-means clusters. Supervised machine learning models were applied to predict LCA-derived patient classes and rank variable importance. The causal graph algorithm was applied to discover variables retaining critical information for predicting LCA-derived patient classes, which were used for the construction of the weighted FSS scoring system. RF: random forest, GBDT: gradient-boosted decision tree, PLSDA: partial least square discriminatory analysis. ANN: artificial neural network. LASSO-OR: Least Absolute Shrinkage and Selection Operator penalized ordinal regression.
Figure 2.
Figure 2.. Learned causal graph demonstrating the relationship between the 33 clinical and laboratory variables and LCA-derived patient classes.
Variables in yellow represent variables in the Markov Blanket, which retained essential information for inferring patient classes. Four different causal relationships were identified, including directional causal relationship: A is the cause of B; bidirectional causal relationship: unobserved latent variables caused both; conditional causal relationship: B is not a cause of A; conditional association: inconclusive causal relationship. Note that besides the edges represented by a direct arrow (AB), all other edges do not exclude the possibility of a latent confounder. AIHA: autoimmune hemolytic anemia; ANA: antinuclear antibody; eSjA: Early Sjögren’s disease autoantibody; MSGBx: minor salivary gland biopsy; RF: rheumatoid factor; RP/SG swelling: recurrent parotitis/salivary gland swelling; SGUS: salivary gland ultrasound; USFR: unstimulated salivary flow rate. This graph was generated by Cytoscape.

References

    1. Kassan SS, Moutsopoulos HM. Clinical manifestations and early diagnosis of Sjögren syndrome. Arch Intern Med 2004; 164(12): 1275–84. - PubMed
    1. Ramos-Casals M, Acar-Denizli N, Vissink A, et al. Childhood-onset of primary Sjögren’s syndrome: phenotypic characterization at diagnosis of 158 children. Rheumatology (Oxford) 2021; 60(10): 4558–67. - PubMed
    1. Ciurtin C, Cho Y, Al-Obaidi M, Jury EC, Price EJ. Barriers to translational research in Sjögren’s syndrome with childhood onset: challenges of recognising and diagnosing an orphan rheumatic disease. The Lancet Rheumatology 2021; 3(2): e138–e48. - PubMed
    1. Yokogawa N, Lieberman SM, Sherry DD, Vivino FB. Features of childhood Sjögren’s syndrome in comparison to adult Sjögren’s syndrome: considerations in establishing child-specific diagnostic criteria. Clin Exp Rheumatol 2016; 34(2): 343–51. - PubMed
    1. Virdee S, Greenan-Barrett J, Ciurtin C. A systematic review of primary Sjögren’s syndrome in male and paediatric populations. Clin Rheumatol 2017; 36(10): 2225–36. - PMC - PubMed

LinkOut - more resources