Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Observational Study
. 2021 Apr;73(4):471-480.
doi: 10.1002/acr.24471. Epub 2021 Mar 13.

Data-Driven Patient Clustering and Differential Clinical Outcomes in the Brigham and Women's Rheumatoid Arthritis Sequential Study Registry

Affiliations
Observational Study

Data-Driven Patient Clustering and Differential Clinical Outcomes in the Brigham and Women's Rheumatoid Arthritis Sequential Study Registry

Jeffrey R Curtis et al. Arthritis Care Res (Hoboken). 2021 Apr.

Abstract

Objective: To use unbiased, data-driven, principal component (PC) and cluster analysis to identify patient phenotypes of rheumatoid arthritis (RA) that might exhibit distinct trajectories of disease progression, response to treatment, and risk for adverse events.

Methods: Patient demographic, socioeconomic, health, and disease characteristics recorded at entry into a large, single-center, prospective observational registry cohort, the Brigham and Women's Rheumatoid Arthritis Sequential Study (BRASS), were harmonized using PC analysis to reduce dimensionality and collinearity. The number of PCs was established by eigenvalue >1, cumulative variance, and interpretability. The resulting PCs were used to cluster patients using a K-means approach. Longitudinal clinical outcomes were compared between the clusters over 2 years.

Results: Analysis of 142 variables from 1,443 patients identified 41 PCs that accounted for 77% of the cumulative variance in the data set. Cluster analysis distinguished 5 patient clusters: 1) less RA disease activity/multimorbidity, shorter RA duration, lower incidence of comorbidities; 2) less RA disease activity/multimorbidity, longer RA duration, more infections, psychiatric comorbidities, health care utilization; 3) moderate RA disease activity/multimorbidity, more neurologic comorbidity; 4) more RA disease activity/multimorbidity, shorter RA duration, more metabolic comorbidity, higher body mass index; 5) more RA disease activity/multimorbidity, longer RA duration, more hepatic, orthopedic comorbidity and RA-related surgeries. The clusters exhibited differences in clinical outcomes over 2 years of follow-up.

Conclusion: Data-driven analysis of the BRASS registry identified 5 distinct phenotypes of RA. These results illustrate the potential of data-driven patient profiling as a tool to support personalized medicine in RA. Validation in an independent data set is ongoing.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Heatmap of principal component (PC) scores across the clusters, showing variation relative to overall Brigham and Women’s Rheumatoid Arthritis Sequential Study average. PCs are ranked by variance explained. Positive scores appear in red, negative scores in blue. csDMARD = conventional synthetic disease‐modifying antirheumatic drug; curr = current; GI = gastrointestinal; NSAID = nonsteroidal antiinflammatory drug; RA = rheumatoid arthritis; TNFi = tumor necrosis factor inhibitor.
Figure 2
Figure 2
Clinical outcomes over 2 years. Mean Clinical Disease Activity Index (CDAI) score (A), Multidimensional Health Assessment Questionnaire (MDHAQ) score (B), 5‐item Mental Health Index (MHI‐5) score (C), and tender joint count (D). Observed cases with no imputation for missing data. 95% CI = 95% confidence interval; BMI = body mass index; RA = rheumatoid arthritis. Error bars indicate the 95% CI.
Figure 3
Figure 3
Time to first infection (A), change of conventional synthetic or biologic disease‐modifying antirheumatic drug (DMARD) (B), and change of biologic DMARD (C). BMI = body mass index; RA = rheumatoid arthritis.

Similar articles

Cited by

References

    1. Aletaha D, Neogi T, Silman AJ, Funovits J, Felson DT, Bingham CO III, et al. 2010 rheumatoid arthritis classification criteria: an American College of Rheumatology/European League Against Rheumatism collaborative initiative. Arthritis Rheum 2010;62:2569–81. - PubMed
    1. Singh JA, Saag KG, Bridges SL Jr, Akl EA, Bannuru RR, Sullivan MC, et al. 2015 American College of Rheumatology guideline for the treatment of rheumatoid arthritis. Arthritis Rheumatol 2016;68:1–26. - PubMed
    1. Cirillo D, Valencia A. Big data analytics for personalized medicine. Curr Opin Biotechnol 2019;58:161–7. - PubMed
    1. Basile AO, Ritchie MD. Informatics and machine learning to define the phenotype. Expert Rev Mol Diagn 2018;18:219–26. - PMC - PubMed
    1. Eng SW, Duong TT, Rosenberg AM, Morris Q, Yeung RS, on behalf of the REACCH OUT and BBOP Research Consortia . The biologic basis of clinical heterogeneity in juvenile idiopathic arthritis. Arthritis Rheumatol 2014;66:3463–75. - PMC - PubMed

Publication types

MeSH terms

LinkOut - more resources