Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Aug 27;26(1):153.
doi: 10.1186/s13075-024-03376-9.

Machine learning prediction and explanatory models of serious infections in patients with rheumatoid arthritis treated with tofacitinib

Affiliations

Machine learning prediction and explanatory models of serious infections in patients with rheumatoid arthritis treated with tofacitinib

Merete Lund Hetland et al. Arthritis Res Ther. .

Abstract

Background: Patients with rheumatoid arthritis (RA) have an increased risk of developing serious infections (SIs) vs. individuals without RA; efforts to predict SIs in this patient group are ongoing. We assessed the ability of different machine learning modeling approaches to predict SIs using baseline data from the tofacitinib RA clinical trials program.

Methods: This analysis included data from 19 clinical trials (phase 2, n = 10; phase 3, n = 6; phase 3b/4, n = 3). Patients with RA receiving tofacitinib 5 or 10 mg twice daily (BID) were included in the analysis; patients receiving tofacitinib 11 mg once daily were considered as tofacitinib 5 mg BID. All available patient-level baseline variables were extracted. Statistical and machine learning methods (logistic regression, support vector machines with linear kernel, random forest, extreme gradient boosting trees, and boosted trees) were implemented to assess the association of baseline variables with SI (logistic regression only), and to predict SI using selected baseline variables using 5-fold cross-validation. Missing values were handled individually per prediction model.

Results: A total of 8404 patients with RA treated with tofacitinib were eligible for inclusion (15,310 patient-years of total follow-up) of which 473 patients reported SIs. Amongst other baseline factors, age, previous infection, and corticosteroid use were significantly associated with SI. When applying prediction modeling for SI across data from all studies, the area under the receiver operating characteristic (AUROC) curve ranged from 0.656 to 0.739. AUROC values ranged from 0.599 to 0.730 in data from phase 3 and 3b/4 studies, and from 0.563 to 0.643 in data from ORAL Surveillance only.

Conclusions: Baseline factors associated with SIs in the tofacitinib RA clinical trial program were similar to established SI risk factors associated with advanced treatments for RA. Furthermore, while model performance in predicting SI was similar to other published models, this did not meet the threshold for accurate prediction (AUROC > 0.85). Thus, predicting the occurrence of SIs at baseline remains challenging and may be complicated by the changing disease course of RA over time. Inclusion of other patient-associated and healthcare delivery-related factors and harmonization of the duration of studies included in the models may be required to improve prediction.

Trial registration: ClinicalTrials.gov: NCT00147498; NCT00413660; NCT00550446; NCT00603512; NCT00687193; NCT01164579; NCT00976599; NCT01059864; NCT01359150; NCT02147587; NCT00960440; NCT00847613; NCT00814307; NCT00856544; NCT00853385; NCT01039688; NCT02187055; NCT02831855; NCT02092467.

Keywords: Extreme gradient boosted trees; Infectious diseases; Janus kinase inhibitor; Machine learning; Prediction models; Random forest; Rheumatic diseases; Risk stratification; Support vector machines with linear kernel; Treatment safety.

PubMed Disclaimer

Conflict of interest statement

MLH has received grants from AbbVie, Biogen, Bristol Myers Squibb, Celltrion, Eli Lilly, Janssen Biologics B.V., Lundbeck Foundation, MSD, Pfizer Inc, Roche, Samsung Bioepis, Sandoz, and Novartis; honoraria from Medac, Pfizer Inc, and Sandoz; is a member of the advisory board for AbbVie; is a co-chair of EuroSpA; and has chaired for DANBIO DRQ. AS is a principal investigator of RABBIT, which is jointly sponsored by a consortium of pharmaceutical manufacturers as follows: AbbVie, Amgen, Bristol Myers Squibb, Celltrion, Eli Lilly, Fresenius-Kabi, Galapagos, Hexal, MSD, Pfizer Inc, Samsung Bioepis, Sanofi-Aventis, UCB, and Viatris; and has received lecture fees from AbbVie, Bristol Myers Squibb, Celltrion, Eli Lilly, MSD, Pfizer Inc, Roche, Sanofi-Aventis, and UCB. GB is a stockholder of Engineering Ingegneria Informatica, which acquired and merged with Fair Dynamics Consulting; and a paid consultant contracted by Health Services Consulting Corporation in connection with this study. Health Services Consulting Corporation was a paid contractor to Pfizer Inc in connection with the formal data analysis. DS and JJD are employees and stockholders of Pfizer Inc. RAE is the owner of Health Services Consulting Corporation and was a paid consultant to Pfizer Inc in connection with data generation.

Figures

Fig. 1
Fig. 1
Baseline variables associated with serious infection (stepwise multivariate logistic regressiona) a Stepwise multivariate logistic regression was performed in all patients (N = 8404); ORs shown are those for patients with serious infection (n = 473). For continuous variables, ORs > 1 indicate a higher risk of serious infection with higher values. ALT, alanine aminotransferase; CDAI, Clinical Disease Activity Index; CGA, Clinician Global Assessment; CI, confidence interval; COPD, chronic obstructive pulmonary disease; CRP, C-reactive protein; csDMARD, conventional synthetic disease-modifying antirheumatic drug; DAS28-4(CRP), Disease Activity Score in 28 joints, C-reactive protein-4; OR, odds ratio; SDAI, Simplified Disease Activity Index; VAS, visual analog scale

References

    1. Schooling CM, Jones HE. Clarifying questions about risk factors: predictors versus explanation. Emerg Themes Epidemiol. 2018;15:10. 10.1186/s12982-018-0080-z - DOI - PMC - PubMed
    1. Sainani KL. Explanatory versus predictive modeling. PM R. 2014;6:841–4. 10.1016/j.pmrj.2014.08.941 - DOI - PubMed
    1. Varga TV, Niss K, Estampador AC, Collin CB, Moseley PL. Association is not prediction: a landscape of confused reporting in diabetes - a systematic review. Diabetes Res Clin Pract. 2020;170:108497. 10.1016/j.diabres.2020.108497 - DOI - PubMed
    1. Tian Z, McLaughlin J, Verma A, Chinoy H, Heald AH. The relationship between rheumatoid arthritis and diabetes mellitus: a systematic review and meta-analysis. Cardiovasc Endocrinol Metab. 2021;10:125–31. 10.1097/XCE.0000000000000244 - DOI - PMC - PubMed
    1. Smolen JS, Aletaha D, Barton A, Burmester GR, Emery P, Firestein GS, et al. Rheumatoid arthritis. Nat Rev Dis Primers. 2018;4:18001. 10.1038/nrdp.2018.1 - DOI - PubMed

MeSH terms

Associated data