Diagnostic Accuracy of a Mobile AI-Based Symptom Checker and a Web-Based Self-Referral Tool in Rheumatology: Multicenter Randomized Controlled Trial
- PMID: 39042425
- PMCID: PMC11303907
- DOI: 10.2196/55542
Diagnostic Accuracy of a Mobile AI-Based Symptom Checker and a Web-Based Self-Referral Tool in Rheumatology: Multicenter Randomized Controlled Trial
Abstract
Background: The diagnosis of inflammatory rheumatic diseases (IRDs) is often delayed due to unspecific symptoms and a shortage of rheumatologists. Digital diagnostic decision support systems (DDSSs) have the potential to expedite diagnosis and help patients navigate the health care system more efficiently.
Objective: The aim of this study was to assess the diagnostic accuracy of a mobile artificial intelligence (AI)-based symptom checker (Ada) and a web-based self-referral tool (Rheport) regarding IRDs.
Methods: A prospective, multicenter, open-label, crossover randomized controlled trial was conducted with patients newly presenting to 3 rheumatology centers. Participants were randomly assigned to complete a symptom assessment using either Ada or Rheport. The primary outcome was the correct identification of IRDs by the DDSSs, defined as the presence of any IRD in the list of suggested diagnoses by Ada or achieving a prespecified threshold score with Rheport. The gold standard was the diagnosis made by rheumatologists.
Results: A total of 600 patients were included, among whom 214 (35.7%) were diagnosed with an IRD. Most frequent IRD was rheumatoid arthritis with 69 (11.5%) patients. Rheport's disease suggestion and Ada's top 1 (D1) and top 5 (D5) disease suggestions demonstrated overall diagnostic accuracies of 52%, 63%, and 58%, respectively, for IRDs. Rheport showed a sensitivity of 62% and a specificity of 47% for IRDs. Ada's D1 and D5 disease suggestions showed a sensitivity of 52% and 66%, respectively, and a specificity of 68% and 54%, respectively, concerning IRDs. Ada's diagnostic accuracy regarding individual diagnoses was heterogenous, and Ada performed considerably better in identifying rheumatoid arthritis in comparison to other diagnoses (D1: 42%; D5: 64%). The Cohen κ statistic of Rheport for agreement on any rheumatic disease diagnosis with Ada D1 was 0.15 (95% CI 0.08-0.18) and with Ada D5 was 0.08 (95% CI 0.00-0.16), indicating poor agreement for the presence of any rheumatic disease between the 2 DDSSs.
Conclusions: To our knowledge, this is the largest comparative DDSS trial with actual use of DDSSs by patients. The diagnostic accuracies of both DDSSs for IRDs were not promising in this high-prevalence patient population. DDSSs may lead to a misuse of scarce health care resources. Our results underscore the need for stringent regulation and drastic improvements to ensure the safety and efficacy of DDSSs.
Trial registration: German Register of Clinical Trials DRKS00017642; https://drks.de/search/en/trial/DRKS00017642.
Keywords: artificial intelligence; decision; decision support; decision support system; diagnosis; diagnostic; diagnostic decision support system; eHealth; randomized controlled trial; resources; rheumatologists; rheumatology; support; support system; symptom assessment; symptom checker; tool.
©Johannes Knitza, Koray Tascilar, Franziska Fuchs, Jacob Mohn, Sebastian Kuhn, Daniela Bohr, Felix Muehlensiepen, Christina Bergmann, Hannah Labinsky, Harriet Morf, Elizabeth Araujo, Matthias Englbrecht, Wolfgang Vorbrüggen, Cay-Benedict von der Decken, Stefan Kleinert, Andreas Ramming, Jörg H W Distler, Peter Bartz-Bazzanella, Nicolas Vuillerme, Georg Schett, Martin Welcker, Axel Hueber. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 23.07.2024.
Conflict of interest statement
Conflicts of Interest: JK and MW have received research support from Novartis Pharma GmbH. Qinum and RheumaDatenRhePort developed and hold the rights for Rheport. WV, CBvdD, S Kleinert, PB-B, and MW are members of RheumaDatenRhePort. WV, CBvdD, and PB-B were involved in the development of Rheport. JK is a member of the scientific board of RheumaDatenRhePort. S Kuhn is a founder and shareholder of MED.digital. PB-B has received speaker honoraria from AbbVie, Boehringer Ingelheim, Chugai/Roche, Janssen-Cilag, Novartis, and Pfizer.
Figures



Similar articles
-
Accuracy, patient-perceived usability, and acceptance of two symptom checkers (Ada and Rheport) in rheumatology: interim results from a randomized controlled crossover trial.Arthritis Res Ther. 2021 Apr 13;23(1):112. doi: 10.1186/s13075-021-02498-8. Arthritis Res Ther. 2021. PMID: 33849654 Free PMC article. Clinical Trial.
-
Patient's Perception of Digital Symptom Assessment Technologies in Rheumatology: Results From a Multicentre Study.Front Public Health. 2022 Feb 22;10:844669. doi: 10.3389/fpubh.2022.844669. eCollection 2022. Front Public Health. 2022. PMID: 35273944 Free PMC article. Clinical Trial.
-
Comparison of physician and artificial intelligence-based symptom checker diagnostic accuracy.Rheumatol Int. 2022 Dec;42(12):2167-2176. doi: 10.1007/s00296-022-05202-4. Epub 2022 Sep 10. Rheumatol Int. 2022. PMID: 36087130 Free PMC article.
-
A systematic literature review of AI-based digital decision support systems for post-traumatic stress disorder.Front Psychiatry. 2022 Aug 9;13:923613. doi: 10.3389/fpsyt.2022.923613. eCollection 2022. Front Psychiatry. 2022. PMID: 36016975 Free PMC article.
-
[Improved early diagnostics of rheumatic diseases : Monocentric experiences with an open rheumatological specialist consultation].Z Rheumatol. 2018 Nov;77(9):844-849. doi: 10.1007/s00393-018-0540-4. Z Rheumatol. 2018. PMID: 30255411 Review. German.
Cited by
-
RhePort 1.3 enhances early identification of inflammatory rheumatic diseases: a prospective study in German rheumatology settings.Rheumatol Int. 2025 Apr 28;45(5):129. doi: 10.1007/s00296-025-05861-z. Rheumatol Int. 2025. PMID: 40293492 Free PMC article. Clinical Trial.
-
Rheumatology in the digital health era: status quo and quo vadis?Nat Rev Rheumatol. 2024 Dec;20(12):747-759. doi: 10.1038/s41584-024-01177-7. Epub 2024 Oct 31. Nat Rev Rheumatol. 2024. PMID: 39482466 Review.
-
Impact of a Symptom Checker App on Patient-Physician Interaction Among Self-Referred Walk-In Patients in the Emergency Department: Multicenter, Parallel-Group, Randomized, Controlled Trial.J Med Internet Res. 2025 Apr 2;27:e64028. doi: 10.2196/64028. J Med Internet Res. 2025. PMID: 40173434 Free PMC article. Clinical Trial.
-
Current application, possibilities, and challenges of artificial intelligence in the management of rheumatoid arthritis, axial spondyloarthritis, and psoriatic arthritis.Ther Adv Musculoskelet Dis. 2025 Jun 21;17:1759720X251343579. doi: 10.1177/1759720X251343579. eCollection 2025. Ther Adv Musculoskelet Dis. 2025. PMID: 40547599 Free PMC article. Review.
-
Emerging Artificial Intelligence Innovations in Rheumatoid Arthritis and Challenges to Clinical Adoption.Curr Rheumatol Rep. 2025 Jun 28;27(1):28. doi: 10.1007/s11926-025-01193-w. Curr Rheumatol Rep. 2025. PMID: 40580343 Review.
References
-
- Simons G, Belcher J, Morton C, Kumar K, Falahee M, Mallen CD, Stack RJ, Raza K. Symptom recognition and perceived urgency of help-seeking for rheumatoid arthritis and other diseases in the general public: a mixed method approach. Arthritis Care Res (Hoboken) 2017 May;69(5):633–641. doi: 10.1002/acr.22979. https://europepmc.org/abstract/MED/27389847 - DOI - PMC - PubMed
-
- Sørensen J, Hetland ML, All Departments of Rheumatology in Denmark Diagnostic delay in patients with rheumatoid arthritis, psoriatic arthritis and ankylosing spondylitis: results from the Danish nationwide DANBIO registry. Ann Rheum Dis. 2015 Mar;74(3):e12. doi: 10.1136/annrheumdis-2013-204867. http://ard.bmj.com/lookup/pmidlookup?view=long&pmid=24534758 annrheumdis-2013-204867 - DOI - PMC - PubMed
-
- Burgers LE, Raza K, van der Helm-van Mil AH. Window of opportunity in rheumatoid arthritis - definitions and supporting evidence: from old to new perspectives. RMD Open. 2019;5(1):e000870. doi: 10.1136/rmdopen-2018-000870. https://rmdopen.bmj.com/lookup/pmidlookup?view=long&pmid=31168406 rmdopen-2018-000870 - DOI - PMC - PubMed
-
- Ceney A, Tolond S, Glowinski A, Marks B, Swift S, Palser T. Accuracy of online symptom checkers and the potential impact on service utilisation. PLoS One. 2021 Jul 15;16(7):e0254088. doi: 10.1371/journal.pone.0254088. https://dx.plos.org/10.1371/journal.pone.0254088 PONE-D-20-21627 - DOI - DOI - PMC - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Research Materials