Comparative Accuracy of Diagnosis by Collective Intelligence of Multiple Physicians vs Individual Physicians
- PMID: 30821822
- PMCID: PMC6484633
- DOI: 10.1001/jamanetworkopen.2019.0096
Comparative Accuracy of Diagnosis by Collective Intelligence of Multiple Physicians vs Individual Physicians
Abstract
Importance: The traditional approach of diagnosis by individual physicians has a high rate of misdiagnosis. Pooling multiple physicians' diagnoses (collective intelligence) is a promising approach to reducing misdiagnoses, but its accuracy in clinical cases is unknown to date.
Objective: To assess how the diagnostic accuracy of groups of physicians and trainees compares with the diagnostic accuracy of individual physicians.
Design, setting, and participants: Cross-sectional study using data from the Human Diagnosis Project (Human Dx), a multicountry data set of ranked differential diagnoses by individual physicians, graduate trainees, and medical students (users) solving user-submitted, structured clinical cases. From May 7, 2014, to October 5, 2016, groups of 2 to 9 randomly selected physicians solved individual cases. Data analysis was performed from March 16, 2017, to July 30, 2018.
Main outcomes and measures: The primary outcome was diagnostic accuracy, assessed as a correct diagnosis in the top 3 ranked diagnoses for an individual; for groups, the top 3 diagnoses were a collective differential generated using a weighted combination of user diagnoses with a variety of approaches. A version of the McNemar test was used to account for clustering across repeated solvers to compare diagnostic accuracy.
Results: Of the 2069 users solving 1572 cases from the Human Dx data set, 1228 (59.4%) were residents or fellows, 431 (20.8%) were attending physicians, and 410 (19.8%) were medical students. Collective intelligence was associated with increasing diagnostic accuracy, from 62.5% (95% CI, 60.1%-64.9%) for individual physicians up to 85.6% (95% CI, 83.9%-87.4%) for groups of 9 (23.0% difference; 95% CI, 14.9%-31.2%; P < .001). The range of improvement varied by the specifications used for combining groups' diagnoses, but groups consistently outperformed individuals regardless of approach. Absolute improvement in accuracy from individuals to groups of 9 varied by presenting symptom from an increase of 17.3% (95% CI, 6.4%-28.2%; P = .002) for abdominal pain to 29.8% (95% CI, 3.7%-55.8%; P = .02) for fever. Groups from 2 users (77.7% accuracy; 95% CI, 70.1%-84.6%) to 9 users (85.5% accuracy; 95% CI, 75.1%-95.9%) outperformed individual specialists in their subspecialty (66.3% accuracy; 95% CI, 59.1%-73.5%; P < .001 vs groups of 2 and 9).
Conclusions and relevance: A collective intelligence approach was associated with higher diagnostic accuracy compared with individuals, including individual specialists whose expertise matched the case diagnosis, across a range of medical cases. Given the few proven strategies to address misdiagnosis, this technique merits further study in clinical settings.
Conflict of interest statement
Figures




Comment in
-
Collective Intelligence for Clinical Diagnosis-Are 2 (or 3) Heads Better Than 1?JAMA Netw Open. 2019 Mar 1;2(3):e191071. doi: 10.1001/jamanetworkopen.2019.1071. JAMA Netw Open. 2019. PMID: 30821821 No abstract available.
Similar articles
-
Assessment of a Simulated Case-Based Measurement of Physician Diagnostic Performance.JAMA Netw Open. 2019 Jan 4;2(1):e187006. doi: 10.1001/jamanetworkopen.2018.7006. JAMA Netw Open. 2019. PMID: 30646211 Free PMC article.
-
The Potential of Collective Intelligence in Emergency Medicine: Pooling Medical Students' Independent Decisions Improves Diagnostic Performance.Med Decis Making. 2017 Aug;37(6):715-724. doi: 10.1177/0272989X17696998. Epub 2017 Mar 29. Med Decis Making. 2017. PMID: 28355975
-
Efficacy of Artificial-Intelligence-Driven Differential-Diagnosis List on the Diagnostic Accuracy of Physicians: An Open-Label Randomized Controlled Study.Int J Environ Res Public Health. 2021 Feb 21;18(4):2086. doi: 10.3390/ijerph18042086. Int J Environ Res Public Health. 2021. PMID: 33669930 Free PMC article. Clinical Trial.
-
The cost-effectiveness of magnetic resonance imaging for investigation of the knee joint.Health Technol Assess. 2001;5(27):1-95. doi: 10.3310/hta5270. Health Technol Assess. 2001. PMID: 11532240 Review.
-
[Do artificial intelligence systems reason in the same way as clinicians when making diagnoses?].Rev Med Interne. 2020 Mar;41(3):192-195. doi: 10.1016/j.revmed.2019.12.014. Epub 2020 Jan 25. Rev Med Interne. 2020. PMID: 31987671 Review. French.
Cited by
-
Development and Assessment of an Artificial Intelligence-Based Tool for Skin Condition Diagnosis by Primary Care Physicians and Nurse Practitioners in Teledermatology Practices.JAMA Netw Open. 2021 Apr 1;4(4):e217249. doi: 10.1001/jamanetworkopen.2021.7249. JAMA Netw Open. 2021. PMID: 33909055 Free PMC article.
-
Organization of Pediatric Echocardiography Laboratories: Impact of Sonographers on Clinical, Academic, and Financial Performance.Front Pediatr. 2022 May 30;10:891360. doi: 10.3389/fped.2022.891360. eCollection 2022. Front Pediatr. 2022. PMID: 35712633 Free PMC article. Review.
-
Assigning a different endoscopist for each annual follow-up may contribute to improved gastric cancer detection rates.Endosc Int Open. 2022 Oct 17;10(10):E1333-E1342. doi: 10.1055/a-1922-6429. eCollection 2022 Oct. Endosc Int Open. 2022. PMID: 36262509 Free PMC article.
-
Impact of digitally acquired peer diagnostic input on diagnostic confidence in outpatient cases: A pragmatic randomized trial.J Am Med Inform Assoc. 2021 Mar 1;28(3):632-637. doi: 10.1093/jamia/ocaa278. J Am Med Inform Assoc. 2021. PMID: 33260212 Free PMC article. Clinical Trial.
-
Comparison of Heart Team vs Interventional Cardiologist Recommendations for the Treatment of Patients With Multivessel Coronary Artery Disease.JAMA Netw Open. 2020 Aug 3;3(8):e2012749. doi: 10.1001/jamanetworkopen.2020.12749. JAMA Netw Open. 2020. PMID: 32777060 Free PMC article.
References
-
- Shojania KG, Burton EC, McDonald KM, Goldman L Autopsy as an outcome and performance measure: summary. Agency for Healthcare Research and Quality. https://www.ncbi.nlm.nih.gov/books/NBK11951/ Published 2002. Accessed January 28, 2019.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources