Can Artificial Intelligence Mitigate Missed Diagnoses by Generating Differential Diagnoses for Neurosurgeons?

Affiliations

¹ Department of Neurosurgery, Hackensack Meridian School of Medicine, Nutley, New Jersey, USA. Electronic address: kumar.rohitp098@gmail.com.
² Department of Neurosurgery, Hackensack Meridian School of Medicine, Nutley, New Jersey, USA.
³ Department of Neurology, HMH-Jersey Shore University Medical Center, Neptune, New Jersey, USA.
⁴ Department of Neurosurgery, Hackensack Meridian School of Medicine, Nutley, New Jersey, USA; Department of Neurosurgery, HMH-Jersey Shore University Medical Center, Neptune, New Jersey, USA.

PMID: 38759788
DOI: 10.1016/j.wneu.2024.05.052

Can Artificial Intelligence Mitigate Missed Diagnoses by Generating Differential Diagnoses for Neurosurgeons?

Rohit Prem Kumar et al. World Neurosurg. 2024 Jul.

. 2024 Jul:187:e1083-e1088.

doi: 10.1016/j.wneu.2024.05.052. Epub 2024 May 16.

Affiliations

¹ Department of Neurosurgery, Hackensack Meridian School of Medicine, Nutley, New Jersey, USA. Electronic address: kumar.rohitp098@gmail.com.
² Department of Neurosurgery, Hackensack Meridian School of Medicine, Nutley, New Jersey, USA.
³ Department of Neurology, HMH-Jersey Shore University Medical Center, Neptune, New Jersey, USA.
⁴ Department of Neurosurgery, Hackensack Meridian School of Medicine, Nutley, New Jersey, USA; Department of Neurosurgery, HMH-Jersey Shore University Medical Center, Neptune, New Jersey, USA.

PMID: 38759788
DOI: 10.1016/j.wneu.2024.05.052

Abstract

Background/objective: Neurosurgery emphasizes the criticality of accurate differential diagnoses, with diagnostic delays posing significant health and economic challenges. As large language models (LLMs) emerge as transformative tools in healthcare, this study seeks to elucidate their role in assisting neurosurgeons with the differential diagnosis process, especially during preliminary consultations.

Methods: This study employed 3 chat-based LLMs, ChatGPT (versions 3.5 and 4.0), Perplexity AI, and Bard AI, to evaluate their diagnostic accuracy. Each LLM was prompted using clinical vignettes, and their responses were recorded to generate differential diagnoses for 20 common and uncommon neurosurgical disorders. Disease-specific prompts were crafted using Dynamed, a clinical reference tool. The accuracy of the LLMs was determined based on their ability to identify the target disease within their top differential diagnoses correctly.

Results: For the initial differential, ChatGPT 3.5 achieved an accuracy of 52.63%, while ChatGPT 4.0 performed slightly better at 53.68%. Perplexity AI and Bard AI demonstrated 40.00% and 29.47% accuracy, respectively. As the number of considered differentials increased from 2 to 5, ChatGPT 3.5 reached its peak accuracy of 77.89% for the top 5 differentials. Bard AI and Perplexity AI had varied performances, with Bard AI improving in the top 5 differentials at 62.11%. On a disease-specific note, the LLMs excelled in diagnosing conditions like epilepsy and cervical spine stenosis but faced challenges with more complex diseases such as Moyamoya disease and amyotrophic lateral sclerosis.

Conclusions: LLMs showcase the potential to enhance diagnostic accuracy and decrease the incidence of missed diagnoses in neurosurgery.

Keywords: Artificial intelligence; Differential diagnoses; Large language models; Missed diagnoses; Neurosurgery.

PubMed Disclaimer

References

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- ClinicalKey
- Elsevier Science
Research Materials
- NCI CPTC Antibody Characterization Program

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Can Artificial Intelligence Mitigate Missed Diagnoses by Generating Differential Diagnoses for Neurosurgeons?

Affiliations

Can Artificial Intelligence Mitigate Missed Diagnoses by Generating Differential Diagnoses for Neurosurgeons?

Authors

Affiliations

Abstract

References

MeSH terms

LinkOut - more resources

Full Text Sources

Research Materials