[The spring of artificial intelligence: AI vs. expert for internal medicine cases]

[Article in French]

A Albaladejo¹, A Lorleac'h², J-S Allain³

Affiliations

¹ Médecine interne et immunologie clinique, CHU de Rennes, 2, rue Henri-le-Guilloux, 35000 Rennes, France. Electronic address: adrien.albaladejo@chu-rennes.fr.
² Groupement hospitalier Bretagne Sud, 5, avenue Choiseul, 56100 Lorient, France. Electronic address: a.lorleach@ghbs.bzh.
³ Groupement hospitalier Bretagne Sud, 5, avenue Choiseul, 56100 Lorient, France. Electronic address: js.allain@ghbs.bzh.

PMID: 38331591
DOI: 10.1016/j.revmed.2024.01.012

Free article

Comparative Study

[The spring of artificial intelligence: AI vs. expert for internal medicine cases]

[Article in French]

A Albaladejo et al. Rev Med Interne. 2024 Jul.

Free article

. 2024 Jul;45(7):409-414.

doi: 10.1016/j.revmed.2024.01.012. Epub 2024 Feb 7.

Authors

A Albaladejo¹, A Lorleac'h², J-S Allain³

Affiliations

¹ Médecine interne et immunologie clinique, CHU de Rennes, 2, rue Henri-le-Guilloux, 35000 Rennes, France. Electronic address: adrien.albaladejo@chu-rennes.fr.
² Groupement hospitalier Bretagne Sud, 5, avenue Choiseul, 56100 Lorient, France. Electronic address: a.lorleach@ghbs.bzh.
³ Groupement hospitalier Bretagne Sud, 5, avenue Choiseul, 56100 Lorient, France. Electronic address: js.allain@ghbs.bzh.

PMID: 38331591
DOI: 10.1016/j.revmed.2024.01.012

Abstract

Introduction: The "Printemps de la Médecine Interne" are training days for Francophone internists. The clinical cases presented during these days are complex. This study aims to evaluate the diagnostic capabilities of non-specialized artificial intelligence (language models) ChatGPT-4 and Bard by confronting them with the puzzles of the "Printemps de la Médecine Interne".

Method: Clinical cases from the "Printemps de la Médecine Interne" 2021 and 2022 were submitted to two language models: ChatGPT-4 and Bard. In case of a wrong answer, a second attempt was offered. We then compared the responses of human internist experts to those of artificial intelligence.

Results: Of the 12 clinical cases submitted, human internist experts diagnosed nine, ChatGPT-4 diagnosed three, and Bard diagnosed one. One of the cases solved by ChatGPT-4 was not solved by the internist expert. The artificial intelligence had a response time of a few seconds.

Conclusions: Currently, the diagnostic skills of ChatGPT-4 and Bard are inferior to those of human experts in solving complex clinical cases but are very promising. Recently made available to the general public, they already have impressive capabilities, questioning the role of the diagnostic physician. It would be advisable to adapt the rules or subjects of future "Printemps de la Médecine Interne" so that they are not solved by a public language model.

Keywords: Artificial intelligence; Bard; Case report; ChatGPT; Diagnostic; Intelligence artificielle.

PubMed Disclaimer

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- Elsevier Science

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

[The spring of artificial intelligence: AI vs. expert for internal medicine cases]

Affiliations

[The spring of artificial intelligence: AI vs. expert for internal medicine cases]

Authors

Affiliations

Abstract

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources