Neurological history both twinned and queried by generative artificial intelligence
- PMID: 39895821
- PMCID: PMC11782252
- DOI: 10.3389/fmed.2024.1496866
Neurological history both twinned and queried by generative artificial intelligence
Erratum in
-
Corrigendum: Neurological history both twinned and queried by generative artificial intelligence.Front Med (Lausanne). 2025 May 27;12:1619686. doi: 10.3389/fmed.2025.1619686. eCollection 2025. Front Med (Lausanne). 2025. PMID: 40495960 Free PMC article.
Abstract
Background and objectives: We propose the use of GPT-4 to facilitate initial history-taking in neurology and other medical specialties. A large language model (LLM) could be utilized as a digital twin which could enhance queryable electronic medical record (EMR) systems and provide healthcare conversational agents (HCAs) to replace waiting-room questionnaires.
Methods: In this observational pilot study, we presented verbatim history of present illness (HPI) narratives from published case reports of headache, stroke, and neurodegenerative diseases. Three standard GPT-4 models were designated Models P: patient digital twin; N: neurologist to query Model P; and S: supervisor to synthesize the N-P dialogue into a derived HPI and formulate the differential diagnosis. Given the random variability of GPT-4 output, each case was presented five separate times to check consistency and reliability.
Results: The study achieved an overall HPI content retrieval accuracy of 81%, with accuracies of 84% for headache, 82% for stroke, and 77% for neurodegenerative diseases. Retrieval accuracies for individual HPI components were as follows: 93% for chief complaints, 47% for associated symptoms and review of systems, 76% for relevant symptom details, and 94% for histories of past medical, surgical, allergies, social, and family factors. The ranking of case diagnoses in the differential diagnosis list averaged in the 89th percentile.
Discussion: Our tripartite LLM model demonstrated accuracy in extracting essential information from published case reports. Further validation with EMR HPIs, and then with direct patient care will be needed to move toward adaptation of enhanced diagnostic digital twins that incorporate real-time data from health-monitoring devices and self-monitoring assessments.
Keywords: headache; history taking; large language model (LLM); neurodegenerative disease; neurology–clinical; stroke.
Copyright © 2025 Lee, Choi, Angulo, McDougal and Lytton.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures
References
-
- Albrink K, Joos C, Schröder D, Müller F, Hummers E, Noack EM. Obtaining patients’ medical history using a digital device prior to consultation in primary care: study protocol for a usability and validity study. BMC Med Inform Decis Mak. (2022) 22:189. doi: 10.1186/s12911-022-01928-0, PMID: - DOI - PMC - PubMed
-
- Shucard H, Muller E, Johnson J, Walker J, Elmore JG, Payne TH, et al. Clinical use of an electronic pre-visit questionnaire soliciting patient visit goals and interim history: a retrospective comparison between safety-net and non-safety-net clinics. Health Serv Res Manag Epidemiol. (2022) 9:23333928221080336. doi: 10.1177/23333928221080336, PMID: - DOI - PMC - PubMed
LinkOut - more resources
Full Text Sources
Miscellaneous
