Inductive reasoning with large language models: A simulated randomized controlled trial for epilepsy
- PMID: 40020525
- PMCID: PMC11908886
- DOI: 10.1016/j.eplepsyres.2025.107532
Inductive reasoning with large language models: A simulated randomized controlled trial for epilepsy
Abstract
Introduction: To investigate the potential of using artificial intelligence (AI), specifically large language models (LLMs), for synthesizing information in a simulated randomized clinical trial (RCT) for an anti-seizure medication, cenobamate, demonstrating the feasibility of inductive reasoning via medical chart review.
Methods: An LLM-generated simulated RCT was conducted, featuring a placebo arm and a full-strength drug arm with a cohort of 240 patients divided 1:1. Seizure counts were simulated using a realistic seizure diary simulator. The study utilized LLMs to generate clinical notes with four neurologist writing styles and random extraneous details. A secondary LLM pipeline synthesized data from these notes. The efficacy and safety of cenobamate in seizure control were evaluated by both an LLM-based pipeline and a human reader.
Results: The AI analysis closely mirrored human analysis, demonstrating the drug's efficacy with marginal differences (<3 %) in identifying both drug efficacy and reported symptoms. The AI successfully identified the number of seizures, symptom reports, and treatment efficacy, with statistical analysis comparing the 50 %-responder rate and median percentage change between the placebo and drug arms, as well as side effect rates in each arm.
Discussion: This study highlights the potential of AI to accurately analyze noisy clinical notes to inductively produce clinical knowledge. Here, treatment effect sizes and symptom frequencies derived from unstructured simulated notes were inferred despite many distractors. The findings emphasize the relevance of AI in future clinical research, offering a scalable and efficient alternative to traditional labor-intensive data mining.
Keywords: Artificial intelligence; Epilepsy; Large language models; Randomized clinical trials.
Copyright © 2025 Elsevier B.V. All rights reserved.
Conflict of interest statement
Declaration of Competing Interest None of the authors have any conflicts of interest to declare.
Update of
-
Inductive reasoning with large language models: a simulated randomized controlled trial for epilepsy.medRxiv [Preprint]. 2024 Mar 19:2024.03.18.24304493. doi: 10.1101/2024.03.18.24304493. medRxiv. 2024. Update in: Epilepsy Res. 2025 Mar;211:107532. doi: 10.1016/j.eplepsyres.2025.107532. PMID: 38562831 Free PMC article. Updated. Preprint.
Similar articles
-
Inductive reasoning with large language models: a simulated randomized controlled trial for epilepsy.medRxiv [Preprint]. 2024 Mar 19:2024.03.18.24304493. doi: 10.1101/2024.03.18.24304493. medRxiv. 2024. Update in: Epilepsy Res. 2025 Mar;211:107532. doi: 10.1016/j.eplepsyres.2025.107532. PMID: 38562831 Free PMC article. Updated. Preprint.
-
Safety and efficacy of adjunctive cenobamate (YKP3089) in patients with uncontrolled focal seizures: a multicentre, double-blind, randomised, placebo-controlled, dose-response trial.Lancet Neurol. 2020 Jan;19(1):38-48. doi: 10.1016/S1474-4422(19)30399-0. Epub 2019 Nov 14. Lancet Neurol. 2020. PMID: 31734103 Clinical Trial.
-
Efficacy of adjunctive cenobamate based on number of concomitant antiseizure medications, seizure frequency, and epilepsy duration at baseline: A post-hoc analysis of a randomized clinical study.Epilepsy Res. 2021 May;172:106592. doi: 10.1016/j.eplepsyres.2021.106592. Epub 2021 Feb 18. Epilepsy Res. 2021. PMID: 33662894 Clinical Trial.
-
Adjunctive Cenobamate for Focal-Onset Seizures in Adults: A Systematic Review and Meta-Analysis.CNS Drugs. 2020 Nov;34(11):1105-1120. doi: 10.1007/s40263-020-00759-9. CNS Drugs. 2020. PMID: 32851590 Free PMC article.
-
Clonazepam monotherapy for treating people with newly diagnosed epilepsy.Cochrane Database Syst Rev. 2019 Nov 19;2019(11):CD013028. doi: 10.1002/14651858.CD013028.pub2. Cochrane Database Syst Rev. 2019. Update in: Cochrane Database Syst Rev. 2022 Feb 21;2:CD013028. doi: 10.1002/14651858.CD013028.pub3. PMID: 31742671 Free PMC article. Updated.
References
-
- Agrawal M, Hegselmann S, Lang H, Kim Y, Sontag D, 2022. Large Language Models are Few-Shot Clinical Information Extractors.
-
- Benner P, 2004. Using the dreyfus model of skill acquisition to describe and interpret skill acquisition and clinical judgment in nursing practice and education. Bull Sci Technol Soc 24, 188–199. 10.1177/0270467604265061 - DOI
-
- Bubeck S, Chandrasekaran V, Eldan R, Gehrke J, Horvitz E, Kamar E, Lee P, Lee YT, Li Y, Lundberg S, Nori H, Palangi H, Ribeiro MT, Zhang Y, 2023. Sparks of Artificial General Intelligence: Early experiments with GPT-4.
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Medical