Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Mar:211:107532.
doi: 10.1016/j.eplepsyres.2025.107532. Epub 2025 Feb 24.

Inductive reasoning with large language models: A simulated randomized controlled trial for epilepsy

Affiliations

Inductive reasoning with large language models: A simulated randomized controlled trial for epilepsy

Daniel M Goldenholz et al. Epilepsy Res. 2025 Mar.

Abstract

Introduction: To investigate the potential of using artificial intelligence (AI), specifically large language models (LLMs), for synthesizing information in a simulated randomized clinical trial (RCT) for an anti-seizure medication, cenobamate, demonstrating the feasibility of inductive reasoning via medical chart review.

Methods: An LLM-generated simulated RCT was conducted, featuring a placebo arm and a full-strength drug arm with a cohort of 240 patients divided 1:1. Seizure counts were simulated using a realistic seizure diary simulator. The study utilized LLMs to generate clinical notes with four neurologist writing styles and random extraneous details. A secondary LLM pipeline synthesized data from these notes. The efficacy and safety of cenobamate in seizure control were evaluated by both an LLM-based pipeline and a human reader.

Results: The AI analysis closely mirrored human analysis, demonstrating the drug's efficacy with marginal differences (<3 %) in identifying both drug efficacy and reported symptoms. The AI successfully identified the number of seizures, symptom reports, and treatment efficacy, with statistical analysis comparing the 50 %-responder rate and median percentage change between the placebo and drug arms, as well as side effect rates in each arm.

Discussion: This study highlights the potential of AI to accurately analyze noisy clinical notes to inductively produce clinical knowledge. Here, treatment effect sizes and symptom frequencies derived from unstructured simulated notes were inferred despite many distractors. The findings emphasize the relevance of AI in future clinical research, offering a scalable and efficient alternative to traditional labor-intensive data mining.

Keywords: Artificial intelligence; Epilepsy; Large language models; Randomized clinical trials.

PubMed Disclaimer

Conflict of interest statement

Declaration of Competing Interest None of the authors have any conflicts of interest to declare.

Update of

Similar articles

References

    1. Agrawal M, Hegselmann S, Lang H, Kim Y, Sontag D, 2022. Large Language Models are Few-Shot Clinical Information Extractors.
    1. Baud MO, Kleen JK, Mirro EA, Andrechak JC, King-Stephens D, Chang EF, Rao VR, 2018. Multi-day rhythms modulate seizure risk in epilepsy. Nat Commun 9, 1–10. 10.1038/s41467-017-02577-y - DOI - PMC - PubMed
    1. Benner P, 2004. Using the dreyfus model of skill acquisition to describe and interpret skill acquisition and clinical judgment in nursing practice and education. Bull Sci Technol Soc 24, 188–199. 10.1177/0270467604265061 - DOI
    1. Bubeck S, Chandrasekaran V, Eldan R, Gehrke J, Horvitz E, Kamar E, Lee P, Lee YT, Li Y, Lundberg S, Nori H, Palangi H, Ribeiro MT, Zhang Y, 2023. Sparks of Artificial General Intelligence: Early experiments with GPT-4.
    1. Chiang S, Haut SR, Ferastraoaru V, Rao VR, Baud MO, Theodore WH, Moss R, Goldenholz DM, 2020. Individualizing the definition of seizure clusters based on temporal clustering analysis. Epilepsy Res 163. 10.1016/j.eplepsyres.2020.106330 - DOI - PubMed

Substances