Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Sep:177:2110792.
doi: 10.1016/j.clinph.2025.2110792. Epub 2025 Jun 15.

Agent-guided AI-powered interpretation and reporting of nerve conduction studies and EMG (INSPIRE)

Affiliations

Agent-guided AI-powered interpretation and reporting of nerve conduction studies and EMG (INSPIRE)

Alon Gorenshtein et al. Clin Neurophysiol. 2025 Sep.

Abstract

Objective: We aimed to create a tool for electrophysiologist enhancing and standardizing interpretation of neuromuscular electrodiagnostic tests (EDX) using state of the art generative AI technology.

Methods: We developed three model frameworks for interpreting and reporting EDX: (1) Base-LLM (large language model), employing one-shot inference; (2) INSPIRE (Agent-Guided AI-Powered Interpretation and Reporting of Nerve Conduction Studies and EMG), a multi-agent AI framework; and (3) INSPIRE-Lite, a cost-efficient version of INSPIRE. INSPIRE uses three agents integrating tools to read reference tables and long-context clinical neuromuscular textbook. Performance was evaluated using the AI-Generated EMG Report Score (AIGERS), a scoring system we developed.

Results: INSPIRE achieved an accuracy of 92.2 % for detecting normal versus abnormal tests, significantly outperforming the Base-LLM model, which achieved 62.6 % (p < 0.001). INSPIRE demonstrated significantly higher AIGERS scores overall and across the domains of finding, clinical diagnosis, and semantic concordance (p < 0.001). INSPIRE-Lite scored lower than INSPIRE in finding and clinical diagnosis (p = 0.001 and p = 0.004).

Conclusion: Our model integrates variables like patient medical history, current complaints, and EDX findings to manage and interpret EMG. Demonstrating superior performance while addressing hallucinations, data overload, and aiding prioritization and standardization.

Significance: This model enables comprehensive analysis by integrating diverse clinical variables, enhancing diagnostic accuracy and efficiency of EDX reports.

Keywords: Artificial intelligence; Electrodiagnostic study; Electromyogram; Large language models; Multi AI agents; Nerve conduction study; Neurology; Neuromuscular.

PubMed Disclaimer

Conflict of interest statement

Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

LinkOut - more resources