A public benchmark for human performance in the detection of focal cortical dysplasia
- PMID: 40167314
- PMCID: PMC12163524
- DOI: 10.1002/epi4.70028
A public benchmark for human performance in the detection of focal cortical dysplasia
Abstract
Objective: This study aims to report human performance in the detection of Focal Cortical Dysplasias (FCDs) using an openly available dataset. Additionally, it defines a subset of this data as a "difficult" test set to establish a public baseline benchmark against which new methods for automated FCD detection can be evaluated.
Methods: The performance of 28 human readers with varying levels of expertise in detecting FCDs was originally analyzed using 146 subjects (not all of which are openly available), we analyzed the openly available subset of 85 cases. Performance was measured based on the overlap between predicted regions of interest (ROIs) and ground-truth lesion masks, using the Dice-Soerensen coefficient (DSC). The benchmark test set was chosen to consist of 15 subjects most predictive for human performance and 13 subjects identified by at most 3 of the 28 readers.
Results: Expert readers achieved an average detection rate of 68%, compared to 45% for non-experts and 27% for laypersons. Neuroradiologists detected the highest percentage of lesions (64%), while psychiatrists detected the least (34%). Neurosurgeons had the highest ROI sensitivity (0.70), and psychiatrists had the highest ROI precision (0.78). The benchmark test set revealed an expert detection rate of 49%.
Significance: Reporting human performance in FCD detection provides a critical baseline for assessing the effectiveness of automated detection methods in a clinically relevant context. The defined benchmark test set serves as a useful indicator for evaluating advancements in computer-aided FCD detection approaches.
Plain language summary: Focal cortical dysplasias (FCDs) are malformations of cortical development and one of the most common causes of drug-resistant focal epilepsy. Once found, FCDs can be neurosurgically resected, which leads to seizure freedom in many cases. However, FCDs are difficult to detect in the visual assessment of magnetic resonance imaging. A myriad of algorithms for automated FCD detection have been developed, but their true clinical value remains unclear since there is no benchmark dataset for evaluation and comparison to human performance. Here, we use human FCD detection performance to define a benchmark dataset with which new methods for automated detection can be evaluated.
Keywords: artificial intelligence; computer‐aided detection; human performance; reader study.
© 2025 The Author(s). Epilepsia Open published by Wiley Periodicals LLC on behalf of International League Against Epilepsy.
Conflict of interest statement
AR has received fees as a speaker from UCB Pharma and travel support from the Elisabeth und Helmut Uhl Stiftung. UA has received fees as a speaker for Siemens Healthineers and as a clinical consultant for Bayer. AR lectures for Guerbet and Bayer and is part of the Advisory Board for GE, Bracco, and Guerbet. RS has received personal fees as a speaker or for serving on advisory boards from Angelini, Arvelle, Bial, Desitin, Eisai, Jazz Pharmaceuticals Germany GmbH, Janssen‐Cilag GmbH, LivaNova, LivAssured BV, Novartis, Precisis GmbH, Rapport Therapeutics, Tabuk Pharmaceuticals, UCB Pharma, UNEEG, and Zogenix. TR has received fees as a speaker from Eisai. None of the previously mentioned activities were related to the content of this manuscript. The remaining authors have nothing to declare. We confirm that we have read the Journal's position on issues involved in ethical publication and affirm that this report is consistent with those guidelines.
Figures
References
-
- Lamberink HJ, Otte WM, Blümcke I, Braun KPJ, Aichholzer M, Amorim I, et al. Seizure outcome and use of antiepileptic drugs after epilepsy surgery according to histopathological diagnosis: a retrospective multicentre cohort study. Lancet Neurol. 2020;19(9):748–757. 10.1016/S1474-4422(20)30220-9 - DOI - PubMed
-
- Timoney N, Rutka JT. Recent advances in epilepsy surgery and achieving best outcomes using high‐frequency oscillations, diffusion tensor imaging, magnetoencephalography, intraoperative Neuromonitoring, focal cortical dysplasia, and bottom of sulcus dysplasia. Neurosurgery. 2017;64:1–10. 10.1093/neuros/nyx239 - DOI - PubMed
MeSH terms
LinkOut - more resources
Full Text Sources
