. 2023 Aug;64(8):2014-2026.

doi: 10.1111/epi.17637. Epub 2023 Jun 16.

Predicting seizure outcome after epilepsy surgery: Do we need more complex models, larger samples, or better data?

Maria H Eriksson^{1

2

3

4}, Mathilde Ripart¹, Rory J Piper^{1

5}, Friederike Moeller⁶, Krishna B Das^{3

6}, Christin Eltze⁶, Gerald Cooray^{6

7}, John Booth⁸, Kirstie J Whitaker⁴, Aswin Chari^{1

5}, Patricia Martin Sanfilippo^{1

2}, Ana Perez Caballero⁹, Lara Menzies¹⁰, Amy McTague^{1

3}, Martin M Tisdall^{1

5}, J Helen Cross^{1

3

5

11}, Torsten Baldeweg^{1

2}, Sophie Adler¹, Konrad Wagstyl¹²

Affiliations

¹ Developmental Neurosciences Research & Teaching Department, UCL Great Ormond Street Institute of Child Health, London, UK.
² Department of Neuropsychology, Great Ormond Street Hospital, London, UK.
³ Department of Neurology, Great Ormond Street Hospital, London, UK.
⁴ The Alan Turing Institute, London, UK.
⁵ Department of Neurosurgery, Great Ormond Street Hospital, London, UK.
⁶ Department of Neurophysiology, Great Ormond Street Hospital, London, UK.
⁷ Clinical Neuroscience, Karolinska Institute, Solna, Sweden.
⁸ Digital Research Environment, Great Ormond Street Hospital, London, UK.
⁹ North Thames Genomic Laboratory Hub, Great Ormond Street Hospital, London, UK.
¹⁰ Department of Clinical Genetics, Great Ormond Street Hospital, London, UK.
¹¹ Young Epilepsy, Lingfield, UK.
¹² Imaging Neuroscience, UCL Queen Square Institute of Neurology, London, UK.

PMID: 37129087
PMCID: PMC10952307
DOI: 10.1111/epi.17637

Predicting seizure outcome after epilepsy surgery: Do we need more complex models, larger samples, or better data?

Maria H Eriksson et al. Epilepsia. 2023 Aug.

. 2023 Aug;64(8):2014-2026.

doi: 10.1111/epi.17637. Epub 2023 Jun 16.

Authors

Affiliations

¹ Developmental Neurosciences Research & Teaching Department, UCL Great Ormond Street Institute of Child Health, London, UK.
² Department of Neuropsychology, Great Ormond Street Hospital, London, UK.
³ Department of Neurology, Great Ormond Street Hospital, London, UK.
⁴ The Alan Turing Institute, London, UK.
⁵ Department of Neurosurgery, Great Ormond Street Hospital, London, UK.
⁶ Department of Neurophysiology, Great Ormond Street Hospital, London, UK.
⁷ Clinical Neuroscience, Karolinska Institute, Solna, Sweden.
⁸ Digital Research Environment, Great Ormond Street Hospital, London, UK.
⁹ North Thames Genomic Laboratory Hub, Great Ormond Street Hospital, London, UK.
¹⁰ Department of Clinical Genetics, Great Ormond Street Hospital, London, UK.
¹¹ Young Epilepsy, Lingfield, UK.
¹² Imaging Neuroscience, UCL Queen Square Institute of Neurology, London, UK.

PMID: 37129087
PMCID: PMC10952307
DOI: 10.1111/epi.17637

Abstract

Objective: The accurate prediction of seizure freedom after epilepsy surgery remains challenging. We investigated if (1) training more complex models, (2) recruiting larger sample sizes, or (3) using data-driven selection of clinical predictors would improve our ability to predict postoperative seizure outcome using clinical features. We also conducted the first substantial external validation of a machine learning model trained to predict postoperative seizure outcome.

Methods: We performed a retrospective cohort study of 797 children who had undergone resective or disconnective epilepsy surgery at a tertiary center. We extracted patient information from medical records and trained three models-a logistic regression, a multilayer perceptron, and an XGBoost model-to predict 1-year postoperative seizure outcome on our data set. We evaluated the performance of a recently published XGBoost model on the same patients. We further investigated the impact of sample size on model performance, using learning curve analysis to estimate performance at samples up to N = 2000. Finally, we examined the impact of predictor selection on model performance.

Results: Our logistic regression achieved an accuracy of 72% (95% confidence interval [CI] = 68%-75%, area under the curve [AUC] = .72), whereas our multilayer perceptron and XGBoost both achieved accuracies of 71% (95% CI_MLP = 67%-74%, AUC_MLP = .70; 95% CI_{XGBoost own} = 68%-75%, AUC_{XGBoost own} = .70). There was no significant difference in performance between our three models (all p > .4) and they all performed better than the external XGBoost, which achieved an accuracy of 63% (95% CI = 59%-67%, AUC = .62; p_LR = .005, p_MLP = .01, p_{XGBoost own} = .01) on our data. All models showed improved performance with increasing sample size, but limited improvements beyond our current sample. The best model performance was achieved with data-driven feature selection.

Significance: We show that neither the deployment of complex machine learning models nor the assembly of thousands of patients alone is likely to generate significant improvements in our ability to predict postoperative seizure freedom. We instead propose that improved feature selection alongside collaboration, data standardization, and model sharing is required to advance the field.

Keywords: epilepsy surgery; machine learning; pediatric; prediction.

PubMed Disclaimer

Conflict of interest statement

JHC has acted as an investigator for studies with GW Pharmaceuticals, Zogenix, Vitaflo, Ovid, Marinius, and Stoke Therapeutics. She has been a speaker and on advisory boards for GW Pharmaceuticals, Zogenix, Biocodex, Stoke Therapeutics, and Nutricia; all remuneration has been paid to her department. She is president of the International League Against Epilepsy (2021–2025), and chair of the medical boards for Dravet UK, Hope 4 Hypothalamic Hamartoma, and Matthew's friends. MT has received grants from Royal Academy of Engineers and LifeArc. He has received honoraria from Medtronic. LM has received personal consultancy fees from Mendelian Ltd, outside the submitted work. AM has received honoraria from Biocodex and Nutricia, and provided consultancy to Biogen, outside the submitted work. All other authors report no disclosures relevant to the manuscript.

Figures

**FIGURE 1**
Study overview. We investigated the impact of model type, sample size, and feature selection on our ability to accurately predict postoperative seizure outcome.

**FIGURE 2**
Relationships between demographic, clinical, and surgical variables. Relationships are shown both before and after correction for multiple comparison using the Holm method. We have highlighted relationships with seizure outcome using a yellow box. ASM, antiseizure medication; Num. ASM pre‐op, number of antiseizure medications at time of preoperative evaluation; Num. ASM trialed, total number of different antiseizure medications trialed from epilepsy onset to preoperative evaluation.

**FIGURE 3**
Impact of model type and sample size on model performance. (A) Receiver‐ operating characteristic (ROC) curves showing model performances. There was no significant difference in performance between our LR (purple), MLP (pink), and XGBoost (teal) models. All of our models performed significantly better than the XGBoost model recently developed by Yossofzai et al. (light blue). (B) The effect of sample size on model performance (accuracy). There was an improvement in model performance with increasing sample size for our LR, MPL, and XGBoost models, but only up until a certain point. After this, the models showed only marginal gains in performance. Extrapolating performance for sample sizes up to N = 2000 did not predict substantial improvement in model performance for any of our models. AUC, area under the (ROC) curve; LR, logistic regression; MLP, multilayer perceptron; ROC, receiver‐operating characteristic.

**FIGURE 4**
Impact of feature selection on model performance. (A) Receiver‐operating characteristic (ROC) curves showing model performance for our LR models containing (1) only MRI diagnosis (red), (2) all predictors (orange), and (3) predictors identified through data‐driven feature selection (green). Data‐driven selection involved including only predictors that were significantly predictive of 1‐year postoperative seizure outcome as identified in univariable logistic regression analyses. Corresponding ROC curves showing model performances for our MLP and XGBoost models are displayed in Figures S2 and S3. (B) Effect of data‐driven feature selection on model performance (AUC). Variables found to be significantly predictive of seizure outcome from univariable logistic regression analyses were added to the LR, from most information to least informative according to their coefficients. Model performance was best when all significantly predictive features were included in the model. Adding the remaining predictors collected for the study, that is, those that were not significantly predictive of seizure outcome, worsened model performance (far right). Points circled in black represent mean AUC obtained across all 10 folds. Noncircled points represent the AUCs obtained from each of the individual 10 folds. ASM, antiseizure medication; AUC, area under the (ROC) curve; LR, logistic regression; NS. predictors, non‐significant predictors; Num. ASM trialed, total number of different antiseizure medication trialed from epilepsy onset to preoperative evaluation; Num. seiz. types, number of seizure types at time of preoperative evaluation; ROC, receiver‐operating characteristic; Spasms hist., history of spasms; Spasms pre‐op, spasms at time of preoperative evaluation.

See this image and copyright information in PMC

Comment in

Back to the Basics in Predictive Modeling-Predicting Surgical Success.
Terman SW. Terman SW. Epilepsy Curr. 2023 Nov 6;24(1):19-21. doi: 10.1177/15357597231205437. eCollection 2024 Jan-Feb. Epilepsy Curr. 2023. PMID: 38327535 Free PMC article.

References

1. Widjaja E, Jain P, Demoe L, Guttmann A, Tomlinson G, Sander B. Seizure outcome of pediatric epilepsy surgery: systematic review and meta‐analyses. Neurology. 2020;94(7):311–21. 10.1212/WNL.0000000000008966 - DOI - PubMed
1. Gracia CG, Chagin K, Kattan MW, Ji X, Kattan MG, Crotty L, et al. Predicting seizure freedom after epilepsy surgery, a challenge in clinical practice. Epilepsy Behav. 2019;95:124–30. 10.1016/j.yebeh.2019.03.047 - DOI - PMC - PubMed
1. Jehi L, Yardi R, Chagin K, Tassi L, Russo GL, Worrell G, et al. Development and validation of nomograms to provide individualised predictions of seizure outcomes after epilepsy surgery: a retrospective analysis. Lancet Neurol. 2015;14(3):283–90. 10.1016/S1474-4422(14)70325-4 - DOI - PubMed
1. Garcia GC, Yardi R, Kattan MW, Nair D, Gupta A, Najm I, et al. Seizure freedom score: a new simple method to predict success of epilepsy surgery. Epilepsia. 2015;56(3):359–65. 10.1111/epi.12892 - DOI - PubMed
1. Dugan P, Carlson C, Jetté N, Wiebe S, Bunch M, Kuzniecky R, et al. Derivation and initial validation of a surgical grading scale for the preliminary evaluation of adult patients with drug‐resistant focal epilepsy. Epilepsia. 2017;58(5):792–800. 10.1111/epi.13730 - DOI - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Consumer Health Information
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Predicting seizure outcome after epilepsy surgery: Do we need more complex models, larger samples, or better data?

Affiliations

Predicting seizure outcome after epilepsy surgery: Do we need more complex models, larger samples, or better data?

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Comment in

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical