This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

[Preprint]. 2025 Apr 17:rs.3.rs-6254029.

doi: 10.21203/rs.3.rs-6254029/v1.

A global effort to benchmark predictive models and reveal mechanistic diversity in long-term stroke outcomes

Anna Matsulevits¹, Pedro Alves², Manfredo Atzori, Ahmad Beyh³, Maurizio Corbetta⁴, Federico Del Pup, Lilit Dulyan, Chris Foulon⁵, Thomas Hope⁶, Stefano Ioannucci, Gaël Jobard, Hervé Lemaître⁷, Douglas Neville, Victor Nozais, Christopher Rorden, Orionas-Vasilis Saprikis, Igor Sibon, Christoph Sperber⁸, Alex Teghipco⁹, Bertrand Thirion¹⁰, Louis Fabrice Tshimanga, Roza Umarova¹¹, Ema Birute Vaidelyte, Emiel van den Hoven, Esteban Villar Rodriguez, Andrea Zanola, Thomas Tourdias, Michel Thiebaut de Schotten¹²

Affiliations

¹ Groupe d'Imagerie Neurofonctionnelle Institut des Maladies Neurodégénératives-UMR 5293, CNRS, CEA, University of Bordeaux, Bordeaux, France ; Brain Connectivity and Behaviour La.
² Centro de Estudos Egas Moniz, Faculdade de Medicina, Universidade de Lisboa.
³ Rutgers University.
⁴ University of Padua, Department of Neuroscience.
⁵ UCL Queen Square Institute of Neurology, University College London.
⁶ University College London.
⁷ Groupe d'Imagerie Neurofonctionnelle.
⁸ University of Bern.
⁹ University of South Carolina.
¹⁰ Inria.
¹¹ University Hospital of Bern.
¹² Institut des Maladies Neurodégénératives-UMR 5293.

PMID: 40321754
PMCID: PMC12047981
DOI: 10.21203/rs.3.rs-6254029/v1

A global effort to benchmark predictive models and reveal mechanistic diversity in long-term stroke outcomes

Anna Matsulevits et al. Res Sq. 2025.

[Preprint]. 2025 Apr 17:rs.3.rs-6254029.

doi: 10.21203/rs.3.rs-6254029/v1.

Authors

Affiliations

¹ Groupe d'Imagerie Neurofonctionnelle Institut des Maladies Neurodégénératives-UMR 5293, CNRS, CEA, University of Bordeaux, Bordeaux, France ; Brain Connectivity and Behaviour La.
² Centro de Estudos Egas Moniz, Faculdade de Medicina, Universidade de Lisboa.
³ Rutgers University.
⁴ University of Padua, Department of Neuroscience.
⁵ UCL Queen Square Institute of Neurology, University College London.
⁶ University College London.
⁷ Groupe d'Imagerie Neurofonctionnelle.
⁸ University of Bern.
⁹ University of South Carolina.
¹⁰ Inria.
¹¹ University Hospital of Bern.
¹² Institut des Maladies Neurodégénératives-UMR 5293.

PMID: 40321754
PMCID: PMC12047981
DOI: 10.21203/rs.3.rs-6254029/v1

Abstract

Stroke remains a leading cause of mortality and long-term disability worldwide, with variable recovery trajectories posing substantial challenges in anticipating post-event care and rehabilitation planning. To address these challenges, we established the NeuralCup consortium to benchmark predictive models of stroke outcome through a collaborative, data-driven approach. This study presents findings from 15 international teams who used a comprehensive dataset including clinical and imaging data, to identify and compare predictors of motor, cognitive, and emotional outcomes one year post-stroke. Our analyses integrated traditional statistical approaches and novel machine learning algorithms to uncover 'optimal recipes' for predicting each domain. The differences in these 'optimal recipes' reflect distinct brain mechanisms in response to different tasks. Key predictors across all domains included infarct characteristics, T1-weighted MRI sequences, and demographic factors. Additionally, integrating FLAIR imaging and white matter tract analysis significantly improved the prediction of cognitive and motor outcomes, respectively. These findings support a multifaceted approach to stroke outcome prediction, underscoring the potential of collaborative data science to develop personalized care strategies that enhance recovery and quality of life for stroke survivors. To encourage further model development and validation, we provide access to the training dataset at http://neuralcup.bcblab.com.

PubMed Disclaimer

Conflict of interest statement

Competing interests The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

**Figure 1**
Visualization of the process for obtaining the theoretically optimal feature combination for predictions: inversing the *randomise* analysis yields heatmaps for each analyzed feature. After binarizing these maps, we investigated the overlap of the local maxima of the clinical tests with the binarized feature maps (1 representing the presence of the feature, and 0 representing the absence of a feature in the final combination ‘optimal recipe’).

**Figure 2**
Summary of participating teams and the approaches taken for all predictions. a Locations of the teams’ affiliated labs. b Summary of different inputs (A: age, B: gender, C: DWI, D: T1, E: lesion, F: FLAIR, G: tracts, H: atlases, I: disconnectome) and methods (A: clustering, B: artificial neural networks, C: regression, D: feature selection, E: dimensionality reduction, F: parcellation, G: cross-validation, H: bootstrap) used for each prediction. (Figure modified from).

**Figure 3**
a Mean R² comparison for all submitted predictions (motor, cognitive, and psychological outcomes) of five neuropsychological scores (FM total, MoCA, IST, HAD-A, HAD-D) sorted ascendingly from the highest score to the lowest score across all teams whose number is indicated on the x-axis. The stars indicate the prediction number, the whiskers indicate the standard error of the mean (SEM). b Median R² comparison for all submitted predictions of the same scores, sorted ascendingly with whiskers indicating the interquartile range (IRQ). c T-statistic maps for each clinical outcome test, obtained from the FSL *randomise* analysis. The yellow regions indicate a significant t-value, the purple regions indicate a non-significant t-value. The UMAP distribution of all teams is plotted on the t-statistic maps. d Local maximum of the t-statistic map for each analyzed outcome score.

See this image and copyright information in PMC

References

1. GBD 2019 Stroke Collaborators: Global, regional, and national burden of stroke and its risk factors, 1990–2019: a systematic analysis for the Global Burden of Disease Study 2019. Lancet Neurol. 20, 795–820 (2021) - PMC - PubMed
1. McMeekin P., et al.: Updating estimates of the number of UK stroke patients eligible for endovascular thrombectomy: incorporating recent evidence to facilitate service planning. Eur. Stroke J. 6, 349–356 (2021) - PMC - PubMed
1. Raha O., et al.: Advances in mechanical thrombectomy for acute ischaemic stroke. bmjmed 2, e000407 (2023) - PMC - PubMed
1. Pollatsek A., Well A.D.: On the use of counterbalanced designs in cognitive research: A suggestion for a better and more powerful analysis. J. Experimental Psychology: Learn. Memory Cognition. 21, 785–794 (1995) - PubMed
1. Oxbury J.M., Greenhall R.C., Grainger K.M.: Predicting the outcome of stroke: acute stage after cerebral infarction. BMJ. 3, 125–127 (1975) - PMC - PubMed

Publication types

Actions

Grants and funding

RF1 MH133701/MH/NIMH NIH HHS/United States

LinkOut - more resources

Full Text Sources
- PubMed Central
- Research Square

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

This is a preprint.

A global effort to benchmark predictive models and reveal mechanistic diversity in long-term stroke outcomes

Affiliations

A global effort to benchmark predictive models and reveal mechanistic diversity in long-term stroke outcomes

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

Grants and funding

LinkOut - more resources

Full Text Sources