Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2025 Apr 17:rs.3.rs-6254029.
doi: 10.21203/rs.3.rs-6254029/v1.

A global effort to benchmark predictive models and reveal mechanistic diversity in long-term stroke outcomes

Affiliations

A global effort to benchmark predictive models and reveal mechanistic diversity in long-term stroke outcomes

Anna Matsulevits et al. Res Sq. .

Abstract

Stroke remains a leading cause of mortality and long-term disability worldwide, with variable recovery trajectories posing substantial challenges in anticipating post-event care and rehabilitation planning. To address these challenges, we established the NeuralCup consortium to benchmark predictive models of stroke outcome through a collaborative, data-driven approach. This study presents findings from 15 international teams who used a comprehensive dataset including clinical and imaging data, to identify and compare predictors of motor, cognitive, and emotional outcomes one year post-stroke. Our analyses integrated traditional statistical approaches and novel machine learning algorithms to uncover 'optimal recipes' for predicting each domain. The differences in these 'optimal recipes' reflect distinct brain mechanisms in response to different tasks. Key predictors across all domains included infarct characteristics, T1-weighted MRI sequences, and demographic factors. Additionally, integrating FLAIR imaging and white matter tract analysis significantly improved the prediction of cognitive and motor outcomes, respectively. These findings support a multifaceted approach to stroke outcome prediction, underscoring the potential of collaborative data science to develop personalized care strategies that enhance recovery and quality of life for stroke survivors. To encourage further model development and validation, we provide access to the training dataset at http://neuralcup.bcblab.com.

PubMed Disclaimer

Conflict of interest statement

Competing interests The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

Figure 1
Figure 1
Visualization of the process for obtaining the theoretically optimal feature combination for predictions: inversing the randomise analysis yields heatmaps for each analyzed feature. After binarizing these maps, we investigated the overlap of the local maxima of the clinical tests with the binarized feature maps (1 representing the presence of the feature, and 0 representing the absence of a feature in the final combination ‘optimal recipe’).
Figure 2
Figure 2
Summary of participating teams and the approaches taken for all predictions. a Locations of the teams’ affiliated labs. b Summary of different inputs (A: age, B: gender, C: DWI, D: T1, E: lesion, F: FLAIR, G: tracts, H: atlases, I: disconnectome) and methods (A: clustering, B: artificial neural networks, C: regression, D: feature selection, E: dimensionality reduction, F: parcellation, G: cross-validation, H: bootstrap) used for each prediction. (Figure modified from).
Figure 3
Figure 3
a Mean R2 comparison for all submitted predictions (motor, cognitive, and psychological outcomes) of five neuropsychological scores (FM total, MoCA, IST, HAD-A, HAD-D) sorted ascendingly from the highest score to the lowest score across all teams whose number is indicated on the x-axis. The stars indicate the prediction number, the whiskers indicate the standard error of the mean (SEM). b Median R2 comparison for all submitted predictions of the same scores, sorted ascendingly with whiskers indicating the interquartile range (IRQ). c T-statistic maps for each clinical outcome test, obtained from the FSL randomise analysis. The yellow regions indicate a significant t-value, the purple regions indicate a non-significant t-value. The UMAP distribution of all teams is plotted on the t-statistic maps. d Local maximum of the t-statistic map for each analyzed outcome score.

References

    1. GBD 2019 Stroke Collaborators: Global, regional, and national burden of stroke and its risk factors, 1990–2019: a systematic analysis for the Global Burden of Disease Study 2019. Lancet Neurol. 20, 795–820 (2021) - PMC - PubMed
    1. McMeekin P., et al.: Updating estimates of the number of UK stroke patients eligible for endovascular thrombectomy: incorporating recent evidence to facilitate service planning. Eur. Stroke J. 6, 349–356 (2021) - PMC - PubMed
    1. Raha O., et al.: Advances in mechanical thrombectomy for acute ischaemic stroke. bmjmed 2, e000407 (2023) - PMC - PubMed
    1. Pollatsek A., Well A.D.: On the use of counterbalanced designs in cognitive research: A suggestion for a better and more powerful analysis. J. Experimental Psychology: Learn. Memory Cognition. 21, 785–794 (1995) - PubMed
    1. Oxbury J.M., Greenhall R.C., Grainger K.M.: Predicting the outcome of stroke: acute stage after cerebral infarction. BMJ. 3, 125–127 (1975) - PMC - PubMed

Publication types

LinkOut - more resources