Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jul;53(7):2299-2306.
doi: 10.1161/STROKEAHA.121.036557. Epub 2022 Apr 1.

Prediction of Recurrent Ischemic Stroke Using Registry Data and Machine Learning Methods: The Erlangen Stroke Registry

Affiliations

Prediction of Recurrent Ischemic Stroke Using Registry Data and Machine Learning Methods: The Erlangen Stroke Registry

Asmir Vodencarevic et al. Stroke. 2022 Jul.

Abstract

Background: There have been multiple efforts toward individual prediction of recurrent strokes based on structured clinical and imaging data using machine learning algorithms. Some of these efforts resulted in relatively accurate prediction models. However, acquiring clinical and imaging data is typically possible at provider sites only and is associated with additional costs. Therefore, we developed recurrent stroke prediction models based solely on data easily obtained from the patient at home.

Methods: Data from 384 patients with ischemic stroke were obtained from the Erlangen Stroke Registry. Patients were followed at 3 and 12 months after first stroke and then annually, for about 2 years on average. Multiple machine learning algorithms were applied to train predictive models for estimating individual risk of recurrent stroke within 1 year. Double nested cross-validation was utilized for conservative performance estimation and models' learning capabilities were assessed by learning curves. Predicted probabilities were calibrated, and relative variable importance was assessed using explainable artificial intelligence techniques.

Results: The best model achieved the area under the curve of 0.70 (95% CI, 0.64-0.76) and relatively good probability calibration. The most predictive factors included patient's family and housing circumstances, rehabilitative measures, age, high calorie diet, systolic and diastolic blood pressures, percutaneous endoscopic gastrotomy, number of family doctor's home visits, and patient's mental state.

Conclusions: Developing fairly accurate models for individual risk prediction of recurrent ischemic stroke within 1 year solely based on registry data is feasible. Such models could be applied in a home setting to provide an initial risk assessment and identify high-risk patients early.

Keywords: ischemic stroke; machine learning; probability; recurrence; registries.

PubMed Disclaimer

Publication types