Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jan 29;16(1):e0245874.
doi: 10.1371/journal.pone.0245874. eCollection 2021.

Accurate prediction of clinical stroke scales and improved biomarkers of motor impairment from robotic measurements

Affiliations

Accurate prediction of clinical stroke scales and improved biomarkers of motor impairment from robotic measurements

Dimitris K Agrafiotis et al. PLoS One. .

Abstract

Objective: One of the greatest challenges in clinical trial design is dealing with the subjectivity and variability introduced by human raters when measuring clinical end-points. We hypothesized that robotic measures that capture the kinematics of human movements collected longitudinally in patients after stroke would bear a significant relationship to the ordinal clinical scales and potentially lead to the development of more sensitive motor biomarkers that could improve the efficiency and cost of clinical trials.

Materials and methods: We used clinical scales and a robotic assay to measure arm movement in 208 patients 7, 14, 21, 30 and 90 days after acute ischemic stroke at two separate clinical sites. The robots are low impedance and low friction interactive devices that precisely measure speed, position and force, so that even a hemiparetic patient can generate a complete measurement profile. These profiles were used to develop predictive models of the clinical assessments employing a combination of artificial ant colonies and neural network ensembles.

Results: The resulting models replicated commonly used clinical scales to a cross-validated R2 of 0.73, 0.75, 0.63 and 0.60 for the Fugl-Meyer, Motor Power, NIH stroke and modified Rankin scales, respectively. Moreover, when suitably scaled and combined, the robotic measures demonstrated a significant increase in effect size from day 7 to 90 over historical data (1.47 versus 0.67).

Discussion and conclusion: These results suggest that it is possible to derive surrogate biomarkers that can significantly reduce the sample size required to power future stroke clinical trials.

PubMed Disclaimer

Conflict of interest statement

The authors have read the journal’s policy and the authors of this manuscript have the following competing interests: AD and MK are employed by Janssen Research & Development, DA and EY were employed by Janssen Research & Development when this work was conducted and are currently employed by Novartis Institutes for BioMedical Research, GL is employed by GSL Statistical Consulting, GB is employed by Bioconstat Bvba, and JC is employed by Biogen-Idec. Wyeth provided grant funding for the study to HIK. This does not alter our adherence to PLOS ONE policies on sharing data and materials. HIK was also the founder of Interactive Motion Technologies and Chairman of the Board from 1998 to 2016. He sold Interactive Motion Technologies on April 2016 to Bionik Laboratories, where he served as Chief Science Officer and Board Member until July 2017. HIK holds equity positions in Interactive Motion Technologies, Watertown, MA, USA, the company that manufactures this type of technology under license from MIT. HIK was the founder of 4Motion Robotics. The authors would like to declare the following patents/patent applications associated with this research: HIK is a co-inventor in patents held by the Massachusetts Institute of Technology for the robotic technology used in this work (US Patent 7,618,381, US Patent 5,466,213, and US Patent App. 11/154,197).

Figures

Fig 1
Fig 1. Dependence of the length of the ant’s path L (log scale) on the value of R2.
As can be seen from this plot, L increases 10-fold as R2 decreases by 0.2 units up to a R2 value of ~0.2, and at a much greater rate for R2 values lower than 0.2.
Fig 2
Fig 2. Schematic illustration of the ensemble neural network models used to predict the clinical scales from the RMK variables.
Each subset of features identified by the artificial ant algorithm was used to construct 10 independent neural network models using exactly the same network topology and training parameters but a different random seed number (and thus different initial synaptic parameters and presentation sequence of the training samples). The predictions of these 10 models were averaged to produce the aggregate prediction of the ensemble.
Fig 3
Fig 3. SPE map of the correlation distances of the clinical and RMK parameters for the completers cohort.
The map was derived by computing the pairwise Pearson correlation coefficients (R) for all pairs of features, converting them to correlation distances (1-abs(R)), and embedding the resulting matrix into 2 dimensions in such a way that the distances of the points on the map approximate as closely as possible the correlation distances of the respective features. The clinical parameters are highlighted in red, the RMK parameters on the affected side in blue, and the RMK parameters on the unaffected side in green. The map also shows distinct clusters of correlated variables which are preserved on both the affected and unaffected sides (outlined by green and blue ellipses, respectively).
Fig 4
Fig 4. Cross-validated R2 of the best models derived from the completers (solid lines) and validated with the non-completers (dashed lines) for each of the four clinical scales, using 2, 4, 6, 8, 10, 12 and 14 robot-derived RMK features.
The figure shows the ability of the robot-derived RMK models to predict the clinical scales with an increasing number of features. The model performance exhibits an asymptotic behavior with respect to the number of RMK features, reaching the point of diminishing returns at approximately 8 features for all four clinical scales. Note the small variance in the prediction of the trained data as shown by the small “whiskers,” which for the most part, are not visible in the figure.
Fig 5
Fig 5. Cross-validated R2 of the best 8-feature models derived using three different approaches for feature selection and model building: 1) features selected by artificial ants with neural networks, model derived with neural networks (dark blue); 2) features selected by artificial ants with neural networks, model derived with multi-linear regression (light blue); and 3) features selected by PCA, model derived with multi-linear regression (red).
Every model was cross-validated using the same 10-fold cross-validation procedure described in the Methods session.
Fig 6
Fig 6. Importance of individual RMK features in predicting the four clinical scales, obtained by systematically removing each feature from the training sample and repeating the feature selection, aggregation and cross-validation procedure for each derived data set.
Only models with 8 input and 2 hidden units are shown. The left-most data point and the horizontal solid line on the top part of the plot represent the cross-validated R2 of the best model with all features included, averaged over all 10 cross-validation runs (standard deviations shown as error bars). Each subsequent point shows the R2 of the corresponding apo model, i.e., the model derived by omitting the feature shown on the x axis (the model still includes 8 features, just not the one shown on the x axis). The individual markers at the bottom part of the plot indicate whether there is a statistically significant difference between the R2 distributions of the all-feature and the respective apo models (the presence of a marker indicates that the difference between the two distributions is statistically significant, and the absence that it is not).
Fig 7
Fig 7. Optimization of effect size for robot-derived RMK metrics.
The horizontal lines show the day 7 to day 90 effect size for comparable patients of the historical VISTA data for the NIH, as well as the effect sizes for the NIH, FM and MP assessment scales for our completers cohort. The figure also shows the performance of the robot-derived RMK composites optimized for effect size for the trained (solid lines) and cross-validated sets (dashed lines). Note the increase of over 20% in cross-validated effect size for the RMK composites over the clinical scales with 4-features for this study (and over 70% over the historical data).
Fig 8
Fig 8. Correlation (expressed as R2) of the FM, MP and NIH original scales with the corresponding optimized 8-feature composites obtained with the greedy algorithm.

References

    1. Prevalence of disabilities and associated health conditions among adults--United States, 1999. MMWR Morb Mortal Wkly Rep. 2001. February 23;50(7):120–5. - PubMed
    1. Marler JR, Tilley BC, Lu M, Brott TG, Lyden PC, Grotta JC, et al. Early stroke treatment associated with better outcome: the NINDS rt-PA stroke study. Neurology. 2000. December 12;55(11):1649–55. 10.1212/wnl.55.11.1649 - DOI - PubMed
    1. Hogan N, Krebs HI. Physically interactive robotic technology for neuromotor rehabilitation. Prog Brain Res. 2011;192:59–68. 10.1016/B978-0-444-53355-5.00004-X - DOI - PubMed
    1. Gassert R, Dietz V. Rehabilitation robots for the treatment of sensorimotor deficits: a neurophysiological perspective. J NeuroEngineering Rehabil. 2018. December;15(1):46 10.1186/s12984-018-0383-x - DOI - PMC - PubMed
    1. Goffredo M, Mazzoleni S, Gison A, Infarinato F, Pournajaf S, Galafate D, et al. Kinematic Parameters for Tracking Patient Progress during Upper Limb Robot-Assisted Rehabilitation: An Observational Study on Subacute Stroke Subjects. Appl Bionics Biomech. 2019;2019:4251089 10.1155/2019/4251089 - DOI - PMC - PubMed

Publication types