Multi-speed transformer network for neurodegenerative disease assessment and activity recognition
- PMID: 36706617
- DOI: 10.1016/j.cmpb.2023.107344
Multi-speed transformer network for neurodegenerative disease assessment and activity recognition
Abstract
Background and objective: Neurodegenerative diseases are the most frequent age-related diseases. This type of disease, if not discovered in the initial stage, will compromise the quality of life of the affected subject. Thus, a timely diagnosis is of paramount importance. One of the most used tasks from neurologists to detect and determine the severity of the disease is analysing human gait. This work presents the dataset named "Beside Gait" containing timeseries of coordinates of extracted body joints of people with neurodegenerative diseases in various stages of the disease as well as control subjects. In addition, the novel Multi-Speed transformer technique will be presented and benchmarked against several other techniques making use of deep learning and Shallow Learning. The objective is to recognize subjects affected by some form of neurodegenerative disease in early stage using a computer vision technique making use of deep learning that can be integrated into a smartphone app for offline inference with the aim of promptly initiate investigations and treatment to improve the patient's quality of life.
Methods: The recorded videos were processed, and the skeleton of the person in the video was extracted using pose estimation. The raw time-series coordinates of the joints extracted by the pose estimation algorithm were tested against novel deep neural network architectures and Shallow Learning techniques. In this work, the proposed Multi-Speed Transformer is benchmarked against other deep neural networks such as Temporal Convolutional Neural Networks, Transformers, as well as Shallow Learning techniques making use of feature extraction and different classifiers such as Random Forests, K Nearest Neighbours, Ada Boost, Linear and RBF SVM. The proposed Multi-Speed Transformer architecture has been developed to learn short and long-term patterns to model the various pathological gaits.
Results: The Multi-Speed Transformer outperformed all other existing models reaching an accuracy of 96.9%, a sensitivity of 96.9%, a precision of 97.7%, and a specificity of 97.1% in binary classification. The accuracy in multi-class classification for detecting the presence of the disease in various stages is 71.6%, the sensitivity is 67.7%, and the specificity is 71.8%. In addition, tests have also been conducted against two other different activity recognition datasets, namely SHREC and JHMDB, in the exact same conditions. Multi-Speed Transformer has demonstrated to beat always all other tested techniques as well as the techniques reviewed in the state-of-the-art with respectively of accuracy 91.8% and 74%. Having those datasets more than two classes, specificity was not computed.
Conclusions: The Multi-Speed Transformer is a valuable technique for neurodegenerative disease assessment through computer vision. In addition, the novel dataset "Beside Gait" here presented is an important starting point for future research work on automatic recognition of neurodegenerative diseases using gait analysis.
Keywords: Attention; Beside gait dataset; Deep learning; LSTM; Multi-speed; Neurodegenerative; Temporal convolutional.
Copyright © 2023 Elsevier B.V. All rights reserved.
Conflict of interest statement
Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Similar articles
-
A comparative study on deep learning models for text classification of unstructured medical notes with various levels of class imbalance.BMC Med Res Methodol. 2022 Jul 2;22(1):181. doi: 10.1186/s12874-022-01665-y. BMC Med Res Methodol. 2022. PMID: 35780100 Free PMC article.
-
Evaluation of Vertical Ground Reaction Forces Pattern Visualization in Neurodegenerative Diseases Identification Using Deep Learning and Recurrence Plot Image Feature Extraction.Sensors (Basel). 2020 Jul 10;20(14):3857. doi: 10.3390/s20143857. Sensors (Basel). 2020. PMID: 32664354 Free PMC article.
-
MBT: Model-Based Transformer for retinal optical coherence tomography image and video multi-classification.Int J Med Inform. 2023 Oct;178:105178. doi: 10.1016/j.ijmedinf.2023.105178. Epub 2023 Aug 21. Int J Med Inform. 2023. PMID: 37657204
-
A Comprehensive Review of Recent Deep Learning Techniques for Human Activity Recognition.Comput Intell Neurosci. 2022 Apr 20;2022:8323962. doi: 10.1155/2022/8323962. eCollection 2022. Comput Intell Neurosci. 2022. PMID: 35498187 Free PMC article. Review.
-
Shallow and deep learning classifiers in medical image analysis.Eur Radiol Exp. 2024 Mar 5;8(1):26. doi: 10.1186/s41747-024-00428-2. Eur Radiol Exp. 2024. PMID: 38438821 Free PMC article. Review.
Cited by
-
A Hybrid Protection Scheme for the Gait Analysis in Early Dementia Recognition.Sensors (Basel). 2023 Dec 19;24(1):24. doi: 10.3390/s24010024. Sensors (Basel). 2023. PMID: 38202886 Free PMC article.
-
Touch events and human activities for continuous authentication via smartphone.Sci Rep. 2023 Jun 29;13(1):10515. doi: 10.1038/s41598-023-36780-3. Sci Rep. 2023. PMID: 37386093 Free PMC article.
-
A Riemannian multimodal representation to classify parkinsonism-related patterns from noninvasive observations of gait and eye movements.Biomed Eng Lett. 2024 Oct 26;15(1):81-93. doi: 10.1007/s13534-024-00420-0. eCollection 2025 Jan. Biomed Eng Lett. 2024. PMID: 39781052
MeSH terms
LinkOut - more resources
Full Text Sources
Medical
Research Materials