MBT: Model-Based Transformer for retinal optical coherence tomography image and video multi-classification
- PMID: 37657204
- DOI: 10.1016/j.ijmedinf.2023.105178
MBT: Model-Based Transformer for retinal optical coherence tomography image and video multi-classification
Abstract
Background and objective: The detection of retinal diseases using optical coherence tomography (OCT) images and videos is a concrete example of a data classification problem. In recent years, Transformer architectures have been successfully applied to solve a variety of real-world classification problems. Although they have shown impressive discriminative abilities compared to other state-of-the-art models, improving their performance is essential, especially in healthcare-related problems.
Methods: This paper presents an effective technique named model-based transformer (MBT). It is based on popular pre-trained transformer models, particularly, vision transformer, swin transformer for OCT image classification, and multiscale vision transformer for OCT video classification. The proposed approach is designed to represent OCT data by taking advantage of an approximate sparse representation technique. Then, it estimates the optimal features, and performs data classification.
Results: The experiments are carried out using three real-world retinal datasets. The experimental results on OCT image and OCT video datasets show that the proposed method outperforms existing state-of-the-art deep learning approaches in terms of classification accuracy, precision, recall, and f1-score, kappa, AUC-ROC, and AUC-PR. It can also boost the performance of existing transformer models, including Vision transformer and Swin transformer for OCT image classification, and Multiscale Vision Transformers for OCT video classification.
Conclusions: This work presents an approach for the automated detection of retinal diseases. Although deep neural networks have proven great potential in ophthalmology applications, our findings demonstrate for the first time a new way to identify retinal pathologies using OCT videos instead of images. Moreover, our proposal can help researchers enhance the discriminative capacity of a variety of powerful deep learning models presented in published papers. This can be valuable for future directions in medical research and clinical practice.
Keywords: Computer-aided diagnosis; Image classification; Multiscale vision transformer; Optical coherence tomography; Retinal disease classification; Swin Transformer; Video classification; Vision Transformer.
Copyright © 2023 Elsevier B.V. All rights reserved.
Conflict of interest statement
Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Similar articles
-
Stitched vision transformer for age-related macular degeneration detection using retinal optical coherence tomography images.PLoS One. 2024 Jun 5;19(6):e0304943. doi: 10.1371/journal.pone.0304943. eCollection 2024. PLoS One. 2024. PMID: 38837967 Free PMC article.
-
Multi-Fundus Diseases Classification Using Retinal Optical Coherence Tomography Images with Swin Transformer V2.J Imaging. 2023 Sep 29;9(10):203. doi: 10.3390/jimaging9100203. J Imaging. 2023. PMID: 37888310 Free PMC article.
-
HTC-retina: A hybrid retinal diseases classification model using transformer-Convolutional Neural Network from optical coherence tomography images.Comput Biol Med. 2024 Aug;178:108726. doi: 10.1016/j.compbiomed.2024.108726. Epub 2024 Jun 9. Comput Biol Med. 2024. PMID: 38878400
-
Do it the transformer way: A comprehensive review of brain and vision transformers for autism spectrum disorder diagnosis and classification.Comput Biol Med. 2023 Dec;167:107667. doi: 10.1016/j.compbiomed.2023.107667. Epub 2023 Nov 3. Comput Biol Med. 2023. PMID: 37939407 Review.
-
Improving diagnosis and prognosis of lung cancer using vision transformers: a scoping review.BMC Med Imaging. 2023 Sep 15;23(1):129. doi: 10.1186/s12880-023-01098-z. BMC Med Imaging. 2023. PMID: 37715137 Free PMC article.
Cited by
-
Discriminative, generative artificial intelligence, and foundation models in retina imaging.Taiwan J Ophthalmol. 2024 Nov 28;14(4):473-485. doi: 10.4103/tjo.TJO-D-24-00064. eCollection 2024 Oct-Dec. Taiwan J Ophthalmol. 2024. PMID: 39803410 Free PMC article. Review.
-
Stitched vision transformer for age-related macular degeneration detection using retinal optical coherence tomography images.PLoS One. 2024 Jun 5;19(6):e0304943. doi: 10.1371/journal.pone.0304943. eCollection 2024. PLoS One. 2024. PMID: 38837967 Free PMC article.
-
Multi-resolution visual Mamba with multi-directional selective mechanism for retinal disease detection.Front Cell Dev Biol. 2024 Oct 11;12:1484880. doi: 10.3389/fcell.2024.1484880. eCollection 2024. Front Cell Dev Biol. 2024. PMID: 39463765 Free PMC article.
LinkOut - more resources
Full Text Sources
Research Materials