MBT: Model-Based Transformer for retinal optical coherence tomography image and video multi-classification

Badr Ait Hammou¹, Fares Antaki², Marie-Carole Boucher³, Renaud Duval³

Affiliations

¹ Department of Ophthalmology, Université de Montréal, Montreal, Québec, Canada; Centre Universitaire d'Ophtalmologie (CUO), Hôpital Maisonneuve-Rosemont, CIUSSS de l'Est-de-l'Île-de-Montréal, Montréal, Québec, Canada. Electronic address: badr.aithamou@gmail.com.
² Department of Ophthalmology, Université de Montréal, Montreal, Québec, Canada; Centre Universitaire d'Ophtalmologie (CUO), Hôpital Maisonneuve-Rosemont, CIUSSS de l'Est-de-l'Île-de-Montréal, Montréal, Québec, Canada; Department of Ophthalmology, Centre Hospitalier de l'Université de Montréal (CHUM), Montreal, Quebec, Canada.
³ Department of Ophthalmology, Université de Montréal, Montreal, Québec, Canada; Centre Universitaire d'Ophtalmologie (CUO), Hôpital Maisonneuve-Rosemont, CIUSSS de l'Est-de-l'Île-de-Montréal, Montréal, Québec, Canada.

PMID: 37657204
DOI: 10.1016/j.ijmedinf.2023.105178

MBT: Model-Based Transformer for retinal optical coherence tomography image and video multi-classification

Badr Ait Hammou et al. Int J Med Inform. 2023 Oct.

. 2023 Oct:178:105178.

doi: 10.1016/j.ijmedinf.2023.105178. Epub 2023 Aug 21.

Authors

Badr Ait Hammou¹, Fares Antaki², Marie-Carole Boucher³, Renaud Duval³

Affiliations

¹ Department of Ophthalmology, Université de Montréal, Montreal, Québec, Canada; Centre Universitaire d'Ophtalmologie (CUO), Hôpital Maisonneuve-Rosemont, CIUSSS de l'Est-de-l'Île-de-Montréal, Montréal, Québec, Canada. Electronic address: badr.aithamou@gmail.com.
² Department of Ophthalmology, Université de Montréal, Montreal, Québec, Canada; Centre Universitaire d'Ophtalmologie (CUO), Hôpital Maisonneuve-Rosemont, CIUSSS de l'Est-de-l'Île-de-Montréal, Montréal, Québec, Canada; Department of Ophthalmology, Centre Hospitalier de l'Université de Montréal (CHUM), Montreal, Quebec, Canada.
³ Department of Ophthalmology, Université de Montréal, Montreal, Québec, Canada; Centre Universitaire d'Ophtalmologie (CUO), Hôpital Maisonneuve-Rosemont, CIUSSS de l'Est-de-l'Île-de-Montréal, Montréal, Québec, Canada.

PMID: 37657204
DOI: 10.1016/j.ijmedinf.2023.105178

Abstract

Background and objective: The detection of retinal diseases using optical coherence tomography (OCT) images and videos is a concrete example of a data classification problem. In recent years, Transformer architectures have been successfully applied to solve a variety of real-world classification problems. Although they have shown impressive discriminative abilities compared to other state-of-the-art models, improving their performance is essential, especially in healthcare-related problems.

Methods: This paper presents an effective technique named model-based transformer (MBT). It is based on popular pre-trained transformer models, particularly, vision transformer, swin transformer for OCT image classification, and multiscale vision transformer for OCT video classification. The proposed approach is designed to represent OCT data by taking advantage of an approximate sparse representation technique. Then, it estimates the optimal features, and performs data classification.

Results: The experiments are carried out using three real-world retinal datasets. The experimental results on OCT image and OCT video datasets show that the proposed method outperforms existing state-of-the-art deep learning approaches in terms of classification accuracy, precision, recall, and f1-score, kappa, AUC-ROC, and AUC-PR. It can also boost the performance of existing transformer models, including Vision transformer and Swin transformer for OCT image classification, and Multiscale Vision Transformers for OCT video classification.

Conclusions: This work presents an approach for the automated detection of retinal diseases. Although deep neural networks have proven great potential in ophthalmology applications, our findings demonstrate for the first time a new way to identify retinal pathologies using OCT videos instead of images. Moreover, our proposal can help researchers enhance the discriminative capacity of a variety of powerful deep learning models presented in published papers. This can be valuable for future directions in medical research and clinical practice.

Keywords: Computer-aided diagnosis; Image classification; Multiscale vision transformer; Optical coherence tomography; Retinal disease classification; Swin Transformer; Video classification; Vision Transformer.

PubMed Disclaimer

Conflict of interest statement

Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Cited by

Discriminative, generative artificial intelligence, and foundation models in retina imaging.
Ruamviboonsuk P, Arjkongharn N, Vongsa N, Pakaymaskul P, Kaothanthong N. Ruamviboonsuk P, et al. Taiwan J Ophthalmol. 2024 Nov 28;14(4):473-485. doi: 10.4103/tjo.TJO-D-24-00064. eCollection 2024 Oct-Dec. Taiwan J Ophthalmol. 2024. PMID: 39803410 Free PMC article. Review.
Stitched vision transformer for age-related macular degeneration detection using retinal optical coherence tomography images.
Azizi MM, Abhari S, Sajedi H. Azizi MM, et al. PLoS One. 2024 Jun 5;19(6):e0304943. doi: 10.1371/journal.pone.0304943. eCollection 2024. PLoS One. 2024. PMID: 38837967 Free PMC article.
Multi-resolution visual Mamba with multi-directional selective mechanism for retinal disease detection.
Zuo Q, Shi Z, Liu B, Ping N, Wang J, Cheng X, Zhang K, Guo J, Wu Y, Hong J. Zuo Q, et al. Front Cell Dev Biol. 2024 Oct 11;12:1484880. doi: 10.3389/fcell.2024.1484880. eCollection 2024. Front Cell Dev Biol. 2024. PMID: 39463765 Free PMC article.

LinkOut - more resources

Full Text Sources
- ClinicalKey
- Elsevier Science
Research Materials
- NCI CPTC Antibody Characterization Program

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

MBT: Model-Based Transformer for retinal optical coherence tomography image and video multi-classification

Affiliations

MBT: Model-Based Transformer for retinal optical coherence tomography image and video multi-classification

Authors

Affiliations

Abstract

Conflict of interest statement

Similar articles

Cited by

LinkOut - more resources

Full Text Sources

Research Materials

Abstract

Conflict of interest statement

Similar articles

Cited by

Related information

LinkOut - more resources

Full Text Sources

Research Materials