. 2024 Nov;62(11):3459-3469.

doi: 10.1007/s11517-024-03144-6. Epub 2024 Jun 14.

Multi-label classification of retinal diseases based on fundus images using Resnet and Transformer

Jiaqing Zhao¹, Jianfeng Zhu², Jiangnan He², Guogang Cao³, Cuixia Dai⁴

Affiliations

¹ Shanghai Institute of Technology, Shanghai, China.
² Shanghai Eye Disease Prevention and Control Center, Shanghai, China.
³ Shanghai Institute of Technology, Shanghai, China. guogangcao@163.com.
⁴ Shanghai Institute of Technology, Shanghai, China. sdadai7412@163.com.

PMID: 38871856
DOI: 10.1007/s11517-024-03144-6

Multi-label classification of retinal diseases based on fundus images using Resnet and Transformer

Jiaqing Zhao et al. Med Biol Eng Comput. 2024 Nov.

. 2024 Nov;62(11):3459-3469.

doi: 10.1007/s11517-024-03144-6. Epub 2024 Jun 14.

Authors

Jiaqing Zhao¹, Jianfeng Zhu², Jiangnan He², Guogang Cao³, Cuixia Dai⁴

Affiliations

¹ Shanghai Institute of Technology, Shanghai, China.
² Shanghai Eye Disease Prevention and Control Center, Shanghai, China.
³ Shanghai Institute of Technology, Shanghai, China. guogangcao@163.com.
⁴ Shanghai Institute of Technology, Shanghai, China. sdadai7412@163.com.

PMID: 38871856
DOI: 10.1007/s11517-024-03144-6

Abstract

Retinal disorders are a major cause of irreversible vision loss, which can be mitigated through accurate and early diagnosis. Conventionally, fundus images are used as the gold diagnosis standard in detecting retinal diseases. In recent years, more and more researchers have employed deep learning methods for diagnosing ophthalmic diseases using fundus photography datasets. Among the studies, most of them focus on diagnosing a single disease in fundus images, making it still challenging for the diagnosis of multiple diseases. In this paper, we propose a framework that combines ResNet and Transformer for multi-label classification of retinal disease. This model employs ResNet to extract image features, utilizes Transformer to capture global information, and enhances the relationships between categories through learnable label embedding. On the publicly available Ocular Disease Intelligent Recognition (ODIR-5 k) dataset, the proposed method achieves a mean average precision of 92.86%, an area under the curve (AUC) of 97.27%, and a recall of 90.62%, which outperforms other state-of-the-art approaches for the multi-label classification. The proposed method represents a significant advancement in the field of retinal disease diagnosis, offering a more accurate, efficient, and comprehensive model for the detection of multiple retinal conditions.

Keywords: Color fundus images; Deep CNN; Multi-label image classification; Transformer.

PubMed Disclaimer

Cited by

WaveAttention-ResNet: a deep learning-based intelligent diagnostic model for the auxiliary diagnosis of multiple retinal diseases.
Guo B, Wang D, Zhang R, Hou J, Liu W, Wu Y, Yang X, Zhang L. Guo B, et al. Front Radiol. 2025 Jul 29;5:1608052. doi: 10.3389/fradi.2025.1608052. eCollection 2025. Front Radiol. 2025. PMID: 40800169 Free PMC article.

References

1. Bourne R, Price H, Stevens G (2012) Global burden of visual impairment and blindness. Arch Ophthalmol 130(5):645–647. https://doi.org/10.1001/archophthalmol.2012.1032 - DOI - PubMed
1. He K, Zhang X, Ren S, Sun J (2015) Deep residual learning for image recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 770–8. https://doi.org/10.1109/CVPR.2016.90
1. Vaswani A, Shazeer NM, Parmar N, Uszkoreit J, Jones L, Gomez AN, et al (2017) Attention is all you need. Advances in neural information processing systems 30. https://doi.org/10.48550/arXiv.1706.03762
1. Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556. https://doi.org/10.48550/arXiv.1409.1556
1. Liu Z, Lin Y, Cao Y, Hu H, Wei Y, Zhang Z, et al (2021) Swin transformer: hierarchical vision transformer using shifted windows. 2021 IEEE/CVF International Conference on Computer Vision (ICCV). 9992–10002. https://doi.org/10.48550/arXiv.2103.14030

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
- Springer
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Multi-label classification of retinal diseases based on fundus images using Resnet and Transformer

Affiliations

Multi-label classification of retinal diseases based on fundus images using Resnet and Transformer

Authors

Affiliations

Abstract

Similar articles

Cited by

References

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical

Abstract

Similar articles

Cited by

References

MeSH terms

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Medical