MSLI-Net: retinal disease detection network based on multi-segment localization and multi-scale interaction
- PMID: 40546319
- PMCID: PMC12179138
- DOI: 10.3389/fcell.2025.1608325
MSLI-Net: retinal disease detection network based on multi-segment localization and multi-scale interaction
Abstract
Background: The retina plays a critical role in visual perception, yet lesions affecting it can lead to severe and irreversible visual impairment. Consequently, early diagnosis and precise identification of these retinal lesions are essential for slowing disease progression. Optical coherence tomography (OCT) stands out as a pivotal imaging modality in ophthalmology due to its exceptional performance, while the inherent complexity of retinal structures and significant noise interference present substantial challenges for both manual interpretation and AI-assisted diagnosis.
Methods: We propose MSLI-Net, a novel framework built upon the ResNet50 backbone, which enhances the global receptive field via a multi-scale dilation fusion module (MDF) to better capture long-range dependencies. Additionally, a multi-segmented lesion localization module (LLM) is integrated within each branch of a modified feature pyramid network (FPN) to effectively extract critical features while suppressing background noise through parallel branch refinement, and a wavelet subband spatial attention module (WSSA) is designed to significantly improve the model's overall performance in noise suppression by collaboratively processing and exchanging information between the low- and high-frequency subbands extracted through wavelet decomposition.
Results: Experimental evaluation on the OCT-C8 dataset demonstrates that MSLI-Net achieves 96.72% accuracy in retinopathy classification, underscoring its strong discriminative performance and promising potential for clinical application.
Conclusion: This model provides new research ideas for the early diagnosis of retinal diseases and helps drive the development of future high-precision medical imaging-assisted diagnostic systems.
Keywords: lesion localization; multi-scale feature fusion; noise suppression; retinal disease detection; wavelet transform.
Copyright © 2025 Qi, Hong, Cheng, Long, Wang, Li and Cao.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures










Similar articles
-
LGF-Net: A multi-scale feature fusion network for thyroid nodule ultrasound image classification.J Appl Clin Med Phys. 2025 Aug;26(8):e70149. doi: 10.1002/acm2.70149. J Appl Clin Med Phys. 2025. PMID: 40714931 Free PMC article.
-
Frequency-spatial feature fusion via a hierarchical framework for diabetic retinopathy classification in low-quality fundus images.Biomed Phys Eng Express. 2025 Aug 5;11(5). doi: 10.1088/2057-1976/adf3b5. Biomed Phys Eng Express. 2025. PMID: 40706624
-
TLTNet: A novel transscale cascade layered transformer network for enhanced retinal blood vessel segmentation.Comput Biol Med. 2024 Aug;178:108773. doi: 10.1016/j.compbiomed.2024.108773. Epub 2024 Jun 25. Comput Biol Med. 2024. PMID: 38925090
-
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4. Cochrane Database Syst Rev. 2021. Update in: Cochrane Database Syst Rev. 2022 May 23;5:CD011535. doi: 10.1002/14651858.CD011535.pub5. PMID: 33871055 Free PMC article. Updated.
-
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340. Health Technol Assess. 2006. PMID: 16959170
Cited by
-
Enhanced MRI brain tumor detection using deep learning in conjunction with explainable AI SHAP based diverse and multi feature analysis.Sci Rep. 2025 Aug 11;15(1):29411. doi: 10.1038/s41598-025-14901-4. Sci Rep. 2025. PMID: 40790336 Free PMC article.
References
-
- Alaba S. Y., Ball J. E. (2024). WCAM: wavelet convolutional attention module SoutheastCon 2024. SoutheastCon 2024, Atlanta, GA, United States, 15-24 March 2024 (IEEE; ), 854–859.
-
- Awais M., Muller H., Meriaudeau F. (2017). Classification of sd-OCT images using a deep learning approach. IEEE International Conference on Signal and Image Processing Applications, 8120661–492. 10.1109/ICSIPA.2017.8120661 - DOI
-
- Bui P.-N., Le D.-T., Bum J., Choo H. (2024). Multi-scale feature enhancement in multi-task learning for medical image analysis. arXiv preprint arXiv:2412.00351. 10.48550/arXiv.2412.00351 - DOI
-
- Burrus C. S., Gopinath R. A., Guo H. (1998). Wavelets and wavelet transforms. houston edition. Houston, TX: rice university, 98.
-
- Cheng J., Long G., Zhang Z., Qi Z., Wang H., Lu L., et al. (2025). WaveNet-SF: a hybrid network for retinal disease detection based on wavelet transform in the spatial-frequency domain. arXiv preprint arXiv:2501.11854. 10.48550/arXiv.2501.11854 - DOI
LinkOut - more resources
Full Text Sources