A Review on Multiscale-Deep-Learning Applications
- PMID: 36236483
- PMCID: PMC9573412
- DOI: 10.3390/s22197384
A Review on Multiscale-Deep-Learning Applications
Abstract
In general, most of the existing convolutional neural network (CNN)-based deep-learning models suffer from spatial-information loss and inadequate feature-representation issues. This is due to their inability to capture multiscale-context information and the exclusion of semantic information throughout the pooling operations. In the early layers of a CNN, the network encodes simple semantic representations, such as edges and corners, while, in the latter part of the CNN, the network encodes more complex semantic features, such as complex geometric shapes. Theoretically, it is better for a CNN to extract features from different levels of semantic representation because tasks such as classification and segmentation work better when both simple and complex feature maps are utilized. Hence, it is also crucial to embed multiscale capability throughout the network so that the various scales of the features can be optimally captured to represent the intended task. Multiscale representation enables the network to fuse low-level and high-level features from a restricted receptive field to enhance the deep-model performance. The main novelty of this review is the comprehensive novel taxonomy of multiscale-deep-learning methods, which includes details of several architectures and their strengths that have been implemented in the existing works. Predominantly, multiscale approaches in deep-learning networks can be classed into two categories: multiscale feature learning and multiscale feature fusion. Multiscale feature learning refers to the method of deriving feature maps by examining kernels over several sizes to collect a larger range of relevant features and predict the input images' spatial mapping. Multiscale feature fusion uses features with different resolutions to find patterns over short and long distances, without a deep network. Additionally, several examples of the techniques are also discussed according to their applications in satellite imagery, medical imaging, agriculture, and industrial and manufacturing systems.
Keywords: artificial intelligence; convolutional neural network; deep learning; machine learning; multiscale features; neural network.
Conflict of interest statement
The authors declare no conflict of interest.
Figures








Similar articles
-
A multibranch and multiscale neural network based on semantic perception for multimodal medical image fusion.Sci Rep. 2024 Jul 30;14(1):17609. doi: 10.1038/s41598-024-68183-3. Sci Rep. 2024. PMID: 39080442 Free PMC article.
-
DeSPPNet: A Multiscale Deep Learning Model for Cardiac Segmentation.Diagnostics (Basel). 2024 Dec 14;14(24):2820. doi: 10.3390/diagnostics14242820. Diagnostics (Basel). 2024. PMID: 39767181 Free PMC article.
-
Automatic segmentation of spine x-ray images based on multiscale feature enhancement network.Med Phys. 2024 Oct;51(10):7282-7294. doi: 10.1002/mp.17278. Epub 2024 Jun 30. Med Phys. 2024. PMID: 38944886
-
A comprehensive review of methods based on deep learning for diabetes-related foot ulcers.Front Endocrinol (Lausanne). 2022 Aug 8;13:945020. doi: 10.3389/fendo.2022.945020. eCollection 2022. Front Endocrinol (Lausanne). 2022. PMID: 36004341 Free PMC article. Review.
-
Automatic feature extraction and fusion recognition of motor imagery EEG using multilevel multiscale CNN.Med Biol Eng Comput. 2021 Oct;59(10):2037-2050. doi: 10.1007/s11517-021-02396-w. Epub 2021 Aug 23. Med Biol Eng Comput. 2021. PMID: 34424453 Review.
Cited by
-
Preliminary data on artificial intelligence tool in magnetic resonance imaging assessment of degenerative pathologies of lumbar spine.Radiol Med. 2024 Apr;129(4):623-630. doi: 10.1007/s11547-024-01791-1. Epub 2024 Feb 13. Radiol Med. 2024. PMID: 38349415
-
Automated Glaucoma Screening and Diagnosis Based on Retinal Fundus Images Using Deep Learning Approaches: A Comprehensive Review.Diagnostics (Basel). 2023 Jun 26;13(13):2180. doi: 10.3390/diagnostics13132180. Diagnostics (Basel). 2023. PMID: 37443574 Free PMC article. Review.
-
Differentially localized protein identification for breast cancer based on deep learning in immunohistochemical images.Commun Biol. 2024 Aug 2;7(1):935. doi: 10.1038/s42003-024-06548-0. Commun Biol. 2024. PMID: 39095659 Free PMC article.
-
Evolution of deep learning tooth segmentation from CT/CBCT images: a systematic review and meta-analysis.BMC Oral Health. 2025 May 26;25(1):800. doi: 10.1186/s12903-025-05984-6. BMC Oral Health. 2025. PMID: 40420051 Free PMC article.
-
Empowering Vision Transformer by Network Hyper-Parameter Selection for Whole Pelvis Prostate Planning Target Volume Auto-Segmentation.Cancers (Basel). 2023 Nov 21;15(23):5507. doi: 10.3390/cancers15235507. Cancers (Basel). 2023. PMID: 38067211 Free PMC article.
References
-
- Gao K., Niu S., Ji Z., Wu M., Chen Q., Xu R., Yuan S., Fan W., Chen Y., Dong J. Double-Branched and Area-Constraint Fully Convolutional Networks for Automated Serous Retinal Detachment Segmentation in SD-OCT Images. Comput. Methods Programs Biomed. 2019;176:69–80. doi: 10.1016/j.cmpb.2019.04.027. - DOI - PubMed
-
- Sermanet P., Lecun Y. Traffic Sign Recognition with Multi-Scale Convolutional Networks; Proceedings of the International Joint Conference on Neural Networks; San Jose, CA, USA. 31 July–5 August 2011; pp. 2809–2813. - DOI
-
- Buyssens P., Elmoataz A., Lézoray O. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Volume 7725 LNCS. Springer; Berlin/Heidelberg, Germany: 2013. Multiscale Convolutional Neural Networks for Vision–Based Classification of Cells; pp. 342–352. - DOI
-
- Zamri N.F.M., Tahir N.M., Ali M.S.A.M., Ashar N.D.K., Al-misreb A.A. Mini-Review of Street Crime Prediction and Classification Methods. J. Kejuruter. 2021;33:391.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources