Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Nov 4;8(7):525.
doi: 10.3390/biomimetics8070525.

Optimizing Image Classification: Automated Deep Learning Architecture Crafting with Network and Learning Hyperparameter Tuning

Affiliations

Optimizing Image Classification: Automated Deep Learning Architecture Crafting with Network and Learning Hyperparameter Tuning

Koon Meng Ang et al. Biomimetics (Basel). .

Abstract

This study introduces ETLBOCBL-CNN, an automated approach for optimizing convolutional neural network (CNN) architectures to address classification tasks of varying complexities. ETLBOCBL-CNN employs an effective encoding scheme to optimize network and learning hyperparameters, enabling the discovery of innovative CNN structures. To enhance the search process, it incorporates a competency-based learning concept inspired by mixed-ability classrooms during the teacher phase. This categorizes learners into competency-based groups, guiding each learner's search process by utilizing the knowledge of the predominant peers, the teacher solution, and the population mean. This approach fosters diversity within the population and promotes the discovery of innovative network architectures. During the learner phase, ETLBOCBL-CNN integrates a stochastic peer interaction scheme that encourages collaborative learning among learners, enhancing the optimization of CNN architectures. To preserve valuable network information and promote long-term population quality improvement, ETLBOCBL-CNN introduces a tri-criterion selection scheme that considers fitness, diversity, and learners' improvement rates. The performance of ETLBOCBL-CNN is evaluated on nine different image datasets and compared to state-of-the-art methods. Notably, ELTLBOCBL-CNN achieves outstanding accuracies on various datasets, including MNIST (99.72%), MNIST-RD (96.67%), MNIST-RB (98.28%), MNIST-BI (97.22%), MNST-RD + BI (83.45%), Rectangles (99.99%), Rectangles-I (97.41%), Convex (98.35%), and MNIST-Fashion (93.70%). These results highlight the remarkable classification accuracy of ETLBOCBL-CNN, underscoring its potential for advancing smart device infrastructure development.

Keywords: automatic network design; deep learning architecture; hyperparameter optimization; image classification; teaching–learning-based optimization.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
The workflow of the original TLBO.
Figure 2
Figure 2
Typical architecture of sequential CNN.
Figure 3
Figure 3
Workflow of the ETLBOCBL-CNN framework.
Figure 4
Figure 4
Solution encoding scheme of ETLBOCBL-CNN to represent a potential CNN.
Figure 5
Figure 5
Decoding of network and learning hyperparameters for CNN construction.
Figure 6
Figure 6
Visual representation of calculating X¯.Mean in ETLBOCBL-CNN.
Figure 7
Figure 7
Visualization of the idea of competency-based learning introduced into the ETLOCBL-CNN’s modified teacher phase. Color dots refer to learners assigned to different groups.
Figure 8
Figure 8
Sample images of the datasets: (a) MNIST, (b) MNIST-RD, (c) MNIST-RB, (d) MNIST-BI, (e) MNIST-RD + BI, (f) Rectangles, (g) Rectangles-I, (h) Convex, and (i) Fashion.
Figure 8
Figure 8
Sample images of the datasets: (a) MNIST, (b) MNIST-RD, (c) MNIST-RB, (d) MNIST-BI, (e) MNIST-RD + BI, (f) Rectangles, (g) Rectangles-I, (h) Convex, and (i) Fashion.
Figure 9
Figure 9
Test errors obtained by ETLBOCBL-CNN while solving the eight datasets: (a) MNIST, (b) MNIST-RD, (c) MNIST-RB, (d) MNIST-BI, (e) MNIST-RD + BI, (f) Rectangles, (g) Rectangles-I, and (h) Convex.
Figure 9
Figure 9
Test errors obtained by ETLBOCBL-CNN while solving the eight datasets: (a) MNIST, (b) MNIST-RD, (c) MNIST-RB, (d) MNIST-BI, (e) MNIST-RD + BI, (f) Rectangles, (g) Rectangles-I, and (h) Convex.

References

    1. Ji S., Xu W., Yang M., Yu K. 3D convolutional neural networks for human action recognition. IEEE Trans. Pattern Anal. Mach. Intell. 2012;35:221–231. doi: 10.1109/TPAMI.2012.59. - DOI - PubMed
    1. Wang P., Li Z., Hou Y., Li W. Action recognition based on joint trajectory maps using convolutional neural networks; Proceedings of the 24th ACM International Conference on Multimedia; Amsterdam, The Netherlands. 15–19 October 2016; pp. 102–106.
    1. Jayanthi J., Jayasankar T., Krishnaraj N., Prakash N., Sagai Francis Britto A., Vinoth Kumar K. An intelligent particle swarm optimization with convolutional neural network for diabetic retinopathy classification model. J. Med. Imaging Health Inform. 2021;11:803–809. doi: 10.1166/jmihi.2021.3362. - DOI
    1. Goel T., Murugan R., Mirjalili S., Chakrabartty D.K. OptCoNet: An optimized convolutional neural network for an automatic diagnosis of COVID-19. Appl. Intell. 2021;51:1351–1366. doi: 10.1007/s10489-020-01904-z. - DOI - PMC - PubMed
    1. Müller A., Karathanasopoulos N., Roth C.C., Mohr D. Machine learning classifiers for surface crack detection in fracture experiments. Int. J. Mech. Sci. 2021;209:106698. doi: 10.1016/j.ijmecsci.2021.106698. - DOI

LinkOut - more resources