Nonlinear Hyperparameter Optimization of a Neural Network in Image Processing for Micromachines
- PMID: 34945353
- PMCID: PMC8704841
- DOI: 10.3390/mi12121504
Nonlinear Hyperparameter Optimization of a Neural Network in Image Processing for Micromachines
Abstract
Deep neural networks are widely used in the field of image processing for micromachines, such as in 3D shape detection in microelectronic high-speed dispensing and object detection in microrobots. It is already known that hyperparameters and their interactions impact neural network model performance. Taking advantage of the mathematical correlations between hyperparameters and the corresponding deep learning model to adjust hyperparameters intelligently is the key to obtaining an optimal solution from a deep neural network model. Leveraging these correlations is also significant for unlocking the "black box" of deep learning by revealing the mechanism of its mathematical principle. However, there is no complete system for studying the combination of mathematical derivation and experimental verification methods to quantify the impacts of hyperparameters on the performances of deep learning models. Therefore, in this paper, the authors analyzed the mathematical relationships among four hyperparameters: the learning rate, batch size, dropout rate, and convolution kernel size. A generalized multiparameter mathematical correlation model was also established, which showed that the interaction between these hyperparameters played an important role in the neural network's performance. Different experiments were verified by running convolutional neural network algorithms to validate the proposal on the MNIST dataset. Notably, this research can help establish a universal multiparameter mathematical correlation model to guide the deep learning parameter adjustment process.
Keywords: deep neural network; hyperparameters; image processing; multiparameter mathematical correlation model.
Conflict of interest statement
The authors declare that they have no conflict of interest.
Figures











References
-
- Hinton G.E. Neural Networks: Tricks of the Trade. Volume 7700. Springer; Berlin/Heidelberg, Germany: 2012. A Practical Guide to Training Restricted Boltzmann Machines; pp. 599–619.
-
- Larochelle H., Erhan D., Courville A., Bergstra J., Bengio Y. An Empirical Evaluation of Deep Architectures on Problems with Many Factors of Variation. ACM Int. Conf. Proc. Ser. 2007;227:473–480.
Grants and funding
- 2020YFB1713300/the National Key Research and Development Program of China
- 2018AAA0101803/the National Key Technologies Research and Development Program of China
- Qianjiaohe KY [2020] 005/the Guizhou Provincial Colleges and Universities Integrated Tackling Platform Project, UAV Test Flight Inte-grated Public Relations Platform
- Qiankehe Major Special Project [2019]3003/the Guizhou Province Major Science and Technology Special Project, Research and Application of Key Technologies for Intelligent Manufacturing of Lithium-ion Battery Cathode Materials
LinkOut - more resources
Full Text Sources