Automated machine learning for fabric quality prediction: a comparative analysis
- PMID: 39145237
- PMCID: PMC11323016
- DOI: 10.7717/peerj-cs.2188
Automated machine learning for fabric quality prediction: a comparative analysis
Abstract
The enhancement of fabric quality prediction in the textile manufacturing sector is achieved by utilizing information derived from sensors within the Internet of Things (IoT) and Enterprise Resource Planning (ERP) systems linked to sensors embedded in textile machinery. The integration of Industry 4.0 concepts is instrumental in harnessing IoT sensor data, which, in turn, leads to improvements in productivity and reduced lead times in textile manufacturing processes. This study addresses the issue of imbalanced data pertaining to fabric quality within the textile manufacturing industry. It encompasses an evaluation of seven open-source automated machine learning (AutoML) technologies, namely FLAML (Fast Lightweight AutoML), AutoViML (Automatically Build Variant Interpretable ML models), EvalML (Evaluation Machine Learning), AutoGluon, H2OAutoML, PyCaret, and TPOT (Tree-based Pipeline Optimization Tool). The most suitable solutions are chosen for certain circumstances by employing an innovative approach that finds a compromise among computational efficiency and forecast accuracy. The results reveal that EvalML emerges as the top-performing AutoML model for a predetermined objective function, particularly excelling in terms of mean absolute error (MAE). On the other hand, even with longer inference periods, AutoGluon performs better than other methods in measures like mean absolute percentage error (MAPE), root mean squared error (RMSE), and r-squared. Additionally, the study explores the feature importance rankings provided by each AutoML model, shedding light on the attributes that significantly influence predictive outcomes. Notably, sin/cos encoding is found to be particularly effective in characterizing categorical variables with a large number of unique values. This study includes useful information about the application of AutoML in the textile industry and provides a roadmap for employing Industry 4.0 technologies to enhance fabric quality prediction. The research highlights the importance of striking a balance between predictive accuracy and computational efficiency, emphasizes the significance of feature importance for model interpretability, and lays the groundwork for future investigations in this field.
Keywords: AutoML; Feature importance; Hyperparameter optimization; Imbalanced data; Model interpretability; Quality control; Textile industry.
©2024 Metin and Bilgin.
Conflict of interest statement
The authors declare there are no competing interests.
Figures
References
-
- Ali M. PyCaret: an open source, low-code machine learning library in Python. PyCaret version 1.0https://www.pycaret.org 2020
-
- Azevedo J, Ribeiro R, Matos LM, Sousa R, Silva JP, Pilastri A, Cortez P. Predicting yarn breaks in textile fabrics: a machine learning approach. Procedia Computer Science. 2022;207:2301–2310. doi: 10.1016/j.procs.2022.09.289. - DOI
-
- Bischl B, Binder M, Lang M, Pielok T, Richter J, Coors S, Thomas J, Ullmann T, Becker M, Boulesteix A-L, Deng D, Lindauer M. Hyperparameter optimization: foundations, algorithms, best practices, and open challenges. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery. 2023;13(2):e1484. doi: 10.1002/widm.1484. - DOI
-
- Bo Z. The prediction of warp breakage rate of weaving by considering sized yarn quality using artificial neural network theory. 2010 International conference on computer design and applications, volume 2; Piscataway. 2010. pp. V2–526.
-
- Cortez P, Embrechts MJ. Opening black box data mining models using sensitivity analysis. 2011 IEEE symposium on computational intelligence and data mining (CIDM); Piscataway. 2011. pp. 341–348.
LinkOut - more resources
Full Text Sources