Multi-Agent Deep Reinforcement Learning for Integrated Demand Forecasting and Inventory Optimization in Sensor-Enabled Retail Supply Chains
- PMID: 40285118
- PMCID: PMC12031219
- DOI: 10.3390/s25082428
Multi-Agent Deep Reinforcement Learning for Integrated Demand Forecasting and Inventory Optimization in Sensor-Enabled Retail Supply Chains
Abstract
The retail industry faces increasing challenges in matching supply with demand due to evolving consumer behaviors, market volatility, and supply chain disruptions. While existing approaches employ statistical and machine learning methods for demand forecasting, they often fail to capture complex temporal dependencies and lack the ability to simultaneously optimize inventory decisions. This paper proposes a novel multi-agent deep reinforcement learning framework that jointly optimizes demand forecasting and inventory management in retail supply chains, leveraging data from IoT sensors, RFID tracking systems, and smart shelf monitoring devices. Our approach combines transformer-based sequence modeling for demand patterns with hierarchical reinforcement learning agents that coordinate inventory decisions across distribution networks. The framework integrates both historical sales data and real-time sensor measurements, employing attention mechanisms to capture seasonal patterns, promotional effects, and environmental conditions detected through temperature and humidity sensors. Through extensive experiments on large-scale retail datasets incorporating sensor network data, we demonstrate that our method achieves 18.2% lower forecast error and 23.5% reduced stockout rates compared with state-of-the-art baselines. The results show particular improvements in handling promotional events and seasonal transitions, where traditional methods often struggle. Our work provides new insights into leveraging deep reinforcement learning for integrated retail operations optimization and offers a scalable solution for modern sensor-enabled supply chain challenges.
Keywords: demand forecasting; inventory optimization; multi-agent reinforcement learning; supply chain management.
Conflict of interest statement
The authors declare no conflicts of interest.
Figures



References
-
- Chui M., Issler M., Roberts R., Yee L. Technology Trends Outlook 2023. McKinsey & Company; New York, NY, USA: 2023.
-
- Amed I., Balchandani A., Berg A., Hedrich S., Jensen J.E., Le Merle L., Rölkens F. State of Fashion 2022: An Uneven Recovery and New Frontiers. Mckinsey & Company; New York, NY, USA: 2021.
-
- Trapero J.R., Kourentzes N., Fildes R. On the identification of sales forecasting models in the presence of promotions. J. Oper. Res. Soc. 2015;66:299–307. doi: 10.1057/jors.2013.174. - DOI
-
- Ma S., Fildes R., Huang T. Demand forecasting with high dimensional data: The case of SKU retail sales forecasting with intra-and inter-category promotional information. Eur. J. Oper. Res. 2016;249:245–257. doi: 10.1016/j.ejor.2015.08.029. - DOI
LinkOut - more resources
Full Text Sources