Free Access

A Deep Q-Network for the Beer Game: Deep Reinforcement Learning for Inventory Optimization

Afshin Oroojlooyjadid
Corresponding Author
Afshin Oroojlooyjadid
[email protected]
https://orcid.org/0000-0001-7829-6145
Department of Industrial and Systems Engineering, Lehigh University, Bethlehem, Pennsylvania 18015
Search for more papers by this author
,
MohammadReza Nazari
MohammadReza Nazari
[email protected]
https://orcid.org/0000-0002-7575-6289
Department of Industrial and Systems Engineering, Lehigh University, Bethlehem, Pennsylvania 18015
Search for more papers by this author
,
Lawrence V. Snyder
Lawrence V. Snyder
[email protected]
https://orcid.org/0000-0002-2227-7030
Department of Industrial and Systems Engineering, Lehigh University, Bethlehem, Pennsylvania 18015
Search for more papers by this author
,
Martin Takáč
Martin Takáč
[email protected]
https://orcid.org/0000-0001-7455-2025
Department of Industrial and Systems Engineering, Lehigh University, Bethlehem, Pennsylvania 18015
Search for more papers by this author

Afshin Oroojlooyjadid

Corresponding Author

Afshin Oroojlooyjadid

[email protected]

https://orcid.org/0000-0001-7829-6145

Department of Industrial and Systems Engineering, Lehigh University, Bethlehem, Pennsylvania 18015

Search for more papers by this author

MohammadReza Nazari

[email protected]

https://orcid.org/0000-0002-7575-6289

Department of Industrial and Systems Engineering, Lehigh University, Bethlehem, Pennsylvania 18015

Search for more papers by this author

Lawrence V. Snyder

[email protected]

https://orcid.org/0000-0002-2227-7030

Department of Industrial and Systems Engineering, Lehigh University, Bethlehem, Pennsylvania 18015

Search for more papers by this author

Martin Takáč

[email protected]

https://orcid.org/0000-0001-7455-2025

Department of Industrial and Systems Engineering, Lehigh University, Bethlehem, Pennsylvania 18015

Search for more papers by this author

Published Online:23 Feb 2021https://doi.org/10.1287/msom.2020.0939

Abstract

Problem definition: The beer game is widely used in supply chain management classes to demonstrate the bullwhip effect and the importance of supply chain coordination. The game is a decentralized, multiagent, cooperative problem that can be modeled as a serial supply chain network in which agents choose order quantities while cooperatively attempting to minimize the network’s total cost, although each agent only observes local information. Academic/practical relevance: Under some conditions, a base-stock replenishment policy is optimal. However, in a decentralized supply chain in which some agents act irrationally, there is no known optimal policy for an agent wishing to act optimally. Methodology: We propose a deep reinforcement learning (RL) algorithm to play the beer game. Our algorithm makes no assumptions about costs or other settings. As with any deep RL algorithm, training is computationally intensive, but once trained, the algorithm executes in real time. We propose a transfer-learning approach so that training performed for one agent can be adapted quickly for other agents and settings. Results: When playing with teammates who follow a base-stock policy, our algorithm obtains near-optimal order quantities. More important, it performs significantly better than a base-stock policy when other agents use a more realistic model of human ordering behavior. We observe similar results using a real-world data set. Sensitivity analysis shows that a trained model is robust to changes in the cost coefficients. Finally, applying transfer learning reduces the training time by one order of magnitude. Managerial implications: This paper shows how artificial intelligence can be applied to inventory optimization. Our approach can be extended to other supply chain optimization problems, especially those in which supply chain partners act in irrational or unpredictable ways. Our RL agent has been integrated into a new online beer game, which has been played more than 17,000 times by more than 4,000 people.

This article appears in INFORMS Analytics Collections Vol. 16: Advances in Integrating AI & O.R.

Visit this collection for free access to more articles showcasing the depth and breadth of research and applications at the intersection of AI and operations research.

Cited by
- The optimization of vocal music teaching by integrating the STEAM concept with the intelligent recommendation system
  1 December 2025 | Scientific Reports, Vol. 16, No. 1
- Industrializing deep reinforcement learning for operational spare parts inventory management
  European Journal of Operational Research, Vol. 334, No. 1
- Reinforcement learning lens: Unlocking the power of information sharing to boost project-level supply chain resilience
  International Journal of Production Economics, Vol. 300
- Deep Q-Network-Based Optimization of Piano Playing Skills and Psychological Feedback
  13 March 2026 | Journal of Circuits, Systems and Computers, Vol. 35, No. 15
- Multi-agent deep reinforcement learning for ordering and inventory allocation in a decentralized two-echelon dual-channel supply chain
  International Journal of Production Economics, Vol. 299
- Designing flexible service strategies for urban drone Delivery: A hybrid Simulation-Optimization framework
  Transportation Research Part E: Logistics and Transportation Review, Vol. 213
- Zero-shot generalization in inventory management: Train, then Estimate and Decide
  European Journal of Operational Research, Vol. 333, No. 1
- Does artificial intelligence enhance the circular bioeconomy's decarbonization potential? A quasi-experimental evidence
  International Journal of Production Economics, Vol. 298
- A group-based mean-field deep reinforcement learning for inventory control with supplier selection in decentralized supply chain networks
  Applied Soft Computing, Vol. 199
- Aged products spillover effect and the value of holding inventory under stochastic demand: the case of Port wine
  International Journal of Production Economics, Vol. 297
- A deep reinforcement learning framework for multiperiod facility location in decentralized pharmaceutical supply chain
  18 March 2026 | AIChE Journal, Vol. 72, No. 7
- Kernel-Based Distributed Q-Learning: A Scalable Reinforcement Learning Approach for Dynamic Treatment Regimes
  Di Wang,
  Yao Wang,
  Shao-Bo Lin
  2 June 2026 | INFORMS Journal on Computing, Vol. 0, No. 0
- Multi-stage network structure fine-tuning for large-scale group consensus collaboration using multi-gradient trust region policy optimization
  14 May 2026 | Annals of Operations Research, Vol. 74
- Transfer Learning, Cross Learning and Co-Learning with Operational Data Analytics (ODA)
  Qi Feng,
  Lei Li,
  J. George Shanthikumar
  7 May 2026 | Management Science, Vol. 0, No. 0
- An assessment of randomized inventory policy in supply chains under multimodal demand distribution
  12 February 2026 | Journal of Modelling in Management, Vol. 21, No. 4
- The Value of Blending—Managing Ameliorating Inventory Using Deep Reinforcement Learning
  22 December 2025 | Production and Operations Management, Vol. 35, No. 5
- AI and Machine Learning Driven Supply Chain and Logistics Optimization in the US Automotive Industry
- Inventory Management with Transformer: Automated Decision Making for Order Timing and Quantity
  Mo Liu,
  Yumo Bai,
  Meng Qi,
  Zuo-Jun (Max) Shen
  7 April 2026 | Service Science, Vol. 0, No. 0
- Agentic automation driven reinforcement learning for inventory optimization
  6 April 2026 | Journal of Modelling in Management, Vol. 199
- A Data-Driven Multi-Objective optimization framework for dynamic job shop scheduling with order Acceptance, inventory and Energy-Aware decisions
  Computers & Industrial Engineering, Vol. 214
- A hierarchical online planning framework for anticipatory yard allocation optimization in dry bulk terminals
  30 March 2026 | IISE Transactions
- When AI bites back: the hidden consequences of generative AI in food delivery
  3 March 2026 | Journal of the Operational Research Society, Vol. 104
- Adaptive inventory strategies using deep reinforcement learning for dynamic agri-food supply chains
  11 February 2026 | OPSEARCH, Vol. 8
- A Primal-Dual Approach to Constrained Markov Decision Processes with Applications to Queue Scheduling and Inventory Management
  Yi Chen,
  Jing Dong,
  Zhaoran Wang,
  Chuheng Zhang
  20 May 2025 | Management Science, Vol. 72, No. 2
- Heuristics and deep reinforcement learning for the inventory problem with an all-or-nothing yield pattern and non-zero leadtimes
  Computers & Operations Research, Vol. 186
- Deep Reinforcement Learning for Online Assortment Customization: A Data-Driven Approach
  30 June 2025 | Production and Operations Management, Vol. 35, No. 2
- An Explainable Artificial Intelligence Algorithm for Optimal Decision Making from a Business Analytics Perspective
  2 February 2026 | Mathematics, Vol. 14, No. 3
- Trustworthy Agentic Supply Chains: A Governance Framework for Digital Twin Orchestrated AI Decisioning Under Compliance, Auditability, and Data Sovereignty Constraints
  23 January 2026 | International Journal of Latest Technology in Engineering Management & Applied Science, Vol. 15, No. 1
- Transportation Science and Logistics Society Best Dissertation Award Competition: Abstracts of 2025 Winners
  5 January 2026 | Transportation Science, Vol. 60, No. 1
- The Impact of AI on Sustainable Procurement: Revolutionizing Aviation Maintenance Operations
  25 February 2026
- AI in Inventory Management: The Disruptive Era of DRL and Beyond
  25 September 2025
- MARL with Automated Negotiation for Reduction of Bullwhip Effect in Serial Supply Chain
  24 November 2025
- Integrating simulation and reinforcement learning for optimized working capital management in supply chains
  Procedia Computer Science, Vol. 277
- A Deep Reinforcement Learning Framework for Dynamic Inventory Control and Cost Optimization Under Demand Uncertainty
  IEEE Access, Vol. 14
- An Integrated Deep Learning and Multi-Objective Pareto Optimization Framework for Retail Supply Chains
  IEEE Access, Vol. 14
- Optimizing dynamic cellular manufacturing system: a deep reinforcement learning approach to profit maximization and inventory management
  26 November 2025 | International Journal of Systems Science: Operations & Logistics, Vol. 12, No. 1
- Actor-critic driven deep reinforcement learning for optimising agri-food supply chain
  9 July 2025 | International Journal of Production Research, Vol. 63, No. 24
- Research on Dynamic Supply Chain Network Optimization Modeling and Decision-Making Based on Reinforcement Learning
- A multimodal deep reinforcement learning framework for multi-period inventory decision-making under demand uncertainty
  25 September 2025 | Fuzzy Optimization and Decision Making, Vol. 24, No. 4
- A systematic review of machine learning approaches in inventory control optimization
  Operations Research Perspectives, Vol. 15
- Reinforcement learning in risk management for pharmaceutical construction projects: Frontiers, challenges, and improvement strategies
  Sustainable Futures, Vol. 10
- The analysis of deep reinforcement learning for dynamic graphical games under artificial intelligence
  2 July 2025 | Scientific Reports, Vol. 15, No. 1
- Hierarchical deep Q-network-based optimization of resilient grids under multi-dimensional uncertainties from extreme weather
  10 July 2025 | Scientific Reports, Vol. 15, No. 1
- Multi-objective optimization of gamified demand response for PV-integrated microgrids: a novel NSGA-III framework with behavioral adaptation modeling
  30 September 2025 | Scientific Reports, Vol. 15, No. 1
- Localized Multi-Agent Reinforcement Learning for Cooperative Management of Supply Chains
  28 August 2025 | Asia-Pacific Journal of Operational Research, Vol. 42, No. 06
- Unveiling the path to sustainable carbon reduction: a comparative analysis of bank-led vs. firm-led carbon finance strategies
  29 October 2025 | Humanities and Social Sciences Communications, Vol. 12, No. 1
- Deep Neural Newsvendor
  Jinhui Han,
  Ming Hu,
  Guohao Shen
  3 November 2025 | Management Science, Vol. 0, No. 0
- Contextual Data-Integrated Newsvendor Solution with Operational Data Analytics (ODA)
  Qi Feng,
  J. George Shanthikumar,
  Jian Wu
  17 March 2025 | Management Science, Vol. 71, No. 11
- Collaborating in a competitive world: Heterogeneous multi-agent decision making in symbiotic supply chain environments
  Computers & Industrial Engineering, Vol. 209
- A multi-agent deep reinforcement learning approach for multi-echelon inventory optimization and its application to the beer game
  Transportation Research Part E: Logistics and Transportation Review, Vol. 203
- Supplier Selection and Material Sourcing With Multiuncertainties in Cloud Manufacturing Using Reinforcement Learning
  IEEE Transactions on Systems, Man, and Cybernetics: Systems, Vol. 55, No. 11
- Newsvendor Problems With Product Unbundling: An Approach Combining Robust Optimization With Deep Reinforcement Learning
  20 June 2025 | Production and Operations Management, Vol. 34, No. 11
- A simulation-driven machine learning framework for large-scale inventory management
  6 October 2025 | Annals of Operations Research, Vol. 56
- A Value Decomposition Multi-Agent Reinforcement Learning Framework for Multi-Echelon Inventory Management in Supply Chain Network
- Deep reinforcement learning for dynamic order picking in warehouse operations
  Computers & Operations Research, Vol. 182
- On order smoothing interpolating the order-up-to and constant order policies
  Omega, Vol. 136
- Sticky creativity: Using digital sticky notes in teaching supply chain in a synchronous class
  28 August 2025 | Decision Sciences Journal of Innovative Education, Vol. 23, No. 4
- The beer game bullwhip effect mitigation: a deep reinforcement learning approach
  24 March 2025 | International Journal of Production Research, Vol. 63, No. 18
- JD.com Improves Fulfillment Efficiency with Data-Driven Integrated Assortment Planning and Inventory Allocation
  Zuo-Jun Max Shen,
  Shuo Sun,
  Yongzhi Qi,
  Hao Hu,
  Ningxuan Kang,
  Jianshen Zhang,
  Xin Wang,
  Xiaoming Lin
  26 September 2025 | INFORMS Journal on Applied Analytics, Vol. 55, No. 5
- A dynamic and intelligent decision-making framework for a platelet inventory-distribution network
  14 June 2025 | Operational Research, Vol. 25, No. 3
- Prediction of Losses in an Agave Liquor Production and Packaging System Using a Neural Network and Fuzzy Logic
  5 September 2025 | Processes, Vol. 13, No. 9
- Research on Power System Fault Diagnosis Method Based on Intelligent Algorithm
- A Unified Framework for Multi-Stage Decision Optimization with Deep Reinforcement Learning and Foundation Models
- Multi-products production control and human-autonomous truck distribution planning with the use of collaborative multi-agent reinforcement learning
- Personalized online shopping recommendation algorithm and practice based on deep Q-network
  3 August 2025 | Journal of Computational Methods in Sciences and Engineering, Vol. 42
- An analysis on the role of artificial intelligence in green supply chains
  Technological Forecasting and Social Change, Vol. 217
- Dynamic Tariff Adjustment for Electric Vehicle Charging in Renewable-Rich Smart Grids: A Multi-Factor Optimization Approach to Load Balancing and Cost Efficiency
  12 August 2025 | Energies, Vol. 18, No. 16
- Cascading Multi-Agent Policy Optimization for Demand Forecasting
  31 July 2025
- Optimization method of electric vehicle energy system based on machine learning
  29 July 2025 | Frontiers in Mechanical Engineering, Vol. 11
- Control scheme of green building renewable energy system based on reinforcement learning
  27 July 2025 | Journal of Computational Methods in Sciences and Engineering, Vol. 8
- Deep Stacking Kernel Machines for the Data-Driven Multi-Item, One-Warehouse, Multiretailer Problems with Backlog and Lost Sales
  Zhen-Yu Chen,
  Minghe Sun
  6 September 2024 | INFORMS Journal on Computing, Vol. 37, No. 4
- Optimal outbound shipment policy for an inventory system with advance demand information
  European Journal of Operational Research, Vol. 324, No. 1
- Deep Controlled Learning for Inventory Control
  European Journal of Operational Research, Vol. 324, No. 1
- Collaborative production control and distributor selection via multi-agent reinforcement learning with differentiable communication
  Expert Systems with Applications, Vol. 282
- IoT-driven dynamic replenishment of fresh produce in the presence of seasonal variations: A deep reinforcement learning approach using reward shaping
  Omega, Vol. 134
- Multi-Agent Deep Reinforcement Learning for Multi-Echelon Inventory Management
  29 December 2024 | Production and Operations Management, Vol. 34, No. 7
- An empirically grounded analytical approach to hog farm finishing stage management: Deep reinforcement learning as decision support and managerial learning tool
  18 December 2024 | Journal of Operations Management, Vol. 71, No. 4
- Integrated energy-water systems for community-level flexibility: A hybrid deep Q-network and multi-objective optimization framework
  Energy Reports, Vol. 13
- Intelligent inventory management: AI-driven solution for the pharmaceutical supply chain
  Societal Impacts, Vol. 5
- Can Kolmogorov-Arnold Network (KAN) Replace Multi-layer Perception (MLP) in Reinforcement Learning for Stochastic Inventory Control?
  25 November 2025
- تبني الذكاء الاصطناعي لإدارة سلسلة التجهيز باستخدام نموذج القبول التكنولوجي TAM: دراسة مسحية في شركة توزيع المنتجات النفطية/ فرع نينوى
  31 December 2024 | Tikrit Journal of Administrative and Economic Sciences, Vol. 20, No. 68, part 2
- Integrating Deep Q-Networks with Rail Transit Systems for Smarter Urban Mobility
  International Journal of Knowledge Management, Vol. 21, No. 1
- Dynamic replenishment policy for perishable goods using change point detection-based soft actor-critic reinforcement learning
  Expert Systems with Applications, Vol. 270
- Logistics Demand Forecasting and Scheduling Optimization Using K-Means Clustering and Deep Q-Network
- Optimization Control of HVAC System and Building Energy Management Based on Machine Learning
- Optimisation of recovery policies in the era of supply chain disruptions: a system dynamics and reinforcement learning approach
  6 August 2024 | International Journal of Production Research, Vol. 63, No. 5
- Deep Policy Iteration with Integer Programming for Inventory Management
  Pavithra Harsha; ,
  Ashish Jagmohan; ,
  Jayant Kalagnanam; ,
  Brian Quanz; ,
  Divya Singhvi
  6 January 2025 | Manufacturing & Service Operations Management, Vol. 27, No. 2
- An intelligent open trading system for on-demand delivery facilitated by deep Q network based reinforcement learning
  14 June 2024 | International Journal of Production Research, Vol. 63, No. 3
- Aircraft Simulation in Game Development Using Reinforcement Learning
- Comparing Deep Reinforcement Learning Algorithms in Two-Echelon Supply Chains
  1 January 2025
- Distributed computing in multi-agent systems: a survey of decentralized machine learning approaches
  19 November 2024 | Computing, Vol. 107, No. 1
- A comprehensive deep reinforcement learning concept model for omnichannel fulfillment
  IFAC-PapersOnLine, Vol. 59, No. 10
- Application of Reinforcement Learning to Improve Finished Goods Inventory Management: A Systematic Review
  IFAC-PapersOnLine, Vol. 59, No. 10
- Towards Scalable Three-Dimensional Loading Capacitated Vehicle Routing
  IFAC-PapersOnLine, Vol. 59, No. 10
- Logistics and Service Operations Under Disruptions: Recent Development Under the DT Taxonomy
  IEEE Transactions on Engineering Management, Vol. 72
- Much Ado About Nothing? An EEG Study of the Beer Game for Neurophysiological Insights on Supply Chain Decision-Making
  IEEE Transactions on Engineering Management, Vol. 72
- Smart home management based on deep learning: Optimizing device prediction and user interface interaction
  Computer Science and Information Systems, Vol. 22, No. 3
- Risk-averse supply chain management via robust reinforcement learning
  Computers & Chemical Engineering, Vol. 192
- Enhancement of Vendor-Managed Inventory Planning Through Deep Reinforcement Learning
- Carbon trading supply chain management based on constrained deep reinforcement learning
  6 August 2024 | Autonomous Agents and Multi-Agent Systems, Vol. 38, No. 2
- RETRACTED ARTICLE: Research on supply chain efficiency optimization algorithm based on reinforcement learning
  6 November 2024 | Advances in Continuous and Discrete Models, Vol. 2024, No. 1
- Optimization of the Stand Structure in Secondary Forests of Pinus yunnanensis Based on Deep Reinforcement Learning
  11 December 2024 | Forests, Vol. 15, No. 12
- Leveraging Multi-Agent Reinforcement Learning for Digital Transformation in Supply Chain Inventory Optimization
  16 November 2024 | Sustainability, Vol. 16, No. 22
- Learning-Based Optimisation for Integrated Problems in Intermodal Freight Transport: Preliminaries, Strategies, and State of the Art
  25 September 2024 | Applied Sciences, Vol. 14, No. 19
- Deep reinforcement learning‐based ordering mechanism for performance optimization in multi‐echelon supply chains
  18 October 2022 | Applied Stochastic Models in Business and Industry, Vol. 40, No. 5
- Multi-echelon inventory optimization using deep reinforcement learning
  19 July 2023 | Central European Journal of Operations Research, Vol. 32, No. 3
- A location-production-routing problem for distributed manufacturing platforms: A neural genetic algorithm solution methodology
  International Journal of Production Economics, Vol. 275
- Generative artificial intelligence in supply chain and operations management: a capability-based framework for analysis and implementation
  31 January 2024 | International Journal of Production Research, Vol. 62, No. 17
- Performance of deep reinforcement learning algorithms in two-echelon inventory control systems
  1 March 2024 | International Journal of Production Research, Vol. 62, No. 17
- Reinforcement learning for the supply chain dynamics problem with production capacity constraints and non-stationary demand
- Real-time application of grey system theory in intelligent traffic signal optimization
  Journal of Computational Methods in Sciences and Engineering, Vol. 24, No. 4-5
- Neuroevolution reinforcement learning for multi-echelon inventory optimization with delivery options and uncertain discount
  Engineering Applications of Artificial Intelligence, Vol. 134
- A hybrid deep reinforcement learning approach for a proactive transshipment of fresh food in the online–offline channel system
  Transportation Research Part E: Logistics and Transportation Review, Vol. 187
- Data-driven dynamic pricing and inventory management of an omni-channel retailer in an uncertain demand environment
  Expert Systems with Applications, Vol. 244
- User Behavior Prediction and Interface Personalization Design Combined with Deep Q-Network
  3 August 2024
- English Grammar Error Detection and Intelligent Assisted Correction Using Autoencoders
  3 August 2024
- Single-Product Assemble-to-Order Systems with Exogenous Lead Times
  Alp Muharremoglu,
  Nan Yang,
  Xin Geng
  21 February 2024 | Operations Research, Vol. 72, No. 3
- Deep reinforcement learning for demand fulfillment in online retail
  International Journal of Production Economics, Vol. 269
- Research on Performance Evaluation and Improvement of Deep Reinforcement Learning in Traffic Scene Target Detection
- Inhibitory influence of supply chain digital transformation on bullwhip effect feedback difference
  19 October 2023 | Business Process Management Journal, Vol. 30, No. 1
- Sequential auction for cloud manufacturing resource trading: A deep reinforcement learning approach to the lot-sizing problem
  Computers & Industrial Engineering, Vol. 188
- Cluster-based lateral transshipments for the Zambian health supply chain
  European Journal of Operational Research, Vol. 313, No. 1
- Online reinforcement learning-based inventory control for intelligent E-Fulfilment dealing with nonstationary demand
  26 November 2023 | Enterprise Information Systems, Vol. 18, No. 2
- Optimization of multi-echelon spare parts inventory systems using multi-agent deep reinforcement learning
  Applied Mathematical Modelling, Vol. 125
- IACPPO: A deep reinforcement learning-based model for warehouse inventory replenishment
  Computers & Industrial Engineering, Vol. 187
- Deep Reinforcement Learning for One-Warehouse Multi-Retailer inventory management
  International Journal of Production Economics, Vol. 267
- Enterprise Security Patch Management with Deep Reinforcement Learning
  1 January 2024 | SSRN Electronic Journal, Vol. 2
- Inventory Planning in Capacitated High-Tech Assembly Systems Under Non-Stationary Demand
  1 January 2024 | SSRN Electronic Journal, Vol. 41
- On the use of machine learning in supply chain management: a systematic review
  30 October 2024 | IMA Journal of Management Mathematics, Vol. 36, No. 1
- A Deep Q-Network Based on Radial Basis Functions for Multi-Echelon Inventory Management
- Joint pricing and inventory control with reference price effects and price thresholds: A deep reinforcement learning approach
  Expert Systems with Applications, Vol. 233
- Adaptive inventory replenishment using structured reinforcement learning by exploiting a policy structure
  International Journal of Production Economics, Vol. 266
- A Machine Learning Approach for Energy-Efficient Intelligent Transportation Scheduling Problem in a Real-World Dynamic Circumstances
  IEEE Transactions on Intelligent Transportation Systems, Vol. 24, No. 12
- Navigational guidance – A deep learning approach
  European Journal of Operational Research, Vol. 310, No. 3
- A Hierarchical Resource Scheduling Method for Satellite Control System Based on Deep Reinforcement Learning
  22 September 2023 | Electronics, Vol. 12, No. 19
- AI vs. Human Buyers: A Study of Alibaba’s Inventory Replenishment System
  Jiaxi Liu,
  Shuyi Lin,
  Linwei Xin,
  Yidong Zhang
  26 September 2023 | INFORMS Journal on Applied Analytics, Vol. 53, No. 5
- Deep learning-based model predictive control for real-time supply chain optimization
  Journal of Process Control, Vol. 129
- A new key performance indicator model for demand forecasting in inventory management considering supply chain reliability and seasonality
  Supply Chain Analytics, Vol. 3
- Reinforcement learning based approach for the optimization of mechanical properties of additively manufactured specimens
  9 March 2023 | International Journal on Interactive Design and Manufacturing (IJIDeM), Vol. 17, No. 4
- Forecasting and Inventory Planning: An Empirical Investigation of Classical and Machine Learning Approaches for Svanehøj’s Future Software Consolidation
  25 July 2023 | Applied Sciences, Vol. 13, No. 15
- The effects of digital innovations and sustainable supply chain management on business competitive performance post-COVID-19
  18 April 2023 | Kybernetes, Vol. 52, No. 7
- Reinforcement Learning and Automatic Control for Resilience of Maritime Container Ports
- A multipath routing algorithm for satellite networks based on service demand and traffic awareness
  5 July 2023 | Frontiers of Information Technology & Electronic Engineering, Vol. 24, No. 6
- Modeling and Analysis of Production Systems Operated in a Human-In-The-Loop Fashion
  Tetsu-to-Hagane, Vol. 109, No. 6
- Data-driven supply chain monitoring using canonical variate analysis
  Computers & Chemical Engineering, Vol. 174
- Cooperative Learning for Smart Charging of Shared Autonomous Vehicle Fleets
  Ramin Ahadi,
  Wolfgang Ketter,
  John Collins,
  Nicolò Daina
  14 December 2022 | Transportation Science, Vol. 57, No. 3
- Deep reinforcement learning with combinatorial actions spaces: An application to prescriptive maintenance
  Computers & Industrial Engineering, Vol. 179
- Proximal policy optimization algorithm for dynamic pricing with online reviews
  Expert Systems with Applications, Vol. 213
- Ovarian cysts classification using novel deep reinforcement learning with Harris Hawks Optimization method
  25 July 2022 | The Journal of Supercomputing, Vol. 79, No. 2
- Large-Scale Inventory Optimization: A Recurrent Neural Networks–Inspired Simulation Approach
  Tan Wang,
  L. Jeff Hong
  6 December 2022 | INFORMS Journal on Computing, Vol. 35, No. 1
- Industrial Revolutions and Supply Network 5.0
  31 January 2023
- Gym-DC: A Distribution Centre Reinforcement Learning Environment
  2 August 2023
- Cooperative Multi-agent Reinforcement Learning for Inventory Management
  17 September 2023
- Automated Generation of Ensemble Pipelines using Policy-Based Reinforcement Learning method
  Procedia Computer Science, Vol. 229
- Reinforcement Learning Approach to Stochastic Vehicle Routing Problem With Correlated Demands
  IEEE Access, Vol. 11
- Deep Reinforcement Learning for Online Assortment Customization: A Data-Driven Approach
  1 January 2023 | SSRN Electronic Journal, Vol. 4
- When Less Is More? Deep Reinforcement Learning-Based Optimization of Debt Collection
  1 January 2023 | SSRN Electronic Journal, Vol. 68
- Deep Reinforcement Learning for Large-Scale Inventory Management
  1 January 2023 | SSRN Electronic Journal, Vol. 56
- Deep Neural Newsvendor
  1 January 2023 | SSRN Electronic Journal, Vol. 3
- The Impact of Input Inaccuracy on Leveraging AI Tools: Evidence from Algorithmic Labor Scheduling
  1 January 2023 | SSRN Electronic Journal, Vol. 25
- An Empirically Grounding Analytics (EGA) Approach to Hog Farm Finishing Stage Management: Deep Reinforcement Learning as Decision Support and Managerial Learning Tool
  1 January 2023 | SSRN Electronic Journal, Vol. 19
- Transfer Learning, Cross Learning and Co-Learning Across Newsvendor Systems with Operational Data Analytics (ODA)
  1 January 2023 | SSRN Electronic Journal, Vol. 13
- Contextual Data-Integrated Newsvendor Solution with Operational Data Analytics (ODA)
  1 January 2023 | SSRN Electronic Journal, Vol. 23
- The role of artificial intelligence in supply chain management: mapping the territory
  9 February 2022 | International Journal of Production Research, Vol. 60, No. 24
- Policy Evaluation with Stochastic Gradient Estimation Techniques
- A Reinforcement Learning Method for Inventory Control Under State-based Stochastic Demand
- Intelligent inventory management approaches for perishable pharmaceutical products in a healthcare supply chain
  Computers & Operations Research, Vol. 147
- Supply Chain Resilience: Impact of Stakeholder Behavior and Trustworthy Information Sharing with a Case Study on Pharmaceutical Supply Chains
  Özlem Ergun,
  Jacqueline Griffin,
  Noah Chicoine,
  Min Gong,
  Omid Mohaddesi,
  Zohreh Raziei,
  Casper Harteveld,
  David Kaeli,
  Stacy Marsella
  14 October 2022
- Barriers, Drivers, and Social Considerations for AI Adoption in Supply Chain Management: A Tertiary Study
  9 September 2022 | Logistics, Vol. 6, No. 3
- Supply Chain Synchronization Through Deep Reinforcement Learning
  24 January 2022
- Proximal Policy Optimization Algorithm for Dynamic Pricing with Online Reviews
  SSRN Electronic Journal, Vol. 7
- Using Neural Networks to Guide Data-Driven Operational Decisions
  SSRN Electronic Journal, Vol. 16
- Multi-Agent Deep Reinforcement Learning for Multi-Echelon Inventory Management
  SSRN Electronic Journal, Vol. 56
- Gamified Learning of Supply Chain Optimization Through the Beer Distribution Game
- Adaptive Supply Chain: Demand–Supply Synchronization Using Deep Reinforcement Learning
  15 August 2021 | Algorithms, Vol. 14, No. 8
- End-to-End Deep Learning for Inventory Management with Fixed Ordering Cost and its Theoretical Analysis
  SSRN Electronic Journal, Vol. 147

cover image Manufacturing & Service Operations Management

Volume 24, Issue 1

January-February 2022

Pages 1-689, C2

Article Information

Supplemental Material

Metrics

Information

Received:February 02, 2020
Accepted:June 25, 2020
Published Online:February 23, 2021

Cite as

Afshin Oroojlooyjadid, MohammadReza Nazari, Lawrence V. Snyder, Martin Takáč (2021) A Deep Q-Network for the Beer Game: Deep Reinforcement Learning for Inventory Optimization. Manufacturing & Service Operations Management 24(1):285-304.

https://doi.org/10.1287/msom.2020.0939

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

A Deep Q-Network for the Beer Game: Deep Reinforcement Learning for Inventory Optimization

Abstract

Volume 24, Issue 1

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News