The Linear Programming Approach to Approximate Dynamic Programming

D. P. de Farias
D. P. de Farias
[email protected]
Department of Mechanical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139
Search for more papers by this author
,
B. Van Roy
B. Van Roy
[email protected]
Department of Management Science and Engineering, Stanford University, Stanford, California 94306
Search for more papers by this author

D. P. de Farias

[email protected]

Department of Mechanical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139

Search for more papers by this author

B. Van Roy

[email protected]

Department of Management Science and Engineering, Stanford University, Stanford, California 94306

Search for more papers by this author

Published Online:1 Dec 2003https://doi.org/10.1287/opre.51.6.850.24925

References

Bertsekas D.Dynamic Programming and Optimal Control (1995) (Athena Scientific, Belmont, MA) Google Scholar
Bertsekas D., Tsitsiklis J. N.Neuro-Dynamic Programming (1996) (Athena Scientific, Belmont, MA) Google Scholar
Bertsekas D., Gamarnik D., Tsitsiklis J. Performance of multiclass Markovian queueing networks via piecewise linear Lyapunov functions. Ann. Appl. Probab. (2001) 11(4):1384–1428Crossref, Google Scholar
Bishop C. M.Neural Networks for Pattern Recognition (1995) (Oxford University Press, New York) Crossref, Google Scholar
Borkar V. A convex analytic approach to Markov decision processes. Probab. Theory Related Fields (1988) 78:583–602Crossref, Google Scholar
Chen R.-R., Meyn S. Value iteration and optimization of multiclass queueing networks. Queueing Systems (1999) 32:65–97Crossref, Google Scholar
Chen V. C. P., Ruppert D., Shoemaker C. A. Applying experimental design and regression splines to high-dimensional continuous-state stochastic dynamic programming. Oper. Res. (1999) 47(1):38–53Link, Google Scholar
Crites R. H., Barto A. G. Improving elevator performance using reinforcement learning. Advances in Neural Information Processing Systems (1996) 8(MIT Press, Cambridge, MA) 1017–1023Google Scholar
Dayan P. The convergence of TD(λ) for general λ. Machine Learning (1992) 8:341–362Crossref, Google Scholar
de Farias D. P., Van Roy B. On the existence of fixed points for appproximate value iteration and temporal-difference learning. J. Optim. Theory Appl. (2000) 105(3):589–608Crossref, Google Scholar
de Farias D. P., Van Roy B. On constraint sampling in the linear programming approach to approximate dynamic programming. Math. Oper. Res. (2001) . Conditionally accepted toGoogle Scholar
De Ghellinck G. Les problèmes de décisions séquentielles. Cahiers du Centre d'Etudes de Recherche Opérationnelle (1960) 2:161–179Google Scholar
Denardo E. V. On linear programming in a Markov decision problem. Management Sci. (1970) 16(5):282–288Link, Google Scholar
D'Epenoux F. A probabilistic production and inventory problem. Management Sci. (1963) 10(1):98–108Link, Google Scholar
Gordon G. Approximate solutions to Markov decision processess. (1999) . Ph.D. thesis, Carnegie Mellon University, Pittsburgh, PAGoogle Scholar
Grötschel M., Holland O. Solution of large-scale symmetric travelling salesman problems. Math. Programming (1991) 51:141–202Crossref, Google Scholar
Guestrin C., Koller D., Parr R. Efficient solution algorithms for factored MDPs. J. Artificial Intelligence Res. (2002) . ForthcomingGoogle Scholar
Haykin S.Neural Networks: A Comprehensive Formulation (1994) (Macmillan, New York) Google Scholar
Hordijk A., Kallenberg L. C. M. Linear programming and Markov decision chains. Management Sci. (1979) 25:352–362Link, Google Scholar
Kumar P. R., Seidman T. I. Dynamic instabilities and stabilization methods in distributed real-time scheduling of manufacturing systems. IEEE Trans. Automatic Control (1990) 35(3):289–298Crossref, Google Scholar
Longstaff F., Schwartz E. S. Valuing American options by simulation: A simple least squares approach. Rev. Financial Stud. (2001) 14:113–147Crossref, Google Scholar
Manne A. S. Linear programming and sequential decisions. Management Sci. (1960) 6(3):259–267Link, Google Scholar
Morrison J. R., Kumar P. R. New linear program performance bounds for queueing networks. J. Optim. Theory Appl. (1999) 100(3):575–597Crossref, Google Scholar
Paschalidis I. C., Tsitsiklis J. N. Congestion-dependent pricing of network services. IEEE/ACM Trans. Networking (2000) 8(2):171–184Crossref, Google Scholar
Rybko A. N., Stolyar A. L. On the ergodicity of stochastic processes describing the operation of open queueing networks. Problemy Peredachi Informatsii (1992) 28:3–26Google Scholar
Schuurmans D., Patrascu R. Direct value-approximation for factored MDPs. Advances in Neural Information Processing Systems (2001) 14(MIT Press, Cambridge, MA) 1579–1586Google Scholar
Schweitzer P., Seidmann A. Generalized polynomial approximations in Markovian decision processes. J. Math. Anal. Appl. (1985) 110:568–582Crossref, Google Scholar
Sutton R. S. Learning to predict by the methods of temporal differences. Machine Learning (1988) 3:9–44Crossref, Google Scholar
Sutton R. S., Barto A. G.Reinforcement Learning: An Introduction (1998) (MIT Press, Cambridge, MA) Google Scholar
Tesauro C. J. Temporal difference learning and TD-gammon. Comm. ACM (1995) 38:58–68Crossref, Google Scholar
Trick M., Zin S. A linear programming approach to solving dynamic programs. (1993) . Unpublished manuscriptGoogle Scholar
Trick M., Zin S. Spline approximations to value functions: A linear programming approach. Macroeconomic Dynamics (1997) 1:255–277Crossref, Google Scholar
Tsitsiklis J. N., Van Roy B. An analysis of temporal-difference learning with function approximation. IEEE Trans. Auto. Control (1997) 42(5):674–690Crossref, Google Scholar
Tsitsiklis J. N., Van Roy B. Regression methods for pricing complex American-style options. IEEE Trans. Neural Networks (2001) 12(4):694–703Crossref, Google Scholar
Van Roy B. Learning and value function approximation in complex decision processes. (1998) . Ph.D. thesis, Massachusetts Institute of Technology, Cambridge, MAGoogle Scholar
Van Roy B., Feinberg E., Schwartz A. Neuro-dynamic programming: Overview and recent trends. Markov Decision Processes: Models, Methods, Directions, and Open Problems (2000) (Kluwer, Norwell, MA) Google Scholar
Zhang W., Dietterich T. G. High-performance job-shop scheduling with a time-delay TD(λ) network. Advances in Neural Information Processing Systems (1996) 8(MIT Press, Cambridge, MA) 1024–1030Google Scholar

Cited by
- OR for the classroom: the linear programming approach to approximate dynamic programming for Markov decision processes
  14 May 2026 | Mathematical Methods of Operations Research, Vol. 52
- Tightness Without Counterexamples: A New Approach and New Results for Prophet Inequalities
  Jiashuo Jiang,
  Will Ma,
  Jiawei Zhang
  29 April 2025 | Mathematics of Operations Research, Vol. 51, No. 2
- Stochastic social learning optimization: Combining social learning and bucket theory for efficient optimization
  Knowledge-Based Systems, Vol. 341
- A Dual Perspective of Reinforcement Learning for Imposing Policy Constraints
  IEEE Transactions on Artificial Intelligence, Vol. 7, No. 4
- Partial‐Outsourcing Strategy for the Vehicle Routing Problem With Stochastic Demands
  31 December 2025 | Networks, Vol. 87, No. 3
- Pricing Shared Rides
  Chiwei Yan,
  Julia Yan,
  Yifan Shen
  16 March 2026 | Operations Research, Vol. 0, No. 0
- Advance Multi-Priority, Multi-Appointment Patient Scheduling With Dependent Demand and Lead Times
  15 July 2025 | Production and Operations Management, Vol. 35, No. 3
- Constrained Pricing in Logit-Based Revenue Management
  Qian Shao,
  Tien Mai,
  Shih-Fen Cheng
  20 February 2026 | INFORMS Journal on Computing, Vol. 0, No. 0
- A Primal-Dual Approach to Constrained Markov Decision Processes with Applications to Queue Scheduling and Inventory Management
  Yi Chen,
  Jing Dong,
  Zhaoran Wang,
  Chuheng Zhang
  20 May 2025 | Management Science, Vol. 72, No. 2
- Platelet Inventory Management with Approximate Dynamic Programming
  Hossein Abouee-Mehrizi,
  Mahdi Mirjalili,
  Vahid Sarhangian
  11 March 2025 | INFORMS Journal on Computing, Vol. 38, No. 1
- Proactive Defense Strategy Design in Probabilistic Attack Graphs
  21 October 2025
- Multi-query Shortest-Path Problem in Graphs of Convex Sets
  30 October 2025
- Path Planning Using Approximate Dynamic Programming Based on Data
  17 February 2026
- Constant Approximation for Network Revenue Management with Markovian-Correlated Customer Arrivals
  1 May 2026
- A primal–dual policy iteration algorithm for constrained Markov decision processes
  European Journal of Operational Research, Vol. 328, No. 1
- Efficiency with consent: Permutable queueing in on-demand services
  Omega, Vol. 138
- Dynamic Basis Function Generation for Network Revenue Management
  Daniel Adelman,
  Christiane Barz,
  Alba V. Olivares-Nadal
  8 December 2025 | INFORMS Journal on Computing, Vol. 0, No. 0
- Cross-Process Defect Attribution Using Potential Loss Analysis
- Optimal Policy for Inventory Management with Periodic and Controlled Resets
  Yoon Lee,
  Yonatan Mintz,
  Anil Aswani,
  Zuo-Jun Max Shen,
  Cong Yang
  9 June 2025 | Manufacturing & Service Operations Management, Vol. 27, No. 5
- Computing Optimal Joint Chance Constrained Control Policies
  IEEE Transactions on Automatic Control, Vol. 70, No. 7
- ADP- and rollout-based dynamic vehicle routing for pick-up service via budgeting capacity
  1 July 2024 | Flexible Services and Manufacturing Journal, Vol. 37, No. 2
- Self-Guided Approximate Linear Programs: Randomized Multi-Shot Approximation of Discounted Cost Markov Decision Processes
  Parshan Pakiman; ,
  Selvaprabu Nadarajah; ,
  Negar Soheili; ,
  Qihang Lin
  23 July 2024 | Management Science, Vol. 71, No. 4
- Data-Driven Optimal Control via Linear Programming: Boundedness Guarantees
  IEEE Transactions on Automatic Control, Vol. 70, No. 3
- Self-Adapting Network Relaxations for Weakly Coupled Markov Decision Processes
  Selvaprabu Nadarajah; ,
  Andre A. Cire
  21 May 2024 | Management Science, Vol. 71, No. 2
- Data-driven dynamic police patrolling: An efficient Monte Carlo tree search
  European Journal of Operational Research, Vol. 321, No. 1
- Approximate linear programming for decentralized policy iteration in cooperative multi-agent Markov decision processes
  Systems & Control Letters, Vol. 196
- Probabilistic Safety Analysis for Model Predictive Control with a Case Study on Aircraft Upset Recovery
  22 August 2025
- Tree Search Reinforcement Learning for Two-Dimensional Cutting Stock Problem With Complex Constraints
  IEEE Transactions on Automation Science and Engineering, Vol. 22
- Approximated Dynamic Programming for Production and Inventory Planning Problem in Cold Rolling Process of Steel Production
  13 December 2024 | Mathematics, Vol. 12, No. 24
- Information Relaxation and a Duality-Driven Algorithm for Stochastic Dynamic Programs
  Nan Chen,
  Xiang Ma,
  Yanchu Liu,
  Wei Yu
  31 July 2024 | Operations Research, Vol. 72, No. 6
- A priori data-driven robustness guarantees on strategic deviations from generalised Nash equilibria
  Automatica, Vol. 167
- Approximate linear programming for a queueing control problem
  Computers & Operations Research, Vol. 169
- An Approximate Dynamic Programming Approach to Dynamic Stochastic Matching
  Fan You,
  Thomas Vossen
  6 February 2024 | INFORMS Journal on Computing, Vol. 36, No. 4
- Dynamic Home Care Routing and Scheduling with Uncertain Number of Visits per Referral
  Danial Khorasanian,
  Jonathan Patrick,
  Antoine Sauré
  13 May 2024 | Transportation Science, Vol. 58, No. 4
- Accelerating value function approximations for dynamic dial-a-ride problems via dimensionality reductions
  Computers & Operations Research, Vol. 167
- Data-Driven Stochastic Optimal Control With Safety Constraints Using Linear Transfer Operators
  IEEE Transactions on Automatic Control, Vol. 69, No. 4
- Real-Time Torque-Distribution for Dual-Motor Off-Road Vehicle Using Machine Learning Approach
  IEEE Transactions on Vehicular Technology, Vol. 73, No. 4
- MF-OMO: An Optimization Formulation of Mean-Field Games
  22 January 2024 | SIAM Journal on Control and Optimization, Vol. 62, No. 1
- A Concurrent Federated Reinforcement Learning for IoT Resources Allocation With Local Differential Privacy
  IEEE Internet of Things Journal, Vol. 11, No. 4
- On Solution to ASUU Strike and Consolidated University Academic Salary Structure II (CONUASS II) in the Nigerian Universities Using Optimization Method
  5 January 2024 | International Journal of Applied Mathematics, Computational Science and Systems Engineering, Vol. 6
- Strategic Control of Experience-Weighted Attraction Model in Human-AI Interactions
  IFAC-PapersOnLine, Vol. 58, No. 30
- Dynamic container slot allocation for a liner shipping service
  Transportation Research Part B: Methodological, Vol. 179
- Joint Chance Constrained Optimal Control via Linear Programming
  IEEE Control Systems Letters, Vol. 8
- Multiparametric Analysis of Multi-Task Markov Decision Processes: Structure, Invariance, and Reducibility
  IEEE Control Systems Letters, Vol. 8
- Strategizing Against Q-Learners: A Control-Theoretical Approach
  IEEE Control Systems Letters, Vol. 8
- Optimal utilization of integrated photovoltaic battery systems: An application in the residential sector
  25 January 2023 | IISE Transactions, Vol. 55, No. 12
- On Solution to ASUU Strike and Consolidated University Academic Salary Structure II (CONUASS II) in the Nigerian Universities Using Optimization Method
  2 November 2023 | International Journal of Applied Mathematics, Computational Science and Systems Engineering, Vol. 5
- Technical Note—On the Strength of Relaxations of Weakly Coupled Stochastic Dynamic Programs
  David B. Brown,
  Jingwei Zhang
  7 June 2022 | Operations Research, Vol. 71, No. 6
- Approximate Optimal Controller Synthesis for Cart-Poles and Quadrotors via Sums-of-Squares
  IEEE Robotics and Automation Letters, Vol. 8, No. 11
- Adaptive fuzzy dynamic programming (AFDP) technique for linear programming problems lps with fuzzy constraints
  20 June 2023 | Soft Computing, Vol. 27, No. 19
- Nonparametric Approximate Dynamic Programming via the Kernel Method
  Nikhil Bhat,
  Vivek F. Farias,
  Ciamac C. Moallemi,
  Andrew T. Zheng
  10 March 2023 | Stochastic Systems, Vol. 13, No. 3
- Optimal discharge of patients from intensive care via a data-driven policy learning framework
  Operations Research for Health Care, Vol. 38
- Reductions of non-separable approximate linear programs for network revenue management
  European Journal of Operational Research, Vol. 309, No. 1
- A Risk-Averse Preview-Based Q-Learning Algorithm: Application to Highway Driving of Autonomous Vehicles
  IEEE Transactions on Control Systems Technology, Vol. 31, No. 4
- MEALPY: An open-source library for latest meta-heuristic algorithms in Python
  Journal of Systems Architecture, Vol. 139
- Synthesis of Proactive Sensor Placement In Probabilistic Attack Graphs
- SOS-based policy iteration for H ∞ control of polynomial systems with uncertain parameters
  25 January 2022 | International Journal of Control, Vol. 96, No. 4
- Manufacturing Productivity with Worker Turnover
  Ken Moon,
  Patrick Bergemann,
  Daniel Brown,
  Andrew Chen,
  James Chu,
  Ellen A. Eisen,
  Gregory M. Fischer,
  Prashant Loyalka,
  Sungmin Rho,
  Joshua Cohen
  9 August 2022 | Management Science, Vol. 69, No. 4
- A First-Order Approach to Accelerated Value Iteration
  Vineet Goyal,
  Julien Grand-Clément
  24 March 2022 | Operations Research, Vol. 71, No. 2
- Optimal Energy Management and Storage Sizing for Electric Vehicles With Dual Storage
  IEEE Transactions on Control Systems Technology, Vol. 31, No. 2
- Meeting Corporate Renewable Power Targets
  Alessio Trivella,
  Danial Mohseni-Taheri,
  Selvaprabu Nadarajah
  25 March 2022 | Management Science, Vol. 69, No. 1
- Approximate Dynamic Programming: Linear Programming-Based Approaches
  21 September 2022
- Value-gradient iteration with quadratic approximate value functions
  Annual Reviews in Control, Vol. 56
- Sampled-data Control of Probabilistic Boolean Control Networks: A Deep Reinforcement Learning Approach
  Information Sciences, Vol. 619
- Optimizing Sensor Allocation Against Attackers With Uncertain Intentions: A Worst-Case Regret Minimization Approach
  IEEE Control Systems Letters, Vol. 7
- Joint Optimization of Pricing and Personalized Recommendations in Online Retailing
  1 January 2023 | SSRN Electronic Journal, Vol. 55
- Infinite-Dimensional Sums-of-Squares for Optimal Control
- Convergence Rates of Average-Reward Multi-agent Reinforcement Learning via Randomized Linear Programming
- Quantifying the impact of delivery day flexibility on last-mile delivery costs
  Digital Chemical Engineering, Vol. 5
- Dynamic node packing
  12 February 2021 | Mathematical Programming, Vol. 196, No. 1-2
- Technical Note—Product-Based Approximate Linear Programs for Network Revenue Management
  Rui Zhang,
  Saied Samiedaluie,
  Dan Zhang
  11 August 2022 | Operations Research, Vol. 70, No. 5
- Fully polynomial time $$(\Sigma ,\Pi )$$-approximation schemes for continuous nonlinear newsvendor and continuous stochastic dynamic programs
  26 July 2021 | Mathematical Programming, Vol. 195, No. 1-2
- The flying sidekick traveling salesman problem with stochastic travel time: A reinforcement learning approach
  Transportation Research Part E: Logistics and Transportation Review, Vol. 164
- A Survey of Parking Solutions for Smart Cities
  IEEE Transactions on Intelligent Transportation Systems, Vol. 23, No. 8
- IRDA: Incremental Reinforcement Learning for Dynamic Resource Allocation
  IEEE Transactions on Big Data, Vol. 8, No. 3
- On the Feasibility of Learning Finger-gaiting In-hand Manipulation with Intrinsic Sensing
- Adoption of blockchain technology in a two-stage supply chain: Spillover effect on workforce
  Transportation Research Part E: Logistics and Transportation Review, Vol. 161
- Finite‐horizon approximate linear programs for capacity allocation over a rolling horizon
  1 May 2022 | Production and Operations Management, Vol. 31, No. 5
- Personalized Medicine
- Proceed with Care
- Forward ADPII : Policy Optimization
  8 April 2022
- Information Relaxations and Duality in Stochastic Dynamic Programs: A Review and Tutorial
  21 March 2022 | Foundations and Trends® in Optimization, Vol. 5, No. 3
- Dynamic programming approach for fuzzy linear programming problems FLPs and its application to optimal resource allocation problems in education system
  14 January 2022 | Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology, Vol. 42, No. 4
- Interpretable Optimal Stopping
  Dragos Florin Ciocan,
  Velibor V. Mišić
  15 July 2020 | Management Science, Vol. 68, No. 3
- Queueing Network Controls via Deep Reinforcement Learning
  J. G. Dai,
  Mark Gluzman
  3 December 2021 | Stochastic Systems, Vol. 12, No. 1
- Dynamic inventory control with payment delay and credit limit
  10 June 2021 | Naval Research Logistics (NRL), Vol. 69, No. 2
- Data-driven optimal control with a relaxed linear program
  Automatica, Vol. 136
- Transmission scheduling for multi-process multi-sensor remote estimation via approximate dynamic programming
  Automatica, Vol. 136
- Economic Dispatch for EV Energy Storage-Integrated Power Systems
- Gradient-bounded dynamic programming for submodular and concave extensible value functions with probabilistic performance guarantees
  Automatica, Vol. 135
- Adaptive polyhedral meshing for approximate dynamic programming in control
  Engineering Applications of Artificial Intelligence, Vol. 107
- Transfer Learning for Constrained Stochastic Control Using Adjustable Benders Cuts
  IEEE Control Systems Letters, Vol. 6
- Data-Driven Optimal Control of Affine Systems: A Linear Programming Perspective
  IEEE Control Systems Letters, Vol. 6
- Accelerated Point-Wise Maximum Approach to Approximate Dynamic Programming
  IEEE Transactions on Automatic Control, Vol. 67, No. 1
- Método de error de Bellman con ponderación de volumen para mallado adaptativo en programación dinámica aproximada
  17 December 2021 | Revista Iberoamericana de Automática e Informática industrial, Vol. 19, No. 1
- A One-shot Convex Optimization Approach to Risk-Averse Q-Learning
- On the Synthesis of Bellman Inequalities for Data-Driven Optimal Control
- Dynamic multistage scheduling for patient-centered care plans
  10 August 2021 | Health Care Management Science, Vol. 24, No. 4
- Implied Markov transition matrices under structural price models
  15 June 2021 | Quantitative Finance, Vol. 21, No. 11
- Randomized Linear Programming for Tabular Average-Cost Multi-agent Reinforcement Learning
- Random Forest Q-Learning for Feedback Stabilization of Probabilistic Boolean Control Networks
- Managing a Hybrid RDC‐DC Inventory System
  1 October 2021 | Production and Operations Management, Vol. 30, No. 10
- Viability, viscosity, and storage functions in model-predictive control with terminal constraints
  Automatica, Vol. 131
- Reducible Markov Decision Processes and Stochastic Games
  1 August 2021 | Production and Operations Management, Vol. 30, No. 8
- A dynamic programming framework for optimal delivery time slot pricing
  European Journal of Operational Research, Vol. 292, No. 2
- Cautious Reinforcement Learning via Distributional Risk in the Dual Domain
  IEEE Journal on Selected Areas in Information Theory, Vol. 2, No. 2
- Recent Modeling and Analytical Advances in Hospital Inpatient Flow Management
  1 June 2021 | Production and Operations Management, Vol. 30, No. 6
- Beyond Cumulative Returns via Reinforcement Learning over State-Action Occupancy Measures
- Convex Q-Learning
- Revenue Management with Repeated Customer Interactions
  Andre P. Calmon,
  Florin D. Ciocan,
  Gonzalo Romero
  5 October 2020 | Management Science, Vol. 67, No. 5
- A Finite Time Analysis of Temporal Difference Learning with Linear Function Approximation
  Jalaj Bhandari,
  Daniel Russo,
  Raghav Singal
  19 March 2021 | Operations Research, Vol. 69, No. 3
- Data Interpolation by Near-Optimal Splines with Free Knots Using Linear Programming
  13 May 2021 | Mathematics, Vol. 9, No. 10
- Dynamic Capacity Allocation for Elective Surgeries: Reducing Urgency-Weighted Wait Times
  Stephanie Carew,
  Mahesh Nagarajan,
  Steven Shechter,
  Jugpal Arneja,
  Erik Skarsgard
  20 April 2020 | Manufacturing & Service Operations Management, Vol. 23, No. 2
- A column and constraint generation algorithm for the dynamic knapsack problem with stochastic item sizes
  26 June 2020 | Mathematical Programming Computation, Vol. 13, No. 1
- Bibliography
  12 March 2021
- SDDP.jl: A Julia Package for Stochastic Dual Dynamic Programming
  Oscar Dowson,
  Lea Kapelevich
  31 August 2020 | INFORMS Journal on Computing, Vol. 33, No. 1
- On the Strength of Relaxations of Weakly Coupled Stochastic Dynamic Programs
  SSRN Electronic Journal, Vol. 52
- Deep Reinforcement Learning in Linear Discrete Action Spaces
- Network-Based Approximate Linear Programming for Discrete Optimization
  Selvaprabu Nadarajah,
  Andre A. Cire
  28 October 2020 | Operations Research, Vol. 68, No. 6
- A heuristic policy for maintaining multiple multi-state systems
  Reliability Engineering & System Safety, Vol. 203
- An Approximation Approach for Response-Adaptive Clinical Trial Design
  Vishal Ahuja,
  John R. Birge
  28 May 2020 | INFORMS Journal on Computing, Vol. 32, No. 4
- Least Squares Monte Carlo and Approximate Linear Programming with an Energy Real Option Application
  1 October 2020 | Foundations and Trends® in Technology, Information and Operations Management, Vol. 14, No. 1-2
- A Least-Squares Temporal Difference based method for solving resource allocation problems
  IFAC Journal of Systems and Control, Vol. 13
- Integer Programming on the Junction Tree Polytope for Influence Diagrams
  Axel Parmentier,
  Victor Cohen,
  Vincent Leclère,
  Guillaume Obozinski,
  Joseph Salmon
  24 July 2020 | INFORMS Journal on Optimization, Vol. 2, No. 3
- The policy graph decomposition of multistage stochastic programming problems
  12 February 2020 | Networks, Vol. 76, No. 1
- Partially observable multistage stochastic programming
  Operations Research Letters, Vol. 48, No. 4
- ONLINE CAPACITY PLANNING FOR REHABILITATION TREATMENTS: AN APPROXIMATE DYNAMIC PROGRAMMING APPROACH
  11 December 2018 | Probability in the Engineering and Informational Sciences, Vol. 34, No. 3
- Randomized Linear Programming Solves the Markov Decision Problem in Nearly Linear (Sometimes Sublinear) Time
  Mengdi Wang
  16 October 2019 | Mathematics of Operations Research, Vol. 45, No. 2
- Revisiting Approximate Linear Programming: Constraint-Violation Learning with Applications to Inventory Control and Energy Storage
  Qihang Lin,
  Selvaprabu Nadarajah,
  Negar Soheili
  20 September 2019 | Management Science, Vol. 66, No. 4
- Least squares policy iteration with instrumental variables vs. direct policy search: comparison against optimal benchmarks using energy storage
  19 June 2019 | INFOR: Information Systems and Operational Research, Vol. 58, No. 1
- Reinforcement Learning
  8 May 2020
- Exploiting Bounded Rationality in Risk-Based Cyber Camouflage Games
  22 December 2020
- Performance Guarantees for Model-Based Approximate Dynamic Programming in Continuous Spaces
  IEEE Transactions on Automatic Control, Vol. 65, No. 1
- A Universal Empirical Dynamic Programming Algorithm for Continuous State MDPs
  IEEE Transactions on Automatic Control, Vol. 65, No. 1
- Computing Controlled Invariant Sets from Data Using Convex Optimization
  16 September 2020 | SIAM Journal on Control and Optimization, Vol. 58, No. 5
- Robust Quadratic Programming for MDPs with uncertain observation noise
  Neurocomputing, Vol. 370
- Quasi-Stochastic Approximation and Off-Policy Reinforcement Learning
- A Data-Driven Policy Iteration Scheme based on Linear Programming
- Stochastic Primal-Dual Method for Learning Mixture Policies in Markov Decision Processes
- Generalized Dual Dynamic Programming for Infinite Horizon Problems in Continuous State and Action Spaces
  IEEE Transactions on Automatic Control, Vol. 64, No. 12
- Energy systems engineering - a guided tour
  10 April 2019 | BMC Chemical Engineering, Vol. 1, No. 1
- Operational power plant scheduling with flexible carbon capture: A multistage stochastic optimization approach
  Computers & Chemical Engineering, Vol. 130
- An Approximate Dynamic Programming Approach to Dynamic Pricing for Network Revenue Management
  1 November 2019 | Production and Operations Management, Vol. 28, No. 11
- Inpatient Overflow: An Approximate Dynamic Programming Approach
  J. G. Dai,
  Pengyi Shi
  16 May 2019 | Manufacturing & Service Operations Management, Vol. 21, No. 4
- Applying unweighted least‐squares based techniques to stochastic dynamic programming: theory and application
  1 August 2019 | IET Control Theory & Applications, Vol. 13, No. 15
- A heuristic based on quadratic approximation for dual sourcing problem with general lead times and supply capacity uncertainty
  20 March 2019 | IISE Transactions, Vol. 51, No. 9
- Multiagent Mechanism Design Without Money
  Santiago R. Balseiro,
  Huseyin Gurkan,
  Peng Sun
  10 September 2019 | Operations Research, Vol. 67, No. 5
- An approximate dynamic programming approach for sequential pig marketing decisions at herd level
  European Journal of Operational Research, Vol. 276, No. 3
- Approximate Dynamic Programming with Probabilistic Temporal Logic Constraints
- Metodología de programación dinámica aproximada para control óptimo basada en datos
  12 June 2019 | Revista Iberoamericana de Automática e Informática industrial, Vol. 16, No. 3
- Efficient Computational Strategies for Dynamic Inventory Liquidation
  Mochen Yang,
  Gediminas Adomavicius,
  Alok Gupta
  30 May 2019 | Information Systems Research, Vol. 30, No. 2
- A proactive transfer policy for critical patient flow management
  17 February 2018 | Health Care Management Science, Vol. 22, No. 2
- Sequential Decision Making With Limited Observation Capability: Application to Wireless Networks
  IEEE Transactions on Cognitive Communications and Networking, Vol. 5, No. 2
- On Solving MDPs With Large State Space: Exploitation of Policy Structures and Spectral Properties
  IEEE Transactions on Communications, Vol. 67, No. 6
- A Concave Value Function Extension for the Dynamic Programming Approach to Revenue Management in Attended Home Delivery
- Learning continuous $Q$-functions using generalized Benders cuts
- Nonlinear Control of Quadcopters via Approximate Dynamic Programming
- Demand-side energy management under time-varying prices
  11 February 2019 | IISE Transactions, Vol. 51, No. 4
- Approximations to Stochastic Dynamic Programs via Information Relaxation Duality
  Santiago R. Balseiro,
  David B. Brown
  21 March 2019 | Operations Research, Vol. 67, No. 2
- Relaxation Analysis for the Dynamic Knapsack Problem with Stochastic Item Sizes
  SIAM Journal on Optimization, Vol. 29, No. 1
- Toward Breaking the Curse of Dimensionality: An FPTAS for Stochastic Dynamic Programs with Multidimensional Actions and Scalar States
  16 April 2019 | SIAM Journal on Optimization, Vol. 29, No. 2
- Linear Programming Based Near-Optimal Pricing for Laminar Bayesian Online Selection
  SSRN Electronic Journal, Vol. 62
- Managing a Hybrid RDC-DC Inventory System
  SSRN Electronic Journal, Vol. 56
- Large-scale unit commitment under uncertainty: an updated literature survey
  20 September 2018 | Annals of Operations Research, Vol. 271, No. 1
- Controlling Large, Graph-based MDPs with Global Control Capacity Constraints: An Approximate LP Solution
- A polyhedral approach to online bipartite matching
  16 December 2017 | Mathematical Programming, Vol. 172, No. 1-2
- Ambiguous partially observable Markov decision processes: Structural results and applications
  Journal of Economic Theory, Vol. 178
- Shape Constraints in Economics and Operations Research
  Statistical Science, Vol. 33, No. 4
- Approximate Value Iteration for Risk-Aware Markov Decision Processes
  IEEE Transactions on Automatic Control, Vol. 63, No. 9
- Capacitated inspection scheduling of multi-unit systems
  Computers & Industrial Engineering, Vol. 120
- Constructive Lyapunov Stabilization with Approximate Optimality for A Class of Nonlinear Systems
- Price Management in Resource Allocation Problem with Approximate Dynamic Programming
- A Linearly Relaxed Approximate Linear Program for Markov Decision Processes
  IEEE Transactions on Automatic Control, Vol. 63, No. 4
- The One-Dimensional Dynamic Dispatch Waves Problem
  Mathias A. Klapp,
  Alan L. Erera,
  Alejandro Toriello
  20 May 2016 | Transportation Science, Vol. 52, No. 2
- Weakly Coupled Dynamic Program: Information and Lagrangian Relaxations
  IEEE Transactions on Automatic Control, Vol. 63, No. 3
- Robust Design Through Probabilistic Maximization
  15 December 2018
- Introduction and Summary
  12 May 2018
- From Reinforcement Learning to Deep Reinforcement Learning: An Overview
  23 August 2018
- Stochastic model predictive control with active uncertainty learning: A Survey on dual control
  Annual Reviews in Control, Vol. 45
- Reinforcement learning for control: Performance, stability, and deep approximators
  Annual Reviews in Control, Vol. 46
- China’ energy-water nexus: Hydropower generation potential of joint operation of the Three Gorges and Qingjiang cascade reservoirs
  Energy, Vol. 142
- From Infinite to Finite Programs: Explicit Error Bounds with Applications to Approximate Dynamic Programming
  SIAM Journal on Optimization, Vol. 28, No. 3
- Revenue Management with Repeated Customer Interactions
  SSRN Electronic Journal, Vol. 56
- Easy Decomposable Markov Decision Processes and Stochastic Games
  SSRN Electronic Journal, Vol. 51
- Least Squares Monte Carlo and Approximate Linear Programming: Error Bounds and Energy Real Option Application
  SSRN Electronic Journal, Vol. 5
- Manufacturing Productivity with Worker Turnover
  1 January 2018 | SSRN Electronic Journal, Vol. 59
- Meeting Corporate Renewable Power Targets
  SSRN Electronic Journal, Vol. 19
- Hedging Strategies: Electricity Investment Decisions under Policy Uncertainty
  1 January 2018 | The Energy Journal, Vol. 39, No. 1
- Point-wise maximum approach to approximate dynamic programming
- Linear and dynamic programming approaches to degenerate risk-sensitive reward processes
- Local water storage control for the developing world
- Data-driven approximate dynamic programming: A linear programming approach
- A dynamic programming framework for optimal home scheduling
- Linear programming formulation for non-stationary, finite-horizon Markov decision process models
  Operations Research Letters, Vol. 45, No. 6
- GMDPtoolbox: A Matlab library for designing spatial management policies. Application to the long-term collective management of an airborne disease
  5 October 2017 | PLOS ONE, Vol. 12, No. 10
- Information Relaxation Bounds for Infinite Horizon Markov Decision Processes
  David B. Brown,
  Martin B. Haugh
  2 August 2017 | Operations Research, Vol. 65, No. 5
- Was Angelina Jolie Right? Optimizing Cancer Prevention Strategies Among BRCA Mutation Carriers
  Eike Nohdurft,
  Elisa Long,
  Stefan Spinler
  12 July 2017 | Decision Analysis, Vol. 14, No. 3
- Relationship between least squares Monte Carlo and approximate linear programming
  Operations Research Letters, Vol. 45, No. 5
- Computing monotone policies for Markov decision processes: a nearly-isotonic penalty approach
  IFAC-PapersOnLine, Vol. 50, No. 1
- Approximate Dynamic Programming via Penalty Functions
  IFAC-PapersOnLine, Vol. 50, No. 1
- Meeting Inelastic Demand in Systems With Storage and Renewable Sources
  IEEE Transactions on Smart Grid, Vol. 8, No. 4
- Managing Patient Admissions in a Neurology Ward
  Saied Samiedaluie,
  Beste Kucukyazici,
  Vedat Verter,
  Dan Zhang
  15 March 2017 | Operations Research, Vol. 65, No. 3
- Global Adaptive Dynamic Programming for Nonlinear Polynomial Systems
  14 April 2017
- High-Speed Finite Control Set Model Predictive Control for Power Electronics
  IEEE Transactions on Power Electronics, Vol. 32, No. 5
- A Factored MDP Approach to Optimal Mechanism Design for Resilient Large-Scale Interdependent Critical Infrastructures
- A NONLINEAR PROGRAMMING METHOD FOR DYNAMIC PROGRAMMING
  18 January 2016 | Macroeconomic Dynamics, Vol. 21, No. 2
- Convergence rates of moment-sum-of-squares hierarchies for optimal control problems
  Systems & Control Letters, Vol. 100
- Value Function Approximation
  14 April 2017
- Advance Patient Appointment Scheduling
  11 March 2017
- Multi-Agent Mechanism Design without Money
  SSRN Electronic Journal, Vol. 78
- Decomposable Markov Decision Processes: A Fluid Optimization Approach
  Dimitris Bertsimas,
  Velibor V. Mišić
  13 October 2016 | Operations Research, Vol. 64, No. 6
- Automata theory meets approximate dynamic programming: Optimal control with temporal logic constraints
- Real-time FPGA implementation of direct MPC for power electronics
- An online primal-dual method for discounted Markov decision processes
- Air Cargo Network Revenue Management
  Christiane Barz,
  Daniel Gartner
  21 October 2016 | Transportation Science, Vol. 50, No. 4
- Maintenance optimisation of a parallel-series system with stochastic and economic dependence under limited maintenance capacity
  Reliability Engineering & System Safety, Vol. 155
- Markov Decision Processes With Applications in Wireless Sensor Networks: A Survey
  IEEE Communications Surveys & Tutorials, Vol. 17, No. 3
- Approximating Markov Chain Approach to Optimal Feedback Control of a Flexible Needle
  13 July 2016 | Journal of Dynamic Systems, Measurement, and Control, Vol. 138, No. 11
- Parallel Least-Squares Policy Iteration
- On the computational complexity and generalization properties of multi-stage and stage-wise coupled scenario programs
  Systems & Control Letters, Vol. 94
- Motion Planning for Continuous-Time Stochastic Processes: A Dynamic Programming Approach
  IEEE Transactions on Automatic Control, Vol. 61, No. 8
- Convex synthesis of randomized policies for controlled Markov chains with density safety upper bound constraints
- High-speed direct model predictive control for power electronics
- Alleviating tuning sensitivity in Approximate Dynamic Programming
- An iterative approach to the optimal co-design of linear control systems
  13 October 2015 | International Journal of Control, Vol. 89, No. 4
- Performance Guarantee of an Approximate Dynamic Programming Policy for Robotic Surveillance
  IEEE Transactions on Automation Science and Engineering, Vol. 13, No. 2
- An AO* Based Exact Algorithm for the Canadian Traveler Problem
  Vural Aksakalli,
  O. Furkan Sahin,
  Ibrahim Ari
  26 January 2016 | INFORMS Journal on Computing, Vol. 28, No. 1
- Smoothing and parametric rules for stochastic mean-CVaR optimal execution strategy
  23 May 2013 | Annals of Operations Research, Vol. 237, No. 1-2
- Towards Modeling the Behavior of Autonomous Systems and Humans for Trusted Operations
  8 April 2016
- Ant-Based System Analysis on the Traveling Salesman Problem Under Real-World Settings
  28 January 2016
- A Polyhedral Approach to Online Bipartite Matching
  25 May 2016
- Semi-Infinite Relaxations for the Dynamic Knapsack Problem with Stochastic Item Sizes
  SIAM Journal on Optimization, Vol. 26, No. 3
- Relaxations of Approximate Linear Programs for the Real Option Management of Commodity Storage
  Selvaprabu Nadarajah,
  François Margot,
  Nicola Secomandi
  9 July 2015 | Management Science, Vol. 61, No. 12
- Reductions of Approximate Linear Programs for Network Revenue Management
  Thomas W. M. Vossen,
  Dan Zhang
  7 December 2015 | Operations Research, Vol. 63, No. 6
- A perspective-based convex relaxation for switched-affine optimal control
  Systems & Control Letters, Vol. 86
- Likelihood of cyber data injection attacks to power systems
- Linear Programming and the Control of Diffusion Processes
  Andrew Ahn,
  Martin Haugh
  13 November 2015 | INFORMS Journal on Computing, Vol. 27, No. 4
- Approximate linear programming for networks: Average cost bounds
  Computers & Operations Research, Vol. 63
- Global Adaptive Dynamic Programming for Continuous-Time Nonlinear Systems
  IEEE Transactions on Automatic Control, Vol. 60, No. 11
- The Price of Nonabandonment: HIV in Resource-Limited Settings
  Amin Khademi,
  Denis R. Saure,
  Andrew J. Schaefer,
  Ronald S. Braithwaite,
  Mark S. Roberts
  29 June 2015 | Manufacturing & Service Operations Management, Vol. 17, No. 4
- Information Relaxation and Dual Formulation of Controlled Markov Diffusions
  IEEE Transactions on Automatic Control, Vol. 60, No. 10
- Deadlock-free scheduling of knowledgeable manufacturing cell with multiple machines and products
  27 June 2014 | Proceedings of the Institution of Mechanical Engineers, Part B: Journal of Engineering Manufacture, Vol. 229, No. 10
- Simulation-Based Approximate Policy Iteration with Generalized Logistic Functions
  Antoine Sauré,
  Jonathan Patrick,
  Martin L. Puterman
  28 September 2015 | INFORMS Journal on Computing, Vol. 27, No. 3
- Approximate dynamic programming via iterated Bellman inequalities
  19 February 2014 | International Journal of Robust and Nonlinear Control, Vol. 25, No. 10
- An iterative scheme for the approximate linear programming solution to the optimal control of a Markov Decision Process
- Stochastic Optimization for Unit Commitment—A Review
  IEEE Transactions on Power Systems, Vol. 30, No. 4
- Large-scale Unit Commitment under uncertainty
  31 January 2015 | 4OR, Vol. 13, No. 2
- Incremental constraint projection methods for variational inequalities
  17 May 2014 | Mathematical Programming, Vol. 150, No. 2
- Technical Note—Trading Off Quick versus Slow Actions in Optimal Search
  Steven M. Shechter,
  Farhad Ghassemi,
  Yasin Gocgun,
  Martin L. Puterman
  6 March 2015 | Operations Research, Vol. 63, No. 2
- Impulse controls and uncertainty in economics: Method and application
  Environmental Modelling & Software, Vol. 65
- Randomized methods for design of uncertain systems: Sample complexity and sequential algorithms
  Automatica, Vol. 52
- Approximate dynamic programming for stochastic linear control problems on compact state spaces
  European Journal of Operational Research, Vol. 241, No. 1
- Literature Review
  4 February 2014
- Optimal and Near Optimal Scheduling Policies
  4 February 2014
- Stochastic Optimization
- A Computationally Efficient FPTAS for Convex Stochastic Dynamic Programs
  SIAM Journal on Optimization, Vol. 25, No. 1
- Dynamic Pricing for Hotel Rooms When Customers Request Multiple-Day Stays
  SSRN Electronic Journal, Vol. 52
- An approximate linear programming solution to the probabilistic invariance problem for stochastic hybrid systems
- Approximate Dynamic Programming with (min; +) linear function approximation for Markov decision processes
- Linear Hamilton Jacobi Bellman Equations in high dimensions
- Approximation of constrained average cost Marks
- Optimal Control for Unknown Discrete-Time Nonlinear Markov Jump Systems Using Adaptive Dynamic Programming
  IEEE Transactions on Neural Networks and Learning Systems, Vol. 25, No. 12
- Traffic control model and algorithm based on decomposition of MDP
- A Dynamic Traveling Salesman Problem with Stochastic Arc Costs
  Alejandro Toriello,
  William B. Haskell,
  Michael Poremba
  16 July 2014 | Operations Research, Vol. 62, No. 5
- Optimal navigation functions for nonlinear stochastic systems
- An approximate dynamic programming model for link scheduling in WMNs with gateway design constraint
- Real Options and Merchant Operations of Energy and Other Commodities
  11 July 2014 | Foundations and Trends® in Technology, Information and Operations Management, Vol. 6, No. 3-4
- Queueing-theoretic approaches for dynamic scheduling: A survey
  Surveys in Operations Research and Management Science, Vol. 19, No. 2
- Semidefinite relaxations for stochastic optimal control policies
- Optimal toll design: a lower bound framework for the asymmetric traveling salesman problem
  19 January 2013 | Mathematical Programming, Vol. 144, No. 1-2
- Lower Bounding Linear Program for the Perimeter Patrol Optimization Problem
  Journal of Guidance, Control, and Dynamics, Vol. 37, No. 2
- Quadratic approximate dynamic programming for input‐affine systems
  23 August 2012 | International Journal of Robust and Nonlinear Control, Vol. 24, No. 3
- Computational bounds for elevator control policies by large scale linear programming
  13 October 2013 | Mathematical Methods of Operations Research, Vol. 79, No. 1
- Value Function Approximation
  7 February 2015
- Optimal Operation Strategy of Energy Storage System for Grid-Connected Wind Power Plants
  IEEE Transactions on Sustainable Energy, Vol. 5, No. 1
- Performance Bounds and Suboptimal Policies for Multi-Period Investment
  1 January 2014 | Foundations and Trends® in Optimization, Vol. 1, No. 1
- Ambiguous Partially Observable Markov Decision Processes: Structural Results and Applications
  SSRN Electronic Journal, Vol. 19
- Dual Formulation of Controlled Markov Diffusions and Its Application
  IFAC Proceedings Volumes, Vol. 47, No. 3
- Global Adaptive Dynamic Programming for Continuous-Time Nonlinear Polynomial Systems
  IFAC Proceedings Volumes, Vol. 47, No. 3
- Optimal power allocation over multiple identical Gilbert-Elliott channels
- Markov Decision Process for Traffic Control at an Isolated Intersection
- Infinite Horizon Performance Bounds for Uncertain Constrained Systems
  IEEE Transactions on Automatic Control, Vol. 58, No. 11
- Approximate Linear Programming for Average Cost MDPs
  Michael H. Veatch,
  20 December 2012 | Mathematics of Operations Research, Vol. 38, No. 3
- Accelerated modified policy iteration algorithms for Markov decision processes
  27 February 2013 | Mathematical Methods of Operations Research, Vol. 78, No. 1
- Investments in combined cycle natural gas-fired systems: A real options analysis
  International Journal of Electrical Power & Energy Systems, Vol. 49
- Power allocation over two identical Gilbert-Elliott channels
- Optimal power allocation policy over two identical Gilbert-Elliott channels
- Algorithmic Survey of Parametric Value Function Approximation
  IEEE Transactions on Neural Networks and Learning Systems, Vol. 24, No. 6
- Control design for specifications on stochastic hybrid systems
  8 April 2013
- A Linear Programming Approach to Nonstationary Infinite-Horizon Markov Decision Processes
  Archis Ghate,
  Robert L. Smith,
  19 March 2013 | Operations Research, Vol. 61, No. 2
- Dynamic Capacity Allocation to Customers Who Remember Past Service
  Daniel Adelman,
  Adam J. Mersereau,
  19 December 2012 | Management Science, Vol. 59, No. 3
- Factored Markov Decision Processes
  7 March 2013
- Markov Decision Processes
- Population-Based Evolutionary Approaches
- Optimization in Healthcare Delivery Modeling: Methods and Applications
  10 January 2013
- Approximate Dynamic Programming Applied to UAV Perimeter Patrol
- Stochastic Dominance-Constrained Markov Decision Processes
  SIAM Journal on Control and Optimization, Vol. 51, No. 1
- Reinforcement Learning and Approximate Dynamic Programming (RLADP)—Foundations, Common Misconceptions, and the Challenges Ahead
  7 February 2013
- Bounds for Markov Decision Processes
  7 February 2013
- Feature Selection for Neuro‐Dynamic Programming
  7 February 2013
- Pathwise Optimization for Optimal Stopping Problems
  Vijay V. Desai,
  Vivek F. Farias,
  Ciamac C. Moallemi,
  15 June 2012 | Management Science, Vol. 58, No. 12
- Dynamic multi-appointment patient scheduling for radiation therapy
  European Journal of Operational Research, Vol. 223, No. 2
- A self-recovery approach to the probabilistic invariance problem for stochastic hybrid systems
- Infinite-horizon performance bounds for constrained stochastic systems
- State partitioning based linear program for stochastic dynamic programs: An invariance property
  Operations Research Letters, Vol. 40, No. 6
- Mean Field for Markov Decision Processes: From Discrete to Continuous Optimization
  IEEE Transactions on Automatic Control, Vol. 57, No. 9
- Approximate Dynamic Programming via a Smoothed Linear Program
  Vijay V. Desai,
  Vivek F. Farias,
  Ciamac C. Moallemi,
  1 June 2012 | Operations Research, Vol. 60, No. 3
- Bounding procedure for stochastic dynamic programs with application to the perimeter patrol problem
- An approximate dynamic programming approach to solving dynamic oligopoly models
  19 June 2012 | The RAND Journal of Economics, Vol. 43, No. 2
- Approximate dynamic programming for capacity allocation in the service industry
  European Journal of Operational Research, Vol. 218, No. 1
- Computing Near-Optimal Policies in Generalized Joint Replenishment
  Daniel Adelman,
  Diego Klabjan,
  2 February 2011 | INFORMS Journal on Computing, Vol. 24, No. 1
- Revenue Management
  21 December 2011
- Reinforcement Learning with a Bilinear Q Function
- Robust Bayesian Reinforcement Learning through Tight Lower Bounds
- Modeling and stochastic dynamic optimization for optimal energy resource allocation
- Optimal planning of energy management system under demand uncertainty
- Network revenue management with inventory-sensitive bid prices and customer choice
  European Journal of Operational Research, Vol. 216, No. 2
- A framework and a mean-field algorithm for the local control of spatial processes
  International Journal of Approximate Reasoning, Vol. 53, No. 1
- Power Systems Investments
- Higher-Order Nonlinear Discrete Approximate Iteration on the Continuing Dynamic Programming
  1 January 2012 | Advanced Materials Research, Vol. 459
- New Approach for the Continuing Dynamic Programming
  1 January 2012 | Advanced Materials Research, Vol. 459
- Performance bounds and suboptimal policies for linear stochastic control via LMIs
  3 December 2010 | International Journal of Robust and Nonlinear Control, Vol. 21, No. 14
- Min-max approximate dynamic programming
- Imputing a convex objective function
- Bibliography
  26 September 2011
- Network Cargo Capacity Management
  Tatsiana Levina,
  Yuri Levin,
  Jeff McGill,
  Mikhail Nediak,
  1 August 2011 | Operations Research, Vol. 59, No. 4
- Fast Evaluation of Quadratic Control-Lyapunov Policy
  IEEE Transactions on Control Systems Technology, Vol. 19, No. 4
- Stochastic control via direct comparison
  1 October 2010 | Discrete Event Dynamic Systems, Vol. 21, No. 1
- On State Aggregation to Approximate Complex Value Functions in Large-Scale Markov Decision Processes
  IEEE Transactions on Automatic Control, Vol. 56, No. 2
- An Improved Dynamic Programming Decomposition Approach for Network Revenue Management
  Dan Zhang,
  13 October 2010 | Manufacturing & Service Operations Management, Vol. 13, No. 1
- Computation and Dynamic Programming
  15 February 2011
- Dynamic Programming Via Linear Programming
  15 February 2011
- Infinite Horizon Problems
  14 January 2011
- Performance Bounds in Queueing Networks
  15 February 2011
- Reinforcement Learning Algorithms for MDPs
  15 February 2011
- Total Expected Discounted Reward MDPs : Value Iteration Algorithm
  15 February 2011
- Value Function Approximation
- A Continuous-Time Markov Decision Process for Infrastructure Surveillance
  21 June 2011
- Towards Cognitive Machines
- Minimizing total tardiness in a stochastic single machine scheduling problem using approximate dynamic programming
  5 February 2010 | Journal of Scheduling, Vol. 13, No. 6
- A variable neighborhood search based algorithm for finite-horizon Markov Decision Processes
  Applied Mathematics and Computation, Vol. 217, No. 7
- State aggregation based linear programming approach to approximate dynamic programming
- Adaptive Modulation with Smoothed Flow Utility
  21 September 2010 | EURASIP Journal on Wireless Communications and Networking, Vol. 2010, No. 1
- Combination of acceleration procedures for solving stochastic shortest-path Markov decision processes
- Approximate dynamic programming approach for process control
  Journal of Process Control, Vol. 20, No. 9
- Tutor learning using linear constraints in approximate dynamic programming
- A stochastic approximation method with max-norm projections and its applications to the Q-learning algorithm
  7 October 2010 | ACM Transactions on Modeling and Computer Simulation, Vol. 20, No. 3
- Computational Methods for Oblivious Equilibrium
  Gabriel Y. Weintraub,
  C. Lanier Benkard,
  Benjamin Van Roy,
  3 June 2010 | Operations Research, Vol. 58, No. 4-part-2
- Information Relaxations and Duality in Stochastic Dynamic Programs
  David B. Brown,
  James E. Smith,
  Peng Sun,
  9 April 2010 | Operations Research, Vol. 58, No. 4-part-1
- Decomposition of large-scale stochastic optimal control problems
  23 July 2010 | RAIRO - Operations Research, Vol. 44, No. 3
- Reallocations in teams of UAVs using dynamic programming and mixed initiative interactions
- Approximate Dynamic Programming for Ambulance Redeployment
  Matthew S. Maxwell,
  Mateo Restrepo,
  Shane G. Henderson,
  Huseyin Topaloglu,
  18 August 2009 | INFORMS Journal on Computing, Vol. 22, No. 2
- Solving Continuous-State POMDPs via Density Projection
  IEEE Transactions on Automatic Control, Vol. 55, No. 5
- SP-SDP for Fuel Consumption and Tailpipe Emissions Minimization in an EVT Hybrid
  IEEE Transactions on Control Systems Technology, Vol. 18, No. 3
- Strategic capacity decision‐making in a stochastic manufacturing environment using real‐time approximate dynamic programming
  12 January 2010 | Naval Research Logistics (NRL), Vol. 57, No. 3
- A dynamic programming approach to the multiple-choice multi-period knapsack problem and the recursive APL2 code
  Journal of Information and Optimization Sciences, Vol. 31, No. 2
- State of the Art in Example‐Based Motion Synthesis for Virtual Characters in Interactive Applications
  8 February 2010 | Computer Graphics Forum, Vol. 29, No. 1
- Commentary—Perspectives on Stochastic Optimization Over Time
  John N. Tsitsiklis,
  2 October 2009 | INFORMS Journal on Computing, Vol. 22, No. 1
- Acceleration Operators in the Value Iteration Algorithms for Markov Decision Processes
  Oleksandr Shlakhter,
  Chi-Guhn Lee,
  Dmitry Khmelev,
  Nasser Jaber,
  23 September 2009 | Operations Research, Vol. 58, No. 1
- Min–max control using parametric approximate dynamic programming
  Control Engineering Practice, Vol. 18, No. 2
- Control of Diffusions via Linear Programming
  18 October 2010
- The Persistence and Effectiveness of Large-Scale Mathematical Programming Strategies: Projection, Outer Linearization, and Inner Linearization
  27 August 2010
- Algorithms for Reinforcement Learning
  11 March 2022
- APPROXIMATE DYNAMIC PROGRAMMING TECHNIQUES FOR THE CONTROL OF TIME-VARYING QUEUING SYSTEMS APPLIED TO CALL CENTERS WITH ABANDONMENTS AND RETRIALS
  21 December 2009 | Probability in the Engineering and Informational Sciences, Vol. 24, No. 1
- Approximate Dynamic Programming for Capacity Allocation in the Service Industry
  SSRN Electronic Journal, Vol. 52
- Comparing LP Bounds for Queueing Networks
  IEEE Transactions on Automatic Control, Vol. 54, No. 11
- Partially Observable Markov Decision Process Approximations for Adaptive Sensing
  28 May 2009 | Discrete Event Dynamic Systems, Vol. 19, No. 3
- A dynamic programming extension to the steady state refinery-LP
  European Journal of Operational Research, Vol. 197, No. 2
- An Approximate Dynamic Programming Approach to Network Revenue Management with Customer Choice
  Dan Zhang,
  Daniel Adelman,
  29 June 2009 | Transportation Science, Vol. 43, No. 3
- An approximate dynamic programming approach for the vehicle routing problem with stochastic demands
  European Journal of Operational Research, Vol. 196, No. 2
- Constraint relaxation in approximate linear programs
  14 June 2009
- Learning Representation and Control in Markov Decision Processes: New Frontiers
  2 June 2009 | Foundations and Trends® in Machine Learning, Vol. 1, No. 4
- Average-delay optimal policies for the point-to-point channel
- Reinforcement Learning: A Tutorial Survey and Recent Advances
  Abhijit Gosavi,
  19 December 2008 | INFORMS Journal on Computing, Vol. 21, No. 2
- Practical solution techniques for first-order MDPs
  Artificial Intelligence, Vol. 173, No. 5-6
- A New Learning Algorithm for Optimal Stopping
  1 November 2008 | Discrete Event Dynamic Systems, Vol. 19, No. 1
- A Dynamic Programming Approach for QoS-Aware Power Management in Wireless Video Sensor Networks
  IEEE Transactions on Vehicular Technology, Vol. 58, No. 2
- Combinatorial Design of a Stochastic Markov Decision Process
- Stability and Asymptotic Optimality of Generalized MaxWeight Policies
  SIAM Journal on Control and Optimization, Vol. 47, No. 6
- Approximate dynamic programming approach for process control
  IFAC Proceedings Volumes, Vol. 42, No. 11
- Dynamic Multipriority Patient Scheduling for a Diagnostic Resource
  Jonathan Patrick,
  Martin L. Puterman,
  Maurice Queyranne,
  1 December 2008 | Operations Research, Vol. 56, No. 6
- Formal models and algorithms for decentralized decision making under uncertainty
  14 February 2008 | Autonomous Agents and Multi-Agent Systems, Vol. 17, No. 2
- Reducing Computational Complexity in Markov Decision Processes Using Abstract Actions
- Structural Properties of Optimal Transmission Policies Over a Randomly Varying Channel
  IEEE Transactions on Automatic Control, Vol. 53, No. 6
- Relaxations of Weakly Coupled Stochastic Dynamic Programs
  Daniel Adelman,
  Adam J. Mersereau,
  14 January 2008 | Operations Research, Vol. 56, No. 3
- Computational Performance Bounds for Markov Chains With Applications
  IEEE Transactions on Automatic Control, Vol. 53, No. 5
- Power distribution network expansion scheduling using dynamic programming genetic algorithm
  IET Generation, Transmission & Distribution, Vol. 2, No. 3
- Exploiting the Structural Properties of the Underlying Markov Decision Problem in the Q-Learning Algorithm
  Sumit Kunnumkal,
  Huseyin Topaloglu,
  25 February 2008 | INFORMS Journal on Computing, Vol. 20, No. 2
- An on-line learning algorithm for energy efficient delay constrained scheduling over a fading channel
  IEEE Journal on Selected Areas in Communications, Vol. 26, No. 4
- Dynamic asset allocation strategies using a stochastic dynamic programming aproach
- Successive Linear Approximation Solution of Infinite-Horizon Dynamic Stochastic Programs
  SIAM Journal on Optimization, Vol. 18, No. 4
- Closed Reentrant Queueing Networks Under Affine Index Policies: Throughput Bounds, Examples and Asymptotic Loss
  IFAC Proceedings Volumes, Vol. 41, No. 2
- Price-Directed Control of a Closed Logistics Queueing Network
  Daniel Adelman,
  20 August 2007 | Operations Research, Vol. 55, No. 6
- Refined experimental design and regression splines method for network revenue management
  2 October 2007 | Journal of Revenue and Pricing Management, Vol. 6, No. 3
- Mission Health Management for 24/7 Persistent Surveillance Operations
  15 June 2007
- An Infinite-Dimensional Linear Programming Algorithm for Deterministic Semi-Markov Decision Processes on Borel Spaces
  Diego Klabjan,
  Daniel Adelman,
  1 August 2007 | Mathematics of Operations Research, Vol. 32, No. 3
- Dynamic Bid Prices in Revenue Management
  Daniel Adelman,
  1 August 2007 | Operations Research, Vol. 55, No. 4
- An Evolutionary Random Policy Search Algorithm for Solving Markov Decision Processes
  Jiaqiao Hu,
  Michael C. Fu,
  Vahid R. Ramezani,
  Steven I. Marcus,
  1 May 2007 | INFORMS Journal on Computing, Vol. 19, No. 2
- Symmetric approximate linear programming for factored MDPs with application to constrained problems
  25 January 2007 | Annals of Mathematics and Artificial Intelligence, Vol. 47, No. 3-4
- Computing and Using Lower and Upper Bounds for Action Elimination in MDP Planning
- Reinforcement Learning Algorithms Based on mGA and EA with Policy Iterations
- Chapter 22 Duality Theory and Approximate Dynamic Programming for Pricing American Options and Portfolio Optimization
- Reinforcement Learning: An On-Line Framework Using Support Vectors
- Performance Bounds in $L_p$‐norm for Approximate Value Iteration
  SIAM Journal on Control and Optimization, Vol. 46, No. 2
- Computational Methods for Oblivious Equilibrium
  SSRN Electronic Journal, Vol. 3
- ACCOUNTING RISK IN MULTISTAGE STOCHASTIC PROBLEMS USING APPROXIMATE DYNAMIC PROGRAMMING
  IFAC Proceedings Volumes, Vol. 40, No. 5
- Approximate dynamic programming methods for an inventory allocation problem under uncertainty
  13 June 2006 | Naval Research Logistics (NRL), Vol. 53, No. 8
- Multi-Agent Task Assignment in the Bandit Framework
- Approximate Solutions of a Dynamic Forecast-Inventory Model
  Tetsuo Iida,
  Paul H. Zipkin,
  1 October 2006 | Manufacturing & Service Operations Management, Vol. 8, No. 4
- Relaxed dynamic programming in switching systems
  IEE Proceedings - Control Theory and Applications, Vol. 153, No. 5
- Approximate dynamic programming based approach to process control and scheduling
  Computers & Chemical Engineering, Vol. 30, No. 10-12
- A Cost-Shaping Linear Program for Average-Cost Approximate Dynamic Programming with Performance Guarantees
  Daniela Pucci de Farias,
  Benjamin Van Roy,
  1 August 2006 | Mathematics of Operations Research, Vol. 31, No. 3
- Towards Cognitive Machines: Multiscale Measures and Analysis
- Resource allocation among agents with preferences induced by factored MDPs
  8 May 2006
- Performance Loss Bounds for Approximate Value Iteration with State Aggregation
  Benjamin Van Roy,
  1 May 2006 | Mathematics of Operations Research, Vol. 31, No. 2
- The Effects of Locality and Asymmetry in Large-Scale Multiagent MDPs
- Chapter 5 Dynamic Asset Allocation Strategies Using a Stochastic Dynamic Programming Approach
- Decentralized approximate dynamic programming for dynamic networks of agents
- An approximate dynamic programming approach for a product distribution problem
  IIE Transactions, Vol. 37, No. 8
- Response to Comments on Brandão et al. (2005)
  Luiz E. Brandão,
  James S. Dyer,
  Warren J. Hahn,
  1 June 2005 | Decision Analysis, Vol. 2, No. 2
- Robust Dynamic Programming
  Garud N. Iyengar,
  1 May 2005 | Mathematics of Operations Research, Vol. 30, No. 2
- A Distributed Decision-Making Structure for Dynamic Resource Allocation Using Nonlinear Functional Approximations
  Huseyin Topaloglu,
  Warren B. Powell,
  1 April 2005 | Operations Research, Vol. 53, No. 2
- LP Modeling for Asset-Liability Management: A Survey of Choices and Simplifications
  ManMohan S. Sodhi,
  1 April 2005 | Operations Research, Vol. 53, No. 2
- On Convergence Conditions of an Extended Projection Neural Network
  Neural Computation, Vol. 17, No. 3
- Modeling Medical Treatment Using Markov Decision Processes
- On Approximate Dynamic Programming in Switching Systems
- On Constraint Sampling in the Linear Programming Approach to Approximate Dynamic Programming
  Daniela Pucci de Farias,
  Benjamin Van Roy,
  1 August 2004 | Mathematics of Operations Research, Vol. 29, No. 3
- A Price-Directed Approach to Stochastic Inventory/Routing
  Daniel Adelman,
  1 August 2004 | Operations Research, Vol. 52, No. 4
- Price-Directed Replenishment of Subsets: Methodology and Its Application to Inventory Routing
  Daniel Adelman,
  1 October 2003 | Manufacturing & Service Operations Management, Vol. 5, No. 4

Volume 51, Issue 6

November-December 2003

Pages 839-1016

Article Information

Metrics

Information

Received:September 01, 2001
Accepted:July 01, 2002
Published Online:December 01, 2003

Cite as

D. P. de Farias, B. Van Roy, (2003) The Linear Programming Approach to Approximate Dynamic Programming. Operations Research 51(6):850-865.

https://doi.org/10.1287/opre.51.6.850.24925

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

The Linear Programming Approach to Approximate Dynamic Programming

References

Volume 51, Issue 6

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News