Convergence of Dynamic Programming Models

Hans-Joachim Langen
Hans-Joachim Langen
Graf-Haeseler-Str. 11, D-4600 Dortmund 1, Federal Republic of Germany
Search for more papers by this author

Graf-Haeseler-Str. 11, D-4600 Dortmund 1, Federal Republic of Germany

Published Online:1 Nov 1981https://doi.org/10.1287/moor.6.4.493

Abstract

Weak conditions are presented for approximating dynamic programming models. For a sequence of these models, continuous convergence of the sequence of associated optimal value functions is obtained under the condition that state and action space converge in the sense of Kuratowski, and that the mappings of admissible actions as well as the transition law, the discount factors and the reward functions converge continuously. Further a relation for the associated sets of optimal actions is given. The analysis is based on results about convergence preserving properties of supremum value functions and integrals. The approximation results are extended to so-called upper-semi-continuous convergent sequences and are related to discretization procedures by using projections.

Cited by
- Optimality of Symmetric Independent Policies Under Decentralized Mean-Field Information Sharing for Stochastic Teams and Equivalence with McKean−Vlasov Control of a Representative Agent
  Sina Sanjari,
  Naci Saldi,
  Serdar Yüksel
  12 May 2026 | Mathematics of Operations Research, Vol. 0, No. 0
- Robustness and Approximation of Discrete-Time Mean-Field Games Under Discounted Cost Criterion
  Uğur Aydin,
  Naci Saldi
  30 January 2025 | Mathematics of Operations Research, Vol. 51, No. 1
- Sensitivity of Filter Kernels and Robustness to Incorrect Transition and Measurement Kernel Perturbations in Partially Observable Stochastic Control
- Consistency of Sample-Based Stationary Points for Infinite-Dimensional Stochastic Optimization
  3 January 2025 | SIAM Journal on Optimization, Vol. 35, No. 1
- Another Look at Partially Observed Optimal Stochastic Control: Existence, Ergodicity, and Approximations Without Belief-Reduction
  4 January 2025 | Applied Mathematics & Optimization, Vol. 91, No. 1
- Partially Observed Optimal Stochastic Control: Regularity, Optimality, Approximations, and Learning
- Robustness of Stochastic Optimal Control to Approximate Diffusion Models Under Several Cost Evaluation Criteria
  Somnath Pradhan,
  Serdar Yüksel
  12 October 2023 | Mathematics of Operations Research, Vol. 49, No. 4
- Infinite Horizon Average Cost Optimality Criteria for Mean-Field Control
  7 October 2024 | SIAM Journal on Control and Optimization, Vol. 62, No. 5
- On Borkar and Young Relaxed Control Topologies and Continuous Dependence of Invariant Measures on Control Policy
  12 August 2024 | SIAM Journal on Control and Optimization, Vol. 62, No. 4
- Vitali Theorems for Varying Measures
  31 July 2024 | Symmetry, Vol. 16, No. 8
- Control Policy Topologies and Dynamic Programming Methods in Decentralized Stochastic Control
  2 February 2024
- Many-Agent Convex and Non-convex Exchangeable (Mean-Field) Teams and Optimality of Symmetric Policies
  2 February 2024
- Comparison of Information Structures and Their Blackwell Ordering
  2 February 2024
- Partially Observed Discrete-Time Risk-Sensitive Mean Field Games
  7 June 2022 | Dynamic Games and Applications, Vol. 13, No. 3
- Sequential Stochastic Control (Single or Multi-Agent) Problems Nearly Admit Change of Measures with Independent Measurement
  13 March 2023 | Applied Mathematics & Optimization, Vol. 87, No. 3
- Comparison of Two Convergence Criterion in the Optimization Process Using a Recursive Method in a Multi-Reservoir System
  21 September 2022 | Water, Vol. 14, No. 19
- Robustness to incorrect models and data-driven learning in average-cost optimal stochastic control
  Automatica, Vol. 139
- Zero-Delay Lossy Coding of Linear Vector Markov Sources: Optimality of Stationary Codes and Near Optimality of Finite Memory Codes
  IEEE Transactions on Information Theory, Vol. 68, No. 5
- A stability result for linear Markovian stochastic optimization problems
  6 October 2020 | Mathematical Programming, Vol. 191, No. 2
- Learning granger causality for non-stationary Hawkes processes
  Neurocomputing, Vol. 468
- Geometry of information structures, strategic measures and associated stochastic control topologies
  Probability Surveys, Vol. 19, No. none
- Hyperspace Neighbor Penetration Approach to Dynamic Programming for Model-Based Reinforcement Learning Problems with Slowly Changing Variables in a Continuous State Space
- Dynamic Capacity Allocation for Elective Surgeries: Reducing Urgency-Weighted Wait Times
  Stephanie Carew,
  Mahesh Nagarajan,
  Steven Shechter,
  Jugpal Arneja,
  Erik Skarsgard
  20 April 2020 | Manufacturing & Service Operations Management, Vol. 23, No. 2
- Optimal Solutions to Infinite-Player Stochastic Teams and Mean-Field Teams
  IEEE Transactions on Automatic Control, Vol. 66, No. 3
- Regularity and Stability of Feedback Relaxed Controls
  13 September 2021 | SIAM Journal on Control and Optimization, Vol. 59, No. 5
- Comparison of Information Structures for Zero-Sum Games and a Partial Converse to Blackwell Ordering in Standard Borel Spaces
  6 May 2021 | SIAM Journal on Control and Optimization, Vol. 59, No. 3
- Approximate Markov-Nash Equilibria for Discrete-Time Risk-Sensitive Mean-Field Games
  Naci Saldi,
  Tamer Başar,
  Maxim Raginsky
  12 August 2020 | Mathematics of Operations Research, Vol. 45, No. 4
- A Convex Optimization Approach to Dynamic Programming in Continuous State and Action Spaces
  14 September 2020 | Journal of Optimization Theory and Applications, Vol. 187, No. 1
- Asymptotic Optimality of Finite Model Approximations for Partially Observed Markov Decision Processes With Discounted Cost
  IEEE Transactions on Automatic Control, Vol. 65, No. 1
- Robustness to Incorrect System Models in Stochastic Control
  27 April 2020 | SIAM Journal on Control and Optimization, Vol. 58, No. 2
- A Universal Dynamic Program and Refined Existence Results for Decentralized Stochastic Control
  8 September 2020 | SIAM Journal on Control and Optimization, Vol. 58, No. 5
- Stochastic Subgradient Methods for Dynamic Programming in Continuous State and Action Spaces
- Robustness to Incorrect Models in Average-Cost Optimal Stochastic Control
- Approximate Nash Equilibria in Partially Observed Stochastic Games with Mean-Field Interactions
  Naci Saldi,
  Tamer Başar,
  Maxim Raginsky
  30 May 2019 | Mathematics of Operations Research, Vol. 44, No. 3
- Finite-State Approximations to Discounted and Average Cost Constrained Markov Decision Processes
  IEEE Transactions on Automatic Control, Vol. 64, No. 7
- Robustness to Incorrect Priors in Partially Observed Stochastic Control
  28 May 2019 | SIAM Journal on Control and Optimization, Vol. 57, No. 3
- Introduction and Summary
  12 May 2018
- Markov--Nash Equilibria in Mean-Field Games with Discounted Cost
  27 November 2018 | SIAM Journal on Control and Optimization, Vol. 56, No. 6
- On the Asymptotic Optimality of Finite Approximations to Markov Decision Processes with Borel Spaces
  Naci Saldi,
  Serdar Yüksel,
  Tamás Linder
  15 March 2017 | Mathematics of Operations Research, Vol. 42, No. 4
- A continuity question of Dubins and Savage
  22 June 2017 | Journal of Applied Probability, Vol. 54, No. 2
- Computable approximations for continuous-time Markov decision processes on Borel spaces based on empirical measures
  Journal of Mathematical Analysis and Applications, Vol. 443, No. 2
- Near optimality of quantized policies in stochastic control under weak continuity conditions
  Journal of Mathematical Analysis and Applications, Vol. 435, No. 1
- Finite-state approximations to constrained Markov decision processes with Borel spaces
- Jointly continuous utility functions on submetrizable kω -spaces
  Topology and its Applications, Vol. 190
- Convergence of partial maps
  Journal of Mathematical Analysis and Applications, Vol. 419, No. 2
- Stochastic approximations of constrained discounted Markov decision processes
  Journal of Mathematical Analysis and Applications, Vol. 413, No. 2
- Inflationary equilibrium in a stochastic economy with independent agents
  Journal of Mathematical Economics, Vol. 52
- The Structure of Extended Real-valued Metric Spaces
  25 September 2013 | Set-Valued and Variational Analysis, Vol. 21, No. 4
- Finite Linear Programming Approximations of Constrained Discounted Markov Decision Processes
  SIAM Journal on Control and Optimization, Vol. 51, No. 2
- Discounted Continuous-Time Controlled Markov Chains: Convergence of Control Models
  1 March 2016 | Journal of Applied Probability, Vol. 49, No. 04
- Discounted Continuous-Time Controlled Markov Chains: Convergence of Control Models
  30 January 2018 | Journal of Applied Probability, Vol. 49, No. 4
- Approximation of Markov decision processes with general state space
  Journal of Mathematical Analysis and Applications, Vol. 388, No. 2
- Approximation of Infinite Horizon Discounted Cost Markov Decision Processes
  12 July 2012
- Vietoris topology on partial maps with compact domains
  Topology and its Applications, Vol. 157, No. 8
- Čech-completeness and related properties of the generalized compact-open topology
  Journal of Applied Analysis, Vol. 16, No. 1
- Inflationary Equilibrium in a Stochastic Economy with Independent Agents
  SSRN Electronic Journal, Vol. 35
- Boundedly UC Spaces and Topologies on Function Spaces
  28 May 2008 | Set-Valued Analysis, Vol. 16, No. 4
- An Infinite-Dimensional Linear Programming Algorithm for Deterministic Semi-Markov Decision Processes on Borel Spaces
  Diego Klabjan,
  Daniel Adelman,
  1 August 2007 | Mathematics of Operations Research, Vol. 32, No. 3
- On Stability of Multistage Stochastic Decision Problems
- The Preservation of Continuity and Lipschitz Continuity by Optimal Reward Operators
  Rida Laraki,
  William D. Sudderth,
  1 August 2004 | Mathematics of Operations Research, Vol. 29, No. 3
- Reflections on Output Analysis for Multistage Stochastic Linear Programs
- Existence and Uniqueness of Solutions to the Bellman Equation in the Unbounded Case
  Econometrica, Vol. 71, No. 5
- Stability of multipacket slotted Aloha with selfish users and perfect information
- Completeness properties of the generalized compact-open topology on partial functions with closed domains
  Topology and its Applications, Vol. 110, No. 3
- Limiting Discounted-Cost Control of Partially Observable Stochastic Systems
  SIAM Journal on Control and Optimization, Vol. 40, No. 2
- A strategic market game with active bankruptcy
  Journal of Mathematical Economics, Vol. 34, No. 3
- Complete metrizability of generalized compact-open topology
  Topology and its Applications, Vol. 91, No. 2
- Bibliography
  27 May 2008
- A strategic market game with secured lending
  Journal of Mathematical Economics, Vol. 28, No. 2
- Rates of Convergence in Stochastic Programs with Complete Integer Recourse
  SIAM Journal on Optimization, Vol. 6, No. 4
- Stability Results for Stochastic Programs and Sensors, Allowing for Discontinuous Objective Functions
  SIAM Journal on Optimization, Vol. 4, No. 3
- Bibliography
  27 May 2008
- Approximation theory for stochastic variational and Ky Fan inequalities in finite dimensions
  Annals of Operations Research, Vol. 44, No. 1
- Some structured dynamic programs arising in economics
  Computers & Mathematics with Applications, Vol. 24, No. 8-9
- On truncations and perturbations of Markov decision problems with an application to queueing network overflow control
  Annals of Operations Research, Vol. 29, No. 1
- Markov decision processes
  European Journal of Operational Research, Vol. 39, No. 1
- Discretization procedures for adaptive Markov control processes
  Journal of Mathematical Analysis and Applications, Vol. 137, No. 2
- Weak convergence of approximate solutions of stochastic equations with applications to random differential and integral equations ∗
  Numerical Functional Analysis and Optimization, Vol. 9, No. 1-2
- Approximation and bounds in discrete event dynamic programming
  IEEE Transactions on Automatic Control, Vol. 31, No. 3
- Concepts of similarity for utility functions
  Journal of Mathematical Economics, Vol. 15, No. 2
- Optimal control of arrivals to multiserver queues in a random environment
  14 July 2016 | Journal of Applied Probability, Vol. 21, No. 03
- Optimal control of arrivals to multiserver queues in a random environment
  14 July 2016 | Journal of Applied Probability, Vol. 21, No. 3
- Applying the method of phases in the optimization of queuing systems
  1 July 2016 | Advances in Applied Probability, Vol. 14, No. 1
- Approximations of inventory models
  Zeitschrift für Operations Research, Vol. 25, No. 5

cover image Mathematics of Operations Research

Volume 6, Issue 4

November 1981

Pages 475-629

Article Information

Metrics

Information

Published Online:November 01, 1981

Cite as

Hans-Joachim Langen, (1981) Convergence of Dynamic Programming Models. Mathematics of Operations Research 6(4):493-512.

https://doi.org/10.1287/moor.6.4.493

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Convergence of Dynamic Programming Models

Abstract

Volume 6, Issue 4

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News