Convergence of Dynamic Programming Models

Published Online:https://doi.org/10.1287/moor.6.4.493

Weak conditions are presented for approximating dynamic programming models. For a sequence of these models, continuous convergence of the sequence of associated optimal value functions is obtained under the condition that state and action space converge in the sense of Kuratowski, and that the mappings of admissible actions as well as the transition law, the discount factors and the reward functions converge continuously. Further a relation for the associated sets of optimal actions is given. The analysis is based on results about convergence preserving properties of supremum value functions and integrals. The approximation results are extended to so-called upper-semi-continuous convergent sequences and are related to discretization procedures by using projections.

INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.