Finite Horizon Markov Decision Processes with Uncertain Terminal Payoffs

Published Online:https://doi.org/10.1287/opre.43.5.862

This paper deals with the problem of finding an estimate of the maximal loss of optimality which can arise when terminal payoffs are uncertain and policies are restricted in some way. The original non-convex optimization problem is converted to a sequence of sub-problems involving the maximization of a bilinear function over a convex region. The paper deals solely with the theoretical issues.

INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.