On Finding the Maximal Gain for Markov Decision Processes

Published Online:https://doi.org/10.1287/opre.17.5.857

The method of successive approximations for solving problems on single-chain Markovian decision processes has been investigated by White and Schweitzer. This paper shows that White's scheme not only converges, but also can be modified so that monotonic upper and lower bounds on the maximal gain can be obtained.

INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.