A Method for Approximate Solutions to Stochastic Dynamic Programming Problems Using Expectations

Published Online:https://doi.org/10.1287/opre.16.2.296

This note describes and illustrates a computational technique to obtain approximate solutions to stochastic dynamic programming problems. The technique is to replace probability distributions by their corresponding expectations, and to use the values of the states in the corresponding deterministic system under its optimal policy to determine an approximate policy in the stochastic system through a single application of Howard's policy improvement operation. Two examples are given.

INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.