Optimum Policy Regions for Markov Processes with Discounting

Richard D. Smallwood
Richard D. Smallwood
Stanford University, Stanford, California
Search for more papers by this author

Stanford University, Stanford, California

Published Online:1 Aug 1966https://doi.org/10.1287/opre.14.4.658

Abstract

In many practical situations the discount factor for future rewards and costs is not known precisely. In the modeling of such situations, this is often reflected in a dependence of the optimum policy on the discount factor. We discuss this dependence of the optimum policy on discount factor for the class of finite-state, time-invariant, Markov models. A procedure is developed for finding the value of the discount factor for which we are indifferent between two policies. This is then extended to a discussion of how we can find the complete description of the optimum policy regions over any range of the discount factor. Two examples are presented.

Volume 14, Issue 4

July-August 1966

Pages 555-756

Article Information

Metrics

Information

Published Online:August 01, 1966

Cite as

Richard D. Smallwood, (1966) Optimum Policy Regions for Markov Processes with Discounting. Operations Research 14(4):658-669.

https://doi.org/10.1287/opre.14.4.658

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Optimum Policy Regions for Markov Processes with Discounting

Abstract

Volume 14, Issue 4

Article Information

Metrics

Information

Cite as

Sign Up for INFORMS Publications Updates and News