Markov Chain Design Problems

Richard V. Evans
Richard V. Evans
University of Illinois, Urbana-Champaign, Illinois
Search for more papers by this author

University of Illinois, Urbana-Champaign, Illinois

Published Online:1 Oct 1981https://doi.org/10.1287/opre.29.5.959

Abstract

System design problems using Markov chains and denoted Markov chain design problems are introduced. It is assumed that the chains of the problem have transition matrices and one period conditional expected rewards which are differentiable functions of a vector parameter. A gradient algorithm for maximizing the sum of the discounted expected reward or the limit of the one period expected reward is presented. The algorithm uses approximate objective function values and approximate gradients at each stage. Numerical work on a simple one dimensional queueing model solved the design problem correctly and quickly even though minimal approximation accuracy was used. The convergence of the approximation is proven for an appropriately converging sequence of design parameter values.

Volume 29, Issue 5

September-October 1981

Pages 829-1034

Article Information

Metrics

Information

Published Online:October 01, 1981

Cite as

Richard V. Evans, (1981) Markov Chain Design Problems. Operations Research 29(5):959-970.

https://doi.org/10.1287/opre.29.5.959

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Markov Chain Design Problems

Abstract

Volume 29, Issue 5

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News