Free Access

Technical Note—Bounds on the Gain of a Markov Decision Process

N. A. J. Hastings
N. A. J. Hastings
University of Birmingham, Birmingham, England
Search for more papers by this author

University of Birmingham, Birmingham, England

Published Online:1 Feb 1971https://doi.org/10.1287/opre.19.1.240

Abstract

An algorithm for the steady-state solution of Markov decision problems has been proposed by Howard and modified by Hastings. This note shows, for the case of single-chain Markov decision processes, how bounds on the optimal gain can be obtained at each cycle of the foregoing algorithms. The results extend to Markov renewal programming. Related results are the bounds proposed by Odoni for use with White's value-iteration method of optimization.

Volume 19, Issue 1

January-February 1971

Pages 1-256

Article Information

Metrics

Information

Published Online:February 01, 1971

Cite as

N. A. J. Hastings, (1971) Technical Note—Bounds on the Gain of a Markov Decision Process. Operations Research 19(1):240-244.

https://doi.org/10.1287/opre.19.1.240

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Technical Note—Bounds on the Gain of a Markov Decision Process

Abstract

Volume 19, Issue 1

Article Information

Metrics

Information

Cite as

Sign Up for INFORMS Publications Updates and News