Finite-Memory Suboptimal Design for Partially Observed Markov Decision Processes

Chelsea C. White, III
Chelsea C. White, III
University of Michigan, Ann Arbor, Michigan
Search for more papers by this author
,
William T. Scherer
William T. Scherer
University of Virginia, Charlottesville, Virginia
Search for more papers by this author

Chelsea C. White, III

University of Michigan, Ann Arbor, Michigan

Search for more papers by this author

William T. Scherer

University of Virginia, Charlottesville, Virginia

Search for more papers by this author

Published Online:1 Jun 1994https://doi.org/10.1287/opre.42.3.439

Abstract

We develop bounds on the value function and a suboptimal design for the partially observed Markov decision process. These bounds and suboptimal design are based on the M most recent observations and actions. An a priori measure of the quality of these bounds is given. We show that larger M implies tighter bounds. An operations count analysis indicates that (^#A^#Z)^M⁺¹(^#S) multiplications and additions are required per successive approximations iteration of the suboptimal design algorithm, where A, Z, and S are the action, observation, and state spaces, respectively, suggesting the algorithm is of potential use for problems with large state spaces. A preliminary numerical study indicates that the quality of the suboptimal design can be excellent.

Volume 42, Issue 3

May-June 1994

Pages 390-575

Article Information

Metrics

Information

Published Online:June 01, 1994

Cite as

Chelsea C. White, III, William T. Scherer, (1994) Finite-Memory Suboptimal Design for Partially Observed Markov Decision Processes. Operations Research 42(3):439-455.

https://doi.org/10.1287/opre.42.3.439

Keywords

dynamic programming: Markov decision processes

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Finite-Memory Suboptimal Design for Partially Observed Markov Decision Processes

Abstract

Volume 42, Issue 3

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News