Robust Modified Policy Iteration

David L. Kaufman
David L. Kaufman
[email protected]
Department of Industrial and Operations Engineering, University of Michigan, Ann Arbor, Michigan 48109
Search for more papers by this author
,
Andrew J. Schaefer
Andrew J. Schaefer
[email protected]
Department of Industrial Engineering, University of Pittsburgh, Pittsburgh, Pennsylvania 15261
Search for more papers by this author

David L. Kaufman

[email protected]

Department of Industrial and Operations Engineering, University of Michigan, Ann Arbor, Michigan 48109

Search for more papers by this author

Andrew J. Schaefer

[email protected]

Department of Industrial Engineering, University of Pittsburgh, Pittsburgh, Pennsylvania 15261

Search for more papers by this author

Published Online:6 Jun 2012https://doi.org/10.1287/ijoc.1120.0509

Abstract

Robust dynamic programming (robust DP) mitigates the effects of ambiguity in transition probabilities on the solutions of Markov decision problems. We consider the computation of robust DP solutions for discrete-stage, infinite-horizon, discounted problems with finite state and action spaces. We present robust modified policy iteration (RMPI) and demonstrate its convergence. RMPI encompasses both of the previously known algorithms, robust value iteration and robust policy iteration. In addition to proposing exact RMPI, in which the “inner problem” is solved precisely, we propose inexact RMPI, in which the inner problem is solved to within a specified tolerance. We also introduce new stopping criteria based on the span seminorm. Finally, we demonstrate through some numerical studies that RMPI can significantly reduce computation time.

cover image INFORMS Journal on Computing

Volume 25, Issue 3

Summer 2013

Pages 395-598

Article Information

Metrics

Information

Received:February 01, 2010
Accepted:January 01, 2012
Published Online:June 06, 2012

Cite as

David L. Kaufman, Andrew J. Schaefer, (2012) Robust Modified Policy Iteration. INFORMS Journal on Computing 25(3):396-410.

https://doi.org/10.1287/ijoc.1120.0509

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Robust Modified Policy Iteration

Abstract

Volume 25, Issue 3

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News