Approximate Linear Programming for Average Cost MDPs

Michael H. Veatch
Michael H. Veatch
[email protected]
Department of Mathematics, Gordon College, Wenham, Massachusetts 01984
Search for more papers by this author

Department of Mathematics, Gordon College, Wenham, Massachusetts 01984

Published Online:20 Dec 2012https://doi.org/10.1287/moor.1120.0574

Abstract

We consider the linear programming approach to approximate dynamic programming with an average cost objective and a finite state space. Using a Lagrangian form of the linear program (LP), the average cost error is shown to be a multiple of the best fit differential cost error. This result is analogous to previous error bounds for a discounted cost objective. Second, bounds are derived for average cost error and performance of the policy generated from the LP that involve the mixing time of the Markov decision process (MDP) under this policy or the optimal policy. These results improve on a previous performance bound involving mixing times.

cover image Mathematics of Operations Research

Volume 38, Issue 3

August 2013

Pages 393-616

Article Information

Metrics

Information

Received:June 09, 2011
Published Online:December 20, 2012

Cite as

Michael H. Veatch, (2012) Approximate Linear Programming for Average Cost MDPs. Mathematics of Operations Research 38(3):535-544.

https://doi.org/10.1287/moor.1120.0574

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Approximate Linear Programming for Average Cost MDPs

Abstract

Volume 38, Issue 3

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News