Uniqueness and Stability of Optimal Policies of Finite State Markov Decision Processes

Arie Leizarowitz
Arie Leizarowitz
[email protected]
Department of Mathematics, Technion, Haifa 32000, Israel
Search for more papers by this author
,
Alexander J. Zaslavski
Alexander J. Zaslavski
[email protected]
Department of Mathematics, Technion, Haifa 32000, Israel
Search for more papers by this author

Arie Leizarowitz

[email protected]

Department of Mathematics, Technion, Haifa 32000, Israel

Search for more papers by this author

Alexander J. Zaslavski

[email protected]

Department of Mathematics, Technion, Haifa 32000, Israel

Search for more papers by this author

Published Online:1 Feb 2007https://doi.org/10.1287/moor.1060.0232

References

Aubin J. P., Ekeland I.Applied Nonlinear Analysis (1984) (Wiley Interscience, New York) Google Scholar
Bather J. Optimal decision procedures for finite Markov chains, I. Adv. Appl. Probab. (1973) 5:328–339Crossref, Google Scholar
Bather J. Optimal decision procedures for finite Markov chains, II. Adv. Appl. Probab. (1973) 5:521–540Crossref, Google Scholar
Bather J. Optimal decision procedures for finite Markov chains, III. Adv. Appl. Probab. (1973) 5:541–553Crossref, Google Scholar
Borkar V. S. On minimum cost per unit time control of Markov chains. SIAM J. Control Optim. (1984) 22:965–984Crossref, Google Scholar
Borkar V. S. Control of Markov chains with long-run average cost criterion: The dynamic programming equations. SIAM J. Control Optim. (1989) 27:642–657Crossref, Google Scholar
Federgruen A., Schweitzer J. P. A fixed-point approach to undiscounted Markov renewal programs. SIAM J. Algebraic Discrete Methods (1984) 5:539–550Crossref, Google Scholar
Federgruen A., Schweitzer J. P., Tijms H. C. Denumerable undiscounted semi-Markov decision processes with unbounded rewards. Math. Oper. Res. (1983) 8:298–313Link, Google Scholar
Feinberg E. A. On controlled finite state Markov processes with compact control sets. Theory Probab. Appl. (1975) 20:856–861Crossref, Google Scholar
Hinderer K.Foundation of Non-Stationary Dynamic Programming with Discrete-Time Parameter. Lecture Notes in Operations Research (1970) 33(Springer-Verlag, New York) Crossref, Google Scholar
Hordijk A. Dynamic programming and Markov potential theory. Math. Centre. Tracts (1974) 51(Amsterdam, The Netherlands)Google Scholar
Hordijk A., Puterman M. L. On the convergence of policy iteration in undiscounted finite state Markov processes: The unichain case. Math. Oper. Res. (1987) 12:163–176Link, Google Scholar
Kelley J. L.General Topology (1975) 27(Springer-Verlag, New York) Google Scholar
Leizarowitz A. Overtaking and almost-sure optimality for infinite horizon Markov decision processes. Math. Oper. Res. (1996) 21:158–181Link, Google Scholar
Leizarowitz A. An algorithm to identify and compute average optimal policies in multichain Markov decision processes. Math. Oper. Res. (2003) 28:553–586Link, Google Scholar
Schweitzer P. J. On the solvability of Bellman’s functional equation for Markov renewal programs. J. Math. Anal. Appl. (1983) 96:13–23Crossref, Google Scholar
Schweitzer P. J. A Brouwer fixed-point mapping approach to communicating Markov decision processes. J. Math. Anal. Appl. (1987) 123:117–130Crossref, Google Scholar

cover image Mathematics of Operations Research

Volume 32, Issue 1

February 2007

Pages 1-256

Article Information

Metrics

Information

Received:September 14, 2004
Published Online:February 01, 2007

Cite as

Arie Leizarowitz, Alexander J. Zaslavski, (2007) Uniqueness and Stability of Optimal Policies of Finite State Markov Decision Processes. Mathematics of Operations Research 32(1):156-167.

https://doi.org/10.1287/moor.1060.0232

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Uniqueness and Stability of Optimal Policies of Finite State Markov Decision Processes

References

Volume 32, Issue 1

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News