Monotonically Improving Limit-Optimal Strategies in Finite-State Decision Processes

Theodore P. Hill
Theodore P. Hill
School of Mathematics, Georgia Institute of Technology, Atlanta, Georgia 30332
Search for more papers by this author
,
Jan Van der Wal
Jan Van der Wal
Eindhoven University of Technology, Eindhoven, The Netherlands
Search for more papers by this author

Theodore P. Hill

School of Mathematics, Georgia Institute of Technology, Atlanta, Georgia 30332

Search for more papers by this author

Jan Van der Wal

Eindhoven University of Technology, Eindhoven, The Netherlands

Search for more papers by this author

Published Online:1 Aug 1987https://doi.org/10.1287/moor.12.3.463

Abstract

In every finite-state leavable gambling problem and in every finite-state Markov decision process with discounted, negative or positive reward criteria there exists a Markov strategy which is monotonically improving and optimal in the limit along every history. An example is given to show that for the positive and gambling cases such strategies cannot be constructed by simply switching to a “better” action or gamble at each successive return to a state.

cover image Mathematics of Operations Research

Volume 12, Issue 3

August 1987

Pages 377-568

Article Information

Metrics

Information

Published Online:August 01, 1987

Cite as

Theodore P. Hill, Jan Van der Wal, (1987) Monotonically Improving Limit-Optimal Strategies in Finite-State Decision Processes. Mathematics of Operations Research 12(3):463-473.

https://doi.org/10.1287/moor.12.3.463

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Monotonically Improving Limit-Optimal Strategies in Finite-State Decision Processes

Abstract

Volume 12, Issue 3

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News