On Stationary Strategies in Countable State Total Reward Markov Decision Processes

Jan van der Wal
Jan van der Wal
University of Technology Eindhoven, P.O. Box 513, 5600 MB Eindhoven, The Netherlands
Search for more papers by this author

University of Technology Eindhoven, P.O. Box 513, 5600 MB Eindhoven, The Netherlands

Published Online:1 May 1984https://doi.org/10.1287/moor.9.2.290

Abstract

This paper deals with total reward Markov decision processes with countable state space. Various partial results from the literature are connected and extended in the following theorem. If in each state where the value is nonpositive a conserving action exists then there exists a stationary strategy f which is uniformly nearly optimal in the following sense: v(f) ≥ v* − ϵu*, where u* is the value of the problem if only the positive rewards are counted.

Further, the following result is established: if an optimal strategy exists then also an optimal stationary strategy exists.

cover image Mathematics of Operations Research

Volume 9, Issue 2

May 1984

Pages 159-316

Article Information

Metrics

Information

Published Online:May 01, 1984

Cite as

Jan van der Wal, (1984) On Stationary Strategies in Countable State Total Reward Markov Decision Processes. Mathematics of Operations Research 9(2):290-300.

https://doi.org/10.1287/moor.9.2.290

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

On Stationary Strategies in Countable State Total Reward Markov Decision Processes

Abstract

Volume 9, Issue 2

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News