Multichain Markov Decision Processes with a Sample Path Constraint: A Decomposition Approach

Keith W. Ross
Keith W. Ross
Department of Systems, University of Pennsylvania, Philadelphia, Pennsylvania 19104
Search for more papers by this author
,
Ravi Varadarajan
Ravi Varadarajan
Department of Computer and Information Science, University of Florida, Gainesville, Florida 32611
Search for more papers by this author

Keith W. Ross

Department of Systems, University of Pennsylvania, Philadelphia, Pennsylvania 19104

Search for more papers by this author

Ravi Varadarajan

Department of Computer and Information Science, University of Florida, Gainesville, Florida 32611

Search for more papers by this author

Published Online:1 Feb 1991https://doi.org/10.1287/moor.16.1.195

Abstract

We consider finite-state finite-action Markov decision processes which accumulate both a reward and a cost at each decision epoch. We study the problem of finding a policy that maximizes the expected long-run average reward subject to the constraint that the long-run average cost be no greater than a given value with probability one. We establish that if there exists a policy that meets the constraint, then there exists an ε-optimal stationary policy. Furthermore, an algorithm is outlined to locate the ε-optimal stationary policy. The proof of the result hinges on a decomposition of the state space into maximal recurrent classes and a set of transient states.

cover image Mathematics of Operations Research

Volume 16, Issue 1

February 1991

Pages 1-222

Article Information

Metrics

Information

Published Online:February 01, 1991

Cite as

Keith W. Ross, Ravi Varadarajan, (1991) Multichain Markov Decision Processes with a Sample Path Constraint: A Decomposition Approach. Mathematics of Operations Research 16(1):195-207.

https://doi.org/10.1287/moor.16.1.195

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Multichain Markov Decision Processes with a Sample Path Constraint: A Decomposition Approach

Abstract

Volume 16, Issue 1

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News