Constrained Markov Decision Models with Weighted Discounted Rewards

Eugene A. Feinberg
Eugene A. Feinberg
303 Harriman Hall, SUNY at Stony Brook, Stony Brook, NY 11794-3775
Search for more papers by this author
,
Adam Shwartz
Adam Shwartz
Department of Electrical Engineering, Technion-Israel Institute of Technology, Haifa 32000, Israel
Search for more papers by this author

Eugene A. Feinberg

303 Harriman Hall, SUNY at Stony Brook, Stony Brook, NY 11794-3775

Search for more papers by this author

Adam Shwartz

Department of Electrical Engineering, Technion-Israel Institute of Technology, Haifa 32000, Israel

Search for more papers by this author

Published Online:1 May 1995https://doi.org/10.1287/moor.20.2.302

Abstract

This paper deals with constrained optimization of Markov Decision Processes. Both objective function and constraints are sums of standard discounted rewards, but each with a different discount factor Such models arise, e.g., in production and in applications involving multiple time scales. We prove that it a feasible policy exists, then there exists an optimal policy which is (i) stationary (nonrandomized) from some step onward, (ii) randomized, Markov before this step, but the total number of actions which are added by randomization is bounded by the number of constraints. Optimality of such policies for multi-criteria problems is also established.

These new policies have the pleasing aesthetic property that the amount of randomization they require over any trajectory is restricted by the number of constraints. This result is new even for constrained optimization with a single discount factor, where the optimality of randomized stationary policies is known. However, a randomized stationary policy may require an infinite number of randomizations over time.

We also formulate a linear programming algorithm for approximate solutions of con-strained weighted discounted models.

cover image Mathematics of Operations Research

Volume 20, Issue 2

May 1995

Pages 257-512

Article Information

Metrics

Information

Published Online:May 01, 1995

Cite as

Eugene A. Feinberg, Adam Shwartz, (1995) Constrained Markov Decision Models with Weighted Discounted Rewards. Mathematics of Operations Research 20(2):302-320.

https://doi.org/10.1287/moor.20.2.302

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Constrained Markov Decision Models with Weighted Discounted Rewards

Abstract

Volume 20, Issue 2

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News