A Weighted Markov Decision Process

Dmitry Krass
Dmitry Krass
University of Toronto, Toronto, Ontario, Canada
Search for more papers by this author
,
Jerzy A. Filar
Jerzy A. Filar
University of Maryland at Baltimore County, Baltimore, Maryland
Search for more papers by this author
,
Sagnik S. Sinha
Sagnik S. Sinha
Indian Statistical Institute, New Delhi, India
Search for more papers by this author

Dmitry Krass

University of Toronto, Toronto, Ontario, Canada

Search for more papers by this author

Jerzy A. Filar

University of Maryland at Baltimore County, Baltimore, Maryland

Search for more papers by this author

Sagnik S. Sinha

Indian Statistical Institute, New Delhi, India

Search for more papers by this author

Published Online:1 Dec 1992https://doi.org/10.1287/opre.40.6.1180

Abstract

The two most commonly considered reward criteria for Markov decision processes are the discounted reward and the long-term average reward. The first tends to “neglect” the future, concentrating on the short-term rewards, while the second one tends to do the opposite. We consider a new reward criterion consisting of the weighted combination of these two criteria, thereby allowing the decision maker to place more or less emphasis on the short-term versus the long-term rewards by varying their weights. The mathematical implications of the new criterion include: the deterministic stationary policies can be outperformed by the randomized stationary policies, which in turn can be outperformed by the nonstationary policies; an optimal policy might not exist. We present an iterative algorithm for computing an ε-optimal nonstationary policy with a very simple structure.

Volume 40, Issue 6

November-December 1992

Pages 1028-1205

Article Information

Metrics

Information

Published Online:December 01, 1992

Cite as

Dmitry Krass, Jerzy A. Filar, Sagnik S. Sinha, (1992) A Weighted Markov Decision Process. Operations Research 40(6):1180-1187.

https://doi.org/10.1287/opre.40.6.1180

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

A Weighted Markov Decision Process

Abstract

Volume 40, Issue 6

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News