Solution and Forecast Horizons for Infinite-Horizon Nonhomogeneous Markov Decision Processes

Torpong Cheevaprawatdomrong
Torpong Cheevaprawatdomrong
[email protected]
Jong Stit Co., Ltd., Bangkok, Thailand
Search for more papers by this author
,
Irwin E. Schochetman
Irwin E. Schochetman
[email protected]
Mathematics and Statistics, Oakland University, Rochester, Michigan 48309
Search for more papers by this author
,
Robert L. Smith
Robert L. Smith
[email protected]
Industrial and Operations Engineering, The University of Michigan, Ann Arbor, Michigan 48109
Search for more papers by this author
,
Alfredo Garcia
Alfredo Garcia
[email protected]
Systems and Information Engineering, University of Virginia, Charlottesville, Virginia 22901
Search for more papers by this author

Torpong Cheevaprawatdomrong

[email protected]

Jong Stit Co., Ltd., Bangkok, Thailand

Search for more papers by this author

Irwin E. Schochetman

[email protected]

Mathematics and Statistics, Oakland University, Rochester, Michigan 48309

Search for more papers by this author

Robert L. Smith

[email protected]

Industrial and Operations Engineering, The University of Michigan, Ann Arbor, Michigan 48109

Search for more papers by this author

Alfredo Garcia

[email protected]

Systems and Information Engineering, University of Virginia, Charlottesville, Virginia 22901

Search for more papers by this author

Published Online:1 Feb 2007https://doi.org/10.1287/moor.1060.0224

Abstract

We consider a nonhomogeneous infinite-horizon Markov Decision Process (MDP) problem with multiple optimal first-period policies. We seek an algorithm that, given finite data, delivers an optimal first-period policy. Such an algorithm can thus recursively generate, within a rolling-horizon procedure, an infinite-horizon optimal solution to the original problem. However, it can happen that no such algorithm exists, i.e., the MDP is not well posed. Equivalently, it is impossible to solve the problem with a finite amount of data. Assuming increasing marginal returns in actions (with respect to states) and stochastically increasing state transitions (with respect to actions), we provide an algorithm that is guaranteed to solve the given MDP whenever it is well posed. This algorithm determines, in finite time, a forecast horizon for which an optimal solution delivers an optimal first-period policy. As an application, we solve all well-posed instances of the time-varying version of the classic asset-selling problem.

cover image Mathematics of Operations Research

Volume 32, Issue 1

February 2007

Pages 1-256

Article Information

Metrics

Information

Received:February 04, 2005
Published Online:February 01, 2007

Cite as

Torpong Cheevaprawatdomrong, Irwin E. Schochetman, Robert L. Smith, Alfredo Garcia, (2007) Solution and Forecast Horizons for Infinite-Horizon Nonhomogeneous Markov Decision Processes. Mathematics of Operations Research 32(1):51-72.

https://doi.org/10.1287/moor.1060.0224

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Solution and Forecast Horizons for Infinite-Horizon Nonhomogeneous Markov Decision Processes

Abstract

Volume 32, Issue 1

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News