Constrained Markov Decision Chains

Cyrus Derman
Cyrus Derman
Columbia University
Search for more papers by this author
,
Arthur F. Veinott, Jr.
Arthur F. Veinott, Jr.
Stanford University
Search for more papers by this author

Cyrus Derman

Columbia University

Search for more papers by this author

Arthur F. Veinott, Jr.

Stanford University

Search for more papers by this author

Published Online:1 Dec 1972https://doi.org/10.1287/mnsc.19.4.389

Abstract

We consider finite state and action discrete time parameter Markov decision chains. The objective is to provide an algorithm for finding a policy that minimizes the long-run expected average cost when there are linear side conditions on the limit points of the expected state-action frequencies. This problem has been solved previously only for the case where every deterministic stationary policy has at most one ergodic class. This note removes that restriction by applying the Dantzig-Wolfe decomposition principle.

Volume 19, Issue 4-part-1

December 1972

Pages 357-463

Article Information

Metrics

Information

Published Online:December 01, 1972

Cite as

Cyrus Derman, Arthur F. Veinott, Jr., (1972) Constrained Markov Decision Chains. Management Science 19(4-part-1):389-390.

https://doi.org/10.1287/mnsc.19.4.389

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Constrained Markov Decision Chains

Abstract

Volume 19, Issue 4-part-1

Article Information

Metrics

Information

Cite as

Sign Up for INFORMS Publications Updates and News