A Strongly Polynomial Algorithm for Controlled Queues

Alexander Zadorojniy
Alexander Zadorojniy
[email protected]
School of Electrical Engineering, Tel-Aviv University, Tel-Aviv 69978, Israel
Search for more papers by this author
,
Guy Even
Guy Even
[email protected]
School of Electrical Engineering, Tel-Aviv University, Tel-Aviv 69978, Israel
Search for more papers by this author
,
Adam Shwartz
Adam Shwartz
[email protected]
Department of Electrical Engineering,Technion, Haifa 32000, Israel
Search for more papers by this author

Alexander Zadorojniy

[email protected]

School of Electrical Engineering, Tel-Aviv University, Tel-Aviv 69978, Israel

Search for more papers by this author

Guy Even

[email protected]

School of Electrical Engineering, Tel-Aviv University, Tel-Aviv 69978, Israel

Search for more papers by this author

Adam Shwartz

[email protected]

Department of Electrical Engineering,Technion, Haifa 32000, Israel

Search for more papers by this author

Published Online:7 Oct 2009https://doi.org/10.1287/moor.1090.0415

Abstract

We consider the problem of computing optimal policies of finite-state finite-action Markov decision processes (MDPs). A reduction to a continuum of constrained MDPs (CMDPs) is presented such that the optimal policies for these CMDPs constitute a path in a graph defined over the deterministic policies. This path contains, in particular, an optimal policy of the original MDP. We present an algorithm based on this new approach that finds this path, and thus an optimal policy. In the general case, this path might be exponentially long in the number of states and actions. We prove that the length of this path is polynomial if the MDP satisfies a coupling property. Thus we obtain a strongly polynomial algorithm for MDP s that satisfies the coupling property. We prove that discrete time versions of controlled M/M/1 queues induce MDP s that satisfy the coupling property. The only previously known polynomial algorithm for controlled M/M/1 queues in the expected average cost model is based on linear programming (and is not known to be strongly polynomial). Our algorithm works both for the discounted and expected average cost models, and the running time does not depend on the discount factor.

cover image Mathematics of Operations Research

Volume 34, Issue 4

November 2009

Pages 769-1024

Article Information

Metrics

Information

Received:July 16, 2008
Published Online:October 07, 2009

Cite as

Alexander Zadorojniy, Guy Even, Adam Shwartz, (2009) A Strongly Polynomial Algorithm for Controlled Queues. Mathematics of Operations Research 34(4):992-1007.

https://doi.org/10.1287/moor.1090.0415

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

A Strongly Polynomial Algorithm for Controlled Queues

Abstract

Volume 34, Issue 4

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News