Technical Note—Cyclic Variables and Markov Decision Processes

Avery Haviv
Corresponding Author
Avery Haviv
https://orcid.org/0000-0001-6910-077X
Simon Business School, University of Rochester, Rochester, New York 14620
Search for more papers by this author

Avery Haviv

Corresponding Author

Avery Haviv

https://orcid.org/0000-0001-6910-077X

Simon Business School, University of Rochester, Rochester, New York 14620

Search for more papers by this author

Published Online:6 May 2020https://doi.org/10.1287/opre.2019.1913

Abstract

In this paper I develop a cyclic value function iteration, which is an adjustment to the standard value function iteration. When using this algorithm, the inclusion of cyclic variables of any size into the state space of an infinite horizon Markov decision process does not increase the computational complexity of solving for the value function. This result is proven theoretically and shown to closely hold in practice using Monte Carlo simulations.

Volume 68, Issue 4

July-August 2020

Pages ii-v, 965-1284, C2-C3

Article Information

Metrics

Information

Received:July 01, 2017
Accepted:June 28, 2019
Published Online:May 06, 2020

Cite as

Avery Haviv (2020) Technical Note—Cyclic Variables and Markov Decision Processes. Operations Research 68(4):1231-1237.

https://doi.org/10.1287/opre.2019.1913

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Technical Note—Cyclic Variables and Markov Decision Processes

Abstract

Volume 68, Issue 4

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News