Exploiting the Structural Properties of the Underlying Markov Decision Problem in the Q-Learning Algorithm

Sumit Kunnumkal
Sumit Kunnumkal
[email protected]
Indian School of Business, Gachibowli, Hyderabad 500032, India
Search for more papers by this author
,
Huseyin Topaloglu
Huseyin Topaloglu
[email protected]
School of Operations Research and Information Engineering, Cornell University, Ithaca, New York 14853
Search for more papers by this author

Sumit Kunnumkal

Sumit Kunnumkal

[email protected]

Indian School of Business, Gachibowli, Hyderabad 500032, India

Search for more papers by this author

,

Huseyin Topaloglu

Huseyin Topaloglu

[email protected]

School of Operations Research and Information Engineering, Cornell University, Ithaca, New York 14853

Search for more papers by this author

Published Online:25 Feb 2008https://doi.org/10.1287/ijoc.1070.0240

Supplemental Material

ijoc.1070.0240-sm-kunnumkal_and_topaloglu.pdf (105 KB)

cover image INFORMS Journal on Computing

Volume 20, Issue 2

Spring 2008

Pages 169-331

Article Information

Supplemental Material

Metrics

Information

Received:July 01, 2005
Accepted:September 01, 2007
Published Online:February 25, 2008

Copyright © 2008, INFORMS

Cite as

Sumit Kunnumkal, Huseyin Topaloglu, (2008) Exploiting the Structural Properties of the Underlying Markov Decision Problem in the Q-Learning Algorithm. INFORMS Journal on Computing 20(2):288-301.

https://doi.org/10.1287/ijoc.1070.0240

Keywords