On Boundedness of Q-Learning Iterates for Stochastic Shortest Path Problems

Published Online:https://doi.org/10.1287/moor.1120.0562

References

  • Abounadi J, Bertsekas DP, Borkar V. Stochastic approximation for nonexpansive maps: Application to Q-learning algorithms. SIAM J. Control Optim. (2002) 41(1):1–22CrossrefGoogle Scholar
  • Bertsekas DP, Tsitsiklis JN. An analysis of stochastic shortest path problems. Math. Oper. Res. (1991) 16(3):580–595LinkGoogle Scholar
  • Bertsekas DP, Tsitsiklis JN. Neuro-Dynamic Programming (1996) (Athena Scientific, Belmont, MA) Google Scholar
  • Borkar VS. Stochastic Approximation: A Dynamic Viewpoint (2008) (Hindustan Book Agency, New Delhi) Google Scholar
  • Kushner HJ, Yin GG. Stochastic Approximation and Recursive Algorithms and Applications (2003) 2nd ed.(Springer-Verlag, New York) Google Scholar
  • Puterman ML. Markov Decision Processes: Discrete Stochastic Dynamic Programming (1994) (John Wiley & Sons, New York) CrossrefGoogle Scholar
  • Seneta E. Nonnegative Matrices and Markov Chains (1981) 2nd ed.(Springer-Verlag, New York) CrossrefGoogle Scholar
  • Tsitsiklis JN. Asynchronous stochastic approximation and Q-learning. Machine Learn. (1994) 16(3):185–202CrossrefGoogle Scholar
  • Watkins CJCH. Learning from delayed rewards. (1989) . Ph.D. thesis, Cambridge University, EnglandGoogle Scholar
  • Yu H. Some proof details for asynchronous stochastic approximation algorithms. (2011) . On-line at: http://www.mit.edu/∼janey_yu/note_asaproofs.pdfGoogle Scholar
  • Yu H. Stochastic shortest path games and Q-learning. (2011) . LIDS Technical Report 2875, MITGoogle Scholar
  • Yu H, Bertsekas DP. Q-learning and policy iteration algorithms for stochastic shortest path problems. Ann. Oper. Res. (2012) . Forthcoming DOI: 10.1007/s10479-012-1128-zGoogle Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.