On the Speed of Convergence of Value Iteration on Stochastic Shortest-Path Problems

Published Online:https://doi.org/10.1287/moor.1060.0238

References

  • Barto A., Bradtke S., Singh S. Learning to act using real-time dynamic programming. Artificial Intelligence (1995) 72:81–138CrossrefGoogle Scholar
  • Bellman R.Dynamic Programming (1957) (Princeton University Press, Princeton, NJ) Google Scholar
  • Bertsekas D.Dynamic Programming and Optimal Control (1995) (Athena Scientific, Belmont, MA) Google Scholar
  • Bertsekas D., Tsitsiklis J.Parallel and Distributed Computation: Numerical Methods (1989) (Prentice Hall, Englewood Cliffs, NJ) Google Scholar
  • Bertsekas D., Tsitsiklis J. An analysis of stochastic shortest-path problems. Math. Oper. Res. (1991) 16:580–595LinkGoogle Scholar
  • Bonet B.Modeling and Solving Sequential Decision Tasks with Uncertainty and Partial Information (2003) . Ph.D. thesis, University of California, Los Angeles, CAGoogle Scholar
  • Dechter R., Pearl J. Generalized best-first search strategies and the optimality of A*. J. Assoc. Comput. Mach. (1985) 32(3):505–536CrossrefGoogle Scholar
  • Eaton J. H., Zadeh L. A. Optimal pursuit strategies in discrete state probabilistic systems. Trans. ASME, Series D, J. Basic Engrg. (1962) 84:23–29CrossrefGoogle Scholar
  • Hart P., Nilsson N., Raphael B. A formal basis for the heuristic determination of minimum cost paths. IEEE Trans. Systems Sci. Cybernetics (1968) 4:100–107CrossrefGoogle Scholar
  • Koenig S. Minimax real-time heuristic search. Artificial Intelligence (2001) 129:165–197CrossrefGoogle Scholar
  • Korf R. Depth-first iterative-deepening: An optimal admissible tree search. Artificial Intelligence (1985) 27(1):97–109CrossrefGoogle Scholar
  • Pearl J.Heuristics (1983) (Morgan Kaufmann, San Francisco, CA) Google Scholar
  • Puterman M.Markov Decision Processes—Discrete Stochastic Dynamic Programming (1994) (John Wiley & Sons, New York) CrossrefGoogle Scholar
  • Tseng P. Solving H-horizon, stationary Markov decision problems in time proportional to log(H). Oper. Res. Lett. (1990) 9:289–297CrossrefGoogle Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.