On the Speed of Convergence of Value Iteration on Stochastic Shortest-Path Problems

Blai Bonet
Blai Bonet
[email protected]
Departamento de Computación, Universidad Simón Bolívar, Caracas 89000, Venezuela
Search for more papers by this author

Blai Bonet

[email protected]

Departamento de Computación, Universidad Simón Bolívar, Caracas 89000, Venezuela

Search for more papers by this author

Published Online:1 May 2007https://doi.org/10.1287/moor.1060.0238

References

Barto A., Bradtke S., Singh S. Learning to act using real-time dynamic programming. Artificial Intelligence (1995) 72:81–138Crossref, Google Scholar
Bellman R.Dynamic Programming (1957) (Princeton University Press, Princeton, NJ) Google Scholar
Bertsekas D.Dynamic Programming and Optimal Control (1995) (Athena Scientific, Belmont, MA) Google Scholar
Bertsekas D., Tsitsiklis J.Parallel and Distributed Computation: Numerical Methods (1989) (Prentice Hall, Englewood Cliffs, NJ) Google Scholar
Bertsekas D., Tsitsiklis J. An analysis of stochastic shortest-path problems. Math. Oper. Res. (1991) 16:580–595Link, Google Scholar
Bonet B.Modeling and Solving Sequential Decision Tasks with Uncertainty and Partial Information (2003) . Ph.D. thesis, University of California, Los Angeles, CAGoogle Scholar
Dechter R., Pearl J. Generalized best-first search strategies and the optimality of A*. J. Assoc. Comput. Mach. (1985) 32(3):505–536Crossref, Google Scholar
Eaton J. H., Zadeh L. A. Optimal pursuit strategies in discrete state probabilistic systems. Trans. ASME, Series D, J. Basic Engrg. (1962) 84:23–29Crossref, Google Scholar
Hart P., Nilsson N., Raphael B. A formal basis for the heuristic determination of minimum cost paths. IEEE Trans. Systems Sci. Cybernetics (1968) 4:100–107Crossref, Google Scholar
Koenig S. Minimax real-time heuristic search. Artificial Intelligence (2001) 129:165–197Crossref, Google Scholar
Korf R. Depth-first iterative-deepening: An optimal admissible tree search. Artificial Intelligence (1985) 27(1):97–109Crossref, Google Scholar
Pearl J.Heuristics (1983) (Morgan Kaufmann, San Francisco, CA) Google Scholar
Puterman M.Markov Decision Processes—Discrete Stochastic Dynamic Programming (1994) (John Wiley & Sons, New York) Crossref, Google Scholar
Tseng P. Solving H-horizon, stationary Markov decision problems in time proportional to log(H). Oper. Res. Lett. (1990) 9:289–297Crossref, Google Scholar

cover image Mathematics of Operations Research

Volume 32, Issue 2

May 2007

Pages 257-496

Article Information

Metrics

Information

Received:October 03, 2005
Published Online:May 01, 2007

Cite as

Blai Bonet, (2007) On the Speed of Convergence of Value Iteration on Stochastic Shortest-Path Problems. Mathematics of Operations Research 32(2):365-373.

https://doi.org/10.1287/moor.1060.0238

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

On the Speed of Convergence of Value Iteration on Stochastic Shortest-Path Problems

References

Volume 32, Issue 2

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News