Tuning Approximate Dynamic Programming Policies for Ambulance Redeployment via Direct Search
Published Online:15 Nov 2013https://doi.org/10.1287/10-SSY020
References
- A price-directed approach to stochastic inventory routing. Operations Research, 52(4):499–514, 2004. MR2075790Link, Google Scholar
- Dynamic bid-prices in revenue management. Operations Research, 55(4):647–661, 2007. MR2349028Link, Google Scholar
- A Markov chain model for an EMS system with repositioning, 2010.Google Scholar
- Decision support tools for dynamic fleet management. PhD thesis, Department of Science and Technology, Linköepings Universitet, Norrköeping, Sweden, 2005.Google Scholar
- Decision support tools for ambulance dispatch and relocation. Journal of the Operational Research Society, 58:195–201, 2007.Google Scholar
- Optimal dispatching strategies for emergency vehicles to increase patient survivability. To appear, International Journal of Operational Research, 2012. MR3014423Google Scholar
- Dynamic repositioning of indistinguishable service units on transportation networks. Transportation Science, 15(2), 1981. MR0639599Google Scholar
- Repositioning of distinguishable urban service units on networks. Computers and Operations Research, 8:105–118, 1981.Google Scholar
- Repositioning of two distinguishable service vehicles on networks. IEEE Transactions on Systems, Man, and Cybernetics, SMC-11(3), 1981. MR0614679Google Scholar
- Dynamic Programming and Optimal Control. Athena Scientific, Nashua, NH, 2005. MR2183196Google Scholar
- Stochastic Optimal Control: The Discrete Time Case. Academic Press, New York, 1978. MR0511544Google Scholar
- Neuro-Dynamic Programming. Athena Scientific, Belmont, Massachusetts, 1996.Google Scholar
- Technical update: Least-squares temporal difference learning. Machine Learning, 49(2):233–246, 2002.Google Scholar
- Linear least-squares algorithms for temporal difference learning. In Machine Learning, pages 22–33, 1996.Google Scholar
- The linear programming approach to approximate dynamic programming. Operations Research, 51:2003, 2001. MR2019651Google Scholar
- Adaptation of the UOBYQA algorithm for noisy functions. In WSC ’06: Proceedings of the 38th conference on Winter simulation, pages 312–319. Winter Simulation Conference, 2006.Google Scholar
- Approximate dynamic programming via a smoothed linear program. To appear in Operations Research, 2012. MR2995921Google Scholar
- Ambulance deployment for maximum survival. Naval Research Logistics, 55(1):42–58, 2007. MR2378248Google Scholar
- An approximate dynamic programming approach to network revenue management. Technical report, Stanford University, Department of Electrical Engineering, 2007.Google Scholar
- A dynamic model and parallel tabu search heuristic for real time ambulance relocation. Parallel Computing, 27:1641–1653, 2001.Google Scholar
- The maximal expected coverage relocation problem for emergency vehicles. Journal of the Operational Research Society, 57:22–28, 2006.Google Scholar
- A GSMP formalism for discrete event systems. Proceedings of the IEEE, 77(1), 1989.Google Scholar
- An algorithm for the dynamic relocation of fire companies. Operations Research, 22(2):249–274, 1974.Link, Google Scholar
- An object-oriented random-number package with many long streams and substreams. Operations Research, 50(6):1073–1075, 2002.Link, Google Scholar
- Valuing American options by simulation: A simple least-squares approach. The Review of Financial Studies, 14(1):113–147, 2001.Google Scholar
- Ambulance redeployment: An approximate dynamic programming approach. In M. D. Rossetti, R. R. Hill, B. Johansson, A. Dunkin, and R. G. Ingalls, editors, Proceedings of the 2009 Winter Simulation Conference, pages 1850–1860, Piscataway, New Jersey, 2009. Institute of Electrical and Electronics Engineers, Inc.Google Scholar
- Identifying effective policies in approximate dynamic programming: Beyond regression. In B. Johansson, S. Jain, J. Montoya-Torres, J. Hugan, and E. Yücesan, editors, Proceedings of the 2010 Winter Simulation Conference, Piscataway, New Jersey, 2010. Institute of Electrical and Electronics Engineers, Inc.Google Scholar
- Approximate dynamic programming for ambulance redeployment. INFORMS Journal on Computing, 22(2):266–281, 2010. MR2677214Link, Google Scholar
- Evaluation of relocation strategies for emergency medical service vehicles. Transportation Research Record: Journal of the Transportation Research Board, 2137:63–73, 2009.Google Scholar
- A simplex method for function minimization. The Computer Journal, 7(4):308–313, 1965.Google Scholar
- An adaptive dynamic programming algorithm for a stochastic multiproduct batch dispatch problem. Naval Research Logistics, 50(7):742–769, 2003. MR2001023Google Scholar
- UOBYQA: Unconstrained optimization by quadratic approximation. Mathematical Programming, 92(3):555–582, 2002. MR1905766Google Scholar
- Approximate Dynamic Programming: Solving the Curses of Dimensionality. John Wiley & Sons, Hoboken, NJ, 2007. MR2347698Google Scholar
- Markov Decision Processes: Discrete Stochastic Dynamic Programming. John Wiley & Sons, Hoboken, NJ, 2005.Google Scholar
- Optimised Ambulance Redeployment Strategies. PhD thesis, The University of Auckland, Auckland, New Zealand, 2007.Google Scholar
- Solving the dynamic ambulance relocation and dispatching problem using approximate dynamic programming. European Journal of Operational Research, 219:611–621, 2012. MR2898941Google Scholar
- Generalized polynomial approximations in Markovian decision processes. Journal of Mathematical Analysis and Applications, 110(2):568–582, 1985. MR0805277Google Scholar
- Monte Carlo sampling methods. In A. Ruszczynski and A. Shapiro, editors, Stochastic Programming, volume 10 of Handbooks in Operations Research and Management Science, pages 353–425. Elsevier, 2003. MR2052758Google Scholar
- The dynamic assignment problem. Transportation Science, 38(4):399–419, 2004.Link, Google Scholar
- Learning to predict by the methods of temporal differences. In Machine Learning, pages 9–44. Kluwer Academic Publishers, 1988.Google Scholar
- Reinforcement Learning. The MIT Press, Cambridge, MA, 1998.Google Scholar
- Learning tetris using the noisy cross-entropy method. Neural Computation., 18(12):2936–2941, 2006.Google Scholar
- TD-Gammon, a self-teaching backgammon program, achieves master-level play. Neural Computation, 6(2):215–219, 1994.Google Scholar
- Dynamic programming approximations for stochastic, time-staged integer multicommodity flow problems. INFORMS Journal on Computing, 18(1):31–42, 2006. MR2205745Link, Google Scholar
- Regression methods for pricing complex American-style options. IEEE Transactions on Neural Networks, 12(4):694–703, 2001.Google Scholar
- A neuro dynamic programming approach to retailer inventory management. In Proceedings of the IEEE Conference on Decision and Control, 1997.Google Scholar
- Simulation Optimisation and Markov Models for Dynamic Ambulance Redeployment. PhD thesis, The University of Auckland, Auckland, New Zealand, 2012.Google Scholar
- Optimization of a single ambulance move up. Preprint, 2010.Google Scholar

