Simulation-Based Approximate Policy Iteration with Generalized Logistic Functions
Published Online:28 Sep 2015https://doi.org/10.1287/ijoc.2015.0645
References
- (2004) A price-directed approach to stochastic inventory/routing. Oper. Res. 52:499–514.Link, Google Scholar
- (2012) Computing near-optimal policies in generalized joint replenishment. INFORMS J. Comput. 24:148–164.Link, Google Scholar
- (2008) Relaxations of weakly coupled stochastic dynamic programs. Oper. Res. 56:712–727.Link, Google Scholar
- (2015) A simulation based approximate dynamic programming approach to multi-class, multi-resource surgical scheduling. Eur. J. Oper. Res. 245:309–319.Crossref, Google Scholar
- (2011) Dynamic Programming and Optimal Control, Vol. II, 3rd ed. (Athena Scientific, Belmont, MA).Google Scholar
- (1996) Neuro-Dynamic Programming (Athena Scientific, Belmont, MA).Google Scholar
- (2003) The linear programming approach to approximate dynamic programming. Oper. Res. 51:850–865.Link, Google Scholar
- (2004) On constraint sampling in the linear programming approach to approximate dynamic programming. Math. Oper. Res. 29:462–478.Link, Google Scholar
- (2009) A smoothed approximate linear program. Adv. Neural Inform 22:459–467.Google Scholar
- (2010) Robust policies for the transformer acquisition and allocation problem. Energy Sys. 1:245–272.Crossref, Google Scholar
- (2009) Computing protection level policies for dynamic capacity allocation problems by using stochastic approximation methods. IIE Trans. 41:498–510.Crossref, Google Scholar
- (1990) A successive linear approximation procedure for stochastic, dynamic vehicle allocation problems. Transportation Sci. 24:40–57.Link, Google Scholar
- GAMS (2011) GAMS—The solver manuals. Technical report, GAMS Development Corporation, Washington, DC.Google Scholar
- (2013) A tutorial on linear function approximators for dynamic programming and reinforcement learning. Foundations Trends Machine Learn. 6:375–451.Crossref, Google Scholar
- (2014) Dynamic scheduling with due dates and time windows: An application to chemotherapy patient appointment booking. Health Care Management Sci. 17:60–76.Crossref, Google Scholar
- (2002) An adaptive dynamic programming algorithm for dynamic fleet management, I: Single period travel times. Transportation Sci. 36:21–39.Link, Google Scholar
- (2002) A reinforcement learning approach to a single leg airline revenue management problem with multiple fare classes and overbooking. IIE Trans. 34:729–742.Crossref, Google Scholar
- (2009) Neural Networks and Learning Machines (Pearson Education, Upper Saddle River, NJ).Google Scholar
- (2007) Reinforcement learning versus heuristics for order acceptance on a single resource. J. Heuristics 13:167–187.Crossref, Google Scholar
- (2007) An approximate dynamic programming approach for the empty container allocation problem. Transportation Res. Part C 15:265–277.Crossref, Google Scholar
- (2000) Call admission control and routing in integrated services networks using neuro-dynamic programming. IEEE J. Sel. Area Comm. 18:197–208.Crossref, Google Scholar
- (2013) Tuning approximate dynamic programming policies for ambulance redeployment via direct search. Stochastic Systems 3:322–361.Link, Google Scholar
- (2010) Approximate dynamic programming for ambulance redeployment. INFORMS J. Comput. 22:266–281.Link, Google Scholar
- (2008) Dynamic multipriority patient scheduling for a diagnostic resource. Oper. Res. 56:1507–1525.Link, Google Scholar
- (1987) An operational planning model for the dynamic vehicle allocation problem with uncertain demands. Transportation Res. Part B 21:217–232.Crossref, Google Scholar
- (2011) Approximate Dynamic Programming: Solving the Curses of Dimensionality (Wiley-Interscience, Hoboken, NJ).Crossref, Google Scholar
- (2012) Perspectives of approximate dynamic programming. Ann. Oper. Res., ePub ahead of print February 7, http://link.springer.com/article/10.1007%2Fs10479-012-1077-6.Google Scholar
- (2005) Approximate dynamic programming for high dimensional resource allocation problems. Proc. 2005 IEEE Internat. Joint Conference Neural Networks, Montreal, 2989–2994.Crossref, Google Scholar
- (2012) Dynamic multi-appointment patient scheduling for radiation therapy. Eur. J. Oper. Res. 223:573–584.Crossref, Google Scholar
- (2012) Solving the dynamic ambulance relocation and dispatching problem using approximate dynamic programming. Eur. J. Oper. Res. 219:611–621.Crossref, Google Scholar
- (2012) Approximate dynamic programming for capacity allocation in the service industry. Eur. J. Oper. Res. 218:239–250.Crossref, Google Scholar
- (2009) Approximate dynamic programming for management of high-value spare parts. J. Manufacturing Tech. Management 20:147–160.Crossref, Google Scholar
- (2009) An approximate dynamic programming algorithm for large-scale fleet management: A case application. Transportation Sci. 43:178–197.Link, Google Scholar
- (2010) Approximate dynamic programming captures fleet operations for Schneider National. Interfaces 40:342–352.Link, Google Scholar
- (1998) Reinforcement Learning: An Introduction (MIT Press, Cambridge, MA).Google Scholar
- (1997) A neuro-dynamic programming approach to retailer inventory management. Proc. 36th IEEE Conf. Decision Control, San Diego, 4052–4057.Crossref, Google Scholar
- (2009) An approximate dynamic programming approach to network revenue management with customer choice. Transportation Sci. 43:381–394.Link, Google Scholar

