Simulation-Based Approximate Policy Iteration with Generalized Logistic Functions

Antoine Sauré
Antoine Sauré
[email protected]
Sauder School of Business, University of British Columbia, Vancouver, British Columbia V6T 1Z2, Canada
Search for more papers by this author
,
Jonathan Patrick
Jonathan Patrick
[email protected]
Telfer School of Management, University of Ottawa, Ottawa, Ontario K1N 6N5, Canada
Search for more papers by this author
,
Martin L. Puterman
Martin L. Puterman
[email protected]
Sauder School of Business, University of British Columbia, Vancouver, British Columbia V6T 1Z2, Canada
Search for more papers by this author

Antoine Sauré

[email protected]

Sauder School of Business, University of British Columbia, Vancouver, British Columbia V6T 1Z2, Canada

Search for more papers by this author

Jonathan Patrick

[email protected]

Telfer School of Management, University of Ottawa, Ottawa, Ontario K1N 6N5, Canada

Search for more papers by this author

Martin L. Puterman

[email protected]

Sauder School of Business, University of British Columbia, Vancouver, British Columbia V6T 1Z2, Canada

Search for more papers by this author

Published Online:28 Sep 2015https://doi.org/10.1287/ijoc.2015.0645

Abstract

We present an approximate dynamic programming method based on simulation, policy iteration, a postdecision state formulation, and a logistic value function approximation. This method was developed as part of our efforts to determine whether nonlinear value function approximations could provide cost-effective policies for advance patient scheduling problems, and as a way of identifying the main advantages and disadvantages of using simulation versus linear programming to approximately solve dynamic capacity allocation problems. We first apply the proposed method to a queueing problem and then study a more practical application based on an advance multipriority patient scheduling problem. We investigate the quality and practical implications of the resulting appointment scheduling policies using simulation, and compare their performance to that of four other policies. Patient scheduling policies obtained by the new method not only depend on the number of appointments already booked on each day but also on the overall system workload. In particular, these policies provide lower discounted cost values and shorter average wait times for higher priority patients than policies directly obtained using linear programming and an affine value function approximation in the predecision state variables.

cover image INFORMS Journal on Computing

Volume 27, Issue 3

Summer 2015

Pages 431-578

Article Information

Supplemental Material

Metrics

Information

Received:December 01, 2013
Accepted:January 01, 2015
Published Online:September 28, 2015

Cite as

Antoine Sauré, Jonathan Patrick, Martin L. Puterman (2015) Simulation-Based Approximate Policy Iteration with Generalized Logistic Functions. INFORMS Journal on Computing 27(3):579-595.

https://doi.org/10.1287/ijoc.2015.0645

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Simulation-Based Approximate Policy Iteration with Generalized Logistic Functions

Abstract

Volume 27, Issue 3

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News