A Restless Bandit Model for Resource Allocation, Competition, and Reservation

Jing Fu
Corresponding Author
Jing Fu
[email protected]
https://orcid.org/0000-0003-4615-8391
School of Mathematics and Statistics, University of Melbourne, Melbourne, Victoria 3010, Australia;
Search for more papers by this author
,
Bill Moran
Bill Moran
[email protected]
Department of Electrical and Electronic Engineering, University of Melbourne, Melbourne, Victoria 3010, Australia
Search for more papers by this author
,
Peter G. Taylor
Peter G. Taylor
[email protected]
School of Mathematics and Statistics, University of Melbourne, Melbourne, Victoria 3010, Australia;
Search for more papers by this author

Jing Fu

Corresponding Author

Jing Fu

[email protected]

https://orcid.org/0000-0003-4615-8391

School of Mathematics and Statistics, University of Melbourne, Melbourne, Victoria 3010, Australia;

Search for more papers by this author

Bill Moran

[email protected]

Department of Electrical and Electronic Engineering, University of Melbourne, Melbourne, Victoria 3010, Australia

Search for more papers by this author

Peter G. Taylor

[email protected]

School of Mathematics and Statistics, University of Melbourne, Melbourne, Victoria 3010, Australia;

Search for more papers by this author

Published Online:12 Mar 2021https://doi.org/10.1287/opre.2020.2066

Abstract

We study a resource allocation problem with varying requests and with resources of limited capacity shared by multiple requests. It is modeled as a set of heterogeneous restless multiarmed bandit problems (RMABPs) connected by constraints imposed by resource capacity. Following Whittle’s relaxation idea and Weber and Weiss’ asymptotic optimality proof, we propose a simple policy and prove it to be asymptotically optimal in a regime where both arrival rates and capacities increase. We provide a simple sufficient condition for asymptotic optimality of the policy and, in complete generality, propose a method that generates a set of candidate policies for which asymptotic optimality can be checked. The effectiveness of these results is demonstrated by numerical experiments. To the best of our knowledge, this is the first work providing asymptotic optimality results for such a resource allocation problem and such a combination of multiple RMABPs.

Volume 70, Issue 1

January-February 2022

Pages iii-viii, 1-640, C2-C3

Article Information

Supplemental Material

Metrics

Information

Received:April 15, 2018
Accepted:June 26, 2020
Published Online:March 12, 2021

Cite as

Jing Fu, Bill Moran, Peter G. Taylor (2021) A Restless Bandit Model for Resource Allocation, Competition, and Reservation. Operations Research 70(1):416-431.

https://doi.org/10.1287/opre.2020.2066

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

A Restless Bandit Model for Resource Allocation, Competition, and Reservation

Abstract

Volume 70, Issue 1

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News