A Faster Index Algorithm and a Computational Study for Bandits with Switching Costs
Published Online:25 Jan 2008https://doi.org/10.1287/ijoc.1070.0238
References
- Asymptotically efficient adaptive allocation rules for the multiarmed bandit problem with switching cost. IEEE Trans. Automatic Control (1988) 33:899–906Crossref, Google Scholar
- Multi-armed bandits with switching penalties. IEEE Trans. Automatic Control (1996) 41:328–348Crossref, Google Scholar
- Switching costs and the Gittins index. Econometrica (1994) 62:687–694Crossref, Google Scholar
- Optimal hysteresis for a class of deterministic deteriorating two-armed bandit problem with switching costs. Automatica (2003) 39:1947–1955Crossref, Google Scholar
- Bandit processes and dynamic allocation indices (with discussion). J. Roy. Statist. Soc. Ser. B (1979) 41:148–177Google Scholar
- , Gani J., Sarkadi K., Vincze I. A dynamic allocation index for the sequential design of experiments. Progress in Statistics (European Meeting of Statisticians, Budapest, 1972) (1974) (North-Holland, Amsterdam) 241–266Google Scholar
- A survey on the bandit problem with switching costs. De Economist (2004) 152:513–541Crossref, Google Scholar
- Restless bandits, partial conservation laws and indexability. Adv. Appl. Probab. (2001) 33:76–98Crossref, Google Scholar
- Dynamic allocation indices for restless projects and queueing admission control: A polyhedral approach. Math. Programming (2002) 93:361–413Crossref, Google Scholar
- Restless bandit marginal productivity indices, diminishing returns and optimal control of make-to-order/make-to-stock M/G/1 queues. Math. Oper. Res. (2006) 31:50–84Link, Google Scholar
- A (2/3)n3 fast-pivoting algorithm for the Gittins index and optimal stopping of a Markov chain. INFORMS J. Comput. (2007a) 19:596–606Link, Google Scholar
- Dynamic priority allocation via restless bandit marginal productivity indices (with discussion). Top (2007b) 15:161–198Crossref, Google Scholar
- Computing an index policy for bandits with switching penalties. SMCtools '07: Proc. from the 2007 Workshop on Tools for Solving Structured Markov Chains (2007c) (ICST, Brussels) Crossref, Google Scholar
- Dynamic scheduling of a two-class queue with setups. Oper. Res. (1998) 46:532–547Link, Google Scholar
- Optimal stochastic scheduling of forest networks with switching penalties. Adv. Appl. Probab. (1994) 26:474–497Crossref, Google Scholar
- Extensions of the multiarmed bandit problem: The discounted case. IEEE Trans. Automatic Control (1985) 30:426–439Crossref, Google Scholar
- Restless bandits: Activity allocation in a changing world. J. Appl. Probab. (1988) 25:287–298A Celebration of Applied Probability, J. Gani, ed. Applied Probability Trust, Sheffield, UKCrossref, Google Scholar

