Evaluating Strategies for Markov Decision Processes in Parallel

Published Online:https://doi.org/10.1287/moor.15.1.17

The authors' work on the use of Gittins indices in the evaluation of strategies for families of alternative bandit processes has found many applications. Among these are procedures for sensitivity analysis in stochastic scheduling. This theoretical paper aims at developing results which will form the basis of an approach to strategy evaluation for a class of processes of greater complexity. These are Markov Decision Processes in parallel satisfying a condition first enunciated by Whittle. The theory of Gittins indices forms the basis of the analysis.

INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.