We consider the multi-armed bandit problem. We show that when the state space is finite the computation of the dynamic allocation indices can be handled by linear programming methods.
Yih Ren Chen, Michael N. Katehakis, (1986) Linear Programming for Finite State Multi-Armed Bandit Problems. Mathematics of Operations Research 11(1):180-183.
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.