Omar Besbes, Yonatan Gur, Assaf Zeevi (2019) Optimal Exploration–Exploitation in a Multi-armed Bandit Problem with Non-stationary Rewards. Stochastic Systems 9(4):319-337.
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.