Continuous-Time Red and Black: How to Control a Diffusion to a Goal

Published Online:https://doi.org/10.1287/moor.10.4.599

A player starts at x in (0, 1) and tries to reach 1. The process (Xt, t ≥ 0) of his positions moves according to a diffusion process (or, more generally, an Ito process) whose infinitesimal parameters μ, σ are chosen by the player at each instant of time from a set depending on his current position. To maximize the probability of reaching 1, the player should choose the parameters so as to maximize μ/σ2, at least when the maximum is achieved by bounded, measurable functions. This implies that bold (timid) play is optimal for subfair (superfair), continuous-time red-and-black. Furthermore, in superfair red-and-black, the strategy which maximizes the drift coefficient of {log Xt} minimizes the expected time to reach 1.

INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.