Markov Decision Processes with a Borel Measurable Cost Function—The Average Case

Published Online:https://doi.org/10.1287/moor.11.2.309

We consider a Markov decision process with a Borel measurable cost function. We introduce a p-step contraction property for the average cost case. By use of this method, the validity of the optimality equation and the existence of ϵ-optimal stationary policies are proved. As some applications, the sequential replacement model and the inventory model are considered.

INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.