On the Second Optimality Equation for Semi-Markov Decision Models

Published Online:https://doi.org/10.1287/moor.17.2.470

For a semi-Markov decision model with average return, the validity of the second optimality equation is shown in the (nonmodified) form where the actions run through the set of all admissible actions rather than through the set of maximum points (conserving actions) for the first optimal equation. As a consequence the existence of a strongly optimal stationary policy is shown. The results seem to be known only for finite state finite action models whereas here countable state compact action models with unbounded rewards are considered.

INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.