A test for nonoptimal actions in undiscounted Markov decision chains is proposed. The test eliminates actions for one or more stages after which they may re-enter the set of possibly optimal actions, but as convergence proceeds such re-entries cease.
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.