Higher-Order and Average Reward Myopic-Affine Dynamic Models

Published Online:https://doi.org/10.1287/moor.15.2.299

The dynamics and rewards in a higher-order model depend directly on states and actions in earlier periods. Here, the expected values of the single-period reward and of the state in the next period are affine functions of the states in the current and earlier periods. It is shown that computations in higher-order infinite-horizon models with myopic solutions are not essentially harder than in the corresponding first-order models. These results are obtained for discounted and average-reward criteria, and similar conclusions are indicated for sequential games. The results are applied to an aquaculture harvesting model and an advertising model with delayed responses.

INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.