Affine Structure and Invariant Policies for Dynamic Programs

Published Online:https://doi.org/10.1287/moor.8.3.342

This paper studies a family of sequential decision processes whose affine structure is shown to imply that attention can be restricted to policies that select the same decision for all states. A sequence of n invariant policies (decisions) is shown to be optimal when the planning horizon is n epochs. When the planning horizon is infinite, added structure is imposed, and a stationary invariant policy is shown to be optimal. Computational methods and examples are included.

INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.