Free Access

Easy Affine Markov Decision Processes: Properties and Applications

Published Online:3 Oct 2017https://doi.org/10.1287/educ.2017.0170

Abstract

This tutorial introduces a class of decomposable affine Markov decision processes (MDPs) that have continuous multidimensional endogenous states and actions, and an exogenous state that follows an exogenous Markov chain. We show that, unlike most MDPs with continuous state and actions, decomposable affine MDPs are free of the curse of dimensionality and can be solved easily and exactly. These nice properties are attributed to its affine dynamics and affine single-period rewards, its decomposable action space, and the polyhedral features of the decomposed action space. Exploiting its structure, we demonstrate that a decomposable affine MDP with a finite-horizon criterion has a value function that is affine in the endogenous state and has an extremal optimal policy; the value function and the extremal optimal policy are determined by the solution of a set of auxiliary equations. At the end of the tutorial, we illustrate the potential applicability of decomposable affine MDPs using the examples of fishery management and dynamic capacity portfolio management.

Your Access Options

Login Options

INFORMS Member Login Nonmember Login

Purchase Options
Save for later Item saved, go to cart

Tutorials in OR, TutorialsNew $20.00 Add to cart

Tutorials in OR, TutorialsNew Checkout
Other Options
Token Access
- Insert token number
Claim access using a token
Restore guest access
Applies for purchases made as a guest

Download PDF

cover image INFORMS TutORials in Operations Research

Leading Developments from INFORMS Communities

September 2017

Article Information

Metrics

Information

Published Online:October 03, 2017

Cite as

(2017) Easy Affine Markov Decision Processes: Properties and Applications. INFORMS TutORials in Operations Research null(null):28-47.

https://doi.org/10.1287/educ.2017.0170

Keywords

PDF download