Dynamic Programming Deconstructed: Transformations of the Bellman Equation and Computational Efficiency

Qingyin Ma
Qingyin Ma
[email protected]
https://orcid.org/0000-0001-8862-4210
International School of Economics and Management, Capital University of Economics and Business, Beijing 100070, China;
Search for more papers by this author
,
John Stachurski
Corresponding Author
John Stachurski
[email protected]
https://orcid.org/0000-0001-6716-0111
Research School of Economics, Australian National University, Acton, ACT 2601, Australia
Search for more papers by this author

International School of Economics and Management, Capital University of Economics and Business, Beijing 100070, China;

Search for more papers by this author

John Stachurski

Corresponding Author

John Stachurski

[email protected]

https://orcid.org/0000-0001-6716-0111

Research School of Economics, Australian National University, Acton, ACT 2601, Australia

Search for more papers by this author

Published Online:8 Feb 2021https://doi.org/10.1287/opre.2020.2006

Abstract

Some approaches to solving challenging dynamic programming problems, such as Q-learning, begin by transforming the Bellman equation into an alternative functional equation to open up a new line of attack. Our paper studies this idea systematically with a focus on boosting computational efficiency. We provide a characterization of the set of valid transformations of the Bellman equation, for which validity means that the transformed Bellman equation maintains the link to optimality held by the original Bellman equation. We then examine the solutions of the transformed Bellman equations and analyze correspondingly transformed versions of the algorithms used to solve for optimal policies. These investigations yield new approaches to a variety of discrete time dynamic programming problems, including those with features such as recursive preferences or desire for robustness. Increased computational efficiency is demonstrated via time complexity arguments and numerical experiments.

Volume 69, Issue 5

September-October 2021

Pages ii-iv, 1349-1650, C2

Article Information

Metrics

Information

Received:January 24, 2019
Accepted:January 03, 2020
Published Online:February 08, 2021

Cite as

Qingyin Ma , John Stachurski (2021) Dynamic Programming Deconstructed: Transformations of the Bellman Equation and Computational Efficiency. Operations Research 69(5):1591-1607.

https://doi.org/10.1287/opre.2020.2006

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Dynamic Programming Deconstructed: Transformations of the Bellman Equation and Computational Efficiency

Abstract

Volume 69, Issue 5

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News