The Optimal Reward Operator in Negative Dynamic Programming

A. Maitra
A. Maitra
School of Statistics, University of Minnesota, 270 Vincent Hall, 206 Church Street SE, Minneapolis, MN 55455
Search for more papers by this author
,
W. Sudderth
W. Sudderth
School of Statistics, University of Minnesota, 270 Vincent Hall, 206 Church Street SE, Minneapolis, MN 55455
Search for more papers by this author

A. Maitra

School of Statistics, University of Minnesota, 270 Vincent Hall, 206 Church Street SE, Minneapolis, MN 55455

Search for more papers by this author

W. Sudderth

School of Statistics, University of Minnesota, 270 Vincent Hall, 206 Church Street SE, Minneapolis, MN 55455

Search for more papers by this author

Published Online:1 Nov 1992https://doi.org/10.1287/moor.17.4.921

Abstract

We consider the negative dynamic programming model of Strauch [12] and prove that the optimal reward function can be obtained by a transfinite iteration of the optimal reward operator. We show that a player loses nothing by restricting himself to measurable policies, if the returns from nonmeasurable policies are evaluated by lower integrals.

cover image Mathematics of Operations Research

Volume 17, Issue 4

November 1992

Pages 765-1020

Article Information

Metrics

Information

Published Online:November 01, 1992

Cite as

A. Maitra, W. Sudderth, (1992) The Optimal Reward Operator in Negative Dynamic Programming. Mathematics of Operations Research 17(4):921-931.

https://doi.org/10.1287/moor.17.4.921

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

The Optimal Reward Operator in Negative Dynamic Programming

Abstract

Volume 17, Issue 4

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News