Bias and Variance Approximation in Value Function Estimates

Shie Mannor
Shie Mannor
[email protected]
Department of Electrical and Computer Engineering, McGill University, Montreal, Quebec, Canada H3A 2A7
Search for more papers by this author
,
Duncan Simester
Duncan Simester
[email protected]
Sloan School of Management, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139
Search for more papers by this author
,
Peng Sun
Peng Sun
[email protected]
Fuqua School of Business, Duke University, Durham, North Carolina 27708
Search for more papers by this author
,
John N. Tsitsiklis
John N. Tsitsiklis
[email protected]
Laboratory for Information and Decision Systems, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139
Search for more papers by this author

Shie Mannor

[email protected]

Department of Electrical and Computer Engineering, McGill University, Montreal, Quebec, Canada H3A 2A7

Search for more papers by this author

Duncan Simester

[email protected]

Sloan School of Management, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139

Search for more papers by this author

Peng Sun

[email protected]

Fuqua School of Business, Duke University, Durham, North Carolina 27708

Search for more papers by this author

John N. Tsitsiklis

[email protected]

Laboratory for Information and Decision Systems, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139

Search for more papers by this author

Published Online:1 Feb 2007https://doi.org/10.1287/mnsc.1060.0614

Abstract

We consider a finite-state, finite-action, infinite-horizon, discounted reward Markov decision process and study the bias and variance in the value function estimates that result from empirical estimates of the model parameters. We provide closed-form approximations for the bias and variance, which can then be used to derive confidence intervals around the value function estimates. We illustrate and validate our findings using a large database describing the transaction and mailing histories for customers of a mail-order catalog firm.

Volume 53, Issue 2

February 2007

Pages iv-355

Article Information

Supplemental Material

Metrics

Information

Received:July 14, 2004
Published Online:February 01, 2007

Cite as

Shie Mannor, Duncan Simester, Peng Sun, John N. Tsitsiklis, (2007) Bias and Variance Approximation in Value Function Estimates. Management Science 53(2):308-322.

https://doi.org/10.1287/mnsc.1060.0614

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Bias and Variance Approximation in Value Function Estimates

Abstract

Volume 53, Issue 2

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News