Normalized Markov Decision Chains I; Sensitive Discount Optimality

Uriel G. Rothblum
Uriel G. Rothblum
New York University, New York, New York
Search for more papers by this author

New York University, New York, New York

Published Online:1 Aug 1975https://doi.org/10.1287/opre.23.4.785

Abstract

In this paper we study sensitive discount optimality criteria for finite state and action, discrete time parameter, stationary generalized Markov decision chains. We extend previous results obtained by Miller and Veinott and Veinott for substochastic transition matrices to arbitrary non-negative matrices with spectral radius not exceeding one. In particular, we generalize their policy improvement algorithm for finding a stationary policy maximizing the expected discounted reward for all sufficiently small positive interest rates.

Volume 23, Issue 4

July-August 1975

Pages 603-840

Article Information

Metrics

Information

Published Online:August 01, 1975

Cite as

Uriel G. Rothblum, (1975) Normalized Markov Decision Chains I; Sensitive Discount Optimality. Operations Research 23(4):785-795.

https://doi.org/10.1287/opre.23.4.785

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Normalized Markov Decision Chains I; Sensitive Discount Optimality

Abstract

Volume 23, Issue 4

Article Information

Metrics

Information

Cite as

Sign Up for INFORMS Publications Updates and News