The Value Iteration Algorithm in Risk-Sensitive Average Markov Decision Chains with Finite State Space

References

  • Arapostathis A., Borkar V. S., Fernández-Gaucherand E., Ghosh M. K., Marcus S. I. Discrete-time controlled Markov processes with average cost criteria: A survey. SIAM J. Control and Optim. (1993) 31:282–334CrossrefGoogle Scholar
  • Bielecki T., Hernández-Hernández D., Pliska S. R. Risk sensitive control of finite state Markov chains in discrete time, with applications to portfolio management. Math. Methods Oper. Res. (1999) 50:167–188CrossrefGoogle Scholar
  • Borkar V. S., Meyn S. P. Risk sensitive optimal control for Markov decision processes with monotone cost. Mathematics Oper. Res. (2002) 27:192–209LinkGoogle Scholar
  • Cavazos-Cadena R. Value iteration and approximately optimal stationary policies in finite-state average Markov decision chains. Math. Methods Oper. Res. (2002) 56:181–196CrossrefGoogle Scholar
  • Cavazos-Cadena R. Solution to the risk-sensitive average cost optimality equation in a class of Markov decision processes with finite state space. Math. Methods Oper. Res. (2003) 57:253–285CrossrefGoogle Scholar
  • Cavazos-Cadena R., Fernández-Gaucherand E. Controlled Markov chains with risk-sensitive criteria: Average cost, optimality equations, and optimal solutions. Math. Methods Oper. Res. (1999) 43:121–139Google Scholar
  • Cavazos-Cadena R., Fernández-Gaucherand E., Dror M., L'Ecuyer P., Szydarovszky F. Risk-sensitive optimal control in communicating average Markov decision chains. Modeling Uncertainty: An Examination of Stochastic Theory, Methods, and Applications (2001) (Kluwer Academic Publishers, Boston, MA) 515–553Google Scholar
  • Di Masi G. B., Stettner L. Risk-sensitive control of discrete-time Markov processes with infinite horizon. SIAM J. Control Optim. (1999) 38:61–78CrossrefGoogle Scholar
  • Hernández-Hernández D., Marcus S. I. Risk sensitive control of Markov processes in countable state space. Systems & Control Lett. (1996) 29:147–155Corrigendum in: Systems & Control Lett. 34 105–106CrossrefGoogle Scholar
  • Hernández-Lerma O.Adaptive Markov Control Processes (1988) (Springer-Verlag, New York) Google Scholar
  • Howard A. R., Matheson J. E. Risk-sensitive Markov decision processes. Managment Sci. (1972) 18:356–369LinkGoogle Scholar
  • Puterman M. L., Heyman D. P., Sobel M. J. Markov decision processes. Handbook on Operations Research and Management Science (1990) 2(North Holland, Amsterdam, The Netherlands) 331–434Google Scholar
  • Puterman M. L.Markov Decision Processes: Discrete Stochastic Dynamic Programming (1994) (Wiley, New York) CrossrefGoogle Scholar
  • Schweitzer P. J. Iterative solution of the functional equations of undiscounted Markov renewal programming. J. Math. Anal. Appl. (1971) 34:495–501CrossrefGoogle Scholar
  • Thomas L. C., Hartley R., Thomas L. C., White D. J. Connectedness conditions for denumerable state Markov decision processes. Recent Advances in Markov Decision Processes (1980) (Academic Press, New York) 181–204Google Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.