Characterization of the Optimal Risk-Sensitive Average Cost in Denumerable Markov Decision Chains

Published Online:https://doi.org/10.1287/moor.2017.0893

References

  • Alanís-Durán A, Cavazos-Cadena R (2012) An optimality system for finite average Markov decision chains under risk-aversion. Kybernetika 48(1):83–104.Google Scholar
  • Balaji S, Meyn SP (2000) Multiplicative ergodicity and large deviations for an irreducible Markov chain. Stochastic Proc. Appl. 90(1):123–144.CrossrefGoogle Scholar
  • Bäuerle N, Rieder U (2011) Markov Decision Processes with Applications to Finance (Springer, New York).CrossrefGoogle Scholar
  • Bäuerle N, Rieder U (2014) More risk-sensitive Markov decision processes. Math. Oper. Res. 39(1):105–120.LinkGoogle Scholar
  • Billingsley P (1995) Probability and Measure, 3rd ed. (John Wiley & Sons, New York).Google Scholar
  • Borkar VS, Meyn SP (2002) Risk-sensitive optimal control for Markov decision process with monotone cost. Math. Oper. Res. 27(1):192–209.LinkGoogle Scholar
  • Canbolat PG (2014) Optimal halting policies in Markov population decision chains with constant risk posture. Ann. Opr. Res. 222:227–237.CrossrefGoogle Scholar
  • Cavazos-Cadena R (2009) The risk-sensitive Poisson equation for a communicating Markov chain on a denumerable state space. Kybernetika 45(5):716–736.Google Scholar
  • Cavazos-Cadena R (2009) Solutions of the average cost optimality equation for finite Markov decision chains: Risk-sensitive and risk-neutral criteria. Math. Method Oper. Res. 70(3):541–566.CrossrefGoogle Scholar
  • Cavazos-Cadena R, Fernández-Gaucherand E (2002) Risk-sensitive control in communicating average Markov decision chains. Dror M, L’Ecuyer P, Szidarovsky F, eds. Modelling Uncertainty: An Examination of Stochastic Theory, Methods and Applications (Kluwer, Boston), 525–544.CrossrefGoogle Scholar
  • Cavazos-Cadena R, Hernández-Hernández D (2005) A characterization of the optimal risk-sensitive average cost in finite controlled Markov chains. Ann. Appl. Probab. 15(1):175–212.CrossrefGoogle Scholar
  • Cavazos-Cadena R, Hernández-Hernández D (2006) A system of Poisson equations for a nonconstant Varadhan functional on a finite state space. Appl. Math. Optim. 53(1):101–119.CrossrefGoogle Scholar
  • Cavazos-Cadena R, Salem-Silva F (2010) The discounted method and equivalence of average criteria for risk-sensitive Markov decision processes on borel spaces. Appl. Math. Optim. 61(2):167–190.CrossrefGoogle Scholar
  • Denardo EV, Rothblum UG (2006) A turnpike theorem for a risk-sensitive Markov decision process with stopping. SIAM J. Control Optim. 45(2):414–431.CrossrefGoogle Scholar
  • Di Masi GB, Stettner L (1999) Risk-sensitive control of discrete time Markov processes with infinite horizon. SIAM J. Control Optim. 38(1):61–78.CrossrefGoogle Scholar
  • Di Masi GB, Stettner L (2000) Infinite horizon risk sensitive control of discrete time Markov processes with small risk. Systems Control Lett. 40:15–20.CrossrefGoogle Scholar
  • Di Masi GB, Stettner L (2007) Infinite horizon risk sensitive control of discrete time Markov processes under minorization property. SIAM J. Control Optim. 46(1):231–252.CrossrefGoogle Scholar
  • Feinberg EA, Huang J (2017) On the reduction of total-cost and average-cost MDPs to discounted MDPs. Naval Res. Logist. ePub ahead of print May 25, http://dx.doi.org/10.1002/nav.21743.CrossrefGoogle Scholar
  • Hernández-Hernández D, Marcus SI (1999) Existence of risk sensitive optimal stationary polices for controlled Markov processes. Appl. Math. Opt. 40(3):273–285.CrossrefGoogle Scholar
  • Hernández-Lerma O (1989) Adaptive Markov Control Processes (Springer, New York).CrossrefGoogle Scholar
  • Hordijk A (1974) Dynamic Programming and Markov Potential Theory. Mathematical Centre tracts (Mathematisch Centrum, Netherlands).Google Scholar
  • Howard RA, Matheson JE (1972) Risk-sensitive Markov decision processes. Manage. Sci. 18(7):356–369.LinkGoogle Scholar
  • Jaśkiewicz A (2007) Average optimality for risk sensitive control with general state space. Ann. Appl. Probab. 17(2):654–675.CrossrefGoogle Scholar
  • Kontoyiannis I, Meyn SP (2003) Spectral theory and limit theorems for geometrically ergodic Markov processes. Ann. App. Probab. 13(1):304–362.CrossrefGoogle Scholar
  • Meyer CD (2000) Matrix Analysis and Applied Linear Algebra (SIAM, Philadelphia).CrossrefGoogle Scholar
  • Pitera M, Stettner L (2016) Long run risk sensitive portfolio with general factors. Math. Methods Oper. Res. 82(2):265–293.CrossrefGoogle Scholar
  • Puterman ML (2005) Markov Decision Processes: Discrete Stochastic Dynamic Programming (John Wiley & Sons, New York).Google Scholar
  • Shen Y, Stannat W, Obermayer K (2013) Risk-sensitive Markov control processes. SIAM J. Control Optim. 51(5):3652–3672.CrossrefGoogle Scholar
  • Sladký K (2008) Growth rates and average optimality in risk-sensitive Markov decision chains. Kybernetika 44(2):205–226.Google Scholar
  • Stettner L (1999) Risk sensitive portfolio optimization. Math. Methods Oper. Res. 50(3):463–474.CrossrefGoogle Scholar
  • Thomas LC (2002) Connectedness conditions for denumerable state Markov decision processes. Hartley R, Thomas LC, White DJ, eds. Recent Developments in Markov Decision Processes (Academic Press, New York), 181–204.Google Scholar
  • Tijms HC (2003) A First Course in Stochastic Models (John Wiley & Sons, New York).CrossrefGoogle Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.