Nonzero-Sum Risk-Sensitive Stochastic Games on a Countable State Space

Published Online:https://doi.org/10.1287/moor.2017.0870

References

  • Aliprantis C, Border K (2006) Infinite Dimensional Analysis: A Hitchhiker’s Guide, 3rd ed. (Springer, Berlin).Google Scholar
  • Altman E, Hordijk A, Spieksma F (1997) Contraction conditions for average and α-discount optimality in countable state Markov games with unbounded rewards. Math. Oper. Res. 22(3):588–618.LinkGoogle Scholar
  • Balaji S, Meyn SP (2000) Multiplicative ergodicity and large deviations for an irreducible Markov chain. Stochastic Processes Their Appl. 90(1):123–144.CrossrefGoogle Scholar
  • Başar T (1999) Nash equilibria of risk-sensitive nonlinear stochastic differential games. J. Optim. Theory Appl. 100(3):479–498.CrossrefGoogle Scholar
  • Basu A, Ghosh MK (2012) Zero-sum risk-sensitive stochastic differential games. Math. Oper. Res. 37(3):437–449.LinkGoogle Scholar
  • Basu A, Ghosh MK (2014) Zero-sum risk-sensitive stochastic games on a countable state space. Stochastic Processes Their Appl. 124(1):961–983.CrossrefGoogle Scholar
  • Bäuerle N, Rieder U (2013) More risk-sensitive Markov decision processes. Math. Oper. Res. 39(1):105–120.LinkGoogle Scholar
  • Bellman R (1957) Dynamic Programming (Princeton University Press, Princeton, NJ).Google Scholar
  • Beneš V (1970) Existence of optimal strategies based on specified information, for a class of stochastic decision problems. SIAM J. Control 8(2):179–188.CrossrefGoogle Scholar
  • Bertsekas D, Shreve S (1996) Stochastic Optimal Control: The Discrete-Time Case (Athena Scientific, Belmont, MA).Google Scholar
  • Bielecki TR, Pliska SR (2003) Economic properties of the risk sensitive criterion for portfolio management. Rev. Accounting and Finance 2(2):3–17.CrossrefGoogle Scholar
  • Borkar VS (1991) Topics in Controlled Markov Chains (Longman Scientific & Technical, Harlow, Essex, UK).Google Scholar
  • Borkar VS (2012) Probability Theory: An Advanced Course (Springer, New York).Google Scholar
  • Borkar VS, Meyn SP (2002) Risk-sensitive optimal control for Markov decision processes with monotone cost. Math. Oper. Res. 27(1):192–209.LinkGoogle Scholar
  • Cavazos-Cadena R, Fernández-Gaucherand E (1999) Controlled Markov chains with risk-sensitive criteria: Average cost, optimality equations, and optimal solutions. Math. Methods Oper. Res. 49(2):299–324.Google Scholar
  • Cavazos-Cadena R, Fernández-Gaucherand E (2000) The vanishing discount approach in Markov chains with risk-sensitive criteria. IEEE Trans. Automatic Control 45(10):1800–1816.CrossrefGoogle Scholar
  • Cavazos-Cadena R, Hernández-Hernández D (2011) Discounted approximations for risk-sensitive average criteria in Markov decision chains with finite state space. Math. Oper. Res. 36(1):133–146.LinkGoogle Scholar
  • Dekker R, Hordijk A (1992) Recurrence conditions for average and Blackwell optimality in denumerable state Markov decision chains. Math. Oper. Res. 17(2):271–289.LinkGoogle Scholar
  • Dekker R, Hordijk A, Spieksma F (1994) On the relation between recurrence and ergodicity properties in denumerable Markov decision chains. Math. Oper. Res. 19(3):539–559.LinkGoogle Scholar
  • Di Masi Get al. (2000) Infinite horizon risk sensitive control of discrete time Markov processes with small risk. Systems and Control Lett. 40(1):15–20.CrossrefGoogle Scholar
  • Di Masi GB, Stettner L (1999) Risk-sensitive control of discrete-time Markov processes with infinite horizon. SIAM J. Control Optim. 38(1):61–78.CrossrefGoogle Scholar
  • Di Masi GB, Stettner Ł (2007) Infinite horizon risk sensitive control of discrete time Markov processes under minorization property. SIAM J. Control Optim. 46(1):231–252.CrossrefGoogle Scholar
  • El-Karoui N, Hamadene S (2003) BSDES and risk-sensitive control, zero-sum and nonzero-sum game problems of stochastic functional differential equations. Stochastic Processes Their Appl. 107(1):145–169.CrossrefGoogle Scholar
  • Fan K (1952) Fixed-point and minimax theorems in locally convex topological linear spaces. Proc. National Acad. Sci. 38(2):121–126.CrossrefGoogle Scholar
  • Fleming W, Hernández-Hernández D (1997) Risk-sensitive control of finite state machines on an infinite horizon i. SIAM J. Control Optim. 35(5):1790–1810.CrossrefGoogle Scholar
  • Fleming WH, McEneaney WM (1995) Risk-sensitive control on an infinite time horizon. SIAM J. Control Optim. 33(6):1881–1915.CrossrefGoogle Scholar
  • Hansen LP, Sargent TJ (1995) Discounted linear exponential quadratic Gaussian control. IEEE Trans. Automatic Control 40(5):968–971.CrossrefGoogle Scholar
  • Hernández-Hernández D, Marcus SI (1996) Risk sensitive control of Markov processes in countable state space. Systems and Control Lett. 29(3):147–155.CrossrefGoogle Scholar
  • Hordijk A (1974) Dynamic Programming and Markov Potential Theory, Math. Centre. Tracts; 51 (Mathematisch Centrum, Amsterdam).Google Scholar
  • Hordijk A, Spieksma F (1992) On ergodicity and recurrence properties of a Markov chain by an application to an open Jackson network. Adv. Appl. Probab. 24(02):343–376.CrossrefGoogle Scholar
  • Howard RA, Matheson JE (1972) Risk-sensitive Markov decision processes. Management Sci. 18(7):356–369.LinkGoogle Scholar
  • Jacobson D (1973) Optimal stochastic linear systems with exponential performance criteria and their relation to deterministic differential games. IEEE Trans. Automatic Control 18(2):124–131.CrossrefGoogle Scholar
  • James MR, Baras JS, Elliott RJ (1994) Risk-sensitive control and dynamic games for partially observed discrete-time nonlinear systems. IEEE Trans. Automatic Control 39(4):780–792.CrossrefGoogle Scholar
  • Jaśkiewicz A (2007) Average optimality for risk-sensitive control with general state space. Ann. Appl. Probab. 17(2):654–675.CrossrefGoogle Scholar
  • Klenke A (2013) Probability Theory: A Comprehensive Course (Springer, Berlin).Google Scholar
  • Klompstra MB (2000) Nash equilibria in risk-sensitive dynamic games. IEEE Trans. Automatic Control 45(7):1397–1401.CrossrefGoogle Scholar
  • Lim AE, Zhou XY (2001) Risk-sensitive control with HARA utility. IEEE Trans. Automatic Control 46(4):563–578.CrossrefGoogle Scholar
  • Meyn S, Tweedie R (2009) Markov Chains and Stochastic Stability, 2nd ed. (Cambridge University Press, Cambridge, UK).CrossrefGoogle Scholar
  • Nagai H (2003) Optimal strategies for risk-sensitive portfolio optimization problems for general factor models. SIAM J. Control Optim. 41(6):1779–1800.CrossrefGoogle Scholar
  • Nowak AS (2005) Notes on risk-sensitive Nash equilibria. Advances in Dynamic Games (Birkhäuser, Boston), 95–109.CrossrefGoogle Scholar
  • Rothblum UG (1984) Multiplicative Markov decision chains. Math. Oper. Res. 9(1):6–24.LinkGoogle Scholar
  • Spieksma F, Tweedie R (1994) Strengthening ergodicity to geometric ergodicity for Markov chains. Stochastic Models 10(1):45–74.CrossrefGoogle Scholar
  • Whittle P (1981) Risk-sensitive linear/quadratic/Gaussian control. Adv. Appl. Probab. 13(04):764–777.CrossrefGoogle Scholar
  • Whittle P (1990) Risk-Sensitive Optimal Control (John Wiley & Sons, New York).Google Scholar
  • Whittle P (1996) Optimal Control: Basics and Beyond (John Wiley & Sons, Chichester, UK).Google Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.