Characterization of the Optimal Risk-Sensitive Average Cost in Denumerable Markov Decision Chains

Rolando Cavazos-Cadena
Corresponding Author
Rolando Cavazos-Cadena
[email protected]
http://orcid.org/0000-0002-0973-9296
Departamento de Estadística y Cálculo, Universidad Autónoma Agraria Antonio Narro, Buenavista, Saltillo, Coahuila 25315, México
Search for more papers by this author

Corresponding Author

Rolando Cavazos-Cadena

Departamento de Estadística y Cálculo, Universidad Autónoma Agraria Antonio Narro, Buenavista, Saltillo, Coahuila 25315, México

Search for more papers by this author

Published Online:16 Mar 2018https://doi.org/10.1287/moor.2017.0893

References

Alanís-Durán A, Cavazos-Cadena R (2012) An optimality system for finite average Markov decision chains under risk-aversion. Kybernetika 48(1):83–104.Google Scholar
Balaji S, Meyn SP (2000) Multiplicative ergodicity and large deviations for an irreducible Markov chain. Stochastic Proc. Appl. 90(1):123–144.Crossref, Google Scholar
Bäuerle N, Rieder U (2011) Markov Decision Processes with Applications to Finance (Springer, New York).Crossref, Google Scholar
Bäuerle N, Rieder U (2014) More risk-sensitive Markov decision processes. Math. Oper. Res. 39(1):105–120.Link, Google Scholar
Billingsley P (1995) Probability and Measure, 3rd ed. (John Wiley & Sons, New York).Google Scholar
Borkar VS, Meyn SP (2002) Risk-sensitive optimal control for Markov decision process with monotone cost. Math. Oper. Res. 27(1):192–209.Link, Google Scholar
Canbolat PG (2014) Optimal halting policies in Markov population decision chains with constant risk posture. Ann. Opr. Res. 222:227–237.Crossref, Google Scholar
Cavazos-Cadena R (2009) The risk-sensitive Poisson equation for a communicating Markov chain on a denumerable state space. Kybernetika 45(5):716–736.Google Scholar
Cavazos-Cadena R (2009) Solutions of the average cost optimality equation for finite Markov decision chains: Risk-sensitive and risk-neutral criteria. Math. Method Oper. Res. 70(3):541–566.Crossref, Google Scholar
Cavazos-Cadena R, Fernández-Gaucherand E (2002) Risk-sensitive control in communicating average Markov decision chains. Dror M, L’Ecuyer P, Szidarovsky F, eds. Modelling Uncertainty: An Examination of Stochastic Theory, Methods and Applications (Kluwer, Boston), 525–544.Crossref, Google Scholar
Cavazos-Cadena R, Hernández-Hernández D (2005) A characterization of the optimal risk-sensitive average cost in finite controlled Markov chains. Ann. Appl. Probab. 15(1):175–212.Crossref, Google Scholar
Cavazos-Cadena R, Hernández-Hernández D (2006) A system of Poisson equations for a nonconstant Varadhan functional on a finite state space. Appl. Math. Optim. 53(1):101–119.Crossref, Google Scholar
Cavazos-Cadena R, Salem-Silva F (2010) The discounted method and equivalence of average criteria for risk-sensitive Markov decision processes on borel spaces. Appl. Math. Optim. 61(2):167–190.Crossref, Google Scholar
Denardo EV, Rothblum UG (2006) A turnpike theorem for a risk-sensitive Markov decision process with stopping. SIAM J. Control Optim. 45(2):414–431.Crossref, Google Scholar
Di Masi GB, Stettner L (1999) Risk-sensitive control of discrete time Markov processes with infinite horizon. SIAM J. Control Optim. 38(1):61–78.Crossref, Google Scholar
Di Masi GB, Stettner L (2000) Infinite horizon risk sensitive control of discrete time Markov processes with small risk. Systems Control Lett. 40:15–20.Crossref, Google Scholar
Di Masi GB, Stettner L (2007) Infinite horizon risk sensitive control of discrete time Markov processes under minorization property. SIAM J. Control Optim. 46(1):231–252.Crossref, Google Scholar
Feinberg EA, Huang J (2017) On the reduction of total-cost and average-cost MDPs to discounted MDPs. Naval Res. Logist. ePub ahead of print May 25, http://dx.doi.org/10.1002/nav.21743.Crossref, Google Scholar
Hernández-Hernández D, Marcus SI (1999) Existence of risk sensitive optimal stationary polices for controlled Markov processes. Appl. Math. Opt. 40(3):273–285.Crossref, Google Scholar
Hernández-Lerma O (1989) Adaptive Markov Control Processes (Springer, New York).Crossref, Google Scholar
Hordijk A (1974) Dynamic Programming and Markov Potential Theory. Mathematical Centre tracts (Mathematisch Centrum, Netherlands).Google Scholar
Howard RA, Matheson JE (1972) Risk-sensitive Markov decision processes. Manage. Sci. 18(7):356–369.Link, Google Scholar
Jaśkiewicz A (2007) Average optimality for risk sensitive control with general state space. Ann. Appl. Probab. 17(2):654–675.Crossref, Google Scholar
Kontoyiannis I, Meyn SP (2003) Spectral theory and limit theorems for geometrically ergodic Markov processes. Ann. App. Probab. 13(1):304–362.Crossref, Google Scholar
Meyer CD (2000) Matrix Analysis and Applied Linear Algebra (SIAM, Philadelphia).Crossref, Google Scholar
Pitera M, Stettner L (2016) Long run risk sensitive portfolio with general factors. Math. Methods Oper. Res. 82(2):265–293.Crossref, Google Scholar
Puterman ML (2005) Markov Decision Processes: Discrete Stochastic Dynamic Programming (John Wiley & Sons, New York).Google Scholar
Shen Y, Stannat W, Obermayer K (2013) Risk-sensitive Markov control processes. SIAM J. Control Optim. 51(5):3652–3672.Crossref, Google Scholar
Sladký K (2008) Growth rates and average optimality in risk-sensitive Markov decision chains. Kybernetika 44(2):205–226.Google Scholar
Stettner L (1999) Risk sensitive portfolio optimization. Math. Methods Oper. Res. 50(3):463–474.Crossref, Google Scholar
Thomas LC (2002) Connectedness conditions for denumerable state Markov decision processes. Hartley R, Thomas LC, White DJ, eds. Recent Developments in Markov Decision Processes (Academic Press, New York), 181–204.Google Scholar
Tijms HC (2003) A First Course in Stochastic Models (John Wiley & Sons, New York).Crossref, Google Scholar

cover image Mathematics of Operations Research

Volume 43, Issue 3

August 2018

Pages 693-1050, C2

Article Information

Metrics

Information

Received:June 26, 2016
Accepted:July 25, 2017
Published Online:March 16, 2018

Cite as

Rolando Cavazos-Cadena (2018) Characterization of the Optimal Risk-Sensitive Average Cost in Denumerable Markov Decision Chains. Mathematics of Operations Research 43(3):1025-1050.

https://doi.org/10.1287/moor.2017.0893

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Characterization of the Optimal Risk-Sensitive Average Cost in Denumerable Markov Decision Chains

References

Volume 43, Issue 3

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News