Distributionally Robust Markov Decision Processes and Their Connection to Risk Measures
Published Online:23 Nov 2021https://doi.org/10.1287/moor.2021.1187
References
- [1] (2012) Entropic value-at-risk: A new coherent risk measure. J. Optim. Theory Appl. 155(3):1105–1123.Crossref, Google Scholar
- [2] (2006) Infinite Dimensional Analysis: A Hitchhiker’s Guide, 3rd ed. (Springer-Verlag, Berlin).Google Scholar
- [3] (2015) Optimal management of wind energy with storage: Structural implications for policy and market design. J. Energy Engrg. 141(1):B4014002.Crossref, Google Scholar
- [4] (2012) Convexity and Optimization in Banach Spaces, 4th ed. (Springer Netherlands, Dordrecht, Netherlands).Crossref, Google Scholar
- [5] (2017) Optimal dividend payout model with risk sensitive preferences. Insurance Math. Econom. 73:82–93.Crossref, Google Scholar
- [6] (2018) Stochastic optimal growth model with risk sensitive preferences. J. Econom. Theory 173:181–200.Crossref, Google Scholar
- [7] (2011) Markov decision processes with average-value-at-risk criteria. Math. Methods Oper. Res. 74(3):361–379.Crossref, Google Scholar
- [8] (2011) Markov Decision Processes with Applications to Finance (Springer-Verlag, Berlin).Crossref, Google Scholar
- [9] (2020) Markov Decision Processes Under Ambiguity, vol. 122 (Banach Center Publications), 25–39.Google Scholar
- [10] (2019) Adaptive robust control under model uncertainty. SIAM J. Control Optim. 57(2):925–946.Crossref, Google Scholar
- [11] (2015) Risk-sensitive and robust decision-making: A CVaR optimization approach. Adv. Neural Inform. Processing Systems 28:1522–1530.Google Scholar
- [12] (2010) Families of alpha- beta- and gamma- divergences: Flexible and robust measures of similarities. Entropy 12(6):1532–1568.Crossref, Google Scholar
- [13] (1961) Risk, ambiguity, and the Savage axioms. Quart. J. Econom. 75(4):643–669.Crossref, Google Scholar
- [14] (2003) Recursive multiple-priors. J. Econom. Theory 113(1):1–31.Crossref, Google Scholar
- [15] (2013) Rényi divergence measures for commonly used univariate continuous distributions. Inform. Sci. 249:124–131.Crossref, Google Scholar
- [16] (1989) Maxmin expected utility with a non-unique prior. J. Math. Econom. 18(2):141–153.Crossref, Google Scholar
- [17] (2020) Robust and risk-sensitive Markov decision processes with applications to dynamic optimal reinsurance. PhD thesis, Karlsruhe Institute of Technology, Karlsruhe, Germany.Google Scholar
- [18] (2002) Minimax control of discrete-time stochastic systems. SIAM J. Control Optim. 41(5):1626–1659.Crossref, Google Scholar
- [19] (1999) Five games and two objective functions that promote robustness. Manuscript, University of Chicago, Stanford University, and Hoover Institution, Chicago. https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.35.2668&rep=rep1&type=pdf.Google Scholar
- [20] (1989) Adaptive Markov Control Processes, Applied Mathematical Sciences, vol. 79 (Springer-Verlag, New York).Crossref, Google Scholar
- [21] (1996) Discrete-Time Markov Control Processes: Basic Optimality Criteria (Springer-Verlag, New York).Crossref, Google Scholar
- [22] (1999) Further Topics on Discrete-Time Markov Control Processes (Springer-Verlag, New York).Crossref, Google Scholar
- [23] (2005) Robust dynamic programming. Math. Oper. Res. 30(2):257–280.Link, Google Scholar
- [24] (2011) Stochastic games with unbounded payoffs: Applications to robust control in economics. Dynam. Games Appl. 1(2):253–279.Crossref, Google Scholar
- [25] (2014) Robust Markov control processes. J. Math. Anal. Appl. 420(2):1337–1353.Crossref, Google Scholar
- [26] (2006) Ambiguity aversion, robustness, and the variational representation of preferences. Econometrica 74(6):1447–1498.Crossref, Google Scholar
- [27] (2001) Functional Analysis: An Introduction to Banach Space Theory (John Wiley & Sons, New York).Google Scholar
- [28] (1997) How does the value function of a Markov decision process depend on the transition probabilities? Math. Oper. Res. 22(4):872–885.Link, Google Scholar
- [29] (2005) Robust control of Markov decision processes with uncertain transition matrices. Oper. Res. 53(5):780–798.Link, Google Scholar
- [30] (2007) Metric Spaces (Springer-Verlag, London).Google Scholar
- [31] (2015) Premiums and reserves, adjusted by distortions. Scandinavian Actuarial J. 2015(4):332–351.Crossref, Google Scholar
- [32] (2020) Entropy based risk measures. Eur. J. Oper. Res. 285(1):223–236.Crossref, Google Scholar
- [33] (1978) Measurable selection theorems for optimization problems. Manuscripta Math. 24(1):115–131.Google Scholar
- [34] (2009) On the distributional transform, Sklar’s theorem, and the empirical copula process. J. Statist. Planning Inference 139(11):3921–3927.Crossref, Google Scholar
- [35] (2013) Mathematical Risk Analysis: Dependence, Risk Bounds, Optimal Allocations and Portfolios (Springer-Verlag, Berlin).Crossref, Google Scholar
- [36] (2010) Risk-averse dynamic programming for Markov decision processes. Math. Programming 125(2):235–261.Crossref, Google Scholar
- [37] (2013) Risk-sensitive Markov control processes. SIAM J. Control Optim. 51(5):3652–3672.Crossref, Google Scholar
- [38] (1958) On general minimax theorems. Pacific J. Math. 8(1):171–176.Crossref, Google Scholar
- [39] (2013) Robust Markov decision processes. Math. Oper. Res. 38(1):153–183.Link, Google Scholar
- [40] (2010) Distributionally robust Markov decision processes. Adv. Neural Inform. Processing Systems 23(2):2505–2513.Google Scholar
- [41] (2017) A convex optimization approach to distributionally robust Markov decision processes with Wasserstein distance. IEEE Control Systems Lett. 1(1):164–169.Crossref, Google Scholar

