Distributionally Robust Markov Decision Processes
Published Online:1 May 2012https://doi.org/10.1287/moor.1120.0540
References
- . Perturbation and stability theory for Markov control problems. IEEE Trans. Automatic Control (1992) 37(9):1415–1420Crossref, Google Scholar
- . Algorithms for singularly perturbed limiting average Markov control problems. IEEE Trans. Automatic Control (1992) 37(9):1421–1425Crossref, Google Scholar
- , Feinberg EA, Shwartz A. Singular perturbations of Markov chains and decision processes. Handbook of Markov Decision Processes: Methods and Applications (2002) Crossref, Google Scholar
- . Solving uncertain Markov decision problems. (2001) . Technical Report CMU-RI-TR-01-25, Carnegie Mellon University, PittsburghGoogle Scholar
- . Thinking and Deciding (2000) (Cambridge University Press, New York) Google Scholar
- . Robust solutions of uncertain linear programs. Oper. Res. Lett. (1999) 25(1):1–13Crossref, Google Scholar
- . Neuro-Dynamic Programming (1996) (Athena Scientific, Nashua, NH) Google Scholar
- . Theory of Games and Statistical Decisions (1954) (John Wiley & Sons Inc., New York) Google Scholar
- . Convex Optimization (2004) (Cambridge University Press, New York) Crossref, Google Scholar
- . On distributionally robust chance-constrained linear programs. J. Optimization Theory and Appl. (2006) 130(1):1–22Crossref, Google Scholar
- . Percentile optimization for Markov decision processes with parameter uncertainty. Oper. Res. (2010) 58(1):203–213Link, Google Scholar
- . Distributionally robust optimization under moment uncertainty with applications to data-driven problems. Oper. Res. (2010) 58(3):596–612Link, Google Scholar
- . Optimal control of Markov chains admitting strong and weak interactions. Automatica (1981) 17(2):281–296Crossref, Google Scholar
- . The minimax approach to stochastic programming and an illustrative application. Stochastics (1987) 20:73–88Crossref, Google Scholar
- . Elimination of randomization in certain statistical decision procedures and zero-sum two-person games. Ann. Math. Statist. (1951) 22(1):1–21Crossref, Google Scholar
- . Learning under ambiguity. Rev. Econom. Stud. (2007) 74(4):1275–1303Crossref, Google Scholar
- . Maxmin expected utility with a non-unique prior. J. Math. Econom. (1989) 18(2):141–153Crossref, Google Scholar
- . Distributionally robust optimization and its tractable approximations. Oper. Res. (2010) 58(4):902–917Link, Google Scholar
- . Geometric Algorithms and Combinatorial Optimization (1988) (Springer, Heidelberg) Crossref, Google Scholar
- . Robust dynamic programming. Math. Oper. Res. (2005) 30(2):257–280Link, Google Scholar
- . Stochastic programming with recourse: Upper bounds and moment problems, a review. Advances in Mathematical Optimization (1988) (Academie-Verlag, Berlin) 86–103Google Scholar
- . The theory of infinite games. Ann. Math. (1953) 58(2):371–401Crossref, Google Scholar
- . Maxmin expected utility and weight of evidence. Oxford Econom. Papers (1994) 46(3):425–444Google Scholar
- . Bias and variance approximation in value function estimates. Management Sci. (2007) 53(2):308–322Link, Google Scholar
- . Robust control of Markov decision processes with uncertain transition matrices. Oper. Res. (2005) 53(5):780–798Link, Google Scholar
- . Robust mean-covariance solutions for stochastic optimization. Oper. Res. (2007) 55(1):98–112Link, Google Scholar
- . Markov Decision Processes (1994) (John Wiley & Sons, New York) Crossref, Google Scholar
- . Convex Analysis (1970) (Princeton University Press, Princeton, NJ) Crossref, Google Scholar
- , Arrow KJ, Karlin S, Scarf H. A min-max solution of an inventory problem. Studies in Mathematical Theory of Inventory and Production (1958) (Stanford University Press, Stanford, CA) 201–209Google Scholar
- . Worst-case distribution analysis of stochastic programs. Math. Programming (2006) 107(1):91–96Crossref, Google Scholar
- . Stochastic games. Proc. National Acad. Sci. USA (1953) 39(10):1095–1100Crossref, Google Scholar
- . On general minimax theorems. Pacific J. Math. (1958) 8(1):171–176Crossref, Google Scholar
- . Markov decision processes with imprecise transition probabilities. Oper. Res. (1992) 42(4):739–748Link, Google Scholar

