A Pareto Dominance Principle for Data-Driven Optimization
Published Online:19 Jan 2024https://doi.org/10.1287/opre.2021.0609
References
- (2007) Infinite Dimensional Analysis: A Hitchhiker’s Guide (Springer, Berlin).Google Scholar
- (2000) Probability and Measure Theory (Academic Press, New York).Google Scholar
- (2019) The big data newsvendor: Practical insights from machine learning. Oper. Res. 67(1):90–108.Link, Google Scholar
- (2014) Information and Exponential Families in Statistical Theory (John Wiley & Sons, Hoboken, NJ).Crossref, Google Scholar
- (2013) Robust solutions of optimization problems affected by uncertain probabilities. Management Sci. 59(2):341–357.Link, Google Scholar
- (2021) Learning and decision-making with data: Optimal formulations and phase transitions. Preprint, submitted September 14, https://arxiv.org/abs/ 2109.06911.Google Scholar
- (1997) Topological Spaces: Including a Treatment of Multi-Valued Functions, Vector Spaces, and Convexity (Courier Corporation, North Chelmsford, MA).Google Scholar
- (2009) Convex Optimization Theory (Athena Scientific, Nashua, NH).Google Scholar
- (2020) From predictive to prescriptive analytics. Management Sci. 66(3):1025–1044.Link, Google Scholar
- (2018) Robust sample average approximation. Math. Programming 171(1–2):217–282.Crossref, Google Scholar
- (1961) Statistical methods in Markov chains. Ann. Math. Statist. 32:12–40.Crossref, Google Scholar
- (1947) Conditional expectation and unbiased sequential estimation. Ann. Math. Statist. 18:105–110.Crossref, Google Scholar
- (2019) Robust Wasserstein profile inference and applications to machine learning. J. Appl. Probabilities 56(3):830–857.Crossref, Google Scholar
- (2006) Elements of Information Theory (Wiley, New York).Google Scholar
- (2010) Distributionally robust optimization under moment uncertainty with application to data-driven problems. Oper. Res. 58(3):595–612.Link, Google Scholar
- (2009) Large Deviations Techniques and Applications (Springer, Berlin).Google Scholar
- (2020) Distributional robustness and regularization in reinforcement learning. Preprint, submitted March 5, https://arxiv.org/abs/ 2003.02894.Google Scholar
- (2019) Distributionally robust optimization with correlated data from vector autoregressive processes. Oper. Res. Lett. 47(4):294–299.Crossref, Google Scholar
- (2021) Statistics of robust optimization: A generalized empirical likelihood approach. Math. Oper. Res. 46(3):946–969.Link, Google Scholar
- (2010) Probability: Theory and Examples (Cambridge University Press, Cambridge, UK).Crossref, Google Scholar
- (2023) Finite-sample guarantees for Wasserstein distributionally robust optimization: Breaking the curse of dimensionality. Oper. Res. 71(6):2291–2306.Google Scholar
- (2019) Near-optimal Bayesian ambiguity sets for distributionally robust optimization. Management Sci. 65(9):4242–4260.Link, Google Scholar
- (2016) A comment on “computational complexity of stochastic programming problems”. Math. Programming 159(1):557–569.Crossref, Google Scholar
- (2022) Fast rates for contextual linear optimization. Management Sci. 68(6):4236–4245.Link, Google Scholar
- (1958) On a space of totally additive functions. Vestnik Leningradskogo Universiteta 13(7):52–59.Google Scholar
- (1993) Asymptotic theory for solutions in statistical estimation and stochastic programming. Math. Oper. Res. 18(1):148–162.Link, Google Scholar
- (1991) Epi-consistency of convex stochastic programs. Stochastic Stochastic Rep. 34(1–2):83–92.Crossref, Google Scholar
- (1936) On distributions admitting a sufficient statistic. Trans. Amer. Math. Soc. 39(3):399–409.Crossref, Google Scholar
- (2006) Exponential Families of Stochastic Processes (Springer, Berlin).Google Scholar
- (2019) Wasserstein distributionally robust optimization: Theory and applications in machine learning. Operations Research & Management Science in the Age of Analytics (INFORMS, Catonsville, MD), 130–166.Link, Google Scholar
- (2019) Recovering best statistical guarantees via the empirical divergence-based distributionally robust optimization. Oper. Res. 67(4):1090–1105.Abstract, Google Scholar
- (2010) Introduction: Uncertainty Quantification and Propagation (Springer, Berlin).Google Scholar
- (1998) Theory of Point Estimation (Springer, Berlin).Google Scholar
- (2021) Distributionally robust optimization with Markovian data. Meila M, Zhang T, eds. Proc. 38th Internat. Conf. Machine Learn., (PMLR, New York), 6493–6503.Google Scholar
- (2018) Data-driven distributionally robust optimization using the Wasserstein metric: Performance guarantees and tractable reformulations. Math. Program. 171(1–2):115–166.Crossref, Google Scholar
- (1994) Large sample estimation and hypothesis testing. Handbook of Econometrics (Elsevier, New York), 2111–2245.Crossref, Google Scholar
- (1945) Information and the accuracy attainable in the estimation of statistical parameters. Bull. Calcutta Math. Soc. 37:81–91.Google Scholar
- (1970) Convex Analysis (Princeton University Press, Princeton, NJ).Crossref, Google Scholar
- (1998) Wets Variational Analysis (Springer, Berlin).Crossref, Google Scholar
- (2010) Introduction to Probability Models (Elsevier, New York).Google Scholar
- (1989) Asymptotic properties of statistical estimators in stochastic programming. Ann. Statist. 17(2):841–858.Crossref, Google Scholar
- (1990) On differential stability in stochastic programming. Math. Programming 47(1):107–116.Crossref, Google Scholar
- (1991) Asymptotic analysis of stochastic programs. Ann. Oper. Res. 30(1–4):169–186.Crossref, Google Scholar
- (1993) Asymptotic behavior of optimal solutions in stochastic programming. Math. Oper. Res. 18(4):829–845.Link, Google Scholar
- (2003) Monte Carlo sampling methods. Stochastic Programming (Elsevier, New York), 353–425.Crossref, Google Scholar
- (2014) Lectures on Stochastic Programming: Modeling and Theory (SIAM, Philadelphia).Crossref, Google Scholar
- (2006) The optimizer’s curse: Skepticism and postdecision surprise in decision analysis. Management Sci. 52(3):311–322.Link, Google Scholar
- (2021) Robust generalization despite distribution shift via minimum discriminating information. Ranzato M, Beygelzimer A, Dauphin Y, Liang PS, Wortman Vaughan J, eds. Advances in Neural Information Processing Systems, vol. 34 (Curran Associates, Inc., Red Hook, NY), 29754–29767.Google Scholar
- (2021) Efficient data-driven optimization with noisy data. Preprint, submitted February 8, https://arxiv.org/abs/ 2102.04363.Google Scholar
- (2021) From data to decisions: Distributionally robust optimization is optimal. Management Sci. 67(6):3387–3402.Link, Google Scholar

