Small-Data, Large-Scale Linear Optimization with Uncertain Objectives

Published Online:https://doi.org/10.1287/mnsc.2019.3554

References

  • Ban G-Y, Rudin C (2014) The big data newsvendor: Practical insights from machine learning. Oper. Res. 67(1):90–108.LinkGoogle Scholar
  • Ban G-Y, Gallien J, Mersereau A (2018) Dynamic procurement of new products with covariate information: The residual tree method. Manufacturing Service Oper. Management 21(4):798–815.Google Scholar
  • Belloni A, Chernozhukov V (2011) ℓ1-penalized quantile regression in high-dimensional sparse models. Ann. Statist. 39(1):82–130.CrossrefGoogle Scholar
  • Ben-Tal A, Nemirovski A (2000) Robust solutions of linear programming problems contaminated with uncertain data. Math. Programming 88(3):411–424.CrossrefGoogle Scholar
  • Ben-Tal A, Nemirovski A (2002) Robust optimization–methodology and applications. Math. Programming 92(3):453–480.CrossrefGoogle Scholar
  • Berger JO (2013) Statistical Decision Theory and Bayesian Analysis (Springer-Verlag, New York).Google Scholar
  • Bertsimas D, Kallus N (2019) From predictive to prescriptive analytics. Management Sci., ePub ahead of print August 23, https://pubsonline.informs.org/doi/10.1287/mnsc.2018.3253.Google Scholar
  • Bertsimas D, Sim M (2004) The price of robustness. Oper. Res. 52(1):35–53.LinkGoogle Scholar
  • Bertsimas D, Tsitsiklis JN (1997) Introduction to Linear Optimization (Athena Scientific, Belmont, MA).Google Scholar
  • Bertsimas D, Gupta V, Kallus N (2018a) Data-driven robust optimization. Math. Programming 167(2):235–292.CrossrefGoogle Scholar
  • Bertsimas D, Gupta V, Kallus N (2018b) Robust sample average approximation. Math. Programming 171(1–2):217–282.CrossrefGoogle Scholar
  • Bezanson J, Edelman A, Karpinski S, Shah VB (2017) Julia: A fresh approach to numerical computing. SIAM Rev. 59(1):65–98.CrossrefGoogle Scholar
  • Bickel PJ, Ritov Y, Tsybakov AB (2009) Simultaneous analysis of Lasso and Dantzig selector. Ann. Statist. 37(4):1705–1732.CrossrefGoogle Scholar
  • Bousquet O, Elisseeff A (2002) Stability and generalization. J. Machine Learn. Res. 2(3):499–526.Google Scholar
  • Bühlmann P, Geer SA (2011) Statistics for High-Dimensional Data (Springer-Verlag, Berlin).CrossrefGoogle Scholar
  • Candès EJ, Recht B (2009) Exact matrix completion via convex optimization. Foundations Comput. Math. 9(6):717–772.CrossrefGoogle Scholar
  • Candès EJ, Sing-Long CA, Trzasko JD (2013) Unbiased risk estimates for singular value thresholding and spectral estimators. IEEE Trans. Signal Processing 61(19):4643–4657.CrossrefGoogle Scholar
  • Chen X, Sim M, Sun P (2007) A robust optimization perspective on stochastic programming. Oper. Res. 55(6):1058–1071.LinkGoogle Scholar
  • Chen X, Owen Z, Pixton C, Simchi-Levi D (2015) A statistical learning approach to personalization in revenue management. Working paper, Massachussetts Institute of Technology, Cambridge.Google Scholar
  • Delage E, Ye Y (2010) Distributionally robust optimization under moment uncertainty with application to data-driven problems. Oper. Res. 58(3):595–612.LinkGoogle Scholar
  • Donoho DL, Johnstone IM (1995) Adapting to unknown smoothness via wavelet shrinkage. J. Amer. Statist. Assoc. 90(432):1200–1224.CrossrefGoogle Scholar
  • Efron B (2012) Large-scale Inference: Empirical Bayes Methods for Estimation, Testing, and Prediction (Cambridge University Press, Cambridge, UK).Google Scholar
  • Efron B, Morris C (1973) Stein’s estimation rule and its competitors—An empirical Bayes approach. J. Amer. Statist. Assoc. 68(341):117–130.Google Scholar
  • Efron B, Morris C (1975) Data analysis using Stein’s estimator and its generalizations. J. Amer. Statist. Assoc. 70(350):311–319.CrossrefGoogle Scholar
  • Elmachtoub AN, Grigas P (2017) Smart “predict, then optimize.” Working paper, Columbia University, New York.Google Scholar
  • Esfahani PM, Kuhn D (2018) Data-driven distributionally robust optimization using the Wasserstein metric: Performance guarantees and tractable reformulations. Math. Programming 171(1–2):115–166.CrossrefGoogle Scholar
  • Ferreira KJ, Lee BHA, Simchi-Levi D (2015) Analytics for an online retailer: Demand forecasting and price optimization. Manufacturing Service Oper. Management 18(1):69–88.LinkGoogle Scholar
  • Frees EW (1991) Linear regression and U-statistics. Sankhya Ser. A 53(1):84–96.Google Scholar
  • Friedman J, Hastie T, Tibshirani R (2001) The Elements of Statistical Learning (Springer-Verlag, New York).Google Scholar
  • Gupta V (2019) Near-optimal Bayesian ambiguity sets for distributionally robust optimization. Management Sci. 65(9):3949–4450.Google Scholar
  • Johnstone IM (2015) Gaussian estimation: Sequence and wavelet models. Working paper, Stanford University, Stanford, CA.Google Scholar
  • Kleywegt AJ, Shapiro A, Homem-de Mello T (2002) The sample average approximation method for stochastic discrete optimization. SIAM J. Optim. 12(2):479–502.CrossrefGoogle Scholar
  • Lan G (2012) An optimal method for stochastic composite optimization. Math. Programming 133(1):365–397.CrossrefGoogle Scholar
  • Lim AEB, Shanthikumar JG, Vahn GY (2011) Conditional value-at-risk in portfolio optimization: Coherent but fragile. Oper. Res. Lett. 39(3):163–171.CrossrefGoogle Scholar
  • Liyanage LH, Shanthikumar JG (2005) A practical inventory control policy using operational statistics. Oper. Res. Lett. 33(4):341–348.CrossrefGoogle Scholar
  • Morris CN (1983) Parametric empirical Bayes inference: Theory and applications. J. Amer. Statist. Assoc. 78(381):47–55.CrossrefGoogle Scholar
  • Mukherjee G, Brown LD, Rusmevichientong P (2015) Empirical Bayes prediction for the multivariate newsvendor loss function. Working paper, University of Southern California, Los Angeles.Google Scholar
  • Negahban S, Yu B, Wainwright MJ, Ravikumar PK (2012) A unified framework for high-dimensional analysis of m-estimators with decomposable regularizers. Statist. Sci. 27(4):538–557.CrossrefGoogle Scholar
  • Nemirovski A, Juditsky A, Lan G, Shapiro A (2009) Robust stochastic approximation approach to stochastic programming. SIAM J. Optim. 19(4):1574–1609.CrossrefGoogle Scholar
  • Nesterov Y (2005) Smooth minimization of non-smooth functions. Math. Programming 103(1):127–152.CrossrefGoogle Scholar
  • Nesterov Y (2009) Primal-dual subgradient methods for convex problems. Math. Programming 120(1):221–259.CrossrefGoogle Scholar
  • Pani A, Raghavan S, Sahin M (2017) Large-scale advertising portfolio optimization in online marketing. Working paper, Adobe Systems Inc., San Jose, CA.Google Scholar
  • Pollard D (1990) Empirical Processes: Theory and Applications, NSFCBMS Regional Conference Series in Probability and Statistics, vol. 2 (Institute of Mathematical Statistics, Hayward, CA).Google Scholar
  • Ross N (2011) Fundamentals of Stein’s method. Probab. Surveys 8:210–293.CrossrefGoogle Scholar
  • Rusmevichientong P, Williamson DP (2006) An adaptive algorithm for selecting profitable keywords for search-based advertising services. Feigenbaum J, Chuang J, Pennock DM, eds. Proc. 7th ACM Conf. Electronic Commerce (ACM, New York), 260–269.Google Scholar
  • Ruszczynski AP, Shapiro A (2003) Stochastic Programming, vol. 10 (Elsevier, Amsterdam).CrossrefGoogle Scholar
  • Shapiro A, Dentcheva D, Ruszczyński A (2009) Lectures on Stochastic Programming: Modeling and Theory (Society for Industrial and Applied Mathematics, Philadelphia).CrossrefGoogle Scholar
  • Stein CM (1981) Estimation of the mean of a multivariate normal distribution. Ann. Statist. 9(6):1135–1151.CrossrefGoogle Scholar
  • Tibshirani RJ, Taylor J (2012) Degrees of freedom in lasso problems. Ann. Statist. 40(2):1198–1232.CrossrefGoogle Scholar
  • Van der Vaart AW (2000) Asymptotic Statistics, Cambridge Series in Statistical and Probabilistic Mathematics, vol. 3 (Cambridge University Press, Cambridge, UK).Google Scholar
  • Vapnik VN (1999) An overview of statistical learning theory. IEEE Trans. Neural Networks 10(5):988–999.CrossrefGoogle Scholar
  • Wainwright MJ (2009) Information-theoretic limits on sparsity recovery in the high-dimensional and noisy setting. IEEE Trans. Inform. Theory 55(12):5728–5741.CrossrefGoogle Scholar
  • Wainwright MJ (2019) High-Dimensional Statistics: A Non-Asymptotic Viewpoint, Cambridge Series in Statistical and Probabilistic Mathematics, vol. 48 (Cambridge University Press, Cambridge, UK).Google Scholar
  • Xie X, Kou SC, Brown LD (2012) Sure estimates for a heteroscedastic hierarchical model. J. Amer. Statist. Assoc. 107(500):1465–1479.CrossrefGoogle Scholar
  • Zhang C-H (2003) Compound decision theory and empirical Bayes methods. Ann. Statist. 31(2):379–390.CrossrefGoogle Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.