On Data-Driven Prescriptive Analytics with Side Information: A Regularized Nadaraya–Watson Approach

Published Online:https://doi.org/10.1287/msom.2024.0997

References

  • Alam MM, Uddin G (2009) Relationship between interest rate and stock price: Empirical evidence from developed and developing countries. Internat. J. Bus. Management 4(3):43–51.Google Scholar
  • Ban GY, Rudin C (2019) The big data newsvendor: Practical insights from machine learning. Oper. Res. 67(1):90–108.LinkGoogle Scholar
  • Ban GY, Gallien J, Mersereau AJ (2019) Dynamic procurement of new products with covariate information: The residual tree method. Manufacturing Service Oper. Management 21(4):798–815.LinkGoogle Scholar
  • Bazier-Matte T, Delage E (2020) Generalization bounds for regularized portfolio selection with market side information. INFOR Inform. Systems Oper. Res. 58(2):374–401.CrossrefGoogle Scholar
  • Bennouna M, Van Parys BPG (2025) Learning and decision-making with data: Optimal formulations and phase transitions. Math. Programming, 1–93.Google Scholar
  • Bertsimas D, Kallus N (2020) From predictive to prescriptive analytics. Management Sci. 66(3):1025–1044.LinkGoogle Scholar
  • Bertsimas D, McCord C (2019) From predictions to prescriptions in multistage optimization problems. Preprint, submitted April 26, https://arxiv.org/abs/1904.11637.Google Scholar
  • Bertsimas D, Van Parys B (2022) Bootstrap robust prescriptive analytics. Math. Programming 195(1):39--78.Google Scholar
  • Bertsimas D, McCord C, Sturt B (2019) Dynamic optimization with side information. Preprint, submitted July 17, https://arxiv.org/abs/1907.07307.Google Scholar
  • Bhatti HJ, Danilovic M (2018) Making the world more sustainable: Enabling localized energy generation and distribution on decentralized smart grid systems. World J. Engrg. Tech. 6(2):350–382.CrossrefGoogle Scholar
  • Brandt MW, Santa-Clara P, Valkanov R (2009) Parametric portfolio policies: Exploiting characteristics in the cross-section of equity returns. Rev. Financial Stud. 22(9):3411–3447.CrossrefGoogle Scholar
  • Chaudhuri K, Kakade SM, Livescu K, Sridharan K (2009) Multi-view clustering via canonical correlation analysis. Proc. 26th Annual Internat. Conf. Machine Learn. (Association for Computing Machinery, New York), 129–136.Google Scholar
  • Chen L, Plambeck EL (2008) Dynamic inventory management with learning about the demand distribution and substitution probability. Manufacturing Service Oper. Management 10(2):236–256.LinkGoogle Scholar
  • Chen X, Owen Z, Pixton C, Simchi-Levi D (2022) A statistical learning approach to personalization in revenue management. Management Sci. 68(3):1923–1937.LinkGoogle Scholar
  • Conejo AJ, Castillo E, Mínguez R, Milano F (2005) Locational marginal price sensitivities. IEEE Trans. Power Systems 20(4):2026–2033.CrossrefGoogle Scholar
  • Dembo A, Zeitouni O (1998) Large Deviations Techniques and Applications, Stochastic Modelling and Applied Probability, vol. 38 (Springer, Berlin).CrossrefGoogle Scholar
  • Duchi J, Namkoong H (2019) Variance-based regularization with convex objectives. J. Machine Learn. Res. 20(1):2450–2504.Google Scholar
  • Eichelsbacher P, Löwe M (2003) Moderate deviations for I.I.D. random variables. ESAIM Probab. Statist. 7:209–218.CrossrefGoogle Scholar
  • El Balghiti O, Elmachtoub AN, Grigas P, Tewari A (2019) Generalization bounds in the predict-then-optimize framework. Wallach H, Larochelle H, Beygelzimer A, d’Alché-Buc F, Fox E, Garnett R, eds. Advances in Neural Information Processing Systems, vol. 32 (Curran Associates, Red Hook, NY).Google Scholar
  • Elmachtoub AN, Grigas P (2022) Smart “predict, then optimize”. Management Sci. 68(1):9–26.LinkGoogle Scholar
  • Esteban-Pérez A, Morales JM (2022) Distributionally robust stochastic programs with side information based on trimmings. Math. Programming 195(1):1069–1105.Google Scholar
  • Estes A, Richard JP (2023) Smart predict-then-optimize for two-stage linear programs with side information. INFORMS J. Optim. 5(3):295–320.Google Scholar
  • Genton MG (2001) Classes of kernels for machine learning: A statistics perspective. J. Machine Learn. Res. 2:299–312.Google Scholar
  • Gotoh Jy, Kim MJ, Lim AEB (2018) Robust empirical optimization is almost the same as mean–variance optimization. Oper. Res. Let. 46(4):448–452.CrossrefGoogle Scholar
  • Gupta V (2019) Near-optimal Bayesian ambiguity sets for distributionally robust optimization. Management Sci. 65(9):4242–4260.LinkGoogle Scholar
  • Gurobi Optimization, LLC (2024) Gurobi Optimizer reference manual. https://www.gurobi.com.Google Scholar
  • Györfi L, Kohler M, Krzyżak A, Walk H (2006) A Distribution-Free Theory of Nonparametric Regression (Springer, New York).Google Scholar
  • Hanasusanto GA, Kuhn D (2013) Robust data-driven dynamic programming. Burges CJ, Bottou L, Welling M, Ghahramani Z, Weinberger KQ, eds. Advances in Neural Information Processing Systems, vol. 26 (Curran Associates, Red Hook, NY), 827–835.Google Scholar
  • Hannah L, Dunson DB (2011) Approximate dynamic programming for storage problems. Proc. 28th Internat. Conf. Machine Learn. (Omnipress, Madison, WI), 337–344.Google Scholar
  • Hannah L, Powell W, Blei D (2010) Nonparametric density estimation for stochastic optimization with an observable state variable. Lafferty J, Williams C, Shawe-Taylor J, Zemel R, Culotta A, eds. Advances in Neural Information Processing Systems, vol. 23 (Curran Associates, Red Hook, NY), 820–828.Google Scholar
  • Ho-Nguyen N, Kılınç-Karzan F (2022) Risk guarantees for end-to-end prediction and optimization processes. Management Sci. 68(12):8680–8698.LinkGoogle Scholar
  • Hu Y, Kallus N, Mao X (2022) Fast rates for contextual linear optimization. Management Sci. 68(6):4236–4245.Google Scholar
  • Kannan R, Bayraksan G, Luedtke JR (2024) Residuals-based distributionally robust optimization with covariate information. Math. Programming 207(1):369–425. Google Scholar
  • Kannan R, Bayraksan G, Luedtke JR (2025) Technical note--data-driven sample average approximation with covariate information. Oper. Res. 73(6):3245–3259.Google Scholar
  • Kannan R, Bayraksan G, Luedtke J (2021) Heteroscedasticity-aware residuals-based contextual stochastic optimization. Preprint, submitted January 8, https://arxiv.org/abs/2101.03139.Google Scholar
  • Kim JH, Powell WB (2011) Optimal energy commitments with storage and intermittent supply. Oper. Res. 59(6):1347–1360.LinkGoogle Scholar
  • Kleywegt AJ, Shapiro A, Homem-de-Mello T (2002) The sample average approximation method for stochastic discrete optimization. SIAM J. Optim. 12(2):479–502.CrossrefGoogle Scholar
  • Lam H (2016) Robust sensitivity analysis for stochastic systems. Math. Oper. Res. 41(4):1248–1275.LinkGoogle Scholar
  • Lam H (2019) Recovering best statistical guarantees via the empirical divergence-based distributionally robust optimization. Oper. Res. 67(4):1090–1105.AbstractGoogle Scholar
  • Loke G, Tang Q, Xiao Y (2020) Decision-driven regularization: Harmonizing the predictive and prescriptive. Preprint, submitted June 17, https://doi.org/10.2139/ssrn.3623006.Google Scholar
  • Maurer A, Pontil M (2009) Empirical Bernstein bounds and sample variance penalization. Preprint, submitted July 21, https://arxiv.org/abs/0907.3740.Google Scholar
  • Miller HR (2011) Optimization: Foundations and Applications (John Wiley & Sons, Hoboken, NJ).Google Scholar
  • Mokkadem A, Pelletier M, Thiam B (2008) Large and moderate deviations principles for kernel estimators of the multivariate regression. Math. Methods Statist. 17(2):146–172.CrossrefGoogle Scholar
  • Nadaraya EA (1964) On estimating regression. Theory Probab. Its Appl. 9(1):141–142.CrossrefGoogle Scholar
  • Parzen E (1962) On estimation of a probability density function and mode. Ann. Math. Statist. 33(3):1065–1076.CrossrefGoogle Scholar
  • Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, et al. (2011) Scikit-learn: Machine learning in Python. J. Machine Learn. Res. 12(85):2825–2830. Google Scholar
  • Sen S, Deng Y (2018) Learning enabled optimization: Towards a fusion of statistical learning and stochastic programming. Technical report, University of Southern California, Los Angeles.Google Scholar
  • Shapiro A, Dentcheva D, Ruszczynski A (2009) Lectures on Stochastic Programming: Modeling and Theory (Society For Industrial and Applied Mathematics, Philadelphia).CrossrefGoogle Scholar
  • Silverman BW (1986) Density Estimation for Statistics and Data Analysis, Monographs on Statistics and Applied Probability, vol. 26 (Chapman & Hall, London).Google Scholar
  • Sim M, Tang Q, Zhou M, Zhu T (2021) The analytics of robust satisficing—Predict, optimise, satisfice, then fortify. Preprint, submitted April 20, https://doi.org/10.2139/ssrn.3829562.Google Scholar
  • Srivastava PR, Sarkar P, Hanasusanto GA (2019) A robust spectral clustering algorithm for sub-Gaussian mixture models with outliers. Preprint, submitted December 16, https://arxiv.org/abs/1912.07546.Google Scholar
  • Vershynin R (2010) Introduction to the non-asymptotic analysis of random matrices. Preprint, submitted November 12, https://arxiv.org/abs/1011.3027.Google Scholar
  • Wainwright MJ (2019) High-Dimensional Statistics: A Non-Asymptotic Viewpoint, Cambridge Series in Statistical and Probabilistic Mathematics, vol. 48 (Cambridge University Press, Cambridge, UK).CrossrefGoogle Scholar
  • Ward DM (2013) The effect of weather on grid systems and the reliability of electricity supply. Climatic Change 121(1):103–113.CrossrefGoogle Scholar
  • Watson GS (1964) Smooth regression analysis. Sankhyā Indian J. Statist. Ser. A 26(4):359–372.Google Scholar
  • Xia Y, Cosgrove BA, Ek MB, Sheffield J, Luo L, Wood EF, Mo K, NDLAS team (2013) Overview of the North American Land Data Assimilation System (NLDAS). Land Surface Observation, Modeling and Data Assimilation (World Scientific, Singapore), 337–377.CrossrefGoogle Scholar
  • Xu H, Caramanis C, Mannor S (2016) Statistical optimization in high dimensions. Oper. Res. 64(4):958–979.LinkGoogle Scholar
  • Yan B, Sarkar P (2021) Covariate regularized community detection in sparse graphs. J. Amer. Statist. Assoc. 116(534):734–745.CrossrefGoogle Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.