On Data-Driven Prescriptive Analytics with Side Information: A Regularized Nadaraya–Watson Approach

Yijie Wang
Yijie Wang
[email protected]
https://orcid.org/0000-0002-5705-892X
School of Economics and Management, Tongji University, Shanghai 200092, China
Search for more papers by this author
,
Prateek R. Srivastava
Prateek R. Srivastava
[email protected]
https://orcid.org/0000-0002-1188-4206
Graduate Program in Operations Research and Industrial Engineering, Cockrell School of Engineering, The University of Texas at Austin, Austin, Texas 78712
Search for more papers by this author
,
Grani A. Hanasusanto
Grani A. Hanasusanto
[email protected]
https://orcid.org/0000-0003-4900-2958
Department of Industrial and Enterprise Systems Engineering, University of Illinois Urbana–Champaign, Urbana, Illinois 61801
Search for more papers by this author
,
Chin Pang Ho
Corresponding Author
Chin Pang Ho
[email protected]
https://orcid.org/0000-0002-2143-978X
Department of Data Science, City University of Hong Kong, Hong Kong
Search for more papers by this author

School of Economics and Management, Tongji University, Shanghai 200092, China

Graduate Program in Operations Research and Industrial Engineering, Cockrell School of Engineering, The University of Texas at Austin, Austin, Texas 78712

Search for more papers by this author

Grani A. Hanasusanto

[email protected]

https://orcid.org/0000-0003-4900-2958

Department of Industrial and Enterprise Systems Engineering, University of Illinois Urbana–Champaign, Urbana, Illinois 61801

Search for more papers by this author

Chin Pang Ho

Corresponding Author

Chin Pang Ho

[email protected]

https://orcid.org/0000-0002-2143-978X

Department of Data Science, City University of Hong Kong, Hong Kong

Search for more papers by this author

Published Online:5 Jan 2026https://doi.org/10.1287/msom.2024.0997

References

Alam MM, Uddin G (2009) Relationship between interest rate and stock price: Empirical evidence from developed and developing countries. Internat. J. Bus. Management 4(3):43–51.Google Scholar
Ban GY, Rudin C (2019) The big data newsvendor: Practical insights from machine learning. Oper. Res. 67(1):90–108.Link, Google Scholar
Ban GY, Gallien J, Mersereau AJ (2019) Dynamic procurement of new products with covariate information: The residual tree method. Manufacturing Service Oper. Management 21(4):798–815.Link, Google Scholar
Bazier-Matte T, Delage E (2020) Generalization bounds for regularized portfolio selection with market side information. INFOR Inform. Systems Oper. Res. 58(2):374–401.Crossref, Google Scholar
Bennouna M, Van Parys BPG (2025) Learning and decision-making with data: Optimal formulations and phase transitions. Math. Programming, 1–93.Google Scholar
Bertsimas D, Kallus N (2020) From predictive to prescriptive analytics. Management Sci. 66(3):1025–1044.Link, Google Scholar
Bertsimas D, McCord C (2019) From predictions to prescriptions in multistage optimization problems. Preprint, submitted April 26, https://arxiv.org/abs/1904.11637.Google Scholar
Bertsimas D, Van Parys B (2022) Bootstrap robust prescriptive analytics. Math. Programming 195(1):39--78.Google Scholar
Bertsimas D, McCord C, Sturt B (2019) Dynamic optimization with side information. Preprint, submitted July 17, https://arxiv.org/abs/1907.07307.Google Scholar
Bhatti HJ, Danilovic M (2018) Making the world more sustainable: Enabling localized energy generation and distribution on decentralized smart grid systems. World J. Engrg. Tech. 6(2):350–382.Crossref, Google Scholar
Brandt MW, Santa-Clara P, Valkanov R (2009) Parametric portfolio policies: Exploiting characteristics in the cross-section of equity returns. Rev. Financial Stud. 22(9):3411–3447.Crossref, Google Scholar
Chaudhuri K, Kakade SM, Livescu K, Sridharan K (2009) Multi-view clustering via canonical correlation analysis. Proc. 26th Annual Internat. Conf. Machine Learn. (Association for Computing Machinery, New York), 129–136.Google Scholar
Chen L, Plambeck EL (2008) Dynamic inventory management with learning about the demand distribution and substitution probability. Manufacturing Service Oper. Management 10(2):236–256.Link, Google Scholar
Chen X, Owen Z, Pixton C, Simchi-Levi D (2022) A statistical learning approach to personalization in revenue management. Management Sci. 68(3):1923–1937.Link, Google Scholar
Conejo AJ, Castillo E, Mínguez R, Milano F (2005) Locational marginal price sensitivities. IEEE Trans. Power Systems 20(4):2026–2033.Crossref, Google Scholar
Dembo A, Zeitouni O (1998) Large Deviations Techniques and Applications, Stochastic Modelling and Applied Probability, vol. 38 (Springer, Berlin).Crossref, Google Scholar
Duchi J, Namkoong H (2019) Variance-based regularization with convex objectives. J. Machine Learn. Res. 20(1):2450–2504.Google Scholar
Eichelsbacher P, Löwe M (2003) Moderate deviations for I.I.D. random variables. ESAIM Probab. Statist. 7:209–218.Crossref, Google Scholar
El Balghiti O, Elmachtoub AN, Grigas P, Tewari A (2019) Generalization bounds in the predict-then-optimize framework. Wallach H, Larochelle H, Beygelzimer A, d’Alché-Buc F, Fox E, Garnett R, eds. Advances in Neural Information Processing Systems, vol. 32 (Curran Associates, Red Hook, NY).Google Scholar
Elmachtoub AN, Grigas P (2022) Smart “predict, then optimize”. Management Sci. 68(1):9–26.Link, Google Scholar
Esteban-Pérez A, Morales JM (2022) Distributionally robust stochastic programs with side information based on trimmings. Math. Programming 195(1):1069–1105.Google Scholar
Estes A, Richard JP (2023) Smart predict-then-optimize for two-stage linear programs with side information. INFORMS J. Optim. 5(3):295–320.Google Scholar
Genton MG (2001) Classes of kernels for machine learning: A statistics perspective. J. Machine Learn. Res. 2:299–312.Google Scholar
Gotoh Jy, Kim MJ, Lim AEB (2018) Robust empirical optimization is almost the same as mean–variance optimization. Oper. Res. Let. 46(4):448–452.Crossref, Google Scholar
Gupta V (2019) Near-optimal Bayesian ambiguity sets for distributionally robust optimization. Management Sci. 65(9):4242–4260.Link, Google Scholar
Gurobi Optimization, LLC (2024) Gurobi Optimizer reference manual. https://www.gurobi.com.Google Scholar
Györfi L, Kohler M, Krzyżak A, Walk H (2006) A Distribution-Free Theory of Nonparametric Regression (Springer, New York).Google Scholar
Hanasusanto GA, Kuhn D (2013) Robust data-driven dynamic programming. Burges CJ, Bottou L, Welling M, Ghahramani Z, Weinberger KQ, eds. Advances in Neural Information Processing Systems, vol. 26 (Curran Associates, Red Hook, NY), 827–835.Google Scholar
Hannah L, Dunson DB (2011) Approximate dynamic programming for storage problems. Proc. 28th Internat. Conf. Machine Learn. (Omnipress, Madison, WI), 337–344.Google Scholar
Hannah L, Powell W, Blei D (2010) Nonparametric density estimation for stochastic optimization with an observable state variable. Lafferty J, Williams C, Shawe-Taylor J, Zemel R, Culotta A, eds. Advances in Neural Information Processing Systems, vol. 23 (Curran Associates, Red Hook, NY), 820–828.Google Scholar
Ho-Nguyen N, Kılınç-Karzan F (2022) Risk guarantees for end-to-end prediction and optimization processes. Management Sci. 68(12):8680–8698.Link, Google Scholar
Hu Y, Kallus N, Mao X (2022) Fast rates for contextual linear optimization. Management Sci. 68(6):4236–4245.Google Scholar
Kannan R, Bayraksan G, Luedtke JR (2024) Residuals-based distributionally robust optimization with covariate information. Math. Programming 207(1):369–425. Google Scholar
Kannan R, Bayraksan G, Luedtke JR (2025) Technical note--data-driven sample average approximation with covariate information. Oper. Res. 73(6):3245–3259.Google Scholar
Kannan R, Bayraksan G, Luedtke J (2021) Heteroscedasticity-aware residuals-based contextual stochastic optimization. Preprint, submitted January 8, https://arxiv.org/abs/2101.03139.Google Scholar
Kim JH, Powell WB (2011) Optimal energy commitments with storage and intermittent supply. Oper. Res. 59(6):1347–1360.Link, Google Scholar
Kleywegt AJ, Shapiro A, Homem-de-Mello T (2002) The sample average approximation method for stochastic discrete optimization. SIAM J. Optim. 12(2):479–502.Crossref, Google Scholar
Lam H (2016) Robust sensitivity analysis for stochastic systems. Math. Oper. Res. 41(4):1248–1275.Link, Google Scholar
Lam H (2019) Recovering best statistical guarantees via the empirical divergence-based distributionally robust optimization. Oper. Res. 67(4):1090–1105.Abstract, Google Scholar
Loke G, Tang Q, Xiao Y (2020) Decision-driven regularization: Harmonizing the predictive and prescriptive. Preprint, submitted June 17, https://doi.org/10.2139/ssrn.3623006.Google Scholar
Maurer A, Pontil M (2009) Empirical Bernstein bounds and sample variance penalization. Preprint, submitted July 21, https://arxiv.org/abs/0907.3740.Google Scholar
Miller HR (2011) Optimization: Foundations and Applications (John Wiley & Sons, Hoboken, NJ).Google Scholar
Mokkadem A, Pelletier M, Thiam B (2008) Large and moderate deviations principles for kernel estimators of the multivariate regression. Math. Methods Statist. 17(2):146–172.Crossref, Google Scholar
Nadaraya EA (1964) On estimating regression. Theory Probab. Its Appl. 9(1):141–142.Crossref, Google Scholar
Parzen E (1962) On estimation of a probability density function and mode. Ann. Math. Statist. 33(3):1065–1076.Crossref, Google Scholar
Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, et al. (2011) Scikit-learn: Machine learning in Python. J. Machine Learn. Res. 12(85):2825–2830. Google Scholar
Sen S, Deng Y (2018) Learning enabled optimization: Towards a fusion of statistical learning and stochastic programming. Technical report, University of Southern California, Los Angeles.Google Scholar
Shapiro A, Dentcheva D, Ruszczynski A (2009) Lectures on Stochastic Programming: Modeling and Theory (Society For Industrial and Applied Mathematics, Philadelphia).Crossref, Google Scholar
Silverman BW (1986) Density Estimation for Statistics and Data Analysis, Monographs on Statistics and Applied Probability, vol. 26 (Chapman & Hall, London).Google Scholar
Sim M, Tang Q, Zhou M, Zhu T (2021) The analytics of robust satisficing—Predict, optimise, satisfice, then fortify. Preprint, submitted April 20, https://doi.org/10.2139/ssrn.3829562.Google Scholar
Srivastava PR, Sarkar P, Hanasusanto GA (2019) A robust spectral clustering algorithm for sub-Gaussian mixture models with outliers. Preprint, submitted December 16, https://arxiv.org/abs/1912.07546.Google Scholar
Vershynin R (2010) Introduction to the non-asymptotic analysis of random matrices. Preprint, submitted November 12, https://arxiv.org/abs/1011.3027.Google Scholar
Wainwright MJ (2019) High-Dimensional Statistics: A Non-Asymptotic Viewpoint, Cambridge Series in Statistical and Probabilistic Mathematics, vol. 48 (Cambridge University Press, Cambridge, UK).Crossref, Google Scholar
Ward DM (2013) The effect of weather on grid systems and the reliability of electricity supply. Climatic Change 121(1):103–113.Crossref, Google Scholar
Watson GS (1964) Smooth regression analysis. Sankhyā Indian J. Statist. Ser. A 26(4):359–372.Google Scholar
Xia Y, Cosgrove BA, Ek MB, Sheffield J, Luo L, Wood EF, Mo K, NDLAS team (2013) Overview of the North American Land Data Assimilation System (NLDAS). Land Surface Observation, Modeling and Data Assimilation (World Scientific, Singapore), 337–377.Crossref, Google Scholar
Xu H, Caramanis C, Mannor S (2016) Statistical optimization in high dimensions. Oper. Res. 64(4):958–979.Link, Google Scholar
Yan B, Sarkar P (2021) Covariate regularized community detection in sparse graphs. J. Amer. Statist. Assoc. 116(534):734–745.Crossref, Google Scholar

cover image Manufacturing & Service Operations Management

Volume 28, Issue 3

May-June 2026

Pages iv-xix, 687-1009, iii

Article Information

Supplemental Material

Metrics

Information

Received:April 16, 2024
Accepted:November 20, 2025
Published Online:January 05, 2026

Cite as

Yijie Wang, Prateek R. Srivastava, Grani A. Hanasusanto, Chin Pang Ho (2026) On Data-Driven Prescriptive Analytics with Side Information: A Regularized Nadaraya–Watson Approach. Manufacturing & Service Operations Management 28(3):841-859.

https://doi.org/10.1287/msom.2024.0997

Keywords

Acknowledgments

The authors thank the area editor, associate editor, and two anonymous referees whose reviews helped substantially improve the quality of this paper. Yijie Wang and Prateek R. Srivastava contributed equally to this work.

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

On Data-Driven Prescriptive Analytics with Side Information: A Regularized Nadaraya–Watson Approach

References

Volume 28, Issue 3

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News