Debiasing ML- or AI-Generated Regressors in Partially Linear Models

Published Online:https://doi.org/10.1287/isre.2024.1370

References

  • Abbasi A, Somanchi S, Kelley K (2025) The critical challenge of using large-scale digital experiment platforms for scientific discovery. MIS Quart. 49(1):1–28.CrossrefGoogle Scholar
  • Abbasi A, Parsons J, Pant G, Sheng ORL, Sarker S (2024) Pathways for design research on artificial intelligence. Inform. Systems Res. 35(2):441–459.LinkGoogle Scholar
  • Allon G, Chen D, Jiang Z, Zhang D (2023) Machine learning and prediction errors in causal inference. Preprint, submitted June 25, https://doi.org/10.2139/ssrn.4480696.Google Scholar
  • Burtch G, McFowland E III, Yang M, Adomavicius G (2026) EnsembleIV: Creating instrumental variables from ensemble learners for robust statistical inference. Management Sci. Forthcoming.Google Scholar
  • Carroll RJ, Ruppert D, Stefanski LA (1995) Measurement Error in Nonlinear Models, vol. 105 (CRC Press, Boca Raton, FL).CrossrefGoogle Scholar
  • Chen X, Hong H, Tamer E (2005) Measurement error models with auxiliary data. Rev. Econom. Stud. 72(2):343–366.CrossrefGoogle Scholar
  • Chernozhukov V, Escanciano JC, Ichimura H, Newey WK, Robins JM (2022) Locally robust semiparametric estimation. Econometrica 90(4):1501–1535.CrossrefGoogle Scholar
  • Chernozhukov V, Chetverikov D, Demirer M, Duflo E, Hansen C, Newey W, Robins J (2018) Double/debiased machine learning for treatment and structural parameters: Double/debiased machine learning. Econom. J. 21(1):C1–C68.CrossrefGoogle Scholar
  • Christensen T, Compiani G (2026) From unstructured data to demand counterfactuals: Theory and practice. Preprint, submitted January 8, https://arxiv.org/abs/2601.05374.Google Scholar
  • de Laplace PS (1820) Théorie Analytique Des Probabilités, vol. 7 (Courcier, Paris).Google Scholar
  • Fingerhut N, Sesia M, Romano Y (2022) Coordinated double machine learning. Proc. 39th Internat. Conf. Machine Learn. (PMLR, New York), 6499–6513.Google Scholar
  • Fong C, Tyler M (2021) Machine learning predictions as regression covariates. Political Anal. 29(4):467–484.CrossrefGoogle Scholar
  • Godes D, Mayzlin D (2004) Using online conversations to study word-of-mouth communication. Marketing Sci. 23(4):545–560.LinkGoogle Scholar
  • Greene WH (2003) Econometric Analysis (Pearson Education India, Noida, India).Google Scholar
  • Guan Y, Tan Y, Wei Q, Chen G (2023) When images backfire: The effect of customer-generated images on product rating dynamics. Inform. Systems Res. 34(4):1641–1663.LinkGoogle Scholar
  • Hausman JA (1978) Specification tests in econometrics. Econometrica 46(6):1251–1271.CrossrefGoogle Scholar
  • Qiao M, Huang K-W (2021) Correcting misclassification bias in regression models with variables generated via data mining. Inform. Systems Res. 32(2):462–480.LinkGoogle Scholar
  • Schecter A, Li W (2025) A robust optimization approach to reliable statistical inference with variables generated by machine learning. Inform. Systems Res., ePub ahead of print December 24, https://doi.org/10.1287/isre.2023.0340.LinkGoogle Scholar
  • Shi B, Mao X, Yang M, Li B (2025) What, why, and how: An empiricist’s guide to double/debiased machine learning. Inform. Systems Res., ePub ahead of print December 5, https://doi.org/10.1287/isre.2024.0888.LinkGoogle Scholar
  • Shin D, He S, Lee GM, Whinston AB, Cetintas S, Lee K-C (2020) Enhancing social media analysis with visual data analytics: A deep learning approach. MIS Quart. 44(4):1459–1492.CrossrefGoogle Scholar
  • Somanchi S, Abbasi A, Kelley K, Dobolyi D, Yuan TT (2023) Examining user heterogeneity in digital experiments. ACM Trans. Inform. Systems 41(4):1–34.Google Scholar
  • Song T, Huang J, Tan Y, Yu Y (2019) Using user-and marketer-generated content for box office revenue prediction: Differences between microblogging and third-party platforms. Inform. Systems Res. 30(1):191–203.LinkGoogle Scholar
  • Wang J, Yu Y, Xue W, Tan Y (2021) Understanding the dynamics between urban transportation modes and air pollutants: Evidence from China’s COVID-19 shock. Preprint, submitted June 10, https://doi.org/10.2139/ssrn.3859071.Google Scholar
  • Wei Y, Malik N (2026) Estimation bias from machine-learned features in econometric models. Management Sci. Forthcoming.Google Scholar
  • Yang M, Adomavicius G, Burtch G, Ren Y (2018) Mind the gap: Accounting for measurement error and misclassification in variables generated via data mining. Inform. Systems Res. 29(1):4–24.LinkGoogle Scholar
  • Yang M, McFowland E III, Burtch G, Adomavicius G (2022) Achieving reliable causal inference with data-mined variables: A random forest approach to the measurement error problem. INFORMS J. Data Sci. 1(2):138–155.LinkGoogle Scholar
  • Ye Z, Zhang Z, Zhang DJ, Zhang H, Zhang R (2025) Deep learning-based causal inference for large-scale combinatorial experiments: Theory and empirical evidence. Management Sci., ePub ahead of print October 15, https://doi.org/10.1287/mnsc.2024.04625.LinkGoogle Scholar
  • Yin D, Bond SD, Zhang H (2017) Keep your cool or let it out: Nonlinear effects of expressed arousal on perceptions of consumer reviews. J. Marketing Res. 54(3):447–463.CrossrefGoogle Scholar
  • Yu Y, Tan X, Tan Y (2024) Understanding volunteer crowdsourcing from a multiplex perspective. Inform. Systems Res. 36(1):107–125.LinkGoogle Scholar
  • Yu Y, Huang S, Liu Y, Tan Y (2025) Emotions in online content diffusion. Inform. Systems Res. 37(1):398–415.LinkGoogle Scholar
  • Yu Y, Yang Y, Huang J, Tan Y (2023) Unifying algorithmic and theoretical perspectives: Emotions in online reviews and sales. MIS Quart. 47(1):127–160.CrossrefGoogle Scholar
  • Zhang S, Lee D, Singh PV, Srinivasan K (2022) What makes a good image? Airbnb demand analytics leveraging interpretable image features. Management Sci. 68(8):5644–5666.LinkGoogle Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.