Debiasing ML- or AI-Generated Regressors in Partially Linear Models
Published Online:27 May 2026https://doi.org/10.1287/isre.2024.1370
References
- (2025) The critical challenge of using large-scale digital experiment platforms for scientific discovery. MIS Quart. 49(1):1–28.Crossref, Google Scholar
- (2024) Pathways for design research on artificial intelligence. Inform. Systems Res. 35(2):441–459.Link, Google Scholar
- (2023) Machine learning and prediction errors in causal inference. Preprint, submitted June 25, https://doi.org/10.2139/ssrn.4480696.Google Scholar
- (2026) EnsembleIV: Creating instrumental variables from ensemble learners for robust statistical inference. Management Sci. Forthcoming.Google Scholar
- (1995) Measurement Error in Nonlinear Models, vol. 105 (CRC Press, Boca Raton, FL).Crossref, Google Scholar
- (2005) Measurement error models with auxiliary data. Rev. Econom. Stud. 72(2):343–366.Crossref, Google Scholar
- (2022) Locally robust semiparametric estimation. Econometrica 90(4):1501–1535.Crossref, Google Scholar
- (2018) Double/debiased machine learning for treatment and structural parameters: Double/debiased machine learning. Econom. J. 21(1):C1–C68.Crossref, Google Scholar
- (2026) From unstructured data to demand counterfactuals: Theory and practice. Preprint, submitted January 8, https://arxiv.org/abs/2601.05374.Google Scholar
- (1820) Théorie Analytique Des Probabilités, vol. 7 (Courcier, Paris).Google Scholar
- (2022) Coordinated double machine learning. Proc. 39th Internat. Conf. Machine Learn. (PMLR, New York), 6499–6513.Google Scholar
- (2021) Machine learning predictions as regression covariates. Political Anal. 29(4):467–484.Crossref, Google Scholar
- (2004) Using online conversations to study word-of-mouth communication. Marketing Sci. 23(4):545–560.Link, Google Scholar
- (2003) Econometric Analysis (Pearson Education India, Noida, India).Google Scholar
- (2023) When images backfire: The effect of customer-generated images on product rating dynamics. Inform. Systems Res. 34(4):1641–1663.Link, Google Scholar
- (1978) Specification tests in econometrics. Econometrica 46(6):1251–1271.Crossref, Google Scholar
- (2021) Correcting misclassification bias in regression models with variables generated via data mining. Inform. Systems Res. 32(2):462–480.Link, Google Scholar
- (2025) A robust optimization approach to reliable statistical inference with variables generated by machine learning. Inform. Systems Res., ePub ahead of print December 24, https://doi.org/10.1287/isre.2023.0340.Link, Google Scholar
- (2025) What, why, and how: An empiricist’s guide to double/debiased machine learning. Inform. Systems Res., ePub ahead of print December 5, https://doi.org/10.1287/isre.2024.0888.Link, Google Scholar
- (2020) Enhancing social media analysis with visual data analytics: A deep learning approach. MIS Quart. 44(4):1459–1492.Crossref, Google Scholar
- (2023) Examining user heterogeneity in digital experiments. ACM Trans. Inform. Systems 41(4):1–34.Google Scholar
- (2019) Using user-and marketer-generated content for box office revenue prediction: Differences between microblogging and third-party platforms. Inform. Systems Res. 30(1):191–203.Link, Google Scholar
- (2021) Understanding the dynamics between urban transportation modes and air pollutants: Evidence from China’s COVID-19 shock. Preprint, submitted June 10, https://doi.org/10.2139/ssrn.3859071.Google Scholar
- (2026) Estimation bias from machine-learned features in econometric models. Management Sci. Forthcoming.Google Scholar
- (2018) Mind the gap: Accounting for measurement error and misclassification in variables generated via data mining. Inform. Systems Res. 29(1):4–24.Link, Google Scholar
- (2022) Achieving reliable causal inference with data-mined variables: A random forest approach to the measurement error problem. INFORMS J. Data Sci. 1(2):138–155.Link, Google Scholar
- (2025) Deep learning-based causal inference for large-scale combinatorial experiments: Theory and empirical evidence. Management Sci., ePub ahead of print October 15, https://doi.org/10.1287/mnsc.2024.04625.Link, Google Scholar
- (2017) Keep your cool or let it out: Nonlinear effects of expressed arousal on perceptions of consumer reviews. J. Marketing Res. 54(3):447–463.Crossref, Google Scholar
- (2024) Understanding volunteer crowdsourcing from a multiplex perspective. Inform. Systems Res. 36(1):107–125.Link, Google Scholar
- (2025) Emotions in online content diffusion. Inform. Systems Res. 37(1):398–415.Link, Google Scholar
- (2023) Unifying algorithmic and theoretical perspectives: Emotions in online reviews and sales. MIS Quart. 47(1):127–160.Crossref, Google Scholar
- (2022) What makes a good image? Airbnb demand analytics leveraging interpretable image features. Management Sci. 68(8):5644–5666.Link, Google Scholar

