Debiasing ML- or AI-Generated Regressors in Partially Linear Models

Published Online:https://doi.org/10.1287/isre.2024.1370

Information systems (IS) researchers are increasingly leveraging machine learning (ML) or artificial intelligence (AI) technologies to predict feature variables from data and use them as regressors in subsequent econometric models. However, because ML/AI predictions are imperfect, these generated regressors would inevitably contain measurement errors. The direct use of such regressors in subsequent econometric models can result in biased estimation, ultimately leading to inaccurate conclusions. In light of this, we examine the problem of debiasing ML/AI-generated regressors in partially linear regression models. We propose estimators that utilize two-stage least squares (TSLS) and generalized method of moments (GMM) under the double machine learning (DML) framework. We demonstrate the asymptotic consistency and normality of our estimators. Moreover, we conduct extensive Monte Carlo simulations and empirical applications to show the outperformance of our estimators compared with other methods. Our work advances causal inference in addressing measurement error problems arising from ML/AI-generated regressors in partially linear models and hence provides valuable practical implications for designing experimental systems and overcoming ML/AI bias.

History: Ahmed Abbasi, Senior Editor; Gordon Burtch, Associate Editor.

Supplemental Material: The electronic companion is available at https://doi.org/10.1287/isre.2024.1370.

INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.