What, Why, and How: An Empiricist’s Guide to Double/Debiased Machine Learning

Published Online:https://doi.org/10.1287/isre.2024.0888

References

  • Abadie A, Imbens GW (2011) Bias-corrected matching estimators for average treatment effects. J. Bus. Econom. Statist. 29(1):1–11.CrossrefGoogle Scholar
  • Abadie A, Imbens GW (2016) Matching on the estimated propensity score. Econometrica 84(2):781–807.CrossrefGoogle Scholar
  • Angrist JD, Pischke JS (2009) Mostly Harmless Econometrics: An Empiricist’s Companion (Princeton University Press, Princeton, NJ).CrossrefGoogle Scholar
  • Bach P, Chernozhukov V, Kurz MS, Spindler M (2022) DoubleML: An object-oriented implementation of double machine learning in python. J. Machine Learn. Res. 23(53):1–6.Google Scholar
  • Bach P, Schacht O, Chernozhukov V, Klaassen S, Spindler M (2024) Hyperparameter tuning for causal inference with double machine learning: A simulation study. Proc. 3rd Conf. Causal Learn. Reasoning (PMLR, New York), 1065–1117.Google Scholar
  • Belkin M, Hsu D, Ma S, Mandal S (2019) Reconciling modern machine-learning practice and the classical bias–variance trade-off. Proc. Natl. Acad. Sci USA 116(32):15849–15854.CrossrefGoogle Scholar
  • Bickel PJ (1982) On adaptive estimation. Ann. Statist. 10(3):647–671.CrossrefGoogle Scholar
  • Breiman L, Friedman JH (1985) Estimating optimal transformations for multiple regression and correlation. J. Amer. Statist. Assoc. 80(391):580–598.CrossrefGoogle Scholar
  • Chen Q, Syrgkanis V, Austern M (2022) Debiased machine learning without sample-splitting for stable estimators. Adv. Neural Inform. Processing Systems 35:3096–3109.Google Scholar
  • Chernozhukov V, Chetverikov D, Demirer M, Duflo E, Hansen C, Newey W, Robins J (2018) Double/debiased machine learning for treatment and structural parameters. Econom. J. 21(1):C1–C68.CrossrefGoogle Scholar
  • Chiang HD, Kato K, Ma Y, Sasaki Y (2022) Multiway cluster robust double/debiased machine learning. J. Bus. Econom. Statist. 40(3):1046–1056. CrossrefGoogle Scholar
  • Cinelli C, Forney A, Pearl J (2024) A crash course in good and bad controls. Sociol. Methods Res. 53(3):1071–1104.CrossrefGoogle Scholar
  • Ding Z, Chen S, Li Q, Wright SJ (2022) Overparameterization of deep ResNet: Zero loss and mean-field analysis. J. Machine Learn. Res. 23(48):1–65.Google Scholar
  • Freedman DA, Berk RA (2008) Weighting regressions by propensity scores. Evaluation Rev. 32(4):392–409.CrossrefGoogle Scholar
  • Goh KY, Heng CS, Lin Z (2013) Social media brand community and consumer behavior: Quantifying the relative impact of user-and marketer-generated content. Inform. Systems Res. 24(1):88–107.LinkGoogle Scholar
  • Hansen B (2022) Econometrics (Princeton University Press, Princeton, NJ).Google Scholar
  • Härdle W, Liang H, Gao J (2007) Partially linear models. Statistical Methods for Biostatistics and Related Fields (Springer-Verlag, Berlin, Heidelberg), 87–103.CrossrefGoogle Scholar
  • Hastie T, Tibshirani R, Friedman JH, Friedman JH (2009) The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Springer Series in Statistics, 2nd ed. (Springer, New York).CrossrefGoogle Scholar
  • Ibragimov I, Has’minskii RZ (1981) Statistical Estimation: Asymptotic Theory, Stochastic Modelling and Applied Probability, vol. 16 (Springer, New York).CrossrefGoogle Scholar
  • Imai K, Khanna K (2016) Improving ecological inference by predicting individual ethnicity from voter registration records. Political Anal. 24(2):263–272.CrossrefGoogle Scholar
  • Levit BY (1976) On the efficiency of a class of non-parametric estimates. Theory Probab. Appl. 20(4):723–740.CrossrefGoogle Scholar
  • Li Q, Racine JS (2007) Nonparametric Econometrics: Theory and Practice (Princeton University Press, Princeton, NJ).Google Scholar
  • Manzoor E, Chen GH, Lee D, Smith MD (2023) Influence via ethos: On the persuasive power of reputation in deliberation online. Management Sci. 70(3):1613–1634.LinkGoogle Scholar
  • Newey WK (1990) Semiparametric efficiency bounds. J. Appl. Econometrics 5(2):99–135.CrossrefGoogle Scholar
  • Robins J, Li L, Tchetgen E, van der Vaart A (2008) Higher order influence functions and minimax estimation of nonlinear functionals. Probability and Statistics: Essays in Honor of David A. Freedman, vol. 2 (Institute of Mathematical Statistics, Beachwood, OH), 335–422.CrossrefGoogle Scholar
  • Robinson PM (1988) Root-n-consistent semiparametric regression. Econometrica 56(4):931–954.CrossrefGoogle Scholar
  • Speckman P (1988) Kernel smoothing in partial linear models. J. Roy. Statist. Soc. Ser. B: Statist. Methodology 50(3):413–436.CrossrefGoogle Scholar
  • Tsiatis AA (2006) Semiparametric Theory and Missing Data (Springer, New York).Google Scholar
  • Van der Vaart A (1991) On differentiable functionals. Ann. Statist. 19(1):178–204.Google Scholar
  • Velez A (2024) On the asymptotic properties of debiased machine learning estimators. Preprint, submitted November 4, https://arxiv.org/abs/2411.01864.Google Scholar
  • Wasserman L (2006) All of Nonparametric Statistics, Springer Texts in Statistics (Springer, New York).Google Scholar
  • Wooldridge JM (2010) Econometric Analysis of Cross Section and Panel Data (MIT Press, Cambridge, MA).Google Scholar
  • Xie T, Ge Y, Kannan K (2025) Crypto airdrop success blueprint: A high-dimensional causal study using double machine learning. Preprint, submitted May 6, https://doi.org/10.2139/ssrn.5215263.Google Scholar
  • Xu Y, Ghose A, Xiao B (2024) Mobile payment adoption: An empirical investigation of Alipay. Inform. Systems Res. 35(2):807–828.LinkGoogle Scholar
  • Yang M, Ren Y, Adomavicius G (2019) Understanding user-generated content and customer engagement on Facebook business pages. Inform. Systems Res. 30(3):839–855.LinkGoogle Scholar
  • Zhang S, Lee D, Singh PV, Srinivasan K (2022) What makes a good image? Airbnb demand analytics leveraging interpretable image features. Management Sci. 68(8):5644–5666.LinkGoogle Scholar
  • Zheng W, van der Laan MJ (2011) Cross-validated targeted minimum-loss-based estimation. Targeted Learning: Causal Inference for Observational and Experimental Data, Springer Series in Statistics (Springer, New York), 459–474.CrossrefGoogle Scholar
  • Zhou M, Abhishek V, Kennedy EH, Srinivasan K, Sinha R (2024) Linking clicks to bricks: Understanding the effects of email advertising on multichannel sales. Inform. Systems Res. 36(1):225–238.LinkGoogle Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.