What, Why, and How: An Empiricist’s Guide to Double/Debiased Machine Learning
References
- (2011) Bias-corrected matching estimators for average treatment effects. J. Bus. Econom. Statist. 29(1):1–11.Crossref, Google Scholar
- (2016) Matching on the estimated propensity score. Econometrica 84(2):781–807.Crossref, Google Scholar
- (2009) Mostly Harmless Econometrics: An Empiricist’s Companion (Princeton University Press, Princeton, NJ).Crossref, Google Scholar
- (2022) DoubleML: An object-oriented implementation of double machine learning in python. J. Machine Learn. Res. 23(53):1–6.Google Scholar
- (2024) Hyperparameter tuning for causal inference with double machine learning: A simulation study. Proc. 3rd Conf. Causal Learn. Reasoning (PMLR, New York), 1065–1117.Google Scholar
- (2019) Reconciling modern machine-learning practice and the classical bias–variance trade-off. Proc. Natl. Acad. Sci USA 116(32):15849–15854.Crossref, Google Scholar
- (1982) On adaptive estimation. Ann. Statist. 10(3):647–671.Crossref, Google Scholar
- (1985) Estimating optimal transformations for multiple regression and correlation. J. Amer. Statist. Assoc. 80(391):580–598.Crossref, Google Scholar
- (2022) Debiased machine learning without sample-splitting for stable estimators. Adv. Neural Inform. Processing Systems 35:3096–3109.Google Scholar
- (2018) Double/debiased machine learning for treatment and structural parameters. Econom. J. 21(1):C1–C68.Crossref, Google Scholar
- (2022) Multiway cluster robust double/debiased machine learning. J. Bus. Econom. Statist. 40(3):1046–1056. Crossref, Google Scholar
- (2024) A crash course in good and bad controls. Sociol. Methods Res. 53(3):1071–1104.Crossref, Google Scholar
- (2022) Overparameterization of deep ResNet: Zero loss and mean-field analysis. J. Machine Learn. Res. 23(48):1–65.Google Scholar
- (2008) Weighting regressions by propensity scores. Evaluation Rev. 32(4):392–409.Crossref, Google Scholar
- (2013) Social media brand community and consumer behavior: Quantifying the relative impact of user-and marketer-generated content. Inform. Systems Res. 24(1):88–107.Link, Google Scholar
- (2022) Econometrics (Princeton University Press, Princeton, NJ).Google Scholar
- (2007) Partially linear models. Statistical Methods for Biostatistics and Related Fields (Springer-Verlag, Berlin, Heidelberg), 87–103.Crossref, Google Scholar
- (2009) The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Springer Series in Statistics, 2nd ed. (Springer, New York).Crossref, Google Scholar
- (1981) Statistical Estimation: Asymptotic Theory, Stochastic Modelling and Applied Probability, vol. 16 (Springer, New York).Crossref, Google Scholar
- (2016) Improving ecological inference by predicting individual ethnicity from voter registration records. Political Anal. 24(2):263–272.Crossref, Google Scholar
- (1976) On the efficiency of a class of non-parametric estimates. Theory Probab. Appl. 20(4):723–740.Crossref, Google Scholar
- (2007) Nonparametric Econometrics: Theory and Practice (Princeton University Press, Princeton, NJ).Google Scholar
- (2023) Influence via ethos: On the persuasive power of reputation in deliberation online. Management Sci. 70(3):1613–1634.Link, Google Scholar
- (1990) Semiparametric efficiency bounds. J. Appl. Econometrics 5(2):99–135.Crossref, Google Scholar
- (2008) Higher order influence functions and minimax estimation of nonlinear functionals. Probability and Statistics: Essays in Honor of David A. Freedman, vol. 2 (Institute of Mathematical Statistics, Beachwood, OH), 335–422.Crossref, Google Scholar
- (1988) Root-n-consistent semiparametric regression. Econometrica 56(4):931–954.Crossref, Google Scholar
- (1988) Kernel smoothing in partial linear models. J. Roy. Statist. Soc. Ser. B: Statist. Methodology 50(3):413–436.Crossref, Google Scholar
- (2006) Semiparametric Theory and Missing Data (Springer, New York).Google Scholar
- (1991) On differentiable functionals. Ann. Statist. 19(1):178–204.Google Scholar
- (2024) On the asymptotic properties of debiased machine learning estimators. Preprint, submitted November 4, https://arxiv.org/abs/2411.01864.Google Scholar
- (2006) All of Nonparametric Statistics, Springer Texts in Statistics (Springer, New York).Google Scholar
- (2010) Econometric Analysis of Cross Section and Panel Data (MIT Press, Cambridge, MA).Google Scholar
- (2025) Crypto airdrop success blueprint: A high-dimensional causal study using double machine learning. Preprint, submitted May 6, https://doi.org/10.2139/ssrn.5215263.Google Scholar
- (2024) Mobile payment adoption: An empirical investigation of Alipay. Inform. Systems Res. 35(2):807–828.Link, Google Scholar
- (2019) Understanding user-generated content and customer engagement on Facebook business pages. Inform. Systems Res. 30(3):839–855.Link, Google Scholar
- (2022) What makes a good image? Airbnb demand analytics leveraging interpretable image features. Management Sci. 68(8):5644–5666.Link, Google Scholar
- (2011) Cross-validated targeted minimum-loss-based estimation. Targeted Learning: Causal Inference for Observational and Experimental Data, Springer Series in Statistics (Springer, New York), 459–474.Crossref, Google Scholar
- (2024) Linking clicks to bricks: Understanding the effects of email advertising on multichannel sales. Inform. Systems Res. 36(1):225–238.Link, Google Scholar

