What, Why, and How: An Empiricist’s Guide to Double/Debiased Machine Learning

Bowen Shi
Bowen Shi
[email protected]
School of Economics and Management, Tsinghua University, Beijing 100084, China
Search for more papers by this author
,
Xiaojie Mao
Corresponding Author
Xiaojie Mao
[email protected]
https://orcid.org/0000-0003-2985-1741
School of Economics and Management, Tsinghua University, Beijing 100084, China; and Research Center for Contemporary Management, Tsinghua University, Beijing 100084, China
Search for more papers by this author
,
Mochen Yang
Mochen Yang
[email protected]
https://orcid.org/0000-0001-5101-9041
Carlson School of Management, University of Minnesota, Minneapolis, Minnesota 55455
Search for more papers by this author
,
Bo Li
Bo Li
[email protected]
https://orcid.org/0000-0001-5599-8857
School of Economics and Management, Tsinghua University, Beijing 100084, China
Search for more papers by this author

Bowen Shi

[email protected]

School of Economics and Management, Tsinghua University, Beijing 100084, China

Search for more papers by this author

Xiaojie Mao

Corresponding Author

Xiaojie Mao

[email protected]

https://orcid.org/0000-0003-2985-1741

School of Economics and Management, Tsinghua University, Beijing 100084, China; and Research Center for Contemporary Management, Tsinghua University, Beijing 100084, China

Search for more papers by this author

Mochen Yang

[email protected]

https://orcid.org/0000-0001-5101-9041

Carlson School of Management, University of Minnesota, Minneapolis, Minnesota 55455

Search for more papers by this author

Bo Li

[email protected]

https://orcid.org/0000-0001-5599-8857

School of Economics and Management, Tsinghua University, Beijing 100084, China

Search for more papers by this author

Published Online:5 Dec 2025https://doi.org/10.1287/isre.2024.0888

References

Abadie A, Imbens GW (2011) Bias-corrected matching estimators for average treatment effects. J. Bus. Econom. Statist. 29(1):1–11.Crossref, Google Scholar
Abadie A, Imbens GW (2016) Matching on the estimated propensity score. Econometrica 84(2):781–807.Crossref, Google Scholar
Angrist JD, Pischke JS (2009) Mostly Harmless Econometrics: An Empiricist’s Companion (Princeton University Press, Princeton, NJ).Crossref, Google Scholar
Bach P, Chernozhukov V, Kurz MS, Spindler M (2022) DoubleML: An object-oriented implementation of double machine learning in python. J. Machine Learn. Res. 23(53):1–6.Google Scholar
Bach P, Schacht O, Chernozhukov V, Klaassen S, Spindler M (2024) Hyperparameter tuning for causal inference with double machine learning: A simulation study. Proc. 3rd Conf. Causal Learn. Reasoning (PMLR, New York), 1065–1117.Google Scholar
Belkin M, Hsu D, Ma S, Mandal S (2019) Reconciling modern machine-learning practice and the classical bias–variance trade-off. Proc. Natl. Acad. Sci USA 116(32):15849–15854.Crossref, Google Scholar
Bickel PJ (1982) On adaptive estimation. Ann. Statist. 10(3):647–671.Crossref, Google Scholar
Breiman L, Friedman JH (1985) Estimating optimal transformations for multiple regression and correlation. J. Amer. Statist. Assoc. 80(391):580–598.Crossref, Google Scholar
Chen Q, Syrgkanis V, Austern M (2022) Debiased machine learning without sample-splitting for stable estimators. Adv. Neural Inform. Processing Systems 35:3096–3109.Google Scholar
Chernozhukov V, Chetverikov D, Demirer M, Duflo E, Hansen C, Newey W, Robins J (2018) Double/debiased machine learning for treatment and structural parameters. Econom. J. 21(1):C1–C68.Crossref, Google Scholar
Chiang HD, Kato K, Ma Y, Sasaki Y (2022) Multiway cluster robust double/debiased machine learning. J. Bus. Econom. Statist. 40(3):1046–1056. Crossref, Google Scholar
Cinelli C, Forney A, Pearl J (2024) A crash course in good and bad controls. Sociol. Methods Res. 53(3):1071–1104.Crossref, Google Scholar
Ding Z, Chen S, Li Q, Wright SJ (2022) Overparameterization of deep ResNet: Zero loss and mean-field analysis. J. Machine Learn. Res. 23(48):1–65.Google Scholar
Freedman DA, Berk RA (2008) Weighting regressions by propensity scores. Evaluation Rev. 32(4):392–409.Crossref, Google Scholar
Goh KY, Heng CS, Lin Z (2013) Social media brand community and consumer behavior: Quantifying the relative impact of user-and marketer-generated content. Inform. Systems Res. 24(1):88–107.Link, Google Scholar
Hansen B (2022) Econometrics (Princeton University Press, Princeton, NJ).Google Scholar
Härdle W, Liang H, Gao J (2007) Partially linear models. Statistical Methods for Biostatistics and Related Fields (Springer-Verlag, Berlin, Heidelberg), 87–103.Crossref, Google Scholar
Hastie T, Tibshirani R, Friedman JH, Friedman JH (2009) The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Springer Series in Statistics, 2nd ed. (Springer, New York).Crossref, Google Scholar
Ibragimov I, Has’minskii RZ (1981) Statistical Estimation: Asymptotic Theory, Stochastic Modelling and Applied Probability, vol. 16 (Springer, New York).Crossref, Google Scholar
Imai K, Khanna K (2016) Improving ecological inference by predicting individual ethnicity from voter registration records. Political Anal. 24(2):263–272.Crossref, Google Scholar
Levit BY (1976) On the efficiency of a class of non-parametric estimates. Theory Probab. Appl. 20(4):723–740.Crossref, Google Scholar
Li Q, Racine JS (2007) Nonparametric Econometrics: Theory and Practice (Princeton University Press, Princeton, NJ).Google Scholar
Manzoor E, Chen GH, Lee D, Smith MD (2023) Influence via ethos: On the persuasive power of reputation in deliberation online. Management Sci. 70(3):1613–1634.Link, Google Scholar
Newey WK (1990) Semiparametric efficiency bounds. J. Appl. Econometrics 5(2):99–135.Crossref, Google Scholar
Robins J, Li L, Tchetgen E, van der Vaart A (2008) Higher order influence functions and minimax estimation of nonlinear functionals. Probability and Statistics: Essays in Honor of David A. Freedman, vol. 2 (Institute of Mathematical Statistics, Beachwood, OH), 335–422.Crossref, Google Scholar
Robinson PM (1988) Root-n-consistent semiparametric regression. Econometrica 56(4):931–954.Crossref, Google Scholar
Speckman P (1988) Kernel smoothing in partial linear models. J. Roy. Statist. Soc. Ser. B: Statist. Methodology 50(3):413–436.Crossref, Google Scholar
Tsiatis AA (2006) Semiparametric Theory and Missing Data (Springer, New York).Google Scholar
Van der Vaart A (1991) On differentiable functionals. Ann. Statist. 19(1):178–204.Google Scholar
Velez A (2024) On the asymptotic properties of debiased machine learning estimators. Preprint, submitted November 4, https://arxiv.org/abs/2411.01864.Google Scholar
Wasserman L (2006) All of Nonparametric Statistics, Springer Texts in Statistics (Springer, New York).Google Scholar
Wooldridge JM (2010) Econometric Analysis of Cross Section and Panel Data (MIT Press, Cambridge, MA).Google Scholar
Xie T, Ge Y, Kannan K (2025) Crypto airdrop success blueprint: A high-dimensional causal study using double machine learning. Preprint, submitted May 6, https://doi.org/10.2139/ssrn.5215263.Google Scholar
Xu Y, Ghose A, Xiao B (2024) Mobile payment adoption: An empirical investigation of Alipay. Inform. Systems Res. 35(2):807–828.Link, Google Scholar
Yang M, Ren Y, Adomavicius G (2019) Understanding user-generated content and customer engagement on Facebook business pages. Inform. Systems Res. 30(3):839–855.Link, Google Scholar
Zhang S, Lee D, Singh PV, Srinivasan K (2022) What makes a good image? Airbnb demand analytics leveraging interpretable image features. Management Sci. 68(8):5644–5666.Link, Google Scholar
Zheng W, van der Laan MJ (2011) Cross-validated targeted minimum-loss-based estimation. Targeted Learning: Causal Inference for Observational and Experimental Data, Springer Series in Statistics (Springer, New York), 459–474.Crossref, Google Scholar
Zhou M, Abhishek V, Kennedy EH, Srinivasan K, Sinha R (2024) Linking clicks to bricks: Understanding the effects of email advertising on multichannel sales. Inform. Systems Res. 36(1):225–238.Link, Google Scholar

cover image Information Systems Research

Articles In Advance

Article Information

Supplemental Material

Metrics

Information

Received:April 04, 2024
Accepted:November 04, 2025
Published Online:December 05, 2025

Cite as

Bowen Shi, Xiaojie Mao, Mochen Yang, Bo Li (2025) What, Why, and How: An Empiricist’s Guide to Double/Debiased Machine Learning. Information Systems Research 0(0).

https://doi.org/10.1287/isre.2024.0888

Keywords

Acknowledgments

The authors sincerely thank the senior editor, associate editor, and anonymous reviewers for their insights and suggestions, which have led to significant improvement of this paper.

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

What, Why, and How: An Empiricist’s Guide to Double/Debiased Machine Learning

References

Articles In Advance

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News