False Discovery in A/B Testing

Ron Berman
Ron Berman
[email protected]
https://orcid.org/0000-0002-8594-3627
Marketing, The Wharton School of the University of Pennsylvania, Philadelphia, Pennsylvania 19104
Search for more papers by this author
,
Christophe Van den Bulte
Christophe Van den Bulte
[email protected]
https://orcid.org/0000-0001-9708-1596
Marketing, The Wharton School of the University of Pennsylvania, Philadelphia, Pennsylvania 19104
Search for more papers by this author

Marketing, The Wharton School of the University of Pennsylvania, Philadelphia, Pennsylvania 19104

Search for more papers by this author

Christophe Van den Bulte

[email protected]

https://orcid.org/0000-0001-9708-1596

Marketing, The Wharton School of the University of Pennsylvania, Philadelphia, Pennsylvania 19104

Search for more papers by this author

Published Online:30 Dec 2021https://doi.org/10.1287/mnsc.2021.4207

Abstract

We investigate what fraction of all significant results in website A/B testing is actually null effects (i.e., the false discovery rate (FDR)). Our data consist of 4,964 effects from 2,766 experiments conducted on a commercial A/B testing platform. Using three different methods, we find that the FDR ranges between 28% and 37% for tests conducted at 10% significance and between 18% and 25% for tests at 5% significance (two sided). These high FDRs stem mostly from the high fraction of true null effects, about 70%, rather than from low power. Using our estimates, we also assess the potential of various A/B test designs to reduce the FDR. The two main implications are that decision makers should expect one in five interventions achieving significance at 5% confidence to be ineffective when deployed in the field and that analysts should consider using two-stage designs with multiple variations rather than basic A/B tests.

This paper was accepted by Eric Anderson, marketing.

Volume 68, Issue 9

September 2022

Pages 6355-7064, iii-iv

Article Information

Supplemental Material

Metrics

Information

Received:March 10, 2020
Accepted:June 02, 2021
Published Online:December 30, 2021

Cite as

Ron Berman, Christophe Van den Bulte (2022) False Discovery in A/B Testing. Management Science 68(9):6762-6782.

https://doi.org/10.1287/mnsc.2021.4207

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

False Discovery in A/B Testing

Abstract

Volume 68, Issue 9

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News