Artificial Collusion: Examining Supracompetitive Pricing by Q-Learning Algorithms

Arnoud den Boer
Arnoud den Boer
[email protected]
Korteweg-de Vries Instituut, University of Amsterdam, 1090 GE Amsterdam, Netherlands
Search for more papers by this author
,
Janusz M. Meylahn
Janusz M. Meylahn
[email protected]
https://orcid.org/0000-0002-4388-805X
Department of Applied Mathematics, University of Twente, 7522 NB Enschede, Netherlands
Search for more papers by this author
,
Maarten Pieter Schinkel
Corresponding Author
Maarten Pieter Schinkel
[email protected]
https://orcid.org/0000-0001-7774-1315
Department of Economics and Tinbergen Institute, University of Amsterdam, 1018WB Amsterdam, Netherlands
Search for more papers by this author

Arnoud den Boer

[email protected]

Korteweg-de Vries Instituut, University of Amsterdam, 1090 GE Amsterdam, Netherlands

Search for more papers by this author

Janusz M. Meylahn

[email protected]

https://orcid.org/0000-0002-4388-805X

Department of Applied Mathematics, University of Twente, 7522 NB Enschede, Netherlands

Search for more papers by this author

Maarten Pieter Schinkel

Corresponding Author

Maarten Pieter Schinkel

[email protected]

https://orcid.org/0000-0001-7774-1315

Department of Economics and Tinbergen Institute, University of Amsterdam, 1018WB Amsterdam, Netherlands

Search for more papers by this author

Published Online:9 Jun 2026https://doi.org/10.1287/mnsc.2024.08557

Abstract

We examine concerns that pricing algorithms, employing reinforcement learning, used by competitors would autonomously and systematically learn to collude. Findings of supracompetitive prices with Q-learning have recently raised that alarm. However, a detailed analysis of the inner workings of this algorithm type reveals that it often does not satisfy conditions for what constitutes “autonomous” algorithmic collusion that would be a cartel risk in practice. We find that Q-learning can learn collusive equilibria only on timescales irrelevant to the firm’s objective. Competitors are committed to using the same Q-learning algorithm, starting at the same moment, with the same hyperparameters and action spaces, although it is outperformed by the first alternative pricing rule. This level of synchronization suggests the need for an explicit cartel agreement. Our analysis gives criteria for practically relevant, explicitly and tacitly colluding pricing algorithms that would constitute a threat to competition. Whether autonomous algorithmic collusion is a potential threat to competition remains to be seen. There is not yet reason for competition agencies to be overly suspicious of pricing algorithms, other than of “collusion by algorithm,” in which pricing software is used to implement cartel agreements or is coded with collusive intent.

This paper was accepted by Martin Bichler, market design, platform, and demand analytics.

Supplemental Material: The data files are available at https://doi.org/10.1287/mnsc.2024.08557.

Articles In Advance

Article Information

Supplemental Material

Metrics

Information

Received:November 15, 2024
Accepted:January 30, 2026
Published Online:June 09, 2026

Cite as

Arnoud den Boer, Janusz M. Meylahn, Maarten Pieter Schinkel (2026) Artificial Collusion: Examining Supracompetitive Pricing by Q-Learning Algorithms. Management Science 0(0).

https://doi.org/10.1287/mnsc.2024.08557

Keywords

Acknowledgments

The authors thank Ibrahim Abada, Ali Aouad, John Asker, Martin Bichler (the editor), Giacomo Calzolari, Vincenzo Denicolò, Sylvian Chassang, Joe Harrington, Justin Johnson, Timo Klein, Xavier Lambin, Steve Tadelis, Ulrich Schwalbe, and Rein Wesseling, as well as the anonymous associate editor and numerous reviewers, for discussion and comments that helped to improve upon earlier versions of this paper. The authors also benefitted from presentations of this paper and comments by various audiences, including those at the Chinese University of Hong Kong, Hong Kong University of Science and Technology, Chinese University of Hong Kong in Shenzhen, Hong Kong University, National University of Singapore, Maastricht University, Universität Zürich, Paris Nanterre University, Oxford University, Stellenbosch University, Het Nederlands Mathematisch Congres, the Organisation for Economic Co-operation and Development, and Jiangxi University.

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Artificial Collusion: Examining Supracompetitive Pricing by Q-Learning Algorithms

Abstract

Articles In Advance

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News