Adaptive Learning in Uncertain and Sequential Competition

Shukai Li
Shukai Li
[email protected]
https://orcid.org/0000-0003-3540-5035
New York University Shanghai, Shanghai 200124, China
Search for more papers by this author
,
Sanjay Mehrotra
Corresponding Author
Sanjay Mehrotra
[email protected]
https://orcid.org/0000-0003-1106-1901
Department of Industrial Engineering and Management Sciences, Northwestern University, Evanston, Illinois 60208
Search for more papers by this author

New York University Shanghai, Shanghai 200124, China

Search for more papers by this author

Sanjay Mehrotra

Corresponding Author

Sanjay Mehrotra

[email protected]

https://orcid.org/0000-0003-1106-1901

Department of Industrial Engineering and Management Sciences, Northwestern University, Evanston, Illinois 60208

Search for more papers by this author

Published Online:4 Nov 2025https://doi.org/10.1287/opre.2024.0825

Abstract

We investigate an individual’s decision-making problem in a competitive and uncertain environment, where N learners (decision makers) confront unknown objective functions, lack competitor data, and optimize actions over a finite horizon of T epochs. Within a general framework, we explore what conditions ensure good performance of learning policies solely based on individual data. We show that when learner objective functions exhibit a tatônnement stability property and individual data are informative regarding the learner’s best response to competitor actions, individual data alone are sufficient for designing a learning policy that, when employed by all learners, leads to Nash equilibrium. Specifically, under our learning policy, the worst-off learners within each epoch make progress toward Nash equilibrium. The convergence rate is $O (1 / T)$ under noise-free feedback and $O (T^{- 1 / 3} \log T)$ under noisy feedback, with constants independent of N. Simultaneously, each learner attains sublinear regret relative to a dynamic benchmark: $O (\log T)$ under noise-free feedback and $O (T^{2 / 3} \log T)$ under noisy feedback. We illustrate our informative individual data conditions and learning policy using applications from a repeated newsvendor-type competition with demand substitution and a multiseller multiproduct repeated price competition.

Funding: The work of the authors was supported by the National Science Foundation [grant CMMI-1763035].

Supplemental Material: All supplemental materials, including the code, data, and files required to reproduce the results, are available at https://doi.org/10.1287/opre.2024.0825.

Volume 74, Issue 1

January-February 2026

Pages iii-vii, 1-571, C2-C3

Article Information

Supplemental Material

Metrics

Information

Received:February 18, 2024
Accepted:July 18, 2025
Published Online:November 04, 2025

Cite as

Shukai Li, Sanjay Mehrotra (2025) Adaptive Learning in Uncertain and Sequential Competition. Operations Research 74(1):301-338.

https://doi.org/10.1287/opre.2024.0825

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Adaptive Learning in Uncertain and Sequential Competition

Abstract

Volume 74, Issue 1

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News