Network Revenue Management with Nonparametric Demand Learning: $\sqrt{T}$ -Regret and Polynomial Dimension Dependency

Sentao Miao
Sentao Miao
[email protected]
Leeds School of Business, University of Colorado at Boulder, Boulder, Colorado 80309
Search for more papers by this author
,
Yining Wang
Corresponding Author
Yining Wang
[email protected]
https://orcid.org/0000-0001-9410-0392
Naveen Jindal School of Management, University of Texas at Dallas, Richardson, Texas 75080
Search for more papers by this author

Sentao Miao

[email protected]

Leeds School of Business, University of Colorado at Boulder, Boulder, Colorado 80309

Search for more papers by this author

Yining Wang

Corresponding Author

Yining Wang

[email protected]

https://orcid.org/0000-0001-9410-0392

Naveen Jindal School of Management, University of Texas at Dallas, Richardson, Texas 75080

Search for more papers by this author

Published Online:10 Oct 2025https://doi.org/10.1287/moor.2022.0086

Abstract

This paper studies the classic price-based network revenue management (NRM) problem with demand learning. The retailer dynamically decides prices of n products over a finite selling season (of length T) subject to m resource constraints, with the purpose of maximizing the cumulative revenue. In this paper, we focus on a nonparametric demand model with some mild technical assumptions which are satisfied by most of the commonly used demand functions. We propose a robust ellipsoid method adapted to the NRM setting in a nontrivial manner. This is the first result which achieves the regret of the form $O (poly (n, m, \ln (T)) \sqrt{T})$ (where $poly (n, m, \ln (T))$ is a polynomial function of $n, m, \ln (T)$ ) in the current literature on the nonparametric NRM problem.

Funding: S. Miao gratefully acknowledges financial support provided by the Ruegg Family Scholar and the Leeds School of Business.

Supplemental Material: The online appendix is available at https://doi.org/10.1287/moor.2022.0086.

cover image Mathematics of Operations Research

Articles In Advance

Article Information

Supplemental Material

Metrics

Information

Received:March 28, 2022
Accepted:August 01, 2025
Published Online:October 10, 2025

Cite as

Sentao Miao, Yining Wang (2025) Network Revenue Management with Nonparametric Demand Learning:

\sqrt{T}

-Regret and Polynomial Dimension Dependency. Mathematics of Operations Research 0(0).

https://doi.org/10.1287/moor.2022.0086

Keywords

Acknowledgments

Author names are listed in alphabetical order.

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Network Revenue Management with Nonparametric Demand Learning: $\sqrt{T}$ -Regret and Polynomial Dimension Dependency

Abstract

Articles In Advance

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News

Available Issues

Available Issues

Network Revenue Management with Nonparametric Demand Learning: T-Regret and Polynomial Dimension Dependency

Abstract

Articles In Advance

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Network Revenue Management with Nonparametric Demand Learning: $\sqrt{T}$ -Regret and Polynomial Dimension Dependency