Beyond $O (\sqrt{T})$ Regret: Decoupling Learning and Decision Making in Online Linear Programming

Wenzhi Gao
Wenzhi Gao
[email protected]
https://orcid.org/0000-0003-0167-7868
Institute for Computational and Mathematical Engineering, Stanford University, Stanford, California 94305
Search for more papers by this author
,
Dongdong Ge
Corresponding Author
Dongdong Ge
[email protected]
https://orcid.org/0009-0006-9328-527X
Antai College of Economics and Management, Shanghai Jiao Tong University, Shanghai 200030, China; and Shanghai Institute for Mathematics and Interdisciplinary Sciences, Shanghai 200433, China
Search for more papers by this author
,
Chunlin Sun
Chunlin Sun
[email protected]
https://orcid.org/0000-0001-7389-4808
Institute for Computational and Mathematical Engineering, Stanford University, Stanford, California 94305
Search for more papers by this author
,
Chenyu Xue
Corresponding Author
Chenyu Xue
[email protected]
https://orcid.org/0009-0004-6695-8147
School of Business, East China University of Science and Technology, Shanghai 200237, China
Search for more papers by this author
,
Yinyu Ye
Yinyu Ye
[email protected]
https://orcid.org/0009-0001-3239-2622
Antai College of Economics and Management, Shanghai Jiao Tong University, Shanghai 200030, China; and Shanghai Institute for Mathematics and Interdisciplinary Sciences, Shanghai 200433, China
Search for more papers by this author

Institute for Computational and Mathematical Engineering, Stanford University, Stanford, California 94305

Corresponding Author

Dongdong Ge

Antai College of Economics and Management, Shanghai Jiao Tong University, Shanghai 200030, China; and Shanghai Institute for Mathematics and Interdisciplinary Sciences, Shanghai 200433, China

Search for more papers by this author

Chunlin Sun

[email protected]

https://orcid.org/0000-0001-7389-4808

Institute for Computational and Mathematical Engineering, Stanford University, Stanford, California 94305

Search for more papers by this author

Chenyu Xue

Corresponding Author

Chenyu Xue

[email protected]

https://orcid.org/0009-0004-6695-8147

School of Business, East China University of Science and Technology, Shanghai 200237, China

Search for more papers by this author

Yinyu Ye

[email protected]

https://orcid.org/0009-0001-3239-2622

Antai College of Economics and Management, Shanghai Jiao Tong University, Shanghai 200030, China; and Shanghai Institute for Mathematics and Interdisciplinary Sciences, Shanghai 200433, China

Search for more papers by this author

Published Online:2 Apr 2026https://doi.org/10.1287/opre.2024.1575

Abstract

Online linear programming plays an important role in both revenue management and resource allocation, and recent research has focused on developing efficient first-order online learning algorithms. Despite the empirical success of first-order methods, they typically achieve a regret no better than $O (\sqrt{T})$ , which is suboptimal compared with the $O (\log T)$ bound guaranteed by the state-of-the-art linear programming (LP)-based online algorithms. This paper establishes a general framework that improves on the $O (\sqrt{T})$ result when the LP dual problem exhibits certain error bound conditions. For the first time, we show that first-order learning algorithms achieve $o (\sqrt{T})$ regret in the continuous support setting and $O (\log T)$ regret in the finite support setting beyond the nondegeneracy assumption. Our results significantly improve the state-of-the-art regret results and provide new insights for sequential decision making.

Funding: This research was supported by the National Natural Science Foundation of China [Grants 72225009, 72394360, and 72394365].

Supplemental Material: All supplemental materials, including the code, data, and files required to reproduce the results, are available at https://doi.org/10.1287/opre.2024.1575.

Articles In Advance

Article Information

Supplemental Material

Metrics

Information

Received:December 26, 2024
Accepted:January 28, 2026
Published Online:April 02, 2026

Cite as

Wenzhi Gao, Dongdong Ge, Chunlin Sun, Chenyu Xue, Yinyu Ye (2026) Beyond

O (\sqrt{T})

Regret: Decoupling Learning and Decision Making in Online Linear Programming. Operations Research 0(0).

https://doi.org/10.1287/opre.2024.1575

Keywords

Acknowledgments

The authors thank the area editor, associate editor, and two anonymous reviewers for insightful comments and constructive suggestions that significantly improved this work.

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Beyond $O (\sqrt{T})$ Regret: Decoupling Learning and Decision Making in Online Linear Programming

Abstract

Articles In Advance

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News

Available Issues

Available Issues

Beyond O(T) Regret: Decoupling Learning and Decision Making in Online Linear Programming

Abstract

Articles In Advance

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Beyond $O (\sqrt{T})$ Regret: Decoupling Learning and Decision Making in Online Linear Programming