Rate-Optimal Online Learning for Dynamic Assortment Selection with Positioning

Yiyun Luo
Yiyun Luo
[email protected]
https://orcid.org/0009-0004-3788-1902
School of Statistics and Data Science, Shanghai University of Finance and Economics, Shanghai 200433, China
Search for more papers by this author
,
Will Wei Sun
Will Wei Sun
[email protected]
https://orcid.org/0000-0002-8412-6430
Daniels School of Business, Purdue University, West Lafayette, Indiana 47907
Search for more papers by this author
,
Yufeng Liu
Corresponding Author
Yufeng Liu
[email protected]
https://orcid.org/0000-0002-1686-0545
Department of Statistics and Operations Research, Department of Genetics, Department of Biostatistics, The University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599
Search for more papers by this author

School of Statistics and Data Science, Shanghai University of Finance and Economics, Shanghai 200433, China

Search for more papers by this author

Will Wei Sun

[email protected]

https://orcid.org/0000-0002-8412-6430

Daniels School of Business, Purdue University, West Lafayette, Indiana 47907

Search for more papers by this author

Yufeng Liu

Corresponding Author

Yufeng Liu

[email protected]

https://orcid.org/0000-0002-1686-0545

Department of Statistics and Operations Research, Department of Genetics, Department of Biostatistics, The University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599

Search for more papers by this author

Published Online:11 Aug 2025https://doi.org/10.1287/opre.2024.1556

Abstract

In online retailing, the seller aims to offer assortment of items with maximized revenue. We introduce a new online learning problem called dynamic assortment selection with positioning (DAP) that additionally learns the optimal positioning within the assortment. Specifically, the customers make purchases based on the item attractiveness as the product of the position effect and unknown preference parameter through a multinomial logit choice model. We first demonstrate that any assortment-only algorithm that neglects position effects results in linear regrets. To address this gap, we propose the truncated linear regression upper confidence bound (TLR-UCB) policy. TLR-UCB utilizes a novel geometric linear bandit–type feedback structure for UCB construction under random and adaptive position effects. In addition, TLR-UCB conducts well-designed truncations before applying linear regression to handle conditional geometric responses. In theory, we establish a regret upper bound of $\tilde{O} (T^{1 / 2})$ for TLR-UCB, matching our derived $Ω (T^{1 / 2})$ lower bound. Moreover, we develop an explore-in-TLR-UCB (EI-TLR) policy to tackle unknown position effects. It first conducts a joint learning procedure to estimate unknown preferences and position effects, and then implements a generalized TLR-UCB procedure driven by estimated position effects. Extensive experiments demonstrate the superior performance of TLR-UCB and EI-TLR over other benchmark policies.

Funding: This research was partially supported by the National Science Foundation [Grant NSF-SES 2217440].

Supplemental Material: All supplemental materials, including the code, data, and files required to reproduce the results, are available at https://doi.org/10.1287/opre.2024.1556.

Volume 74, Issue 1

January-February 2026

Pages iii-vii, 1-571, C2-C3

Article Information

Supplemental Material

Metrics

Information

Received:March 04, 2024
Accepted:June 23, 2025
Published Online:August 11, 2025

Cite as

Yiyun Luo, Will Wei Sun, Yufeng Liu (2025) Rate-Optimal Online Learning for Dynamic Assortment Selection with Positioning. Operations Research 74(1):224-242.

https://doi.org/10.1287/opre.2024.1556

Keywords

Acknowledgments

The authors thank the editor-in-chief (Amy R. Ward) and area editor (Xi Chen) for guidance and oversight throughout the review process and the associate editor and anonymous reviewers for insightful comments and constructive suggestions. The code and data to support the numerical experiments in this paper can be found at https://github.com/yiyun851/Assortment-Positioning/blob/main/Code_Data.zip.

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Rate-Optimal Online Learning for Dynamic Assortment Selection with Positioning

Abstract

Volume 74, Issue 1

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News