Bid Shading in First-Price Auction: Nonstationary Bayesian Multiarmed Bandit Methods for Real-Time Bidding

Mengzhuo Guo
Mengzhuo Guo
[email protected]
https://orcid.org/0000-0003-3559-733X
Business School, Sichuan University, Chengdu, Sichuan Province 610065, China
Search for more papers by this author
,
Wuqi Zhang
Wuqi Zhang
[email protected]
https://orcid.org/0000-0003-4236-1827
Tencent, Nanshan District, Shenzhen, Guangdong Province 518054, China
Search for more papers by this author
,
Yiwen Shen
Yiwen Shen
[email protected]
https://orcid.org/0000-0002-9170-9044
School of Business and Management, Hong Kong University of Science and Technology, Kowloon, Hong Kong 999077, China
Search for more papers by this author
,
Qingpeng Zhang
Corresponding Author
Qingpeng Zhang
[email protected]
https://orcid.org/0000-0002-6819-0686
Musketeers Foundation Institute of Data Science, The University of Hong Kong, Hong Kong, China; and Department of Pharmacology and Pharmacy, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China; and HKU Shanghai Intelligent Computing Research Center, Shanghai, China; and Shenzhen Loop Area Institute, Shenzhen, China
Search for more papers by this author

Business School, Sichuan University, Chengdu, Sichuan Province 610065, China

Search for more papers by this author

Wuqi Zhang

[email protected]

https://orcid.org/0000-0003-4236-1827

Tencent, Nanshan District, Shenzhen, Guangdong Province 518054, China

Search for more papers by this author

Yiwen Shen

[email protected]

https://orcid.org/0000-0002-9170-9044

School of Business and Management, Hong Kong University of Science and Technology, Kowloon, Hong Kong 999077, China

Search for more papers by this author

Qingpeng Zhang

Corresponding Author

Qingpeng Zhang

[email protected]

https://orcid.org/0000-0002-6819-0686

Musketeers Foundation Institute of Data Science, The University of Hong Kong, Hong Kong, China; and Department of Pharmacology and Pharmacy, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China; and HKU Shanghai Intelligent Computing Research Center, Shanghai, China; and Shenzhen Loop Area Institute, Shenzhen, China

Search for more papers by this author

Published Online:1 Jun 2026https://doi.org/10.1287/isre.2025.1837

Abstract

In real-time bidding systems, ad exchanges and supply-side platforms are switching from the second-price auction (SPA) to the first-price auction (FPA), where advertisers pay the amount they bid if they win the auction. To mitigate the risk of overpaying, advertisers employ bid shading strategies to adjust their bids below their true valuations. Such strategies adopt a simplified assumption that the market price distribution is stationary over time and balance the trade-off between maximizing the probability of winning and minimizing costs. However, the real-world market price distribution is inherently nonstationary, and the current bidding strategies might fail in such a condition, especially as advertisers lack visibility into competitors’ bids. Therefore, we propose two complementary Bayesian multiarmed Bandit methods for nonstationary bid shading, namely, BayesMAB-NS and BayesMAB-CD. Our methods incorporate dependencies among arms, enabling the outcome of one bid to inform the rewards and selection criteria for others while progressively refining the market price distribution through Bayesian updates. BayesMAB-NS employs predefined segmentation and time discounting to adapt to evolving environments when prior structural knowledge about market dynamics is available. On the other hand, BayesMAB-CD introduces adaptive change-point detection and soft posterior resets to track unknown market price distribution changes automatically. Empirical evaluations using simulated data, real-world offline data sets, and online replay demonstrate strong performance of the BayesMAB family over nonstationary MAB baselines. In addition, the performance of BayesMAB and BayesMAB-NS is further validated in large-scale online A/B tests on a large Chinese online display advertising platform. The online results show reductions in cost per mille and cost per action by up to 18.68% and 17.71%, respectively, along with a 17.78% increase in return on investment without compromising winning rates. Our methods have been deployed online and used in practice to handle large volumes of traffic daily.

History: Martin Bichler, Senior Editor; Mochen Yang, Associate Editor.

Funding: This research was supported by the National Natural Science Foundation of China (NSFC) [Grant 72401210], the China Postdoctoral Science Foundation funded project [Grant 2024M752282], the Natural Science Foundation of Sichuan Province [Grant 2025NSFSC1998], the Hong Kong University of Science and Technology [Grant B000-0172-R9281], the General Research Fund of the Research Grants Council of Hong Kong [Grant 17209225], the seed grant of the HKU Shanghai Intelligent Computing Research Center, and the seed grant of Shenzhen Loop Area Institute.

Supplemental Material: The online appendix is available at https://doi.org/10.1287/isre.2025.1837.

cover image Information Systems Research

Articles In Advance

Article Information

Supplemental Material

Metrics

Information

Received:January 22, 2025
Accepted:April 10, 2026
Published Online:June 01, 2026

Cite as

Mengzhuo Guo, Wuqi Zhang, Yiwen Shen, Qingpeng Zhang (2026) Bid Shading in First-Price Auction: Nonstationary Bayesian Multiarmed Bandit Methods for Real-Time Bidding. Information Systems Research 0(0).

https://doi.org/10.1287/isre.2025.1837

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Bid Shading in First-Price Auction: Nonstationary Bayesian Multiarmed Bandit Methods for Real-Time Bidding

Abstract

Articles In Advance

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News