Dual-Directed Algorithm Design for Efficient Pure Exploration

Chao Qin
Chao Qin
[email protected]
https://orcid.org/0000-0002-4600-6147
Stanford Graduate School of Business, Stanford University, Stanford, California 94305
Search for more papers by this author
,
Wei You
Corresponding Author
Wei You
[email protected]
https://orcid.org/0000-0003-0844-4194
Department of Industrial Engineering and Decision Analytics, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong SAR, China
Search for more papers by this author

Stanford Graduate School of Business, Stanford University, Stanford, California 94305

Corresponding Author

Wei You

Department of Industrial Engineering and Decision Analytics, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong SAR, China

Search for more papers by this author

Published Online:15 Jul 2025https://doi.org/10.1287/opre.2023.0590

Abstract

Although experimental design often focuses on selecting the single best alternative from a finite set (e.g., in ranking and selection or best-arm identification), many pure-exploration problems pursue richer goals. Given a specific goal, adaptive experimentation aims to achieve it by strategically allocating sampling effort, with the underlying sample complexity characterized by a maximin optimization problem. By introducing dual variables, we derive necessary and sufficient conditions for an optimal allocation, yielding a unified algorithm design principle that extends the top-two approach beyond best-arm identification. This principle gives rise to information-directed selection, a hyperparameter-free rule that dynamically evaluates and chooses among candidates based on their current informational value. We prove that, when combined with information-directed selection, top-two Thompson sampling attains asymptotic optimality for Gaussian best-arm identification, resolving a notable open question in the pure-exploration literature. Furthermore, our framework produces asymptotically optimal algorithms for pure-exploration thresholding bandits and $ε$ -best-arm identification (i.e., ranking and selection with probability-of-good-selection guarantees), and more generally establishes a recipe for adapting Thompson sampling across a broad class of pure-exploration problems. Extensive numerical experiments highlight the efficiency of our proposed algorithms compared with existing methods.

Funding: W. You’s research is generously supported by the Hong Kong Research Grants Council [Grants ECS 26212320 and GRF 16212823].

Supplemental Material: All supplemental materials, including the code, data, and files required to reproduce the results are available at https://doi.org/10.1287/opre.2023.0590.

Volume 74, Issue 2

March-April 2026

Pages v-ix, 573-1152, iii-iv

Article Information

Supplemental Material

Metrics

Information

Received:October 30, 2023
Accepted:May 13, 2025
Published Online:July 15, 2025

Cite as

Chao Qin, Wei You (2025) Dual-Directed Algorithm Design for Efficient Pure Exploration. Operations Research 74(2):1104-1125.

https://doi.org/10.1287/opre.2023.0590

Keywords

Acknowledgments

The authors thank the anonymous reviewers and associate editor for valuable feedback that significantly improved the paper and Daniel Russo, Sandeep Juneja, Po-An Wang, Shane Henderson, Xiaowei Zhang, Jun Luo, the participants of the 2023 INFORMS Applied Probability Society Conference, the 2023 INFORMS Annual Meeting, and the 2024 INFORMS MSOM Conference for their insightful comments on this work. A preliminary version of this paper appeared as an extended abstract in the Proceedings of the 36th Annual Conference on Learning Theory, with the title “Information-Directed Selection for Top-Two Algorithms.”

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Dual-Directed Algorithm Design for Efficient Pure Exploration

Abstract

Volume 74, Issue 2

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News