Feature Misspecification in Sequential Learning Problems

Dohyun Ahn
Dohyun Ahn
[email protected]
https://orcid.org/0000-0002-0304-0636
Department of Systems Engineering and Engineering Management, The Chinese University of Hong Kong, Sha Tin, New Territories, Hong Kong
Search for more papers by this author
,
Dongwook Shin
Corresponding Author
Dongwook Shin
[email protected]
https://orcid.org/0000-0002-2984-0148
HKUST Business School, Clear Water Bay, Kowloon, Hong Kong
Search for more papers by this author
,
Assaf Zeevi
Assaf Zeevi
[email protected]
https://orcid.org/0000-0003-1075-6664
Graduate School of Business, Columbia University, New York, New York 10025
Search for more papers by this author

Department of Systems Engineering and Engineering Management, The Chinese University of Hong Kong, Sha Tin, New Territories, Hong Kong

Search for more papers by this author

Dongwook Shin

Corresponding Author

Dongwook Shin

[email protected]

https://orcid.org/0000-0002-2984-0148

HKUST Business School, Clear Water Bay, Kowloon, Hong Kong

Search for more papers by this author

Assaf Zeevi

[email protected]

https://orcid.org/0000-0003-1075-6664

Graduate School of Business, Columbia University, New York, New York 10025

Search for more papers by this author

Published Online:29 Aug 2024https://doi.org/10.1287/mnsc.2022.00328

Abstract

We consider a class of sequential learning problems where a decision maker must learn the unknown statistical characteristics of a finite set of alternatives (or systems) using sequential sampling to ultimately select a subset of “good” alternatives. A salient feature of our problem is that system performance is governed by a set of features. The decision maker postulates the dependence on these features to be linear, but this model may not precisely represent the true underlying system structure. We show that this misspecification, if not managed properly, can lead to suboptimal performance because of a phenomenon identified as sample-selection endogeneity. We propose a prospective sampling principle—a new approach that eliminates the adverse effects of misspecification as the number of samples grows large. The proposed principle applies across a very general class of widely used sampling policies, enjoys strong asymptotic performance guarantees, and exhibits effective finite-sample performance in numerical experiments.

This paper was accepted by Vivek Farias, data science.

Funding: This work was supported by the United States-Israel Binational Science Foundation [Grant 2020063] and the Hong Kong Research Grant Council [GRF Grant 16501821 and ECS Grant 24210420].

Supplemental Material: The online appendix and data files are available at https://doi.org/10.1287/mnsc.2022.00328.

Volume 71, Issue 5

May 2025

Pages iv-vi, 3641-4531

Article Information

Supplemental Material

Metrics

Information

Received:February 02, 2022
Accepted:March 20, 2024
Published Online:August 29, 2024

Cite as

Dohyun Ahn; , Dongwook Shin; , Assaf Zeevi (2024) Feature Misspecification in Sequential Learning Problems. Management Science 71(5):4066-4086.

https://doi.org/10.1287/mnsc.2022.00328

Keywords

Acknowledgments

The authors thank Vivek Farias, the department editor, for providing valuable feedback that helped improve the paper. The authors also thank the associate editor and the referees for their thoughtful and detailed comments on the earlier version of the manuscript; their suggestions greatly enhanced the quality and the presentation of the paper.

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Feature Misspecification in Sequential Learning Problems

Abstract

Volume 71, Issue 5

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News