Cost-Restricted Feature Selection for Data Acquisition

Published Online:https://doi.org/10.1287/mnsc.2022.4551

When acquiring consumer data for marketing or new business initiatives, it is important to decide what attributes or features of potential customers should be acquired. We study a new feature selection problem in the context of customer data acquisition in which different features have different acquisition costs. This feature selection problem is studied for linear regression and logistic regression. We formulate the feature selection and acquisition problems as nonlinear discrete optimization problems that minimize prediction errors subject to a budget constraint. We derive the analytical properties of the solutions for the problems, develop a computational procedure for solving the problems, provide an intuitive interpretation for the feature selection criteria, and discuss managerial implications of the solution approach. The results of the experimental study demonstrate the effectiveness of our approach.

This paper was accepted by Kartik Hosanagar, information systems.

Supplemental Material: Data are available at https://doi.org/10.1287/mnsc.2022.4551.

INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.