Selectively Acquiring Customer Information: A New Data Acquisition Problem and an Active Learning-Based Solution

Zhiqiang Zheng
Zhiqiang Zheng
[email protected]
A. Gary Anderson Graduate School of Management, University of California, Riverside, 18 Anderson Hall, Riverside, California 92521
Search for more papers by this author
,
Balaji Padmanabhan
Balaji Padmanabhan
[email protected]
The Wharton School, University of Pennsylvania, 3730 Walnut Street, Philadelphia, Pennsylvania 19104
Search for more papers by this author

A. Gary Anderson Graduate School of Management, University of California, Riverside, 18 Anderson Hall, Riverside, California 92521

Search for more papers by this author

Balaji Padmanabhan

[email protected]

The Wharton School, University of Pennsylvania, 3730 Walnut Street, Philadelphia, Pennsylvania 19104

Search for more papers by this author

Published Online:1 May 2006https://doi.org/10.1287/mnsc.1050.0488

References

Atkinson A. The usefulness of optimum experimental designs. J. Roy. Statist. Soc. (1996) 58:59–76Google Scholar
Atkinson A., Bailey R. A. One hundred years of the design of experiments on and off the pages of Biometrika. Biometrika (2001) 88:53–97Crossref, Google Scholar
Atkinson A., Donev A.Optimum Experimental Designs (1992) (Oxford Science Publications, Oxford, UK) Google Scholar
Blake L., Merz J. UCI repository of machine learning databases. (1998) (Department of Computer Science, University of California, Irvine, CA) . http://www.ics.uci.edu/∼mlearn/MLRepository.htmlGoogle Scholar
Box G. E. P., Tiao G. C.Bayesian Inference in Statistical Analysis (1992) (Addison-Wesley, Reading, MA) Crossref, Google Scholar
Box G. E. P., Hunter W. G., Hunter J. S.Statistics for Experimenters (1978) (John Wiley & Sons, New York) Google Scholar
Chaloner K., Larntz K. Optimal Bayesian design applied to logistic regression experiments. J. Statist. Planning Inference (1989) 21:191–208Crossref, Google Scholar
Chaloner K., Verdinelli I. Bayesian experimental design: A review. Statist. Sci. (1995) 10:273–304Crossref, Google Scholar
Chaudhuri Arijit, Stenger H.Survey Sampling: Theory and Methods (1992) (Marcel Dekker Inc., Florence, KY.) Google Scholar
Cochran W. G.Sampling Techniques (1977) 3rd ed.(John Wiley & Sons, New York) Google Scholar
Cohn D. Neural network exploration using optimal experiment design. J. Econometrics (1996) 37:87–114Google Scholar
Cohn D., Ghahramani Z., Jordan M. Active learning with statistical models. J. Artificial Intelligence Res. (1996) 4:129–143Crossref, Google Scholar
Cook T. D., Campbell D. T.Quasi-Experimentation: Design and Analysis Issues for Field Settings (1979) (Rand McNally College Publishing Company, Chicago, IL) Google Scholar
Cook R., Wong W. On the equivalence of constrained and compound optimal designs. J. Amer. Statist. Assoc. (1994) 89:426–434Crossref, Google Scholar
Dietterich T. Ensemble methods in machine learning. Multiple Classifier Systems (2000) 18:1–15Crossref, Google Scholar
Dos Santos B., Mookerjee V. Expert system design: Minimizing information acquisition costs. Decision Support Systems (1993) 9:161–181Crossref, Google Scholar
Dunn L. F., Kim T. An empirical investigation of credit card default: Ponzi schemes and other behaviors. (2004) . Working paper, Department of Economics, Ohio State University, Columbus, OHGoogle Scholar
Engelson S., Dagan G. Committee-based sample selection for probabilistic classifiers. J. Artificial Intelligence Res. (1999) 11:335–360Crossref, Google Scholar
Ford I., Silvey S. A sequentially constructed design for estimating nonlinear parameter functions. Biometrika (1980) 67:381–388Crossref, Google Scholar
Freund Y., Seung H., Shamir E., Tishby N. Selective sampling using query by committee algorithm. Mach. Learning (1997) 28:133–168Crossref, Google Scholar
Greene W.Econometric Analysis (2000) (Prentice Hall, Upper Saddle River, NJ) Google Scholar
Greiner R., Grove D. Learning cost-sensitive active classifiers. Artificial Intelligence (2002) 139:137–174Crossref, Google Scholar
Hasenjäger M., Ritter H. Active learning in neural networks. (1999) . Working paper, University of Bielefeld, Bielefeld, Germany, http://citeseer.nj.nec.com/404108.htmlGoogle Scholar
Hu I. On sequential designs in non-linear problems. Biometrika (1998) 85:496–503Crossref, Google Scholar
Kallberg J., Udell G. The value of private sector credit information sharing: The U.S. case. J. Banking Finance (2003) 27:449–469Crossref, Google Scholar
Kifer J. C. Optimum experimental design. J. Roy. Statist. Soc., Ser. B (1959) 21:272–304Google Scholar
Kuhfeld W., Tobias R., Garratt M. Efficient experimental design with marketing research applications. J. Marketing Res. (1994) 31:545–557Crossref, Google Scholar
Kullback S.Information Theory and Statistics (1959) (Wiley, New York) Google Scholar
Lindley D. V.Bayesian Statistics—A Review (1972) (SIAM, Philadelphia, PA) Crossref, Google Scholar
Little R. J. A., Rubin D.Statistical Analysis with Missing Data (1987) (John Wiley & Sons, New York) Google Scholar
MacKay D. J. C. Information-based objective functions for active data selection. Neural Comput. (1992) 4:590–604Crossref, Google Scholar
Markoff John. Pentagon plans a computer system that would peek at personal data of Americans. New York Times (2002) November 9Google Scholar
Melville P., Saar-Tsechansky M., Provost F., Mooney R. Active feature-value acquisition for classification induction. Proc. ICDM-2004, Brighton, UK (2004) (IEEE Computer Society)483–486Google Scholar
Mookerjee V., Dos Santos B. Inductive expert system design: Maximizing system value. Inform. Systems Res. (1993) 4:111–131Link, Google Scholar
Mookerjee Vijay S., Mannino M. Redesigning case retrieval to reduce information acquisition costs. Inform. Systems Res. (1997) 8:51–69Link, Google Scholar
Moore J., Whinston A. A model of decision-making with sequential information-acquisition (Part 1). Decision Support Systems (1986) 2:285–307Crossref, Google Scholar
Moore J., Whinston A. A model of decision-making with sequential information-acquisition (Part 2). Decision Support Systems (1987) 3:47–73Crossref, Google Scholar
Moore J. C., Rao H. R., Whinston A., Nam K., Raghu T. S. Information acquisition policies for resources allocation among multiple agents. Inform. Systems Res. (1997) 8:151–181Link, Google Scholar
Padmanabhan B., Zheng Z., Kimbrough S. An empirical analysis of the value of complete information for eCRM models. MIS Quart. (2005) . ForthcomingGoogle Scholar
Park Y., Fader P. Modeling browsing behavior at multiple websites. Marketing Sci. (2004) 23:280–303Link, Google Scholar
Perlich C., Provost F. ACORA: Distribution-based aggregation for relational learning from identified attributes. (2004) . Working paper CeDER-04-04, Stern School of Business, New York University, New YorkGoogle Scholar
Rosenberger W., Hu M. On the use of generalized linear models following a sequential design. Statist. Probab. Lett. (2002) 56:155–161Crossref, Google Scholar
Royall R. M. On finite population sampling theory under certain linear regression models. Biometrika (1970) 57:377–387Crossref, Google Scholar
Royall R. M. Likelihood functions in finite population sampling theory. Biometrika (1976) 63:605–614Crossref, Google Scholar
Rubin D.Multiple Imputation for Nonresponse in Surveys (1987) (J. Wiley & Sons, New York) Crossref, Google Scholar
Rubin D. Missing data, data imputation, bootstrap. J. Amer. Statist. Assoc. (1994) 89:426–434Crossref, Google Scholar
Saar-Tsechansky M., Provost F. J. Active learning for class probability estimation and ranking. Proc. Internat. Joint Conf. Artificial Intelligence (IJCAI 01), Seattle, WA (2001) (American Association for Artificial Intelligence)911–920Google Scholar
Schafer M. K., Olsen J. L. Multiple imputation for multivariate missing-data problems: A data analyst’s perspective. Multivariate Behavioral Res. (1998) 33:545–571Crossref, Google Scholar
Schuurmans D., Greiner R. Practical PAC learning. Proc. 14th Internat. Conf. Artificial Intelligence (IJCAI 95), Montreal, Canada (1995) (American Association for Artificial Intelligence)1–7Google Scholar
Simon H., Lea G., Gregg L. W. Problem solving and rule induction: A unified view. Knowledge and Cognition (1974) (Erlbaum, Potomac, MD) . Chapter 5Google Scholar
Singh R., Mangat N.Elements of Survey Sampling (1996) (Kluwer Academic Publishers, Amsterdam, Netherlands) Crossref, Google Scholar
Smith T. M. F. Populations and selection: Limitations of statistics. J. Roy. Statist. Soc. (Ser. A) (1993) 156:144–166Crossref, Google Scholar
Smith T. M. F. Biometrika centenary: Sample surveys. Biometrika (2001) 88:167–194Crossref, Google Scholar
Steeleand S., Brown J., Chambers R. A controlled donor imputation system for a one-number census. J. Roy. Statist. Soc. (Ser. A) (2002) 165:495–522Crossref, Google Scholar
Steuer E.Multiple Criteria Optimization: Theory, Computation and Application (1986) (Wiley, New York) Google Scholar
Tong S., Koller D. Active learning for structure in Bayesian networks. Proc. Internat. Joint Conf. Artificial Intelligence 2001, Seattle, WA (2001) (American Association for Artificial Intelligence)647–653Google Scholar
Valiant L. A theory of the learnable. Comm. ACM (1984) 27:1134–1142Crossref, Google Scholar
Wu C. Asymptotic inference from sequential design in a non-linear situation. Biometrika (1985) 72:552–558Crossref, Google Scholar
Zheng Z., Padmanabhan B. On active learning for data acquisition. Proc. IEEE Internat. Conf. Data Mining 2002 (2002) 562–569Crossref, Google Scholar
Zheng Z., Padmanabhan B. Constructing ensembles from data envelopment analysis. INFORMS J. Comput. (2005) . ForthcomingGoogle Scholar

Volume 52, Issue 5

May 2006

Pages iv-811

Article Information

Supplemental Material

Metrics

Information

Received:July 27, 2003
Published Online:May 01, 2006

Cite as

Zhiqiang Zheng, Balaji Padmanabhan, (2006) Selectively Acquiring Customer Information: A New Data Acquisition Problem and an Active Learning-Based Solution. Management Science 52(5):697-712.

https://doi.org/10.1287/mnsc.1050.0488

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Selectively Acquiring Customer Information: A New Data Acquisition Problem and an Active Learning-Based Solution

References

Volume 52, Issue 5

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News