Selectively Acquiring Customer Information: A New Data Acquisition Problem and an Active Learning-Based Solution

Published Online:https://doi.org/10.1287/mnsc.1050.0488

References

  • Atkinson A. The usefulness of optimum experimental designs. J. Roy. Statist. Soc. (1996) 58:59–76Google Scholar
  • Atkinson A., Bailey R. A. One hundred years of the design of experiments on and off the pages of Biometrika. Biometrika (2001) 88:53–97CrossrefGoogle Scholar
  • Atkinson A., Donev A.Optimum Experimental Designs (1992) (Oxford Science Publications, Oxford, UK) Google Scholar
  • Blake L., Merz J. UCI repository of machine learning databases. (1998) (Department of Computer Science, University of California, Irvine, CA) . http://www.ics.uci.edu/∼mlearn/MLRepository.htmlGoogle Scholar
  • Box G. E. P., Tiao G. C.Bayesian Inference in Statistical Analysis (1992) (Addison-Wesley, Reading, MA) CrossrefGoogle Scholar
  • Box G. E. P., Hunter W. G., Hunter J. S.Statistics for Experimenters (1978) (John Wiley & Sons, New York) Google Scholar
  • Chaloner K., Larntz K. Optimal Bayesian design applied to logistic regression experiments. J. Statist. Planning Inference (1989) 21:191–208CrossrefGoogle Scholar
  • Chaloner K., Verdinelli I. Bayesian experimental design: A review. Statist. Sci. (1995) 10:273–304CrossrefGoogle Scholar
  • Chaudhuri Arijit, Stenger H.Survey Sampling: Theory and Methods (1992) (Marcel Dekker Inc., Florence, KY.) Google Scholar
  • Cochran W. G.Sampling Techniques (1977) 3rd ed.(John Wiley & Sons, New York) Google Scholar
  • Cohn D. Neural network exploration using optimal experiment design. J. Econometrics (1996) 37:87–114Google Scholar
  • Cohn D., Ghahramani Z., Jordan M. Active learning with statistical models. J. Artificial Intelligence Res. (1996) 4:129–143CrossrefGoogle Scholar
  • Cook T. D., Campbell D. T.Quasi-Experimentation: Design and Analysis Issues for Field Settings (1979) (Rand McNally College Publishing Company, Chicago, IL) Google Scholar
  • Cook R., Wong W. On the equivalence of constrained and compound optimal designs. J. Amer. Statist. Assoc. (1994) 89:426–434CrossrefGoogle Scholar
  • Dietterich T. Ensemble methods in machine learning. Multiple Classifier Systems (2000) 18:1–15CrossrefGoogle Scholar
  • Dos Santos B., Mookerjee V. Expert system design: Minimizing information acquisition costs. Decision Support Systems (1993) 9:161–181CrossrefGoogle Scholar
  • Dunn L. F., Kim T. An empirical investigation of credit card default: Ponzi schemes and other behaviors. (2004) . Working paper, Department of Economics, Ohio State University, Columbus, OHGoogle Scholar
  • Engelson S., Dagan G. Committee-based sample selection for probabilistic classifiers. J. Artificial Intelligence Res. (1999) 11:335–360CrossrefGoogle Scholar
  • Ford I., Silvey S. A sequentially constructed design for estimating nonlinear parameter functions. Biometrika (1980) 67:381–388CrossrefGoogle Scholar
  • Freund Y., Seung H., Shamir E., Tishby N. Selective sampling using query by committee algorithm. Mach. Learning (1997) 28:133–168CrossrefGoogle Scholar
  • Greene W.Econometric Analysis (2000) (Prentice Hall, Upper Saddle River, NJ) Google Scholar
  • Greiner R., Grove D. Learning cost-sensitive active classifiers. Artificial Intelligence (2002) 139:137–174CrossrefGoogle Scholar
  • Hasenjäger M., Ritter H. Active learning in neural networks. (1999) . Working paper, University of Bielefeld, Bielefeld, Germany, http://citeseer.nj.nec.com/404108.htmlGoogle Scholar
  • Hu I. On sequential designs in non-linear problems. Biometrika (1998) 85:496–503CrossrefGoogle Scholar
  • Kallberg J., Udell G. The value of private sector credit information sharing: The U.S. case. J. Banking Finance (2003) 27:449–469CrossrefGoogle Scholar
  • Kifer J. C. Optimum experimental design. J. Roy. Statist. Soc., Ser. B (1959) 21:272–304Google Scholar
  • Kuhfeld W., Tobias R., Garratt M. Efficient experimental design with marketing research applications. J. Marketing Res. (1994) 31:545–557CrossrefGoogle Scholar
  • Kullback S.Information Theory and Statistics (1959) (Wiley, New York) Google Scholar
  • Lindley D. V.Bayesian Statistics—A Review (1972) (SIAM, Philadelphia, PA) CrossrefGoogle Scholar
  • Little R. J. A., Rubin D.Statistical Analysis with Missing Data (1987) (John Wiley & Sons, New York) Google Scholar
  • MacKay D. J. C. Information-based objective functions for active data selection. Neural Comput. (1992) 4:590–604CrossrefGoogle Scholar
  • Markoff John. Pentagon plans a computer system that would peek at personal data of Americans. New York Times (2002) November 9Google Scholar
  • Melville P., Saar-Tsechansky M., Provost F., Mooney R. Active feature-value acquisition for classification induction. Proc. ICDM-2004, Brighton, UK (2004) (IEEE Computer Society)483–486Google Scholar
  • Mookerjee V., Dos Santos B. Inductive expert system design: Maximizing system value. Inform. Systems Res. (1993) 4:111–131LinkGoogle Scholar
  • Mookerjee Vijay S., Mannino M. Redesigning case retrieval to reduce information acquisition costs. Inform. Systems Res. (1997) 8:51–69LinkGoogle Scholar
  • Moore J., Whinston A. A model of decision-making with sequential information-acquisition (Part 1). Decision Support Systems (1986) 2:285–307CrossrefGoogle Scholar
  • Moore J., Whinston A. A model of decision-making with sequential information-acquisition (Part 2). Decision Support Systems (1987) 3:47–73CrossrefGoogle Scholar
  • Moore J. C., Rao H. R., Whinston A., Nam K., Raghu T. S. Information acquisition policies for resources allocation among multiple agents. Inform. Systems Res. (1997) 8:151–181LinkGoogle Scholar
  • Padmanabhan B., Zheng Z., Kimbrough S. An empirical analysis of the value of complete information for eCRM models. MIS Quart. (2005) . ForthcomingGoogle Scholar
  • Park Y., Fader P. Modeling browsing behavior at multiple websites. Marketing Sci. (2004) 23:280–303LinkGoogle Scholar
  • Perlich C., Provost F. ACORA: Distribution-based aggregation for relational learning from identified attributes. (2004) . Working paper CeDER-04-04, Stern School of Business, New York University, New YorkGoogle Scholar
  • Rosenberger W., Hu M. On the use of generalized linear models following a sequential design. Statist. Probab. Lett. (2002) 56:155–161CrossrefGoogle Scholar
  • Royall R. M. On finite population sampling theory under certain linear regression models. Biometrika (1970) 57:377–387CrossrefGoogle Scholar
  • Royall R. M. Likelihood functions in finite population sampling theory. Biometrika (1976) 63:605–614CrossrefGoogle Scholar
  • Rubin D.Multiple Imputation for Nonresponse in Surveys (1987) (J. Wiley & Sons, New York) CrossrefGoogle Scholar
  • Rubin D. Missing data, data imputation, bootstrap. J. Amer. Statist. Assoc. (1994) 89:426–434CrossrefGoogle Scholar
  • Saar-Tsechansky M., Provost F. J. Active learning for class probability estimation and ranking. Proc. Internat. Joint Conf. Artificial Intelligence (IJCAI 01), Seattle, WA (2001) (American Association for Artificial Intelligence)911–920Google Scholar
  • Schafer M. K., Olsen J. L. Multiple imputation for multivariate missing-data problems: A data analyst’s perspective. Multivariate Behavioral Res. (1998) 33:545–571CrossrefGoogle Scholar
  • Schuurmans D., Greiner R. Practical PAC learning. Proc. 14th Internat. Conf. Artificial Intelligence (IJCAI 95), Montreal, Canada (1995) (American Association for Artificial Intelligence)1–7Google Scholar
  • Simon H., Lea G., Gregg L. W. Problem solving and rule induction: A unified view. Knowledge and Cognition (1974) (Erlbaum, Potomac, MD) . Chapter 5Google Scholar
  • Singh R., Mangat N.Elements of Survey Sampling (1996) (Kluwer Academic Publishers, Amsterdam, Netherlands) CrossrefGoogle Scholar
  • Smith T. M. F. Populations and selection: Limitations of statistics. J. Roy. Statist. Soc. (Ser. A) (1993) 156:144–166CrossrefGoogle Scholar
  • Smith T. M. F. Biometrika centenary: Sample surveys. Biometrika (2001) 88:167–194CrossrefGoogle Scholar
  • Steeleand S., Brown J., Chambers R. A controlled donor imputation system for a one-number census. J. Roy. Statist. Soc. (Ser. A) (2002) 165:495–522CrossrefGoogle Scholar
  • Steuer E.Multiple Criteria Optimization: Theory, Computation and Application (1986) (Wiley, New York) Google Scholar
  • Tong S., Koller D. Active learning for structure in Bayesian networks. Proc. Internat. Joint Conf. Artificial Intelligence 2001, Seattle, WA (2001) (American Association for Artificial Intelligence)647–653Google Scholar
  • Valiant L. A theory of the learnable. Comm. ACM (1984) 27:1134–1142CrossrefGoogle Scholar
  • Wu C. Asymptotic inference from sequential design in a non-linear situation. Biometrika (1985) 72:552–558CrossrefGoogle Scholar
  • Zheng Z., Padmanabhan B. On active learning for data acquisition. Proc. IEEE Internat. Conf. Data Mining 2002 (2002) 562–569CrossrefGoogle Scholar
  • Zheng Z., Padmanabhan B. Constructing ensembles from data envelopment analysis. INFORMS J. Comput. (2005) . ForthcomingGoogle Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.