Using Conditional Restricted Boltzmann Machines to Model Complex Consumer Shopping Patterns

Published Online:https://doi.org/10.1287/mksc.2019.1162

References

  • Blei DM, Kucukelbir A, McAuliffe JD (2017) Variational inference: A review for statisticians. J. Amer. Statist. Assoc. 112(518):859–877.CrossrefGoogle Scholar
  • Bottou L (2010) Large-scale machine learning with stochastic gradient descent. Proc. COMPSTAT’2010 (Physica-Verlag, Berlin, Heidelberg, Germany), 177–186.Google Scholar
  • Boztug Y, Hildebrandt L (2008) Modeling joint purchases with a multivariate MNL approach. Schmalenbach Bus. Rev. 60(4):400–422.CrossrefGoogle Scholar
  • Burnap A, Ren Y, Lee H, Gonzalez R, Papalambros PY (2014) Improving preference prediction accuracy with feature learning. Proc. ASME 2014 Internat. Design Engrg. Tech. Conf. Comput. Inform. Engrg. Conf. (American Society of Mechanical Engineers, New York), V02AT03A012.Google Scholar
  • Carpenter B, Gelman A, Hoffman MD, Lee D, Goodrich B, Betancourt M, Brubaker M, Guo J, Li P, Riddell A (2017) Stan: A probabilistic programming language. J. Statist. Software 76(1):1–32.CrossrefGoogle Scholar
  • Cotter A, Shamir O, Srebro N, Sridharan K (2011) Better mini-batch algorithms via accelerated gradient methods. Adv. Neural Inform. Processing Systems 24:1647–1655.Google Scholar
  • Duvvuri SD, Gruca TS (2010) A Bayesian multi-level factor analytic model of consumer price sensitivities across categories. Psychometrika 75(3):558–578.CrossrefGoogle Scholar
  • Duvvuri SD, Ansari A, Gupta TS (2007) Consumers’ price sensitivities across complementary categories. Management Sci. 53(12):1933–1945.LinkGoogle Scholar
  • Edwards YD, Allenby GM (2003) Multivariate analysis of multiple response data. J. Marketing Res. 40(3):321–334.CrossrefGoogle Scholar
  • Fader PS, Hardie BG, Lee KL (2005) “Counting your customers” the easy way: An alternative to the Pareto/NBD model. Marketing Sci. 24(2):275–284.LinkGoogle Scholar
  • Friedman J, Hastie T, Tibshirani R (2001) The Elements of Statistical Learning, Springer Series in Statistics (Springer, Berlin).Google Scholar
  • Girolami M, Rogers S (2006) Variational Bayesian multinomial probit regression with Gaussian process priors. Neural Comput. 18(8):1790–1817.CrossrefGoogle Scholar
  • Hansen K, Singh V, Chintagunta P (2006) Understanding store-brand purchase behavior across categories. Marketing Sci. 25(1):75–90.LinkGoogle Scholar
  • Hastie T, Tibishirani R, Wainwright M (2016) Statistical Learning with Sparsity (CRC Press, Boca Raton, FL).Google Scholar
  • Hinton G (2010) A practical guide to training restricted Boltzmann machines. Momentum 9(1):926.Google Scholar
  • Hinton G (2002) Training products of experts by minimizing contrastive divergence. Neural Comput. 14(8):1771–1800.CrossrefGoogle Scholar
  • Hinton G, Salakhutdinov R (2006) Reducing the dimensionality of data with neural networks. Science 313(5786):504–507.CrossrefGoogle Scholar
  • Hruschka H (2014) Analyzing market baskets by restricted Boltzmann machines. OR Spectrum 36(1):209–228.CrossrefGoogle Scholar
  • Keenan DM (1982) A time series analysis of binary data. J. Amer. Statist. Assoc. 77(380):816–821.CrossrefGoogle Scholar
  • Kucukelbir A, Tran D, Ranganath R, Gelman A, Blei DM (2017) Automatic differentiation variational inference. J. Machine Learn. Res. 18(1):430–474.Google Scholar
  • Lawrence ND (2004) Gaussian process latent variable models for visualisation of high dimensional data. Adv. Neural Inform. Processing Systems 16(3):329–336.Google Scholar
  • Makhzani A, Frey BJ (2015) Winner-take-all autoencoders. Adv. Neural Inform. Processing Systems 28:2791–2799.Google Scholar
  • Manchanda P, Ansari A, Gupta S (1999) The “shopping basket”: A model for multicategory purchase incidence decisions. Marketing Sci. 18(2):95–114.LinkGoogle Scholar
  • Mehta N (2007) Investigating consumers’ purchase incidence and brand choice decisions across multiple product categories: A theoretical and empirical analysis. Marketing Sci. 26(2):196–217.LinkGoogle Scholar
  • Miles S (2013) 7 strategies for boosting digital coupon conversions. Retrieved March 3, 2014, http://streetfightmag.com/2013/10/03/7-strategies-for-boosting-digital-coupon-conversions/.Google Scholar
  • Pearl J (2014) Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference (Elsevier, Amsterdam).Google Scholar
  • Russell GJ, Petersen A (2000) Analysis of cross category dependence in market basket selection. J. Retailing 76(3):367–392.CrossrefGoogle Scholar
  • Salakhutdinov R, Larochelle H (2010) Efficient learning of deep Boltzmann machines. Proc. 13th Internat. Conf. Artificial Intelligence Statist., vol. 9 (Microtome Publishing, Brookline, MA), 693–700.Google Scholar
  • Salakhutdinov R, Mnih A, Hinton G (2007) Restricted Boltzmann machines for collaborative filtering. Proc. 24th Internat. Conf. Machine Learn. (ACM, New York), 791–798.Google Scholar
  • Salakhutdinov R, Tenenbaum JB, Torralba A (2013) Learning with hierarchical-deep models. IEEE Trans. Pattern Anal. Machine Intelligence 35(8):1958–1971.CrossrefGoogle Scholar
  • Seetharaman PB, Chib S, Ainslie A, Boatwright P, Chan T, Gupta S, Mehta N, Rao V, Strijnev A (2005) Models of multi-category choice behavior. Marketing Lett. 16(3–4):239–254.CrossrefGoogle Scholar
  • Shai S, Ambuj T (2011) Stochastic method for L1 regularized loss minimization. J. Machine Learn. Res. 12(June):1865–1892.Google Scholar
  • Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R (2014) Dropout: A simple way to prevent neural networks from overfitting. J. Machine Learn. Res. 15(1):1929–1958.Google Scholar
  • Shocker AD, Bayus BL, Kim N (2004) Product complements and substitutes in the real world: The relevance of “other products.” J. Marketing 68(1):28–40.CrossrefGoogle Scholar
  • Talhouk A, Doucet A, Murphy K (2012) Efficient Bayesian inference for multivariate probit models with sparse inverse correlation matrices. J. Comput. Graphical Statist. 21(3):739–757.CrossrefGoogle Scholar
  • Taylor GW, Hinton G (2009) Factored conditional restricted Boltzmann machines for modeling motion style. Proc. 26th Annual Internat. Conf. Machine Learn. (ACM, New York), 1025–1032.Google Scholar
  • Zou H, Hastie T (2005) Regularization and variable selection via the elastic net. J. Roy. Statist. Soc. Ser. B. Statist. Methodology 67(2):301–320.CrossrefGoogle Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.