Using Conditional Restricted Boltzmann Machines to Model Complex Consumer Shopping Patterns
Published Online:12 Jul 2019https://doi.org/10.1287/mksc.2019.1162
References
- (2017) Variational inference: A review for statisticians. J. Amer. Statist. Assoc. 112(518):859–877.Crossref, Google Scholar
- (2010) Large-scale machine learning with stochastic gradient descent. Proc. COMPSTAT’2010 (Physica-Verlag, Berlin, Heidelberg, Germany), 177–186.Google Scholar
- (2008) Modeling joint purchases with a multivariate MNL approach. Schmalenbach Bus. Rev. 60(4):400–422.Crossref, Google Scholar
- (2014) Improving preference prediction accuracy with feature learning. Proc. ASME 2014 Internat. Design Engrg. Tech. Conf. Comput. Inform. Engrg. Conf. (American Society of Mechanical Engineers, New York), V02AT03A012.Google Scholar
- (2017) Stan: A probabilistic programming language. J. Statist. Software 76(1):1–32.Crossref, Google Scholar
- (2011) Better mini-batch algorithms via accelerated gradient methods. Adv. Neural Inform. Processing Systems 24:1647–1655.Google Scholar
- (2010) A Bayesian multi-level factor analytic model of consumer price sensitivities across categories. Psychometrika 75(3):558–578.Crossref, Google Scholar
- (2007) Consumers’ price sensitivities across complementary categories. Management Sci. 53(12):1933–1945.Link, Google Scholar
- (2003) Multivariate analysis of multiple response data. J. Marketing Res. 40(3):321–334.Crossref, Google Scholar
- (2005) “Counting your customers” the easy way: An alternative to the Pareto/NBD model. Marketing Sci. 24(2):275–284.Link, Google Scholar
- (2001) The Elements of Statistical Learning, Springer Series in Statistics (Springer, Berlin).Google Scholar
- (2006) Variational Bayesian multinomial probit regression with Gaussian process priors. Neural Comput. 18(8):1790–1817.Crossref, Google Scholar
- (2006) Understanding store-brand purchase behavior across categories. Marketing Sci. 25(1):75–90.Link, Google Scholar
- (2016) Statistical Learning with Sparsity (CRC Press, Boca Raton, FL).Google Scholar
- (2010) A practical guide to training restricted Boltzmann machines. Momentum 9(1):926.Google Scholar
- (2002) Training products of experts by minimizing contrastive divergence. Neural Comput. 14(8):1771–1800.Crossref, Google Scholar
- (2006) Reducing the dimensionality of data with neural networks. Science 313(5786):504–507.Crossref, Google Scholar
- (2014) Analyzing market baskets by restricted Boltzmann machines. OR Spectrum 36(1):209–228.Crossref, Google Scholar
- (1982) A time series analysis of binary data. J. Amer. Statist. Assoc. 77(380):816–821.Crossref, Google Scholar
- (2017) Automatic differentiation variational inference. J. Machine Learn. Res. 18(1):430–474.Google Scholar
- (2004) Gaussian process latent variable models for visualisation of high dimensional data. Adv. Neural Inform. Processing Systems 16(3):329–336.Google Scholar
- (2015) Winner-take-all autoencoders. Adv. Neural Inform. Processing Systems 28:2791–2799.Google Scholar
- (1999) The “shopping basket”: A model for multicategory purchase incidence decisions. Marketing Sci. 18(2):95–114.Link, Google Scholar
- (2007) Investigating consumers’ purchase incidence and brand choice decisions across multiple product categories: A theoretical and empirical analysis. Marketing Sci. 26(2):196–217.Link, Google Scholar
- (2013) 7 strategies for boosting digital coupon conversions. Retrieved March 3, 2014, http://streetfightmag.com/2013/10/03/7-strategies-for-boosting-digital-coupon-conversions/.Google Scholar
- (2014) Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference (Elsevier, Amsterdam).Google Scholar
- (2000) Analysis of cross category dependence in market basket selection. J. Retailing 76(3):367–392.Crossref, Google Scholar
- (2010) Efficient learning of deep Boltzmann machines. Proc. 13th Internat. Conf. Artificial Intelligence Statist., vol. 9 (Microtome Publishing, Brookline, MA), 693–700.Google Scholar
- (2007) Restricted Boltzmann machines for collaborative filtering. Proc. 24th Internat. Conf. Machine Learn. (ACM, New York), 791–798.Google Scholar
- (2013) Learning with hierarchical-deep models. IEEE Trans. Pattern Anal. Machine Intelligence 35(8):1958–1971.Crossref, Google Scholar
- (2005) Models of multi-category choice behavior. Marketing Lett. 16(3–4):239–254.Crossref, Google Scholar
- (2011) Stochastic method for L1 regularized loss minimization. J. Machine Learn. Res. 12(June):1865–1892.Google Scholar
- (2014) Dropout: A simple way to prevent neural networks from overfitting. J. Machine Learn. Res. 15(1):1929–1958.Google Scholar
- (2004) Product complements and substitutes in the real world: The relevance of “other products.” J. Marketing 68(1):28–40.Crossref, Google Scholar
- (2012) Efficient Bayesian inference for multivariate probit models with sparse inverse correlation matrices. J. Comput. Graphical Statist. 21(3):739–757.Crossref, Google Scholar
- (2009) Factored conditional restricted Boltzmann machines for modeling motion style. Proc. 26th Annual Internat. Conf. Machine Learn. (ACM, New York), 1025–1032.Google Scholar
- (2005) Regularization and variable selection via the elastic net. J. Roy. Statist. Soc. Ser. B. Statist. Methodology 67(2):301–320.Crossref, Google Scholar

