Ensemble Experiments to Optimize Interventions Along the Customer Journey: A Reinforcement Learning Approach
References
- (2020) An optimistic perspective on offline reinforcement learning. Internat. Conf. Machine Learn. (PMLR, New York), 104–114.Google Scholar
- (2022) Measuring business uncertainty in developing and emerging economies (Brookings Institution), https://policycommons.net/artifacts/4141097/measuring-business-uncertainty-in-developing-and-emerging-economies/4949875/.Google Scholar
- (2018) Efficient exploration through Bayesian deep Q-networks. 2018 Inform. Theory Appl. Workshop ITA 2018 (IEEE, Piscataway, NJ), 1–9.Google Scholar
- (2018) AE: A domain-agnostic platform for adaptive experimentation. Conf. Neural Inform. Processing Systems (San Diego, CA), 1–8.Google Scholar
- (1995) Neuro-dynamic programming: An overview. Proc. 1995 34th IEEE Conf. Decision Control, vol. 1 (IEEE, Piscataway, NJ), 560–564.Google Scholar
- (2016) Zooming in on choice: How do consumers search for cameras online? Marketing Sci. 35(5):693–712.Link, Google Scholar
- (1998) A survey of POMDP applications. Working Notes AAAI 1998 Fall Sympos. Planning Partially Observable Markov Decision Processes, vol. 1724 (AAAI Press, Palo Alto, CA).Google Scholar
- (2016) Xgboost: A scalable tree boosting system. Proc. 22nd ACM SIGKDD Internat. Conf. Knowledge Discovery Data Mining (ACM, New York), 785–794.Google Scholar
- (1998) Bayesian Q-learning. Mostow J, Rich C, eds. AAAI 98 (AAAI Press/MIT Press, Cambridge, MA), 761–768.Google Scholar
- (2018) Learning to collaborate: Multi-scenario ranking via multi-agent reinforcement learning. Proc. 2018 World Wide Web Conf. (International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva), 1939–1948.Google Scholar
- (2018) Bayesian optimization, chapter 11. Gel E, Ntaimo L, eds. Recent Advances in Optimization and Modeling of Contemporary Problems (INFORMS, Catonsville, MD), 255–278.Link, Google Scholar
- (2017) A refresher on A/B testing. Harvard Bus. Rev. (June 28), https://hbr.org/2017/06/a-refresher-on-ab-testing.Google Scholar
- (2009) An empirical analysis of search engine advertising: Sponsored search in electronic markets. Management Sci. 55(10):1605–1622.Link, Google Scholar
- (2019) Modeling consumer footprints on search engines: An interplay with social media. Management Sci. 65(3):1363–1385.Link, Google Scholar
- (1969) Linear Bayesian methods. J. Roy. Statist. Soc. B 31(3):446–454.Crossref, Google Scholar
- (2014) Website morphing 2.0: Switching costs, partial exposure, random exit, and when to morph. Management Sci. 60(6):1594–1616.Link, Google Scholar
- (2009) Website morphing. Marketing Sci. 28(2):202–223.Link, Google Scholar
- (2015) Deep recurrent Q-learning for partially observable MDPs. AAAI 2015 Fall Symposium (AAAI Press, Palo Alto, CA), 29–37.Google Scholar
- (2018) Rainbow: Combining improvements in deep reinforcement learning. Thirty-Second AAAI Conf. Artificial Intelligence (AAAI Press, Palo Alto, CA), 3215–3222.Google Scholar
- (2019) Word-of-mouth system implementation and customer conversion: A randomized field experiment. Inform. Systems Res. 30(3):805–818.Link, Google Scholar
- (1994) Do coupon expiration dates affect consumer behavior? J. Marketing Res. 31(3):423–428.Crossref, Google Scholar
- (1987) The multi-armed bandit problem: Decomposition and computation. Math. Oper. Res. 12(2):262–268.Link, Google Scholar
- (2008) Remedying hyperopia: The effects of self-control regret on consumer behavior. J. Marketing Res. 45(6):676–689.Crossref, Google Scholar
- (2021) Demand-aware career path recommendations: A reinforcement learning approach. Management Sci. 67(7):4362–4383.Link, Google Scholar
- (1964) A new method of locating the maximum point of an arbitrary multipeak curve in the presence of noise. J. Fluids Engrg. 86(1):97–106.Google Scholar
- (2021) How do product attributes and reviews moderate the impact of recommender systems through purchase stages? Management Sci. 67(1):524–546.Link, Google Scholar
- (2014) Attributing conversions in a multichannel online marketing environment: An empirical model and a field experiment. J. Marketing Res. 51(1):40–56.Crossref, Google Scholar
- (2019) The right music at the right time: Adaptive personalized playlists based on sequence modeling. MIS Quart. 43(3):765–786.Crossref, Google Scholar
- (2016) Offline evaluation of online reinforcement learning algorithms. Proc. AAAI Conf. Artificial Intelligence, vol. 30 (AAAI Press, Palo Alto, CA), 1926–1933.Google Scholar
- (2020) Principles of Economics (Cengage Learning, Boston).Google Scholar
- (2015) Human-level control through deep reinforcement learning. Nature 518(7540):529–533.Crossref, Google Scholar
- (1975) On Bayesian methods for seeking the extremum. Optimization Techniques IFIP Tech. Conf. (Springer, Berlin), 400–404.Crossref, Google Scholar
- (2004) Dynamic conversion behavior at e-commerce sites. Management Sci. 50(3):326–335.Link, Google Scholar
- (2019) Pytorch: An imperative style, high-performance deep learning library. Adv. Neural Inform. Processing Systems 32 (NIPS, San Diego, CA).Google Scholar
- (2013) A reinforcement learning approach to autonomous decision-making in smart electricity markets. Machine Learn. 92(1):5–39.Crossref, Google Scholar
- (2020) Beyond exploding and vanishing gradients: Analysing RNN training using attractors and smoothness. PMLR 2020 (PMLR), 2370–2380.Google Scholar
- (2015) Prioritized experience replay. Internat. Conf. Learn. Representations 2016 (ICLR, Appleton, WI).Google Scholar
- (2016) Mastering the game of go with deep neural networks and tree search. Nature 529(7587):484–489.Crossref, Google Scholar
- (2022) Uncovering characteristic response paths of a population. INFORMS J. Comput. 34(3):1661–1680.Link, Google Scholar
- (2018) Reinforcement Learning: An Introduction (MIT Press, Cambridge, MA).Google Scholar
- (1933) On the likelihood that one unknown probability exceeds another in view of the evidence of two samples. Biometrika 25(3/4):285–294.Crossref, Google Scholar
- (2016) Deep reinforcement learning with double Q-learning. Proc. AAAI Conf. Artificial Intelligence, vol. 30 (AAAI Press, Palo Alto, CA), 2094–2100.Google Scholar
- (2022) Deep reinforcement learning for sequential targeting. Management Sci. 69(9):5439–5460.Google Scholar
- (1992) Q-learning. Machine Learn. 8(3–4):279–292.Crossref, Google Scholar
- (2020) The long-term and spillover effects of price promotions on retailing platforms: Evidence from a large randomized experiment on Alibaba. Management Sci. 66(6):2589–2609.Link, Google Scholar
- (2019) Personalized mobile targeting with user engagement stages: Combining a structural hidden Markov model and field experiment. Inform. Systems Res. 30(3):787–804.Link, Google Scholar

