Contextual Search in the Presence of Adversarial Corruptions
References
- (2021) Stochastic dueling bandits with adversarial corruption. Feldman V, Ligett K, Sabato S, eds. Algorithmic Learn. Theory, March 16–19, Virtual Conference, Worldwide, vol. 132 (PMLR), 217–248.Google Scholar
- (2013) Learning prices for repeated auctions with strategic buyers. Burges CJC, Bottou L, Ghahramani Z, Weinberger KQ, eds. Proc. 27th Annual Conf. on Neural Inform. Processing Systems, 1169–1177.Google Scholar
- (2014) Repeated contextual auctions with strategic buyers. Ghahramani Z, Welling M, Cortes C, Lawrence ND, Weinberger KQ, eds. Proc. Annual Conf. Adv. Neural Inform. Processing Systems, vol. 27, 622–630.Google Scholar
- (2020) Prediction with corrupted expert advice. Larochelle H, Ranzato M, Hadsell R, Balcan M-F, Lin H-T, eds. Proc. 32nd Adv. Neural Processing Systems. https://proceedings.neurips.cc/paper/2020/hash/a512294422de868f8474d22344636f16-Abstract.html.Google Scholar
- (1991) Searching in the presence of linearly bounded errors. Koutsougeras C, Vitter JS, eds. Proc. 23rd Annual ACM Sympos. on Theory of Comput. (ACM, New York), 486–493.Google Scholar
- (2002) The nonstochastic multiarmed bandit problem. SIAM J. Comput. 32(1):48–77.Crossref, Google Scholar
- (2021) Personalized dynamic pricing with machine learning: High-dimensional features and heterogeneous elasticity. Management Sci. 67(9):5549–5568.Google Scholar
- (2020) Online decision making with high-dimensional covariates. Oper. Res. 68(1):276–294.Link, Google Scholar
- (2015) Non-stationary stochastic optimization. Oper. Res. 63(5):1227–1244.Link, Google Scholar
- (2016) Foundations of Data Science (Cambridge University Press, Cambridge, UK).Google Scholar
- (2020) Corruption-tolerant gaussian process bandit optimization. Chiappa S, Calandra R, eds. Proc. Internat. Conf. Artificial Intelligence Statist., vol. 108 (PMLR), 1071–1081.Google Scholar
- (2012) The best of both worlds: Stochastic and adversarial bandits. Mannor S, Srebro N, Williamson RC, eds. Proc. 25th Annual Conf. on Learn. Theory, vol. 23 (JMLR), 42.1–42.23.Google Scholar
- (2019) Dynamic pricing with finitely many unknown valuations. Garivier A, Kale S, eds. Proc. Algorithmic Learn. Theory, vol. 98 (PMLR), 247–273.Google Scholar
- (2022) Robust dynamic pricing with demand learning in the presence of outlier customers. Oper. Res., ePub ahead of print April 14, https://doi.org/10.1287/opre.2022.2280.Link, Google Scholar
- (2019) Robust dynamic assortment optimization in the presence of outlier customers. Preprint, submitted October 9, https://arxiv.org/abs/1910.04183.Google Scholar
- (2021) A statistical learning approach to personalization in revenue management. Management Sci. 68(3):1923–1937.Google Scholar
- (2021) Hedging the drift: Learning to optimize under non-stationarity. Management Sci. 68(3):1696–1713.Google Scholar
- (2020) Feature-based dynamic pricing. Management Sci. 66(11):4921–4943.Link, Google Scholar
- (2021) The entropy of lies: Playing twenty questions with a liar. Lee JR, ed. Proc. 12th Innovations in Theoretical Comput. Sci. Conf., vol. 185 (Schloss Dagstuhl - Leibniz-Zentrum für Informatik, Germany), 1:1–1:16.Google Scholar
- (2017) Horizon-independent optimal pricing in repeated auctions with truthful and strategic buyers. Barrett R, Cummings R, Agichtein E, Gabrilovich E, eds. Proc. 26th Internat. Conf. on World Wide Web, 33–42.Google Scholar
- (2016) Online pricing with strategic and patient buyers. Lee DD, Sugiyama M, von Luxburg U, Guyon I, Garnett R, eds. Proc. Annual Conf. Adv. Neural Inform. Processing Systems, vol. 29, 3864–3872.Google Scholar
- (2013) A linear response bandit problem. Stochastic Systems 3(1):230–261.Link, Google Scholar
- (2019a) Incentive-aware contextual pricing with non-parametric market noise. Preprint, submitted, November 8 https://arxiv.org/abs/1911.03508.Google Scholar
- (2019b) Dynamic incentive-aware learning: Robust pricing in contextual auctions. Adv. Neural Inform. Processing Systems. 69(1):297–314.Google Scholar
- (2021) Learning product rankings robust to fake users. Biró P, Chawla S, Echenique F, eds. Proc. 22nd ACM Conf. on Econom. and Comput. (ACM, New York), 560–561.Google Scholar
- (2019) Better algorithms for stochastic bandits with adversarial corruptions. Beygelzimer A, Hsu D, eds. Proc. Conf. on Learn. Theory, vol. 99 (PMLR), 1562–1578.Google Scholar
- (2014) Stochastic multi-armed-banditproblem with non-stationary rewards. Ghahramani Z, Welling M, Cortes C, Lawrence ND, Weinberger KQ, eds. Proc. 27th Annual Conf. Neural Inform. Processing Systems, 199–207.Google Scholar
- (1998) Tracking the best expert. Machine Learn. 32(2):151–178.Crossref, Google Scholar
- (2019) Dynamic pricing in high-dimensions. J. Machine Learn. Res. 20:9:1–9:49.Google Scholar
- (2021) Incentive-compatible learning of reserve prices for repeated auctions. Oper. Res. 69(2):509–524.Link, Google Scholar
- (2007) Noisy binary search and its applications. Bansal N, Pruhs K, Stein C, eds. Proc. 18th Annual ACM-SIAM Sympos. on Discrete Algorithms (SIAM, Philadelphia), 881–890.Google Scholar
- (2014) Dynamic pricing with an unknown demand model: Asymptotically optimal semi-myopic policies. Oper. Res. 62(5):1142–1167.Link, Google Scholar
- (2017) Chasing demand: Learning and earning in a changing environment. Math. Oper. Res. 42(2):277–307.Link, Google Scholar
- (2003) The value of knowing a demand curve: Bounds on regret for online posted-price auctions. Proc. 44th Annual IEEE Sympos. on Foundations of Comput. Sci. (IEEE, New York), 594–605.Google Scholar
- (2021) Contextual search in the presence of irrational agents. Khuller S, Williams VV, eds. Proc. 53rd Annual ACM SIGACT Sympos. Theory Comput. (ACM, New York), 910–918.Google Scholar
- (2022) Contextual search via intrinsic volumes. SIAM J. Comput. 51(4):1096–1125.Google Scholar
- (2019) Stochastic linear optimization with adversarial corruption. Preprint, submitted, https://arxiv.org/abs/1909.02109.Google Scholar
- (2021) Optimal contextual pricing and extensions. Marx D, ed. Proc. ACM-SIAM Sympos. on Discrete Algorithms (SIAM, Philadelphia), 1059–1078.Google Scholar
- (2018) Learning optimal reserve price against non-myopic bidders. Bengio S, Wallach HM, Larochelle H, Grauman K, Cesa-Bianchi N, Garnett R, eds. Proc. Annual Conf. Adv. Neural Inform. Processing Systems, vol. 31, 2042–2052.Google Scholar
- (2018) Multidimensional binary search for contextual decision-making. Oper. Res. 66(5):1346–1361.Google Scholar
- (2018) Stochastic bandits robust to adversarial corruptions. Diakonikolas I, Kempe D, Henzinger M, eds. Proc. 50th Annual ACM SIGACT Sympos. on Theory of Comput. (ACM, New York), 114–122.Google Scholar
- (2021) Corruption-robust exploration in episodic reinforcement learning. Belkin M, Kpotufe S, eds. Proc. Conf. Learn. Theory, vol. 134 (PMLR), 3242–3245.Google Scholar
- (2018) Contextual pricing for lipschitz buyers. Bengio S, Wallach HM, Larochelle H, Grauman K, and Cesa-Bianchi N, Garnett R, eds. Proc. Annual Conf. Adv. Neural Inform. Processing Systems, vol. 31, 5648–5656.Google Scholar
- (2014) Optimal regret minimization in posted-price auctions with strategic buyers. Ghahramani Z, Welling M, Cortes C, Lawrence ND, Weinberger KQ, eds. Proc. Annual Conf. Adv. Neural Inform. Processing Systems, vol. 27, 1871–1879.Google Scholar
- (2015) Revenue optimization against strategic buyers. Cortes C, Lawrence ND, Lee DD, Sugiyama M, Garnett R, eds. Proc. Annual Conf. Adv. Neural Inform. Processing Systems, vol. 28, 2530–2538.Google Scholar
- (2019) Dynamic learning and pricing with model misspecification. Management Sci. 65(11):4980–5000.Link, Google Scholar
- (2008) Generalized binary search. Proc. 46th Annual Allerton Conf. on Communication, Control, and Comput. (IEEE, New York), 568–574.Google Scholar
- (2009) Noisy generalized binary search. Bengio Y, Schuurmans D, Lafferty JD, Williams CKI, Culotta A, eds. Proc. 23rd Annual Conf. Adv. Neural Inform. Processing Systems, vol. 22 (Curran Associates, Red Hook, NY), 1366–1374.Google Scholar
- (1987) Coding with bounded error fraction. Ars Combin. 24:17–22.Google Scholar
- (2002) Searching games with errors-fifty years of coping with liars. Theoretical Comput. Sci. 270(1-2):71–109.Crossref, Google Scholar
- (2016) Dynamic pricing with demand covariates. Preprint, submitted April 25, https://arxiv.org/abs/1604.07463.Google Scholar
- (2007) Approximating the centroid is hard. Proc. 23rd Annual Sympos. on Comput. Geometry. 302–305.Google Scholar
- (2020) Low-regret algorithms for strategic buyers with unknown valuations in repeated posted-price auctions. Joint European Conference on Machine Learning and Knowledge Discovery in Databases (Springer, Berlin), 416–436.Google Scholar
- (1980) Coping with errors in binary search procedures. J. Comput. System Sci. 20(3):396–404.Crossref, Google Scholar
- (2021) Online posted pricing with unknown time-discounted valuations. Proc. Conf. AAAI Artificial Intelligence 6:5682–5689.Crossref, Google Scholar
- (1958) The perceptron: A probabilistic model for information storage and organization in the brain. Psych. Rev. 65(6):386–408.Crossref, Google Scholar
- (2019) Semi-parametric dynamic contextual pricing. Wallach HM, Larochelle H, Beygelzimer A, d’Alché-Buc F, Fox EB, Garnett R, eds. Proc. 32nd Adv. Neural Inform. Processing Systems, 2360–2370.Google Scholar
- (2008) Adapting to a changing environment: The Brownian restless bandits. Rocco A. Servedio RA, Zhang T, eds. 21st COLT Conf. (Omni Press) 343–354.Google Scholar
- (1992) Ulam’s searching game with a fixed number of lies. Theoretical Comput. Sci. 95(2):307–321.Crossref, Google Scholar
- (1992) Three thresholds for a liar. Combinatorial Probability Comput. 1:81–93.Crossref, Google Scholar
- (1976) Adventures of a Mathematician (Charles Scribner’s Sons, New York).Crossref, Google Scholar
- (2021) Non-stationary reinforcement learning without prior knowledge: An optimal black-box approach. Belkin M, Kpotufe S, eds. Proc. Conf. Learn. Theory, vol. 134, (PMLR), 4300–4354.Google Scholar
- (2020) Bisection-based pricing for repeated contextual auctions against strategic buyer. Hal Daumé III H, Aarti S, eds. Proc. Internat. Conf. Machine Learn., vol. 119 (PMLR), 11469–11480.Google Scholar
- (2021) Tsallis-inf: An optimal algorithm for stochastic and adversarial bandits. J. Machine Learn. Res. 22:28:1–28:49.Google Scholar

