Technical Note—Nonstationary Stochastic Optimization Under Lp,q-Variation Measures
Published Online:23 Sep 2019https://doi.org/10.1287/opre.2019.1843
References
- (2010) Optimal algorithms for online convex optimization with multi-point bandit feedback. Kalai AT, Mohri M, eds. Proc. 23rd Conf. Learn. Theory (Omnipress, Madison, WI), 28–40.Google Scholar
- (2013) Stochastic convex optimization with bandit feedback. SIAM J. Optim. 23(1):213–240.Crossref, Google Scholar
- (2015) Non-stationary stochastic optimization. Oper. Res. 63(5):1227–1244.Link, Google Scholar
- (2004) Convex Optimization (Cambridge University Press, Cambridge, UK).Crossref, Google Scholar
- (2017) Kernel-based methods for bandit convex optimization. Hatami H, McKenzie P, eds. Proc. 49th Annual ACM SIGACT Sympos. Theory Comput. (STOC 2017) (ACM, New York), 72–85.Crossref, Google Scholar
- (2008) Minimax bounds for active learning. IEEE Trans. Inform. Theory 54(5):2339–2353.Crossref, Google Scholar
- (2006) Elements of Information Theory, 2nd ed. (John Wiley & Sons, Hoboken, NJ).Google Scholar
- (2015) Strongly adaptive online learning. Bach F, Blei D, eds. J. Machine Learn. Res. 37: 1405–1411.Google Scholar
- (2015) Dynamic pricing and learning: Historical origins, current research, and new directions. Surveys Oper. Res. Management Sci. 20(1):1–18.Crossref, Google Scholar
- (2015) Dynamic pricing and learning with finite inventories. Oper. Res. 63(4):965–978.Link, Google Scholar
- (2005) Online convex optimization in the bandit setting: Gradient descent without a gradient. Buchsbaum A, ed. Proc. 16th Annual ACM-SIAM Sympos. Discrete Algorithms (SODA ‘05) (SIAM, Philadelphia), 385–394.Google Scholar
- (2014) Sequential optimization in changing environments: theory and application to online content recommendation services. PhD thesis, Columbia University, New York.Google Scholar
- (2015) Online convex optimization in dynamic environments. IEEE J. Selected Topics Signal Processing 9(4):647–662.Crossref, Google Scholar
- (2016) Introduction to online convex optimization. Foundations Trends Optim. 2(3–4):157–325.Crossref, Google Scholar
- (2014) Beyond the regret minimization barrier: Optimal algorithms for stochastic strongly-convex optimization. J. Machine Learn. Res. 15(July):2489–2512.Google Scholar
- (2014) Bandit convex optimization: Towards tight bounds. Gharamani Z, Welling M, Cortes C, Lawrence ND, Weinberger KQ, eds. Proc. Adv. Neural Inform. Processing Systems 27 (NIPS 2014) (Curran Associates, Red Hook, NY), 784–792.Google Scholar
- (2007) Logarithmic regret algorithms for online convex optimization. Machine Learn. 69(2):169–192.Crossref, Google Scholar
- (1981) Statistical Estimation: Asymptotic Theory (Springer-Verlag, New York).Crossref, Google Scholar
- (2015) Online optimization: Competing with dynamic comparators. Lebanon G, Vishwanathan SVN, eds. Proc. 18th Internat. Conf. Artificial Intelligence Statist. (AISTATS), San Diego, CA, 398–406.Google Scholar
- (2012) Query complexity of derivative-free optimization. Pereira F, Burges CJC, Bottou L, Weinberger KQ, eds. Proc. Adv. Neural Inform. Processing Systems 25 (NIPS 2012) (Curran Associates, Red Hook, NY), 2672–2680.Google Scholar
- (2016) Multi-armed bandits: Competing with optimal sequences. Lee DD, Sugiyama M, Luxburg UV, Guyon I, Garnett R, eds. Proc. Adv. Neural Inform. Processing Systems 29 (NIPS 2016) (Curran Associates, Red Hook, NY), 199–207.Google Scholar
- (2017) Chasing demand: Learning and earning in a changing environment. Math. Oper. Res. 42(2):277–307.Link, Google Scholar
- (2017) A survey of algorithms and analysis for adaptive online learning. J. Machine Learn. Res. 18(90):1–50.Google Scholar
- (2016) Online optimization in dynamic environments: Improved regret rates for strongly convex problems. Proc. IEEE Conf. Decision Control (CDC) (IEEE, Piscataway, NJ), 7196–7201.Crossref, Google Scholar
- (1994) Interior-Point Polynomial Algorithms in Convex Programming (SIAM, Philadelphia).Crossref, Google Scholar
- (2017) An optimal algorithm for bandit and zero-order convex optimization with two-point feedback. J. Machine Learn. Res. 18(1):1703–1713.Google Scholar
- (2009) Introduction to Nonparametric Estimation, Springer Series in Statistics (Springer, New York).Crossref, Google Scholar
- (1997) Assouad, Fano, and Le Cam. Pollard D, Torgersen E, Yang GL, eds. Festschrift for Lucien Le Cam (Springer, New York), 423–435.Crossref, Google Scholar
- (1983) Problem Complexity and Method Efficiency in Optimization (John Wiley & Sons, New York).Google Scholar
- (2018) Dynamic regret of strongly adaptive models. Dy J, Krause A, eds. Proc. 35th Internat. Conf. Machine Learn. (ICML), Stockholm, Sweden, 5877–5886.Google Scholar
- (2003) Online convex programming and generalized infinitesimal gradient ascent. Fawcett T, Mishra N, eds. Proc. 20th Internat. Conf. Machine Learn. (ICML-2003) (AAAI Press, Menlo Park, CA), 928–935.Google Scholar

