Open Access

Applied Probability Society Student Paper Competition: Abstracts of 2020 Finalists

Published Online:6 Oct 2021https://doi.org/10.1287/stsy.2021.0069

Abstract

The journal is pleased to publish the abstracts of the winner and finalists of the 2020 Applied Probability Society’s student paper competition.

The 2020 student paper prize committee was chaired by Amy Ward. The 2020 committee members are (in alphabetical order by last name): Alessandro Arlotto, Sayan Banerjee, Junfei Huang, Jefferson Huang, Rouba Ibrahim, Peter Jacko, Henry Lam, Nan Liu, Yunan Liu, Siva Theja Maguluri, Giang Nguyen, Mariana Olvera-Cravioto, Lerzan Örmeci, Erhun Özkan, Jamol Pender, Weina Wang, Amy Ward (chair), Linwei Xin, Kuang Xu, Galit Yom-Tov, Assaf Zeevi, Jiheng Zhang, Zeyu Zheng, Yuan Zhong, Enlu Zhou, and Serhan Ziya.

The 2020 prize winners are as follows:

First Prize
Asymptotic Optimality of the Binomial-Exhaustive Policy for Polling Systems with Large Switchover Times
Yue Hu, Columbia University

Finalists (in alphabetical order according to student’s last name):
Smooth Contextual Bandits: Bridging the Parametric and Nondifferentiable Regret Regimes
Yichun Hu and Xiaojie Mao, Cornell University

Sparsity-Agnostic Lasso Bandit
Min-hwan Oh, Columbia University
Correlated Randomly Growing Graphs
Anirudh Sridhar, Princeton University

Asymptotic Optimality of the Binomial-Exhaustive Policy for Polling Systems with Large Switchover Times

Yue Hu

Columbia University, [email protected]

Advisors: Jing Dong, Columbia University;

Ohad Perry, Northwestern University

We study an optimal-control problem of polling systems with large switchover times when a holding cost is incurred on the queues. In particular, we consider a stochastic network with a single server that switches between several buffers (queues) according to a prespecified order, assuming that the switchover times between the queues are large relative to the processing times of individual jobs. Due to its complexity, computing an optimal control for such a system is prohibitive, so we instead search for an asymptotically optimal control. To this end, we first solve an optimal control problem for a deterministic relaxation (namely, for a fluid model), that is represented as a hybrid dynamical system. We then translate the solution to that fluid problem to a binomial-exhaustive policy for the underlying stochastic system, and prove that this policy is asymptotically optimal in a large-switchover-time scaling regime, provided a certain uniform integrability (UI) condition holds. Finally, we demonstrate that the aforementioned UI condition holds in the following cases: (i) the holding cost has (at most) linear growth, and all service times have finite second moments; (ii) the holding cost grows at most at a polynomial rate (of any degree), and the service-time distributions possess finite moment generating functions. Our proofs that the UI condition holds in these two cases may be of independent interest.

Smooth Contextual Bandits: Bridging the Parametric and Nondifferentiable Regret Regimes

Yichun Hu

Cornell University, [email protected]

Xiaojie Mao

Cornell University, [email protected]

Advisor: Nathan Kallus, Cornell University

We study a nonparametric contextual bandit problem where the expected reward functions belong to a Hölder class with smoothness parameter $β$ . We show how this interpolates between two extremes that were previously studied in isolation: nondifferentiable bandits ( $β \leq 1$ ), where rate-optimal regret is achieved by running separate noncontextual bandits in different context regions, and parametric-response bandits (satisfying $β = \infty$ ), where rate-optimal regret can be achieved with minimal or no exploration due to infinite extrapolatability. We develop a novel algorithm that carefully adjusts to all smoothness settings and we prove its regret is rate-optimal by establishing matching upper and lower bounds, recovering the existing results at the two extremes. In this sense, our work bridges the gap between the existing literature on parametric and nondifferentiable contextual bandit problems and between bandit algorithms that exclusively use global or local information, shedding light on the crucial interplay of complexity and regret in contextual bandits.

Sparsity-Agnostic Lasso Bandit

Min-hwan Oh

Columbia University, [email protected]

Advisors: Garud Iyengar, Columbia University;

Assaf Zeevi, Columbia University

We consider a stochastic contextual bandit problem where the dimension $d$ of the feature vectors is potentially large, however, only a sparse subset of features of cardinality $s_{0} ≪ d$ affect the reward function. Essentially all existing algorithms for sparse bandits require a priori knowledge of the value of the sparsity index $s_{0}$ . This knowledge is almost never available in practice, and misspecification of this parameter can lead to severe deterioration in the performance of existing methods. The main contribution of this paper is to propose an algorithm that does not require prior knowledge of the sparsity parameter $s_{0}$ and establish tight regret bounds under mild conditions. We comprehensively evaluate our proposed algorithm numerically and show that it consistently outperforms existing methods, even when the correct sparsity parameter is revealed to these methods but is kept hidden from our algorithm.

Correlated Randomly Growing Graphs

Anirudh Sridhar

Princeton University, [email protected]

Advisor: Miklós K. Rácz, Princeton University

We introduce a new model of correlated randomly growing graphs and study the fundamental questions of detecting correlation and estimating aspects of the correlated structure. The model is simple and starts with any model of randomly growing graphs, such as uniform attachment (UA) or preferential attachment (PA). Given such a model, a pair of graphs $(G 1, G 2)$ is grown in two stages: until time $t_{⋆}$ they are grown together (i.e., $G_{1} = G_{2}$ ), after which they grow independently according to the underlying growth model.

We show that whenever the seed graph has an influence in the underlying graph growth model—this has been shown for PA and UA trees and is conjectured to hold broadly—then correlation can be detected in this model, even if the graphs are grown together for just a single time step. We also give a general sufficient condition (which holds for PA and UA trees) under which detection is possible with probability going to 1 as $t_{⋆} \to 1$ . Finally, we show for PA and UA trees that the amount of correlation, measured by $t_{⋆}$ , can be estimated with vanishing relative error as $t_{⋆} \to \infty$ .

Volume 11, Issue 3

September 2021

Pages 193-305

Article Information

Metrics

Information

Published Online:October 06, 2021

Cite as

(2021) Applied Probability Society Student Paper Competition: Abstracts of 2020 Finalists. Stochastic Systems 11(3):304-305.

https://doi.org/10.1287/stsy.2021.0069

PDF download

Available Issues

Available Issues

Available Issues

Applied Probability Society Student Paper Competition: Abstracts of 2020 Finalists

Abstract

Asymptotic Optimality of the Binomial-Exhaustive Policy for Polling Systems with Large Switchover Times

Smooth Contextual Bandits: Bridging the Parametric and Nondifferentiable Regret Regimes

Sparsity-Agnostic Lasso Bandit

Correlated Randomly Growing Graphs

Volume 11, Issue 3

Article Information

Metrics

Information

Cite as

Sign Up for INFORMS Publications Updates and News