Improving Upon the Generalized cμ Rule: A Whittle Approach

Published Online:https://doi.org/10.1287/opre.2025.1987

Scheduling a stream of jobs whose holding cost changes over time is a classic and practical problem. Specifically, each job is associated with a holding cost (penalty), and a job’s instantaneous holding cost is some nondecreasing function of its class and current age (the time it has spent in the system since its arrival). The goal is to schedule the jobs to minimize the time-average total holding cost across all jobs. The seminal paper on this problem, by Van Mieghem in 1995, introduced the generalized c-mu rule for scheduling jobs. Since then, this problem has attracted significant interest but remains challenging because of the absence of a finite-dimensional state space formulation. Consequently, subsequent works focus on more tractable versions of this problem. This paper returns to the original problem for a k-class M/M/1 system. We derive a heuristic that empirically improves upon the generalized c-mu rule and all existing heuristics. Our key idea is to first translate the holding cost minimization problem to a novel restless multiarmed bandit (R-MAB) problem with a finite number of arms, in which each arm’s state corresponds to the age of the oldest job in one class. Based on our R-MAB, we next derive a novel Whittle index policy, which is both elegant and intuitive.

Funding: This work was supported by the National Science Foundation Division of Computing and Communication Foundations [Grants CIF-2403194, CMMI-2307008, III-2322973].

Supplemental Material: All supplemental materials, including the code, data, and files required to reproduce the results, are available at https://doi.org/10.1287/opre.2025.1987.

INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.