Algorithmic Challenges in Ensuring Fairness at the Time of Decision

Published Online:https://doi.org/10.1287/opre.2022.0304

Algorithmic decision making in societal contexts, such as retail pricing, loan administration, recommendations on online platforms, etc., can be framed as stochastic optimization under bandit feedback, which typically requires experimentation with different decisions for the sake of learning. Such experimentation often results in perceptions of unfairness among people impacted by these decisions; for instance, there have been several recent lawsuits accusing companies that deploy algorithmic pricing practices of price-gouging. Motivated by the changing legal landscape surrounding algorithmic decision making, we introduce the well-studied fairness notion of envy-freeness within the context of stochastic convex optimization. Our notion requires that upon receiving decisions in the present time, groups do not envy the decisions received by any of the other groups, both in the present as well as the past. This results in a novel trajectory-constrained stochastic optimization problem that renders existing techniques inapplicable. The main technical contribution of this work is to show problem settings where there is no gap in achievable regret (up to logarithmic factors) when envy-freeness is imposed. In particular, in our main result, we develop a near-optimal envy-free algorithm that achieves O˜(T) regret for smooth convex functions that satisfy the Polyak-Łojasiewicz (PL) inequality. This algorithm has a coordinate descent structure, in which we carefully leverage gradient information to ensure monotonic sampling along each dimension, while avoiding overshooting the constrained optimum with high probability. The latter aspect critically uses smoothness and the structure of the envy-freeness constraints, whereas the PL inequality allows for sufficient progress toward the optimal solution. We discuss several open questions that arise from this analysis, which may be of independent interest.

Funding: S. Gupta’s research was partially supported by the National Science Foundation CAREER [Grant 2239824], received by the Massachusetts Institute of Technology.

Supplemental Material: All supplemental materials, including the code, data, and files required to reproduce the results, are available at https://doi.org/10.1287/opre.2022.0304.

INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.