Distributionally Robust Losses for Latent Covariate Mixtures

John Duchi
John Duchi
[email protected]
Departments of Electrical Engineering and Statistics, Stanford University, Stanford, California 94305;
Search for more papers by this author
,
Tatsunori Hashimoto
Tatsunori Hashimoto
[email protected]
Department of Computer Science, Stanford University, Stanford, California 94305;
Search for more papers by this author
,
Hongseok Namkoong
Corresponding Author
Hongseok Namkoong
[email protected]
https://orcid.org/0000-0002-5708-4044
Decision, Risk, and Operations Division, Columbia Business School, New York, New York 10027
Search for more papers by this author

John Duchi

[email protected]

Departments of Electrical Engineering and Statistics, Stanford University, Stanford, California 94305;

Search for more papers by this author

Tatsunori Hashimoto

[email protected]

Department of Computer Science, Stanford University, Stanford, California 94305;

Search for more papers by this author

Hongseok Namkoong

Corresponding Author

Hongseok Namkoong

[email protected]

https://orcid.org/0000-0002-5708-4044

Decision, Risk, and Operations Division, Columbia Business School, New York, New York 10027

Search for more papers by this author

Published Online:2 Sep 2022https://doi.org/10.1287/opre.2022.2363

Abstract

While modern large-scale data sets often consist of heterogeneous subpopulations—for example, multiple demographic groups or multiple text corpora—the standard practice of minimizing average loss fails to guarantee uniformly low losses across all subpopulations. We propose a convex procedure that controls the worst case performance over all subpopulations of a given size. Our procedure comes with finite-sample (nonparametric) convergence guarantees on the worst-off subpopulation. Empirically, we observe on lexical similarity, wine quality, and recidivism prediction tasks that our worst case procedure learns models that do well against unseen subpopulations.

Supplemental Material: The online appendix is available at https://doi.org/10.1287/opre.2022.2363.

Volume 71, Issue 2

March-April 2023

Pages iii-vi, 397-790, C2-C3

Article Information

Supplemental Material

Metrics

Information

Received:July 03, 2020
Accepted:July 11, 2022
Published Online:September 02, 2022

Cite as

John Duchi, Tatsunori Hashimoto, Hongseok Namkoong (2022) Distributionally Robust Losses for Latent Covariate Mixtures. Operations Research 71(2):649-664.

https://doi.org/10.1287/opre.2022.2363

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Distributionally Robust Losses for Latent Covariate Mixtures

Abstract

Volume 71, Issue 2

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News