Wasserstein Robust Classification with Fairness Constraints
Published Online:30 Apr 2024https://doi.org/10.1287/msom.2022.0230
References
- (2018) A reductions approach to fair classification. Internat. Conf. Machine Learn. (PMLR, New York), 60–69.Google Scholar
- (2006) Infinite Dimensional Analysis: A Hitchhiker’s Guide (Springer, Berlin).Google Scholar
- (2022) Machine bias. Ethics of Data and Analytics (Auerbach Publications, Boca Raton, FL), 254–264.Google Scholar
- (2023) Detecting customer trends for optimal promotion targeting. Manufacturing Service Oper. Management 25(2):448–467.Link, Google Scholar
- (2021) Learning Theory from First Principles. Draft of a book, version of, September 6, 2021, https://www.di.ens.fr/~fbach/ltfp_book.pdf.Google Scholar
- (2018) AI fairness 360: An extensible toolkit for detecting, understanding, and mitigating unwanted algorithmic bias. Preprint, submitted October 3, https://arxiv.org/abs/1810.01943.Google Scholar
- (2021) Fairness in criminal justice risk assessments: The state of the art. Sociol. Methods Res. 50(1):3–44.Crossref, Google Scholar
- (2018) A data-driven approach to multi-stage linear optimization. Optimization Online (November 3), https://optimization-online.org/2018/11/6907/.Google Scholar
- (2022) Two-stage sample robust optimization. Oper. Res. 70(1):624–640.Google Scholar
- (2020) Fairlearn: A toolkit for assessing and improving fairness in AI. Technical report, Microsoft, Redmond, WA.Google Scholar
- (2019) Quantifying distributional model risk via optimal transport. Math. Oper. Res. 44(2):565–600.Link, Google Scholar
- (2019) Robust Wasserstein profile inference and applications to machine learning. J. Appl. Probabilities 56(3):830–857.Crossref, Google Scholar
- (2001) Business data mining—A machine learning perspective. Inform. Management 39(3):211–225.Crossref, Google Scholar
- (2004) Convex Optimization (Cambridge University Press, Cambridge, UK).Crossref, Google Scholar
- (2009) Evaluating the predictive validity of the compas risk and needs assessment system. Criminal Justice Behav. 36(1):21–40.Crossref, Google Scholar
- (2009) Building classifiers with independency constraints. Proc. IEEE Internat. Conf. Data Mining Workshops (IEEE, Piscataway, NJ), 13–18.Google Scholar
- (2006) Applying data mining to predict college admissions yield: A case study. New Directions Institutional Res. 131:53–68.Crossref, Google Scholar
- (2008) Tighter bounds for structured estimation. Koller D, Schuurmans D, Bengio Y, Bottou L, eds. Adv. Neural Inform. Processing Systems, vol. 21 (Curran Associates, Inc., Red Hook, NY).Google Scholar
- (2022) A statistical learning approach to personalization in revenue management. Management Sci. 68(3):1923–1937.Google Scholar
- (2020) A snapshot of the frontiers of fairness in machine learning. Comm. ACM 63(5):82–89.Crossref, Google Scholar
- Consumer Financial Protection Bureau (2013) CFPB and DOJ order ally to pay $80 million to consumers harmed by discriminatory auto loan pricing. Accessed October 21, 2020, https://www.consumerfinance.gov/about-us/newsroom/cfpb-and-doj-order-ally-to-pay-80-million-to-consumers-harmed-by-discriminatory-auto-loan-pricing.Google Scholar
- (2017) Algorithmic decision making and the cost of fairness. Proc. 23rd ACM SIGKDD Internat. Conf. Knowledge Discovery Data Mining (ACM, New York), 797–806.Google Scholar
- (2022) Amazon scraps secret AI recruiting tool that showed bias against women. Ethics of Data and Analytics (Auerbach Publications, Boca Raton, FL), 296–299.Google Scholar
- (2015) Automated experiments on ad privacy settings: A tale of opacity, choice, and discrimination. Proc. Privacy Enhancing Tech. 2015(1):92–112.Crossref, Google Scholar
- (2018) Empirical risk minimization under fairness constraints. Bengio S, Wallach H, Larochelle H, Grauman K, Cesa-Bianchi N, Garnett R, eds. Adv. Neural Inform. Processing Systems, vol. 31 (Curran Associates, Inc., Red Hook, NY). 2791–2801.Google Scholar
- (2017) UCI machine learning repository. Accessed October 15, 2020, http://archive.ics.uci.edu/ml.Google Scholar
- (1988) On the complexity of computing the volume of a polyhedron. SIAM J. Comput. 17(5):967–974.Crossref, Google Scholar
- (2023) Distributionally robust stochastic optimization with Wasserstein distance. Math. Oper. Res. 48(2):603–655.Google Scholar
- (1984) A class of Wasserstein metrics for probability distributions. Michigan Math. J. 31(2):231–240.Crossref, Google Scholar
- (2014) Real-time optimization of personalized assortments. Management Sci. 60(6):1532–1551.Link, Google Scholar
- Gurobi Optimization, LLC (2023) Gurobi Optimizer Reference Manual (Gurobi Optimizer, Beaverton, OR).Google Scholar
- (2016) Equality of opportunity in supervised learning. Lee D, Sugiyama M, Luxburg U, Guyon I, Garnett R, eds. Adv. Neural Inform. Processing Systems, vol. 29 (Curran Associates, Inc., Red Hook, NY), 3315–3323.Google Scholar
- (2018) Fairness without demographics in repeated loss minimization. Proc. 35th Internat. Conf. Machine Learn., 1929–1938.Google Scholar
- (2009) The Elements of Statistical Learning (Springer, Berlin).Crossref, Google Scholar
- (2023) Adversarial classification via distributional robustness with Wasserstein ambiguity. Math. Programming 198(2):1411–1447.Google Scholar
- (2003) Bank lending policy, credit scoring and value-at-risk. J. Bank. Finance 27(4):615–633.Crossref, Google Scholar
- (1987) Representability in mixed integer programming, I: Characterization results. Discrete Appl. Math. 17(3):223–243.Crossref, Google Scholar
- (2013) Predicting student performance by using data mining methods for classification. Cybernetics Inform. Tech. 13(1):61–72.Crossref, Google Scholar
- (2019) Wasserstein distributionally robust optimization: Theory and applications in machine learning. INFORMS TutORials in Operations Research (INFORMS, Catonsville, MD), 130–166.Link, Google Scholar
- (1978) Integer programming solution of a classification problem. Management Sci. 24(14):1515–1525.Link, Google Scholar
- (2013) Big data, trying to build better workers. The New York Times (April 21).Google Scholar
- (2014) Appointment scheduling with limited distributional information. Management Sci. 61(2):316–334.Link, Google Scholar
- (2021) A survey on bias and fairness in machine learning. ACM Comput. Surveys (CSUR) 54(6):1–35.Google Scholar
- (2020) Data analytics in operations management: A review. Manufacturing Service Oper. Management 22(1):158–169.Link, Google Scholar
- (2018) Data-driven distributionally robust optimization using the Wasserstein metric: Performance guarantees and tractable reformulations. Math. Programming 171(1–2):115–166.Crossref, Google Scholar
- (2016) Risk assessment in criminal sentencing. Annu. Rev. Clinical Psych. 12:489–513.Crossref, Google Scholar
- MOSEK ApS (2024) Mosek optimizer API for python. Version 10. Accessed April 11, 2024, https://docs.mosek.com/10.1/pythonapi.pdf.Google Scholar
- (2022) Distributionally robust inverse covariance estimation: The Wasserstein shrinkage estimator. Oper. Res. 70(1):490–515.Google Scholar
- (2020) Distributionally robust local non-parametric conditional estimation. Larochelle H, Ranzato M, Hadsell R, Balcan MF, Lin H, eds. Adv. Neural Inform. Processing Systems, vol. 33 (Curran Associates, Inc., Red Hook, NY), 15232–15242.Google Scholar
- (2016) Predicting the future—Big data, machine learning, and clinical medicine. New England J. Medicine 375(13):1216.Crossref, Google Scholar
- (2017) Recycling privileged learning and distribution matching for fairness. Guyon I, Von Luxburg U, Bengio S, Wallach H, Fergus R, Vishwanathan S, Garnett R, eds. Adv. Neural Inform. Processing Systems, vol. 30 (Curran Associates, Inc., Red Hook, NY), 677–688.Google Scholar
- (2020) Fairness for robust log loss classification. Proc. AAAI Conf. Artificial Intelligence.Google Scholar
- (1964) Principles of Mathematical Analysis, vol. 3 (McGraw-Hill, New York).Google Scholar
- (2022) Overbooked and overlooked: Machine learning and racial bias in medical appointment scheduling. Manufacturing Service Oper. Management 24(6):2825–2842.Link, Google Scholar
- (2019) Regularization via mass transportation. J. Machine Learn. Res. 20(103):1–68.Google Scholar
- (2014) Understanding Machine Learning: From Theory to Algorithms (Cambridge University Press, Cambridge, UK).Crossref, Google Scholar
- (1988) Using an expert system with inductive learning to evaluate business loans. Financial Management 17(3):45–56.Crossref, Google Scholar
- (2002) Diffuse large B-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning. Nature Medicine 8(1):68–74.Crossref, Google Scholar
- (2021) A statistical test for probabilistic fairness. Proc. ACM Conf. Fairness Accountability Transparency (ACM, New York), 648–665.Google Scholar
- (2020) A distributionally robust approach to fair classification. Preprint, submitted July 18, https://arxiv.org/abs/2007.09530.Google Scholar
- (2009) A new learning paradigm: Learning using privileged information. Neural Networks 22(5–6):544–557.Crossref, Google Scholar
- (2020) Robust optimization for fairness with noisy protected groups. Larochelle H, Ranzato M, Hadsell R, Balcan MF, Lin H, eds. Adv. Neural Inform. Processing Systems 33 (Curran Associates, Inc., Red Hook, NY), 5190–5203.Google Scholar
- (2020) Tractable reformulations of two-stage distributionally robust linear programs over the type ∞ Wasserstein ball. Oper. Res. Lett. 48(4):513–523.Crossref, Google Scholar
- (2020) Unbiased subdata selection for fair classification: A unified framework and scalable algorithms. Preprint, submitted December 24, https://arxiv.org/abs/2012.12356.Google Scholar
- (2020) Training individually fair ML models with sensitive subspace robustness. Proc. 8th Internat. Conf. Learn. Representations (Curran Associates, Inc., Red Hook, NY).Google Scholar
- (2017) Fairness beyond disparate treatment & disparate impact: Learning classification without disparate mistreatment. Proc. 26th Internat. Conf. World Wide Web (ACM, New York), 1171–1180.Google Scholar

