Relatively Robust Multicriteria Decisions

Published Online:https://doi.org/10.1287/mnsc.2025.00510

Abstract

For a general multicriteria decision problem with linear scalarization and unknown weights, we propose relatively robust decisions, which are Pareto-efficient and at the same time maximize a performance index. The latter measures the worst-case ratio, attained by the weighted objective relative to its maximum value, with respect to all possible weights. The main results include a simple boundary representation of the performance index as the minimum of criterion-specific performance ratios, and a computationally simple method of determining a relatively robust decision up to any prespecified performance tolerance by maximizing an ε-augmented performance index. The proposed method relies merely on the continuity of all criterion functions and the compactness of the set of feasible decisions which may be nonconvex. This imposes no restrictions at all for any finite action set. A notable feature of our method is that it endogenously yields the tradeoffs between all criteria, including a performance guarantee relative to decisions justified by any other weighting. A number of structural results, examples, and applications are provided, as well as generalizations to allow for limited weight ambiguity, criterion ambiguity, and generalized aggregation of criteria based on an axiomatic foundation.

This paper was accepted by Peng Sun, optimization and decision analytics.

1. Introduction

In real-world decision making, evaluating alternatives often involves multiple, sometimes conflicting criteria. Whether considering investment portfolios under Environmental, Social, and Governance (ESG) parameters, selecting products based on bundles of attributes, or valuing companies for both profitability and sustainability, decision makers must navigate tradeoffs between competing objectives. Similarly, lifecycle environmental impact assessments—such as comparing vehicles with electric, combustion, or hybrid engines—require reconciling diverse metrics like emissions, cost, and resource consumption. These complex but often inevitable comparisons highlight the critical need for robust multicriteria optimization frameworks.

Multicriteria optimization involves identifying solutions that balance conflicting objectives in a manner consistent with the decision maker’s priorities. Traditional approaches often rely on scalarization techniques, where multiple criteria are combined into a single weighted objective function using weighted sums. However, these methods depend heavily on the precise specification of weights, which are rarely known a priori and can be difficult to justify. This uncertainty complicates the search for decisions that are both Pareto-efficient and robust to variations in tradeoff preferences. To address these challenges, we specialize the concept of relatively robust decisions by Weber (2023) so as to refer to decisions that achieve Pareto-efficiency among all relevant criteria while also maximizing a performance index designed to account for the ambiguity in weights. Specifically, the performance index measures the worst-case (WC) ratio attained by the weighted objective relative to its maximum value over all possible weight configurations. By focusing on worst-case performance, this framework provides guarantees of robustness and tradeoff transparency, which are particularly valuable in high-stakes or uncertain decision contexts.

1.1. Practical Examples

The approach developed here can be used for virtually all multicriteria decision problems, such as the following three example applications.1

  • ESG Investing: Investors face the challenge of balancing financial returns with social and environmental impact. For example, a fund manager might evaluate portfolios based on criteria such as profitability, carbon footprint, and diversity inclusion. Weighting these criteria is inherently subjective, and the optimal portfolio might vary widely depending on the chosen weights. The relatively robust optimization framework proposed here enables the identification of investment strategies that remain robust across different weight configurations, offering a performance guarantee regardless of the specific preferences.2

  • Lifecycle Assessment of Vehicles: Consider evaluating the environmental impact of electric, combustion, and hybrid cars. Criteria might include greenhouse gas emissions, energy efficiency, and resource use (see, e.g., Hawkins et al. 2012). A relatively robust decision could pinpoint vehicle types or designs that perform well across a broad range of plausible weightings, resulting in a balanced and defensible choice.

  • Product Evaluation and Design: Companies frequently design products to optimize attributes such as cost, durability, and aesthetic appeal. For example, in designing a smartphone, decision makers must weigh the importance of battery life, screen quality, and price.3 Relatively robust multicriteria optimization helps determine design specifications that ensure competitive performance across various market segments with diverse preferences, valuing the availability of different attributes with different weights.

For the application of our method, one only needs that there exists a default action, that is, a baseline alternative that performs adequately across all criteria (which can always be achieved by reindexing evaluation scales), together with the technical assumption that criteria are continuous in actions, and that the finite-dimensional action set is closed and bounded (i.e., compact).

1.2. Literature

1.2.1. Origins of Multicriteria Decision Making.

The idea of multicriteria optimization in Economics can be traced back to the distribution of resources among different individuals, leading to a set of undominated solutions such as the “contract curve” proposed by Edgeworth (1881, p. 21) for a simple exchange economy, and more generally a set of efficient outcomes as implied by Pareto (1894; 1897, sections 721–723),4 which cannot be improved upon for one agent without making another agent worse off. The latter avoids a direct interpersonal comparison of individuals’ utilities (or “ophelimities” in Pareto’s terminology); see also Harsanyi (1955) as well as Keeney and Raiffa (1993, chapter 10) who explore group utility functions. The drawback of such an agnostic approach to optimality with multiple criteria is that the set of Pareto-optimal allocations, because of its typically large cardinality, offers only imperfect guidance about which solution should actually be implemented. For example, in the two-agent exchange economy, the contract curve usually includes allocations that attribute all resources to any single individual, which almost completely undermines the notion of multicriteria optimization.

In Operations Research, “goal programming” refers to the notion of minimizing the weighted deviation from criterion-specific targets (Charnes and Cooper 1961). The technique was first employed in the context of executive compensation based on different “factors” (i.e., criteria) using linear programming techniques (Charnes et al. 1955). This basic approach is usually applied to a “utopian” (or “ideal”) target point, which corresponds to the (generically infeasible) vector of individually maximized criteria. Depending on the distance measure (e.g., a weighted Chebyshev distance; see, e.g., Steuer 1986, chapter 14),5 the corresponding solutions trade off among criteria according to the specified weights, and they are naturally Pareto-efficient.6 Instead of minimizing the distance to the ideal point (in the outcome space), it is also possible to maximize the distance to a (generically infeasible) “nadir” point, which contains the minimum value of each individual criterion on the Pareto-efficient set.7 In this approach, known as “compromise programming” (Zeleny 1974), the appropriate choice of the weights for the different criteria remains the critical point, and in our view, very little satisfying progress has been made in the assignment of weights without imposing subjectivity, which arguably amounts to picking a solution from the Pareto-efficient set. For instance, based on an exogenous ranking of the criterion importance, it is possible to apply an ordered weighted average (Yager 1988) which in turn can be related to compromise programming (Zarghami and Szidarovszky 2010, Wang and Fu 2020).8 Besides being subjective from the start by requiring the imposition of a preference order on criteria, this method does not provide any nontrivial performance guarantee relative to other choices of weights and/or importance rankings that might be plausible for other decision makers.9

1.2.2. Relative Robustness.

By contrast, we approach the selection of weights from the standpoint of relative robustness (Weber 2023), involving only comparisons between feasible points, thus avoiding fictitious targets such as the aforementioned utopian and nadir points. Rather than minimizing a fixed distance metric with specific weights, the proposed method evaluates performance in terms of the worst-case tradeoffs among criteria, ensuring a solution that remains defensible regardless of the exact weighting chosen ultimately. The underlying robustness measure, equivalent to relative regret, has been used in computer science to evaluate the relative performance of algorithms (Sleator and Tarjan 1985, Ben-David and Borodin 1994), for the scenario-based evaluation of operational decisions (Kouvelis and Yu 1997), in price discrimination (Han and Weber 2023), robust optimization (Weber 2024), and fair resource allocation (Goel et al. 2009). The idea of absolute regret (AR) is due to Savage (1951), based on the maximin robustness approach by Wald (1945) in his general treatment of sequential decision problems. The main issue with absolute regret is that it is inherently sensitive to the scale of the reference point. This sensitivity may make it impossible to derive reasonable performance guarantees, such as ensuring positive profits in a monopoly pricing problem with unknown demand (Weber 2025), because a zero-profit reference point is intrinsically small-scale. A relative robustness approach, we argue, yields more acceptable results, particularly in the context of multicriteria optimization, where changing the units of any given criterion would usually affect solutions that are based on absolute performance measures.

1.2.3. Connection to Distributionally Robust Optimization.

Because one can reinterpret a (normalized) weight vector as a probability distribution, our approach is naturally related to distributionally robust optimization (DRO), where the true probability distribution governing uncertain parameters is unknown but assumed to lie within a known ambiguity set. Important early contributions include Delage and Ye (2010), who studied DRO with moment-based sets, and Ben-Tal et al. (2013), who developed tractable reformulations for DRO with Wasserstein ambiguity sets. More recently, Blanchet and Murthy (2019) and Mohajerin Esfahani and Kuhn (2018) developed general formulations based on Wasserstein balls, offering strong out-of-sample guarantees. Whereas DRO typically focuses on expectations or risk measures over stochastic uncertainty, our approach generalizes worst-case robustness to a multicriteria setting without requiring a probabilistic model. This positions our proposed framework as a deterministic analogue to DRO, where ambiguity arises from unknown relative preferences and state-dependent criteria rather than unknown probability distributions.

1.2.4. Relative Evaluations.

The proposed approach to robust multicriteria decisions is entirely relative, in the sense that the question underlying all of our developments is “How well am I doing relative to how well I could be doing?” Indeed, the idea that the size of an object can be judged only relative to other objects goes back at least to the Taoist writings of Zhuang Zhou in the fourth century BC.10 In Economics, Cournot (1838) was among the first to note that there is no absolute value (“Il n’y a pas de valeurs absolues,” p. 22) and that inference from a social system can be likened to the observation of astronomical objects and their relative positions to each other (pp. 15–16), concluding by analogy that the concept of value is fundamentally relative (“Il y en a en ce sense que des valeurs relatives,” p. 18). In fact, human perception is inherently relative, as demonstrated by Weber (1846) and his student Fechner (1860) in extensive experiments which showed that across different senses (e.g., hearing, touch, and vision) the minimum perceptible difference is proportional to the current level of the stimulus, giving rise to the Weber-Fechner law of psychophysics. Similarly, in an economic context, it is often relative reference points such as one’s current wealth level (Kahneman and Tversky 1979) or the outcomes experienced by others (Loewenstein et al. 1989) that tend to determine human behavior. For example, when pondering whether to purchase a product from a cheaper store within walking distance, humans base decisions less on absolute gains than on the prospective relative savings in expenditure (Kahneman and Tversky 1984).

Beyond the aforementioned congruence with human perception, there are other practical arguments for relative evaluations. First, there is the insensitivity to scale, common to all relative measures such as internal rate of return (IRR), demand elasticity, profit margin, or relative regret, which allows for a direct comparison across different sizes. For example, with IRR one can readily benchmark projects of different financial magnitudes against outside investment options (of comparable risk), whereas the equivalent absolute indicator of net present value (NPV) remains silent about the required absolute investment (see, e.g., Weber 2014).11 Second, normalization facilitates fairness and equity. When stakeholders have differing capacities or baselines, a relative comparison may help to ensure fairness (Goel et al. 2009). Third, relative evaluation criteria allow for comparability across contexts. For example, demand elasticity (as introduced by Marshall 1890, p. 162) is a unit-free relative measure that allows one to compare the changes of demand relative to price changes across widely differing goods, irrespective of the underlying unity of measurement (e.g., units of cars versus units of electric power). This last point is especially salient for multicriteria decision making, as the units (and inherent scale) for different criteria generally differ, so that a robustness criterion (and robust decision) should remain unaffected if the values of a given criterion are all multiplied by 10, for example. The relatively robust framework developed here uses a relative worst-case performance perspective. This methodology not only identifies solutions with reliable tradeoff characteristics, but also provides a transparent representation of the tradeoffs themselves. These tradeoffs are reflected in a robust weight vector consistent with a robust solution.

1.3. Outline

The remainder of this paper is organized as follows: Section 2 discusses the multicriteria optimization problem and associated comparative statics, together with the performance index for a robust evaluation of different decisions. Section 3 introduces pseudo-robustness and Pareto-efficiency, which together characterize relatively robust decisions. Here we also provide a computational approach for determining a relatively robust decision up to any given performance tolerance, and we allow for the possibility of close-to-arbitrary restrictions in the set of admissible weights, for example, based on a priori knowledge about physical constraints or a given priority ranking of criteria. In addition, there are extensions to criterion ambiguity and general aggregation of criteria, followed by a practical guide for how to apply the method. Section 4 focuses on discrete applications where our framework relies on virtually no assumptions, so the approach can be entirely data-driven. Section 5 concludes.

2. Basic Framework

Let XRm be a nonempty, compact action set, for a given integer m1. Consider a decision maker who faces the multicriteria optimization problem of having to select an action (or decision, or point) xX so as to “simultaneously maximize” the continuous functions fi:XR+, for iN={1,,n}, each of which is referred to as a criterion, where n1 is a given integer.

Remark 1.

(i) The continuity of each criterion fi ensures that small perturbations in the action x yield correspondingly small changes in the objective value. Notably, this assumption is trivially satisfied at any isolated point of X.12 In particular, this means that there is no imposed regularity requirement when the action set is finite. (ii) The requirement that each fi be nonnegative is without loss of generality: any real-valued (continuous) criterion f^i can be translated as fi=f^if^i0, where f^i=minxXf^i(x) denotes the minimum value of f^i.13

2.1. Decision Problem

To evaluate goal achievement for any decision xX, the decision maker considers a scalarization of his multicriteria optimization problem by means of a weighted objective,

F(x|λ)=i=1nλifi(x),xX,(1)
where λ=(λ1,,λn)Δ={wR+n:w1=1} is a (normalized) vector of weights (with λi0 for all iN, and λ1++λn=1). Maximizing the weighted objective yields the set of ex post optimal decisions,
X(λ)=argmaxxXF(x|λ),λΔ,(2)
which by the extreme-value theorem (Rudin 1976, theorem 4.16, p. 89) is nonempty. Moreover, by the maximum theorem (Berge 1963, p. 116) the (set-valued) mapping X:ΔX is compact-valued and upper semicontinuous.

Remark 2.

The weighted objective is homogeneous of degree one, that is, for all α>0 and λΔ, it is F(·|αλ)=αF(·|λ), whereas the set of ex post optimal decisions is homogeneous of degree zero, in the sense that X(αλ)=X(λ), for all α>0 and λΔ. It is therefore possible to extend the definition of F(x|·) in Equation (1) and the definition of X(·) in Equation (2) to a domain containing any nonzero weight vector wR+n\{0}, because a unique normalized weight λ=αwΔ, with α=1/(i=1nwi)>0, is always available:14

F(x|w)(1/α)F(x|λ),xX,(1′)
and
X(w)X(λ)=argmaxxX(1/α)F(x|λ).(2′)

Remark 3.

Limited ambiguity, that is, allowing for weights in a (nonempty, compact) subset of Δ, is discussed in Section 3.7. Criterion ambiguity is treated in Section 3.8, and Section 3.9 investigates the use of general multicriteria objectives. All three generalizations can be treated, after suitable adjustment, within the basic framework.

To keep matters nontrivial, we assume that there exists a (feasible) default decision (xd) such that the decision maker’s weighted objective is positive, that is,

xdX:F(xd|λ)>0,λΔ.(N)

The nontriviality condition (N) ensures that the ex post optimal objective (or value function) is positive:

F*(λ)=maxxXF(x|λ)=F(x^|λ)F(xd|λ)>0,x^X(λ),λΔ.

The sign-definiteness of the ex post optimal objective is critical for its role as a reference, against which the goal achievement of any feasible decision can be compared.

Remark 4.

The nontriviality condition (N) can be satisfied without affecting X(·), that is, without changing any set of ex post optimal decisions. It is sufficient to consider the translated criterion f^i=fi+si instead of fi, for all iN, using a suitable shift s=(s1,,sn)R++n, in which case F^(x|λ)=F(x|λ)+c0(λ)>0, because c0(λ)=λ·s is strictly positive (as it is bounded from below by min{s1,,sn}>0).

2.2. Comparative Statics

What happens to the criteria at the optimum when shifting weight from one criterion to another? At the optimum, one would naturally expect that a criterion which receives relatively more weight than before cannot decrease, whereas a criterion that receives relatively less weight cannot increase. The following result formalizes this intuition for a weight shift from one criterion to another.

Proposition 1.

Let δij=eiej, where ei and ej denote the i-th and j-th Euclidean unit vectors, respectively. Consider λ,λ^Δ such that λ^=λ+εδij for some ε>0 and some i,jN with ij. Then for any (x,x^)X(λ)×X(λ^) it is fi(x)fi(x^) and fj(x^)fj(x).

A transfer from the j-th criterion weight λj to the i-th criterion weight λi augments the optimal value of criterion i and lowers the optimal value of criterion j (at least weakly). Criteria other than i and j, whose weights remain constant but whose values are affected when decisions change, may go either way as a result of the weight shift. Similarly, the value function F*(λ) could go up or down, depending primarily on the difference between fi and fj at the optimum.

Remark 5.

The conclusion of Proposition 1 can be applied multiple times. In particular, it can also be used for shifts in nonnormalized weights w=(wj,wj)R+n\{0}; see Remark 2. Thus, increasing wj is equivalent to (at most) n1 successive weight transfers in the normalized weight from the components of λj to λj, resulting in an increase of the j-th criterion at the optimum.

Remark 6.

For small shifts of weight from criterion j to criterion i, the value of the ex post optimal objective F*(λ)=F(x*(λ)|λ) goes up (resp., down) when the corresponding score difference, fi(x*(λ))fj(x*(λ)), is positive (resp., negative), at the selection x*(λ)X(λ).

The following result states that for a simple nonnormalized increase of the i-th weight, the value function goes up, as long as the i-th criterion is always positive at an optimum.

Lemma 1.

Let w,w^Rn\{0} be nonnormalized weights, such that w^=w+δei, for a given δ>0 and a given iN. Provided that fi(x)>0, for any xX(w), it is F*(w^)>F*(w).15

2.3. Performance Index

The decision maker may a priori have no knowledge about which weight λΔ should be used to compute the weighted objective F(·|λ) in Equation (1).16 To deal with this ambiguity, the decision maker evaluates any feasible decision xX with respect to any given weight λΔ by the performance ratio,17

φ(x|λ)=F(x|λ)F*(λ)[0,1],(3)
which is continuous on X×Δ and naturally bounded from above by one. Its minimum with respect to all possible weights (cf. Endnote 13),
ρ(x)=minλΔφ(x|λ)[0,1],(4)
is called the performance index (evaluated at x). The performance index measures the performance of the action x relative to all possible weighted-sum scalarizations of the decision maker’s multicriteria decision problem. It turns out that this performance criterion depends only on the maximized individual criteria,
fi*=maxxXfi(x),iN.(5)

By the nontriviality condition (N), it is fi*=F*(ei)F(xd|ei)>0, for all iN. Thus, all maximized individual criteria are strictly positive. The next result provides an important representation of the performance index, in terms of relative goal achievement of a decision, relative to the various maximized individual criteria.

Proposition 2.

The performance index is equal to

ρ(x)=miniNϕi(x),xX,(6)
where ϕi(x)=fi(x)/fi*[0,1], for all (x,i)X×N.

The representation in Equation (6) expresses the performance index as the minimum of the criterion-specific performance ratios ϕi. Hence, to compute ρ(x) as the minimum of φ(x|λ) over all weights λΔ, it is sufficient to restrict attention to the Euclidean unit vectors eiΔ, as ϕi(·)=F(·|ei)/F*(ei), for all iN. This simplification arises because φ(x|·) is quasiconcave for fixed x, implying that its minimum over Δ is attained at a vertex. This highlights a “perfect complementarity” among the criterion-specific performance ratios in the determination of the performance index.

Remark 7.

The idea of perfect complementarity, discussed by Cournot (1838) and Edgeworth (1897), describes elements contributing to a common goal in fixed proportions. This is equivalent to evaluating the criterion (i.e., the performance index) using the Leontief production function,18 that is, taking the minimum among the inputs ϕi, resulting in ρ(x)=miniNϕi(x), for all xX, as in Proposition 2.

Example 1.

Let XR+m be a compact action set with XR++m, where m2 is an integer. Consider a decision problem with n=2 criteria (containing an egalitarian and a utilitarian evaluation),

f1(x)=min{x1,,xm}andf2(x)=(1/m)(x1++xm),
for all x=(x1,,xm), with maximized values fi*=maxfi(X)>0, for iN. The corresponding weighted objective becomes
F(x|)=(1)f1(x)+f2(x),[0,1],
where, for simplicity, the parameter replaces the normalized weight λ=(1,). The optimal value, F*()=maxF(X|), can be obtained by solving a two-stage maximization problem,
F*()=maxtT{(1)t+μ(t)},[0,1].

Here T=[t1,t2] is a compact interval, with 0<t1t2<, and the best average coordinate (subject to all coordinates being at least of size t),

μ(t)=(1/m)maxxX{x1++xm:tx1,,txm},tT,
is a (weakly) decreasing function on T, independent of . Because t<μ(t), for all tT, it is clear that F*() must be increasing in . The interval T is adapted to the problem in the sense that at t1 no constraint is imposed on the computation of the mean μ(t), and at t2 the maximum of the smallest coordinate of any point in X is reached. Accordingly, one can verify that t2=F*(0)=f1*<f2*=F*(1)=μ(t1), where X()=argmaxxXF(x|) and 0<t1=minξX(1){ξ1,,ξm}t2, where ξ=(ξ1,,ξm); see Figure 1. By means of Proposition 2 the performance index can then be written in the form
ρ(x)=min{f1(x)f1*,f2(x)f2*}=min{f1(x)t2,f2(x)μ(t1)},xX.

Figure 1. Nonconvex Action Set XR++2 in Example 1

Here the criterion-specific performance ratios are ϕ1=f1/t2 and ϕ2=f2/μ(t1).

3. Robust Multicriteria Decision Making

3.1. Pseudo-Robustness

We refer to a decision x^X which maximizes the performance index ρ(·) as pseudo-robust. The corresponding (compact, nonempty) set of pseudo-robust decisions,19

Ψ=argmaxxXρ(x),(7)
is not necessarily a singleton, as illustrated by the following example.

Example 2.

Optimizing the performance index determined in Example 1, using the same two-stage maximization procedure, yields the optimal performance index,

ρ*=maxxXρ(x)=maxtTmin{tt2,μ(t)μ(t1)}.

Here, the first term in the minimand increases continuously in t, whereas the second is (weakly) decreasing and may be discontinuous; see Figure 2. As a result, at the optimal value t^T=[t1,t2] the two terms are about to cross, resulting in a “balancedness condition,”

t^=sup{tT:tt2<μ(t)μ(t1)},(8)

Figure 2. Qualitative Behavior of Performance Ratio μ(t)/μ(t1) as a Function of t/t2 in Example 2, for the Nonconvex Action Set in Figure 1
Note.f1*=t2>t1 and f2*=μ(t1)>μ(t2).

so that ρ*=t^/t2=t^/f1*, as shown in Figure 2. Because for any tT it is μ(t)t, one obtains μ(t^)μ(t2)t2=f1*. This implies that the optimal performance index,

ρ*=t^f1*[t1f1*,μ(t^)f2*],
is always nontrivial (i.e., strictly positive). It attains its maximum possible value (of one) if and only if μ(·) is constant on T, that is, when μ(t1)=μ(t2). The set of pseudo-robust actions, Ψ={xX:min{x1,,xm}=t^/ρ*}, may generally contain multiple elements.

3.2. Efficiency

When Ψ is not a singleton, the decision maker may have good reason to prefer one pseudo-robust decision over another, based on “efficiency” (or a lack thereof). Specifically, given two feasible actions x,xX, we say that decision x is more efficient than decision x with respect to the vector of criteria f=(f1,,fn), if and only if x strictly improves on at least one criterion while weakly improving on all other criteria (over their values achieved at x). The corresponding preference relation on X is defined by20

xx(iN:fi(x)>fi(x),f(x)f(x)),(9)
for all x,xX. The resulting set of efficient (or Pareto-optimal) decisions is
P={xX:(f(x)f(x)f(x)=f(x)),xX}.(10)

It is easy to see that a pseudo-robust decision x^Ψ is not necessarily efficient, in the sense that it may be possible to find a different decision which strictly improves on at least one criterion-specific performance ratio while maintaining the optimal performance index achieved by x^.

Example 3.

Following up on our analysis in Example 1 and Example 2, consider the (nonconvex, compact) action set X=[0,1]m\(1/2,1]m, for some integer m2. Because μ(t)=11/(2m), for all tT=[t1,t2], where t1t2=f1*=1/2 and f2*=μ(t1), the optimality condition t^/t2=μ(t^)/μ(t1)=1 yields t^=1/2. Hence, the optimal performance index attains its maximum possible value, ρ*=1. This makes sense, because it is feasible (in X) to attain simultaneously the highest possible minimum coordinate and the highest possible average coordinate, regardless of the weights assigned to the two objectives. Note also that the set of pseudo-robust actions,

Ψ={xX:min{x1,,xm}=t^/ρ*}={x[1/2,1]m:iN s.t. xi=1/2},
has a continuum of elements. For instance, compared with the pseudo-robust action (1,,1)/2, all other actions in Ψ are more efficient, particularly those in the set of Pareto-optimal decisions, P={(1,,1)(ei/2):iN}, which is a subset of Ψ. For m=2, we obtain that P={(0.5,1),(1,0.5)}. Figure 3 depicts the situation, including the set of Pareto-optimal decisions, for m=3.

Figure 3. Nonconvex Action Set in Example 3, for m=3
Note. The set of Pareto-optimal decisions is P={(1,1,0.5),(1,0.5,1),(0.5,1,1)}.
Remark 8.

It is well known that for any weight vector λ with strictly positive components (ensuring all criteria are considered) an ex post optimal decision must also be efficient (see, e.g., Ehrgott 2010, proposition 3.9, p. 71). That is, for any λint(Δ), the corresponding optimal decision set satisfies X(λ)P.

3.3. Robust Decision Set

Requiring efficiency in addition to pseudo-robustness is important, because, by avoiding unnecessary shortfalls, it can only improve the weighted objective in Equation (1)—at least weakly. A decision x^*X is called robust if it is pseudo-robust and efficient. Thus, our goal becomes to examine the properties of the robust decision set,

R=ΨP,(11)
which contains all available robust decisions in X, and to then determine a direct method for the computation of a robust decision (or an arbitrarily close approximation thereof).

Example 4.

Following up on our analysis in Example 1 and Example 2, consider the (convex, compact) action set X={xRm:x21}, which is equal to the unit ball in the standard Euclidean distance, where the dimension of the underlying space is given by some integer m2. By direct computation, μ(t)=1/m for all tT=[t1,t2], and t1=f1*=1/m=f2*=μ(t2)=t2. As in Example 3, the balancedness condition (8) yields t^=t2, and thus an optimal performance index of ρ*=1. The set of pseudo-robust actions,

Ψ={xX:min{x1,,xm}=t^/ρ*}={(1/m,,1/m)},
is a singleton. On the other hand, the set of Pareto-optimal actions,
P={xR+m:x2=1},
contains a continuum of elements, including the only pseudo-robust decision, (1,,1)/m.

In Example 3, we examined a problem where PΨ (so R=P), whereas in Example 4, for the same multicriteria decision problem with a different action set, it was ΨP (so R=Ψ). Neither of these two extremes might apply, in which case R{Ψ,P}, as illustrated next.

Example 5.

Mixing and matching features from Example 3 and Example 4, let us consider the (nonconvex, compact) action set X={xR2:x21}\(1/2,1]2 in the Euclidean plane. The corresponding set of Pareto-optimal actions is given by the intersection of the unit circle with both the action set X and the positive quadrant R+2, so P={xX+2:x2=1}. Meanwhile, the set of pseudo-robust actions is Ψ={xX:min{x1,x2}=1/2}. Thus, by Equation (11) it is R=ΨP={(1/2,3/2),(3/2,1/2)}, which in this setting means R{Ψ,P}.

The existence of robust actions is ensured by the next result, together with the fact that the robust decision set R must be closed and bounded (i.e., compact).

Lemma 2.

The robust decision set R is nonempty and compact.

The proof of Lemma 2 starts by noting that the set P of efficient actions is compact because it must be bounded (by the boundedness of the encompassing action set X) and closed (by the continuity of the criteria). One can then construct a sequence of pseudo-robust actions in the (nonempty) compact set Ψ which might be inefficient (or else R holds true immediately). But each inefficient pseudo-robust action suggests the existence of a more efficient action, which incidentally must also be pseudo-robust. Given that Ψ is compact, the Bolzano-Weierstrass theorem (Berge 1963, p. 67) then guarantees the existence of a converging subsequence of pseudo-robust actions, with a limit that must be a feasible decision which is both pseudo-robust and efficient, so R. Compactness of the robust decision set then follows, as it has to be both closed and bounded.

Lemma 3.

The optimal performance index ρ* is such that maxρ(P)=ρ*=maxρ(X) and ρ(R)=ρ(Ψ)={ρ*}.

The preceding result (re)states the fact that the decision maker can restrict attention to efficient actions when maximizing the performance index, meaning that there always exists an efficient action which attains the optimal performance index ρ*; this action is by definition robust. Conversely, any robust action necessarily achieves a performance index of ρ*, which is quite straightforward in light of both Lemma 2 and the definition of the robust decision set in Equation (11).

3.4. Robust Decisions: Computation

How can one determine a (relatively) robust decision? By Lemma 3 we can limit attention to maximizing the performance index over all efficient decisions in our search for robust decisions. Thus, to be guaranteed an efficient decision which is also approximately pseudo-robust, by virtue of Proposition 2 we introduce the ε-augmented performance index,

Φε(x)=(1ε)miniNϕi(x)+(ε/n)iNϕi(x),(x,ε)X×[0,1],(12)
so that Φ0=ρ=miniNϕi, and Φ1=(1/n)iNϕi. Because Φε(·) is a continuous function, its maximizer, referred to as the set of ε-robust actions,
ε=argmaxxXΦε(x),ε[0,1],(13)
is nonempty and compact, again by virtue of the extreme-value theorem and the maximum theorem. It is useful, for our further analysis, to shift the perspective from the available decisions in X to their respective consequences (or “outcomes”) in terms of their robustness performance, given the prevailing weight ambiguity.

Remark 9.

The (nonempty, compact) outcome space,

Y={y[0,1]n:yi=ϕi(x),(x,i)X×N},(14)
contains the vectors y of criterion-specific performance ratios, achieved by the available decisions x, so Y=ϕ(X), where ϕ=(ϕ1,,ϕn). As several feasible decisions might result in the same score vector, the mapping ϕ:XY may not be one-to-one. Consider now the ε-augmented performance index in the outcome space,
Φ^ε(y)=(1ε)miniNyi+(ε/n)iNyi,(y,ε)Y×R+.

Using ideas from Example 1, maximization of this weighted objective yields

Φ^ε*=maxyYΦε(y)=maxtT{(1ε)t+εμ^(t)},ε[0,1],
where T=[t1,t2] is a suitable compact interval, and
μ^(t)=max{(1/n)iNyi:tyi,(y,i)Y×N},tT,
denotes the average (criterion-specific) performance ratio. The interval boundaries t1,t2 of T, with 0<t1t21, are given by
t1=minyYminiNyiandt2=maxyYminiNyi.

In addition, one can easily verify that

Φ^0*=t2=ρ*=maxρ(X)andΦ^1*=μ^(t1)=(1/n)maxyYiNyit2.

Proposition 1 implies that any selection tεTε=argmaxtT{(1ε)t+εμ^(t)} must be decreasing in ε, and similarly, μ^(tε) must be increasing in ε, at least weakly. The latter also follows directly from the monotonicity of tε, because μ^(t) must be nonincreasing in t.

Lemma 4.

Let ε(0,1]. (i) 0=Ψ. (ii) εP. (iii) If xε\Ψ and x^Ψ, then there exists ε(0,ε) such that Φε^(x)<Φε^(x^), for all ε^[0,ε].

Part (i) of the preceding result notes that without ε-augmentation (i.e., for ε=0) the weighted objective Φε specializes to the performance index (via Proposition 2), the maximization of which produces the set of pseudo-robust decisions, as in Equation (7). Part (ii) stipulates that whenever the ε-augmentation is nontrivial (i.e., for ε>0), maximization yields efficient actions. Finally, part (iii) means that if a decision x maximizes the ε-augmented performance index in Equation (12), for some nontrivial ε(0,1], without being pseudo-robust, then any pseudo-robust decision x^Ψ would strictly improve upon x in terms of any ε^-augmented performance index, as long as ε^ (smaller than ε) lies in a sufficiently small right-neighborhood of the origin.

We are now able to establish a cornerstone property for the practice of robust multicriteria optimization, in the sense that a robust decision (i.e., an element of R) can be obtained as a lower limit of the set of ε-robust actions, for ε0+.21 In Section 3.5, we then show that ε>0 can be chosen so as to guarantee an approximation of the optimal performance ρ* up to any desired precision.

Proposition 3.

Let Q=Lim¯ε0+ε. Then QR and Q.

The following example shows that it is possible that QR. In other words, some points in the robust decision set might not be reached using the proposed approximation procedure.

Example 6.

Consider a (nonconvex, compact) action set as shown in Figure 4(a), specified by

X={x[0,1]×R+:x22+1x1}{(2,32)}.

Figure 4. Action Sets in Examples 6 and 7
Notes. (a) Action set in Example 6 with Lim¯ε0+ε={(2,32)}R={(1,2),(2,32)}. (b) Action set in Example 7 with Lim¯ε0+ε={(1,2)}=R.

There are n=2 criteria that simply measure the coordinate achievement, so fi(x)=xi, for all (x,i)X×N. The set of efficient actions is

P={x[0,1]×R+:x2=2+1x1}{(2,32)},
whereas the set of pseudo-robust actions is given by Ψ={(1,x2):1x22}{(2,32)}. By its definition in Equation (11) the set of robust actions can be obtained as the intersection of the preceding sets:
R=ΨP={(1,2),(2,32)}.

At this point, let us consider the ε-augmented performance index in Equation (12). By Lemma 4 we have 0=Ψ, and for ε(0,1] the maximizer of Φε is efficient, so

Φε*=maxxPΦε(x)=max{maxx1[0,1]{(1ε)x12+ε2(x12+2+1x13)},Φε(2,32)},
where ϕi=fi/fi*, for iN, with (f1*,f2*)=(2,3), and Φε(2,32)=(2+ε)/4. By direct computation one finds that the maximizer of the (nontrivially) ε-augmented performance index,22
ε={(2,32)},ε(0,1],
is a singleton. Proposition 3 then guarantees that the lower limit of this maximizer is robust:
Q=Lim¯ε0+ε={(2,32)}{(1,2),(2,32)}=R.

The fact that (2,32) is an isolated point (cf. Endnote 12) is not important, because one could easily connect it to X by adding points to the action set that are always suboptimal.23

3.5. Approximation Error of ε-Robust Decisions

Maximizing the ε-augmented performance index Φε in Equation (12) enables us to approximate a robust decision to an arbitrary prespecified precision, as a function of ε(0,1]. The quality of any approximate decision xεε in Equation (11) is gauged by its approximation error,

ψε=ρ*ρε[0,1],ε[0,1],(15)
that is, the difference between the optimal performance index, ρ*=maxρ(X), and the achieved performance index, ρε=ρ(xε). This quantity measures the loss in robustness from using the ε-approximation rather than the true robust decision. For the remainder of this discussion, we keep the selection xεε fixed, for all ε[0,1]. If we denote by με=μ(xε)=(1/n)iNϕi(xε) the average performance ratio achieved by xε, the optimal ε-augmented performance index becomes
Φε*=maxΦε(X)=(1ε)ρε+εμε,ε[0,1].(16)

The following result guarantees continuity, as well as first- and second-order monotonicity of the optimal ε-augmented performance index.

Lemma 5.

Φε* is continuous, increasing, and convex in ε[0,1].

The proof of the first-order monotonicity uses the fact that μερε (for all ε[0,1]), together with natural properties of an optimal solution to Equation (13) in order to establish that Φε* in Equation (16) must increase in ε (at least weakly). That the optimal ε-augmented performance index also exhibits a second-order monotonicity means that tightening the approximation parameter further and further leads to progressively slower decreases of Φε* toward Φ0*=ρ* (as ε0+). The convexity of the optimal ε-augmented performance index also implies its smoothness (almost everywhere), as pointed out next.

Remark 10.

By the Rademacher theorem (Villani 2008, theorem 10.8, p. 222), the convexity of Φε* in ε[0,1], which implies Lipschitz continuity, guarantees that the optimal ε-augmented performance index is differentiable almost everywhere (a.e.) on [0, 1]. The Alexandrov theorem (Villani 2008, theorem 14.25, p. 402) goes even further by establishing its second-order differentiability a.e., in the sense of having a Taylor expansion with a smaller-than-quadratic local error at almost every point ε[0,1]. By the envelope theorem (see, e.g., Mas-Colell et al. 1995, theorem M.L.1, pp. 965–966), at points of differentiability one therefore obtains:

dΦε*dε=μερε0,˙ε[0,1].

In other words, the gradient of the optimal ε-augmented performance index is (a.e.) equal to the difference between the average performance ratio (με) and the minimum performance ratio (ρε) attained by the approximately robust decision xε.

The maximized weighted objective in Equation (16) is a convex combination of ρε and με; the latter both exhibit “natural” comparative statics as implied by the reweighting result in Proposition 1.

Lemma 6.

(i) ρε is decreasing in ε[0,1]. (ii) ρ*ρε, for all ε[0,1]. (iii) limε0+ρε=ρ*. (iv) με is increasing in ε[0,1]. (v) μερ*, for all ε[0,1].

Parts (i) and (iv) of Lemma 6 assert that decreasing the augmentation parameter ε can only decrease the average criterion-specific performance ratio με and at the same time increase the performance index ρε, where both are achieved at a given ε-robust selection xε. Meanwhile, by parts (ii) and (v) the optimal performance ratio ρ*=maxρ(X) is bracketed by these two values, in the sense that ρερ*με, for all ε[0,1]. Finally, parts (i) and (iii) establish the monotone convergence of ρε to ρ*, as ε0+.

The following result provides a performance guarantee for any ε-approximation of our robust multicriteria decision problem, alluded to at the outset of our discussion.

Proposition 4.

Fix any δ>0. If εδ/μ1, then ψεδ.

A special case of the preceding result is that for any δ>0 the approximation error ψδ/μ1 cannot exceed δ. In other words, any ε-robust decision xε, for ε=δ/μ1, attains a performance index ρε[ρ*δ,ρ*], where ρ* is the optimal performance index. The following example applies this result in an already familiar context.

Example 7.

Somewhat similar to Example 6, we consider the (nonconvex, compact) action set

X={x[0,1]×R+:x22+1x1}{(3,12)},
together with the same n=2 criteria fi(x)=xi, for all (x,i)X×N. Consequently, the set of efficient actions is P={x[0,1]×R+:x2=2+1x1}{(3,12)}, and the set of pseudo-robust actions is Ψ={(1,x2):1x22}. Thus, by Equation (11) the set of robust actions amounts to R=ΨP={(1,2)}, which is a singleton. By Lemma 4 the ε-augmented performance index Φε in Equation (12) is efficient (i.e., εP), for all ε(0,1], and 0=Ψ. In particular, the optimal value of the ε-augmented performance index is
Φε*=maxxPΦε(x)=max{maxx1[0,1]{(1ε)x13+ε2(x13+2+1x13)},Φε(3,12)},
where ϕi=fi/fi*, for iN, with (f1*,f2*)=(3,3), and Φε(3,12)=(2+5ε)/12. For sufficiently small ε, the maximizer,
ε={(1(ε/22ε)2,2+ε/22ε)},0<ε<47(212)0.7388,
is single-valued, achieving Φε*=(1/24)(163ε2)/(2ε).24 By Proposition 3 the lower limit of this maximizer for ε0+ is robust, and in our case:
Q=Lim¯ε0+ε={(1,2)}=R;
see Figure 4(b). Note also that 1={(3,12)}, which in turn yields the average criterion-specific performance ratio achieved by the decision xεε, for ε=1:
μ1=12(x1f1*+x2f2*)|(x1,x2)=(3,12)=7120.5833.

Thus, to guarantee that the approximation error ψε in Equation (15) cannot exceed δ=5%, it is by Proposition 4 enough to find an action xε that maximizes the ε-augmented performance index Φε in Equation (12) for some ε(0,δ/μ1](0,8.6%]. Incidentally, for ε=8%, we find ρε33.32%, so that the realized approximation error of ψε0.01% (with respect to ρ*=1/3) is actually much smaller than the prespecified 5% approximation-error bound.

It is also possible to derive a priori performance estimates without knowledge of the optimal performance index (ρ*) just by solving the ε-approximation problem in Equation (13) for some admissible ε. Indeed, Lemma 6 (ii) and Lemma 5 together imply that

ρερ*Φε*,ε[0,1].(17)

For any ε[0,1], let us now consider the midpoint estimator,

ρ^ε=ρε+Φε*2=ρε+εμερε2=ρ(xε)+εμ(xε)ρ(xε)2,ε[0,1].(18)

The latter means that ρ^ε effectively approximates ρ*, for ε0+.

Lemma 7.

|ρ^ερ*|ε(μερε)/2, for all ε[0,1].

Because μερε[0,1], a somewhat simpler (though generally less precise) approximation inequality than the one given in the preceding Lemma 7 is |ρ^ερ*|ε/2, for all ε[0,1].

Remark 11.

For a robust decision x^R let y^=(f1(x^),,fn(x^)); in addition, let y*=(f1*,,fn*) be the utopian point in the outcome space. Because by construction y^iρ*yi*, for all iN, it is

ρ*ρ¯=sup{r[0,1]:ry*Y},
where sup=0. In the balanced case, where fi(x^)=ρ*fi*, for all iN, the lower bound becomes tight (i.e., ρ*=ρ¯). However, in general ρ¯ may not be very reliable (e.g., generically worse than the performance index ρd=ρ(xd), attained by the default decision xd).

3.6. Robust Weights

3.6.1. General Case.

An interesting and useful byproduct of a robust decision x^R is its associated robust (normalized) weight,

λ^=(h^f1(x^),,h^fn(x^))Δ,(19)
where the positive constant
h^=(iN1fi(x^))1(20)
denotes the harmonic mean of the criterion scores at the robust decision. By construction one obtains that the contribution to the robustly weighted objective F(·|λ^), evaluated at the robust decision x^, is equal for all criteria, in the sense that
maxxXminiNλ^ifi(x)=h^=λ^ifi(x^),iN.(21)

When the action set is convex, then the chosen robust action naturally also maximizes the robustly weighted criterion, that is, x^X(λ^). One can think of the robust weight as an endogenous belief that can be used to evaluate the expected criterion achievement. It defines the tradeoffs compatible with the robust decision, where the latter was found while remaining agnostic over all possible weights (in Δ).

3.6.2. Balanced Case.

In the case where fi(x^)=ρ*fi*, for all iN (cf. Remark 11), we have

λ^=(h*f1*,,h*fn*),(19′)
where
h*=(iN1fi*)1(20′)
is the harmonic mean of the maximum criterion scores. The balancedness condition then becomes
ρ*=fi(x^)fi*=h^h*,iN.

In the balanced case, the robust weight can therefore be determined based on the maximized individual criteria alone. If all maximized individual criteria are equal, the robust weight becomes uniform. Indeed, fi*=c>0, for all iN, implies that h*=c/n, so λ^=(1/n,,1/n).25

Example 8.

In Example 1, we determined the value function F*()=(1)t^+μ(t^), so that by Equations (19′) and (20′) in this balanced case (as established in Example 2):

λ^=(1^,^)=(μ(t^)t^+μ(t^),t^t^+μ(t^))=(f2*f1*+f2*,f1*f1*+f2*).

Generically, it is F*(^)F(x^|^), even when the decision set is convex. To see this, let m=n=2 and consider the convex domain, X={xR2+:ν1x1+ν2x21}, where the constants ν1,ν2 are such that 0<ν1<ν2<. Then μ(t)=(1ν2t)/ν1+t, so that μ(t)=1(ν2/ν1)<0. Meanwhile, F(t|)=(1)t+μ(t), for t[0,f1*]=[0,1/(ν1+ν2)], and

F(t|)=(1)+μ(t)=1ν2ν10ν1ν2.

Therefore, F*()=F(t*()|)=(/ν1)+max{0,1(ν2/ν1)}f1*, where

t*(){{f1*},if <ν1/ν2,[0,f1*],if =ν1/ν2,{0},if >ν1/ν2,
and f1*=1/(ν1+ν2). Because f2*=1/ν1, we find ν1/ν2=f1*/(f2*f1*). By virtue of the fact that ^=f1*/(f1*+f2*)<ν1/ν2, it is t*(^)=f1*. The balancedness condition,
t^f1*=μ(t^)f2*,
is equivalent to t^/f1*=1(t^/f1*), so t^=f1*/2. Hence, we find that t^t*(^). Let us check the corresponding performance index. Indeed,
ρ(t^)=0.5>0=min{f1*f1*,1f1*f1*}=ρ(t*(^)).

Note also,

^=f1*f1*+f2*=12+(ν2/ν1)<13.

That is, the robust weight puts more than twice as much emphasis on the Rawlsian (or egalitarian) objective as on the utilitarian objective. One can conclude that in general there is no “robustness equivalence principle,” in the sense that substituting the robust parameter into the original scalarization (via maximization of the corresponding weighted objective) might not lead to a robust decision.26

Example 9.

Consider the robust allocation of two resources to n=2 agents. The total amount of each resource has been normalized to one. Allocations are determined as decisions x=(x1,x2)[0,1]2, under which agent 1 obtains x1 of resource 1 and x2 of resource 2, whereas agent 2 obtains 1x1 of resource 1 and 1x2 of resource 2. Agent 1’s utility is f1(x)=x1αx21α, and agent 2’s utility is f2(x)=(1x1)β(1x2)1β, where α,β(0,1) are given scalars. Figure 5 provides an illustration in the corresponding Edgeworth box (see, e.g., Pareto 1906, p. 187). Because a robust allocation is necessarily Pareto-optimal, both agents’ marginal rates of substitution for the two goods must be equal, so xP if and only if

f1(x)/x1f1(x)/x2=α1αx2x1=β1β1x21x1=f2(x)/x1f2(x)/x2.(22)

Figure 5. Robust Allocation of Resources x^=(x^1,x^2) in Edgeworth Box X=[0,1]×[0,1]
Note. For two agents with utility functions (f1 and f2) specified in Example 9 (for α>β), the unique robust allocation satisfies Pareto-efficiency in Equation (22) and the balancedness condition in Equation (23), under full ambiguity.

In addition, a robust allocation must maximize the performance ratio, and one can verify that this requires the boundary performance ratios to be equal (i.e., a balancedness condition), so

f1(x)=x1αx21α=(1x1)β(1x2)1β=f2(x),(23)
because f1*=f2*=1. In the interesting special case where α+β=1, Equations (22) and (23) together yield the unique robust solution x^=(α,β), resulting in an optimal performance index of ρ*=ααββ=αα(1α)1α, with the last expression being a strictly convex function in α(0,1) achieving its minimum (of 1/2) at α=1/2 and ρ* going to one as α{0+,1}.27

3.7. Limited Ambiguity

Consider the (nonempty, compact) subset ΩR+n\{0} which may reflect the decision maker’s a priori knowledge about the relevant weights for the problem at hand, and let

ρ(x|Ω)=infwΩφ(x|w),xX,(24)
be the corresponding Ω-conditioned performance index. The different weight combinations considered feasible may not necessarily be normalized, especially at an initial stage in practice when information about reasonable weight ranges is being compiled. The function π:R+n\{0}Δ, with π(w)=w/w1, describes the radial projection of any nonzero weight w in R+n onto the unit simplex Δ. It allows for the normalization under weight ambiguity; see Remark 2. The radial projection is a nonlinear function which is homogeneous of degree zero (i.e., π(αw)=π(w), for all α>0 and all wR+n\{0}). It can be represented using the conical closure (see, e.g., Berge 1963, p. 14), C(Ω)={w^R+n:w^=αw,wΩ,αR+}, as described, among other useful properties of π(·), in the following auxiliary result.

Lemma 8.

Let Ω be a (nonempty, compact) subset of R+n\{0}. Then (i)(a) π(Ω)=C(Ω)Δ is nonempty and compact, and (b) π(Ω)=π(Ω). (ii) If ΩΩ is open (in R+n), then π(Ω) is open (in Δ). (iii) If ΩΩ and Ω, then π(Ω)=π(Ω) is nonempty and compact. (iv) If Ω is convex, then π(Ω) is convex. (v) If ΩΩ is a straight line segment, then π(Ω) is a straight line segment (or a point). (vi) π(co(Ω))=co(π(Ω)). (vii) If Ω=ΩΩ for some Ω,ΩR+n, then π(Ω)=π(Ω)π(Ω). (viii) If Ω=ΩΩ for some Ω,ΩR+n, then π(Ω)=C(ΩΩ)Δ.

Part (i) of Lemma 8 notes that (a) the radial projection of Ω onto Δ can be obtained by simply intersecting the conical closure of Ω with Δ, and (b) one can limit attention to the boundary Ω (ignoring interior points of Ω). Although a continuous function generally does not map open sets to open sets (e.g., a constant function would not), part (ii) guarantees that π(·) does exactly that, provided the “interiority” of a point in Ω is assessed in R+n and then, after its projection, in the lower-dimensional Δ. Part (iii) ensures that weight ambiguity in any (nonempty) subset Ω of Ω leads to a compact domain π(Ω) for a robust decision according to Proposition 5 below. Part (iv) establishes that convex sets are projected to convex sets.28 Following (v) and (vi), the radial projection leaves straight-line geometries intact, thus, for example, converting (bounded) polyhedra in Rn+ to polytopes in Δ. Finally, parts (vii) and (viii) note that the radial projection of a union of sets is the union of the corresponding single-set projections, but the same does generally not apply to an intersection.29

Example 10.

Assume the (nonnormalized) weights w considered reasonable by the decision maker are such that each of its components wi, for iN, is known to lie in some interval, resulting in a rectangular ambiguity set,

Ω=[w¯,w¯]=iN[w¯i,w¯i],
with bounds w¯=(w¯1,,w¯n)0 and w¯=(w¯1,,w¯n)w¯. Using Lemma 8, leveraging the preservation of straight-line geometries, it can be verified that the radial projection of Ω onto Δ has the form
π(Ω)=co({w¯+δ1e1w¯1+δ1,,w¯+δnenw¯1+δn}),(25)
where δ=(δ1,,δn)=w¯w¯ and w¯1=l=1nw¯l. As an example, Figure 6 illustrates that the radial projection of a box in n=3 dimensions is a triangle in Δ. Thus, the shape of π(Ω) depends on (at most) n of the original 2n vertices of Ω. Moreover, if the modulus of the lower bound of the (nontrivial) box-shaped ambiguity set Ω becomes very small, then the situation tends to become equivalent to full ambiguity. To see this, consider a w¯ with only positive components and let w¯=εe1, for a sufficiently small ε>0. Then, by Equation (25) it is π(Ω)=co({λ(1),,λ(n)}), where λ(1)=e1 and λ(i)=(εe1+w¯iei)/(ε+w¯i), for i{2,,n}. Hence, limε0+λ(i)=ei, for all iN, implying full ambiguity in the limit, as Δ=co({e1,,en})=π(R+n\{0}).

Figure 6. Radial Projection of Box-Shaped Ambiguity Set Ω=[w¯,w¯]
Note. With w¯=(1,1,1) and w¯=(3,2,2), a radial projection onto Δ yields π(Ω)=co({(3/5,1/5,1/5),(1/4,1/2,1/4),(1/4,1/4,1/2)}), as described in Example 10.

Under limited ambiguity, a coordinate-wise decomposition is no longer available. However, because φ(x|·) is quasiconcave, for any given xX, as established in the proof of Proposition 2, the upper contour sets of the performance ratio in the space of weights are necessarily convex. This in turn allows restricting attention, for the representation of the performance index, to the extreme points of the convex hull of the ambiguity set, which yields a representation much in the same spirit as before.

Proposition 5.

The Ω-conditioned performance index in Equation (24) is such that

ρ(x|Ω)=minλπ(Ω)φ(x|λ)=minλΛφ(x|λ),xX,
for some “extremal base” Λ which is a (compact) subset of π(Ω), such that co(Λ)=co(π(Ω)).

Although choosing Λ=π(Ω)=π(Ω) (cf. Lemma 8 (i) (b)) is always possible, it is usually advantageous to opt for the smallest possible extremal base Λ, so it consists only of the “extreme points” of co(π(Ω)), where the latter cannot be represented as convex combinations of other points in Λ (see, e.g., Rockafellar 1970, section 18, p. 162).30 If Ω is finite or a (finite) polytope, then the smallest extremal base Λ of extreme points of co(π(Ω)) is also finite.31 Because any bounded convex set can be approximated by a finite polytope (Bronstein 2008), a finite extremal base can be used to represent the convex hull of the given ambiguity set (or its radial projection onto Δ) up to any desired precision. That π(Ω) may be nonconvex is not important because the level sets of the performance ratio are convex, so that the minima of φ(x|·) on π(Ω) are attained on the boundary of its convexification, co(π(Ω)).

Example 11.

Consider the robust allocation of resources to n=2 agents as in Example 9, given the box-shaped ambiguity set Ω as in Example 11, with its radial projection specified by Equation (25),

π(Ω)=co(Λ)=co({(w¯1,w¯2)w¯1+w¯2,(w¯1,w¯2)w¯1+w¯2})Δ,
where Λ={λ(1),λ(2)}. Thus, by virtue of Proposition 5 the Ω-conditioned performance index becomes
ρ(x|Ω)=min{w¯1f1(x)+w¯2f2(x)maxx^X{w¯1f1(x^)+w¯2f2(x^)},w¯1f1(x)+w¯2f2(x)maxx^X{w¯1f1(x^)+w¯2f2(x^)}},xX.

For a pseudo-robust decision x (in Ψ), balancedness must hold, so that, by substituting f1,f2 from Example 9,

F(x|λ(1))F(x|λ(2))=w¯1x1αx21α+w¯2(1x1)β(1x2)1βw¯1x1αx21α+w¯2(1x1)β(1x2)1β=F*(λ(1))F*(λ(2)).(26)

Meanwhile, a feasible allocation decision x=(x1,x2) is Pareto-optimal (in P) if and only if it satisfies Equation (22), as before. When the box-shaped ambiguity set becomes smaller, the elements of the extremal base Λ become more similar and coincide in the limit (i.e., λ(1)λ(2)0 as w¯w¯10+). Figure 7 shows the robust allocation x^=x^(Ω) on the contract curve between the Pareto-optimal allocations x*(λ(i)) that maximize the weighted objective F(x|λ(i)) for iN. Under full ambiguity (i.e., for Ω=Δ), it is λ(i)ei; see Example 9.

Figure 7. Robust Allocation of Resources, {x^(Ω)}=argmaxxPρ(x|Ω), to Two Agents in Example 11 (for α>β)
Note. Under limited ambiguity Ω=[w¯,w¯] as in Example 10, the robust allocation x^(Ω) satisfies Pareto-efficiency in Equation (22) and balancedness in Equation (26); it differs from the allocation x^(Δ) under full ambiguity, which maximizes ρ(x)=ρ(x|Δ) on P.
Example 12.

In some applications, the weights are naturally ranked by importance (see, e.g., Wang and Fu 2020). The ordered set of weights, Ω={(w1,,wn)R+n:w1w2wn}=C(Ω), has a radial projection onto Δ of the form π(Ω)=co({λ(i):iN}), where λ(i)=1ij=1iej, for all iN. Combining this importance-ranking with a box-shaped ambiguity set Ω=[w¯,w¯+δ] as in Example 10, with π(Ω)=co({λ(i):iN}), where λ(i)=(w¯+δiei)/(w¯1+δi), for all iN, leads to

Λ={𝟙{w¯+δieiΩ}λ(i)+𝟙{w¯+δieiΩ}λ(i):iN}.

This extremal base (of cardinality n) suits practical applications where a decision maker disposes of plausible ranges for the weights to be placed on the criteria, together with a ranking of their importance (cf. Section 4.3.2).

Remark 12.

(i) Limiting ambiguity can only increase robustness performance. That is, if Ω,Ω^R+n\{0} are nonempty and compact and satisfy π(Ω^)π(Ω), then ρ(·|Ω)ρ(·|Ω^), which follows directly from the definition of the Ω-conditioned performance index in Equation (24). (ii) In the absence of ambiguity, the optimal robustness performance has to be maximal. That is, if Ω^={w}, then ρ*(Ω^)=maxxXρ(x|{w})=F*(w)/F*(w)=1.

3.8. Criterion Ambiguity

Given a (nonempty, compact) action set XRm as in Section 2, we now allow for ambiguity in each of the n criteria, given by the continuous functions gi:X×ΘR+, for iN={1,,n}, which map (x,θ) to real numbers, where xX is a feasible action and θ is an ex ante unknown state in the (nonempty, compact) state space ΘRl, and where l,m,n1 are given integers. The unknown state θ represents the decision maker’s uncertainty about the exact value that each of his n objectives may attain under a chosen action x.

Remark 13.

(i) Continuity of the criteria in the state implies that a small perturbation in the state can have only a small impact on the decision maker’s objective. This continuity is automatically satisfied when the state space Θ is finite.32 (ii) The unknown state may introduce dependencies between the different gi. Indeed, even if θ=(θ1,,θn) (for l=n) and criteria are such that gi(x,θ)gi(x,θi), for all iN, then the different criteria’s ambiguity may still be linked via common constraints in Θ.33 But if in addition the state space is a Cartesian product, so Θ=Θ1×Θn, and θiΘi, for all iN, then the decoupling of ambiguity across the different criteria is complete, in the sense that observing the part of the state that determines one criterion does not reduce ambiguity for any other criterion.

To assess goal achievement of an action in a given state, the decision maker considers a scalarization of his multicriteria optimization problem by means of a weighted objective,

G(x,θ|λ)=i=1nλigi(x,θ),(x,θ,λ)X×Θ×Δ,
where Δ={w=(w1,,wn)R+n:w1++wn=1} denotes the set of all (normalized) weights. As in the nontriviality condition (N) (cf. Section 2), we assume that there exists a default decision (xd) such that the decision maker’s weighted objective is positive across all states, that is,
xdX:G(xd,θ|λ)>0,(θ,λ)Θ×Δ.(N′)

The modified nontriviality condition (N′) ensures that the ex post optimal objective is always positive, so

G*(θ|λ)=maxxXG(x,θ|λ)G(xd,θ|λ)>0,
for all (θ,λ)Θ×Δ.

Remark 14.

The modified nontriviality condition (N′) can always be satisfied, without altering the set of ex post optimal decisions, by using a (positive, translated) criterion g^i=gi+ε, for some ε>0; see also Remark 4.

Given a (nonempty, compact) ambiguity set ΩR+n, the decision maker’s robustness objective, as in Section 3.7, is to maximize the (Ω,Θ)-conditioned performance index,

ρ(x|Ω,Θ)=min(λ,θ)π(Ω)×ΘG(x,θ|λ)G*(θ|λ),xX.

The following result recasts the robustness objective into a by-now-familiar representation.

Proposition 6.

Let ΛΔ be an extremal base of co(π(Ω)). Then

ρ(x|Ω,Θ)=minλΛ{minθΘG(x,θ|λ)G*(θ|λ)},xX.(27)

Based on Proposition 6, if the (smallest) extremal base is finite, we can reduce the general multicriteria decision problem to our basic framework in Section 2, with full weight ambiguity and no criterion ambiguity.

Corollary 1.

Assume that there exists a (finite, smallest) integer n1 so that Λ={λ(1),,λ(n)} and co(Λ)=co(π(Ω)), and let fi(x)=minθΘ{G(x,θ|λ(i))/G*(θ|λ(i))}, for all (x,i)X×N. Then

ρ(x)=miniN{fi(x)fi*},xX,(28)

where fi*=maxxXfi(x)=1, for all iN={1,,n}, represents ρ(·|Ω,Θ) in Equation (27).

Example 13.

(i) In the case of full ambiguity, it is Λ=Δ, so that n=n and Δ=co({e1,,en})=Δ, with fi(x)=minθΘ{gi(x,θ)/G*(θ|ei)}, for all (x,i)X×N. (ii) Consider now a multicriteria decision problem under full ambiguity, with criteria of the form gi(x,θ)=(g^i(x))θi, for all iN, where the state θ=(θ1,,θn) is only known to lie in the (nonempty, compact) set ΘR+n (with Θ{0} to avoid trivialities), and where the functions g^i:XR++ are continuous. Let g^i*=maxg^i(X)>0 and θ¯i=maxθΘθi; furthermore, set

fi(x)=(g^i(x)g^i*)θ¯i=minθΘ(g^iθi(x)maxxXg^iθi(x)),xX,
for all iN, taking into account that ξθ decreases in θ, for all ξ[0,1]. The extreme state θ¯=(θ¯1,,θ¯n) determines the robust choice, as it leads ceteris paribus to the smallest criterion values—thus following a precautionary principle. Then, by Corollary 1 we follow our basic framework, maximizing the performance index in Equation (28) with respect to all Pareto-optimal decisions in Equation (10). For instance, if Θ=Δ, the reduction to the weighted objective in Equation (1) obtains with fi=g^i/g^i*, for all iN.

Remark 15.

(i) The minimization over the state space in Example 13 reflects a robust, precautionary stance: the decision maker evaluates actions under the most adverse plausible realization of the state with respect to the achievable performance ratio, consistent with a notion of relative worst-case robustness. (ii) The treatment of criterion ambiguity in this section also bears a conceptual resemblance to models of DRO, where decisions are evaluated against worst-case distributions within a specified ambiguity set. In our framework, the uncertainty is not over probability distributions but over states θΘ, with performance assessed under the worst-case realization of both the state and the weights. This can be viewed as a nonprobabilistic analogue of DRO, where the ambiguity set Ω over the weights plays a role similar to ambiguity sets over distributions in DRO models. In particular, the two-layer minimization in Equation (27)—over weights and states—mirrors the inner DRO minimization over distributions, highlighting a parallel structure between worst-case evaluation across distributional and multicriteria settings.

3.9. General Multicriteria Objectives

In certain practical applications, it may seem appealing to consider alternatives to the arithmetic mean in Equation (1), such as a harmonic or geometric mean, with suitable weights. One could even think of using a (weighted) power mean which would accommodate each of the earlier options as a special case; see Example 14 below, which illustrates aggregation ambiguity. Rather than commit to a particular functional form for aggregating multiple criteria, however, we propose a general class of multicriteria objectives that adhere to a set of reasonable axioms. These axioms reduce (in the case of uniform weights) to those known to characterize the quasiarithmetic mean. A key finding, notably, is that maximizing this general weighted objective is entirely equivalent to maximizing the arithmetic mean objective in Equation (1), provided the criteria are suitably transformed.

Let H:Y×ΔR denote a (continuous) multicriteria objective, where YRn is a (nonempty) domain such that f^(X)Y, with the vector of criteria f^=(f^1,,f^n), and where ΔR+n is the unit simplex; see Section 2.1. For any decision xX and weight λΔ, the multicriteria objective produces an overall score H^(x|λ)=H(f^(x)|λ), which the decision maker would like to maximize by choosing an appropriate decision—in the presence of (possibly limited) ambiguity about λ (and possibly also about f^) as discussed in Section 3.7 (and Section 3.8). To guide the form of a sufficiently flexible and interpretable objective, we posit five axioms (Axioms 15), which nest those proposed by Kolmogorov (1930) in his seminal work “On the notion of mean.”

Axiom 1

(Monotonicity). For all y=(y1,,yn)Y,  y^=(y^1,,y^)Y,  λ=(λ1,,λn)Δ, and iN:

y^y=(y^iyi)ei,y^i>yi{H(y^|λ)>H(y|λ),if λi>0,H(y^|λ)=H(y|λ),if λi=0.

Axiom 2

(Symmetry). For all y=(y1,,yn)Y,  λ=(λ1,,λn)Δ, and i,jN:

(y^,λ^)=(y,λ)+(yiyj,λiλj)(ejei)H(y^|λ^)=H(y|λ).

Axiom 3

(Reflexivity). H(α1n|λ)=α, for all α>0 (with α1nY) and λΔ.34

Axiom 4

(Associativity). For all y=(y1,,yn)Y,  λ=(λ1,,λn)Δ, and mN\{n}:35

(λ1,,λm)0,H(i=1myiei|i=1mλieii=1mλi)=αH(α1m,ym+1,,yn|λ)=H(y|λ).

Axiom 5

(Coordinate Filter). H(y|ei)=yi, for all y=(y1,,yn)Y and iN.

The significance of these five basic requirements is as follows: Monotonicity (Axiom 1) means that as long as the weight component λi is positive, increasing the value yi of criterion iN must also increase the weighted objective, whereas for λi=0 the weighted objective becomes insensitive to criterion i; symmetry (Axiom 2) requires that the weighted objective is invariant with respect to any joint permutation of indices belonging to criteria and their associated weights; reflexivity (Axiom 3) imposes score-consistency in the sense that if all criterion scores are identical, then that should also be the value of the weighted objective, no matter what (normalized) weights are applied; in a similar vein, associativity (Axiom 4) postulates that replacing a group of inputs with their internal weighted average leaves the overall aggregation unchanged; finally, in order to guarantee that the weights provide a homotopic relation between all criteria in isolation, the weighted objective is a coordinate filter (Axiom 5) if it yields the i-th component of y when putting all weight on the i-th coordinate (i.e., for λ=ei).

Based on Kolmogorov’s result,36 we define the general h-mean,

H(y|λ)=h1(i=1nλih(yi)),(29)
where the kernel h:DR is a continuous strictly monotonic function, defined on a suitable domain DR.

Lemma 9.

The general h-mean H:Y×ΔR in Equation (29) satisfies Axioms 15.

The following example illustrates the flexibility afforded by the general h-mean.

Example 14.

Consider two well-known averages.

  1. Let Y=R++n. For any weight λΔ and power parameter pR, define the power mean:

    Mp(y|λ)={(i=1nλiyip)1/p,p0,i=1nyiλi,p=0,
    which corresponds to Equation (29) with h(y)=yp for p0, and h(y)=logy for p=0. This recovers the harmonic (p=1), geometric (p=0), and arithmetic (p=1) means, among others. In the limit, the power mean approaches the minimum (for p) or maximum (for p+) of all yi with positive weights. Importantly, Mp is increasing in p (see, e.g., Hardy et al. 1934, theorem 16, p. 26) and uniquely satisfies homogeneity: Mp(αy|λ)=αMp(y|λ) for all α>0 (see, e.g., Hardy et al. 1934, theorem 84, p. 68).37

  2. For h(·)=exp(·), the general h-mean in Equation (29) becomes a weighted mean in the log-semiring, related to the LogSumExp (LSE) function, because H(y|λ)=LSE(y1+logλ1,,yn+logλn), for all (y,λ)Rn×int(Δ), where LSE(y)=log(i=1nexp(yi)), for all yRn. Such a formulation carries fruit in probabilistic modeling and neural networks, where LSE and softmax appear naturally because of their smoothness properties.38

3.9.1. Reduction to Basic Framework.

Crucially, the general h-mean in Equation (29) reduces to our standard weighted objective in Equation (1) under a transformation of the criteria, by setting fi=hf^i, so that

H^(x|λ)=h1(F(x|λ)),(x,λ)X×Δ.

If the kernel h is increasing (resp., decreasing), then maximizing H^(·|λ) is equivalent to maximizing (resp., minimizing) F(·|λ). Thus, our earlier results carry over directly—with minor adjustments in the decreasing case, as detailed in Section 3.9.2.

Example 15.

Consider Example 13 (ii) for the “diagonal” state space Θ={θR+n:θ1==θn[p¯,p¯]}, where the constants p¯,p¯ are such that 0<p¯<p¯<. By Example 14 (i), this represents a situation in which the decision maker is uncertain about the appropriate aggregation method, except that a (homogeneous) power mean Mp should be used, for some p[p¯,p¯]. The result in Example 13 (ii) suggests as robust choice the largest available option: the power mean Mp¯.

Remark 16.

(i) As Example 15 suggests, the insights about criterion ambiguity in Section 3.8 may sometimes be combined with the general representation of multicriteria objectives in Section 3.9 to handle aggregation ambiguity, that is, uncertainty over which aggregation rule or kernel should be used. (ii) For the general h-mean in Equation (29), we define H=h1Fh with Fh=i=1nλi(hfi). However, in general, h1(Fh/Fh*)(h1Fh)/(h1Fh*) with Fh*(·)=maxFh(X|·), so our optimization focuses on Fh directly. (iii) For the practically very important power mean in Example 14 (i) and Example 15 (as long as p0), the robust optimization of Fh is equivalent to the robust optimization of Mp, because h(·)=(·)p and h1(·)=(·)1/p both feature multiplicative separability.

3.9.2. Special Case: Minimization Under Decreasing Kernel.

For a decreasing kernel, fi=hf^i transforms larger f^i into smaller fi, so that the objective reflects a smaller-is-better interpretation (akin to optimizing a loss function). Assuming R++nY, by Axiom 3 it is limα0+H(α1n|λ)=H(0+|λ)=0. Because in most practical applications (e.g., for logarithmic or inverse transformations in multiobjective loss minimization) it is h(0+)=, we obtain h1()=0+, so that on the compact action set X it is fi=minfi(X)=h(f^i*)>0, for all iN, as f^i*=maxf^i(X) is necessarily finite. Hence, we can consider the adjusted performance ratio,

φ(x|λ)=F(λ)F(x|λ)(0,1],(x,λ)X×Δ,(30)
where F(λ)=minF(X|λ), for all λΔ. A representation of the performance index ρ(x) in Equation (4) then obtains, analogous to Equation (6), for all xX, as the minimum of the (adjusted) criterion-specific performance ratios ϕi(x)=fi/fi(x) with respect to iN. This defines the (nonempty, compact) set of pseudo-robust actions,
Ψ=argmaxxX{miniNfifi(x)}.

The set of Pareto-optimal actions P needs to be based on the original vector of criteria f^, so that

P={xX:(f^(x)f^(x)f^(x)=f^(x)),xX},
analogous to Equation (10). The robust decision set is then R=ΨP as before, in Equation (11).

3.10. Applying the Method

“What do I do? What do I get? How do I adapt it to my case?”—We now address these practitioner questions by outlining the method’s core, the interpretation of its results, and important extensions for customization, as a navigation device for approaching the multicriteria decision problem introduced in Section 2.1 and its generalizations.

  1. (Core) To determine an approximately robust decision x^ε (at any prespecified precision ε) with respect to n criteria, one needs to solve just n+1 optimization problems: one to maximize the ε-augmented performance index Φε in Equations (12) and (13) and n to compute the normalization constants fi* in Equation (5) for the criterion-specific performance ratios ϕi in Equation (6). The associated normalized robust weight λ^Δ, which encapsulates the tradeoffs between the criteria embedded in both the action set and the shape of the criterion functions (independent of any scaling), is then obtained (approximately) from Equations (19) and (20) by setting x^=x^ε.

  2. (Interpretation) The performance index ρ(x^ε)[0,1], with ρ(·) given in Equation (6), guarantees a minimum percentage that the robust solution x^ε achieves of the optimal weighted objective in Equation (1), for any weight in Δ. This means that the performance of x^ε is guaranteed to be no worse than ρ(x^ε) times the best achievable performance under any possible weight vector, ensuring robustness to unknown or contested preferences. In this manner, the weight-induced subjectivity in multicriteria optimization can be removed. The robust weight rationalizes the robust decision as an optimum of the weighted objective in Equation (1) for λ=λ^. The vector λ^ can be interpreted as the revealed weight structure that best justifies the robust solution, based on the problem’s internal tradeoffs.

  3. (Customization) The method accommodates several practically important extensions:

    • Partial weight information: Prior knowledge or constraints on weights can be imposed by requiring weight vectors (not necessarily normalized) to belong to a suitable subset (cf. Section 3.7).

    • State-dependent criteria: Uncertain or context-dependent criteria fi can be captured by worst-case performance ratios, effectively reducing the problem to the core framework (cf. Section 3.8).

    • Generalized aggregation: Rather than relying on the arithmetic mean in Equation (1), one may adopt alternative scalarization functions consistent with an axiomatic foundation (cf. Section 3.9).

Overall, the relatively robust methodology provides a computationally tractable and conceptually transparent toolkit for balancing multiple, possibly ambiguous objectives in diverse real-world settings (cf. Section 4.3).

4. Discrete Applications

4.1. Finite Action Set

4.1.1. Relative Robustness Criterion.

Consider a finite set of alternatives X={1,,J}, evaluated according to n positive criteria f1,,fn:XR++. Each alternative jX receives a score fi(j)=sij>0, for all iN. If we set s^i=maxjXsij, then the performance index for the j-th option becomes

ρj=miniNsijs^i,jX.(31)

By Proposition 3, a robust decision j*R can be found by computing the lower limit, as follows:

j*Lim¯ε0+argmaxjX{(1ε)ρj+εniNsijs^i}.(32)

This robust decision achieves the optimal performance index,

ρ*=maxjXρj=ρj*=miniNsij*s^i.(33)

As discussed in Section 3, simply maximizing the performance index ρj yields pseudo-robust solutions, which are generically inefficient; see, for instance, Example 6. We now compare the proposed approach with several alternative robustness criteria.

4.1.2. Alternative Robustness Criteria.

The following three robustness criteria, defined in the present context of discrete action sets, are frequently used in the literature for dealing with parameter ambiguity.

  1. The Laplace criterion corresponds to a weighted objective F(·|λLaplace) in Equation (1) with a uniform weight, λLaplace=(1,,1)/nΔ, leading to the “Laplace solution,”

    jLaplaceargmaxjXF(j|λLaplace)=argmaxjXi=1nsij.(34)

  2. The worst-case criterion is defined as the minimum payoff across the admissible weights, which yields a so-called “maximin solution,”

    jWCargmaxjXminλΔF(j|λ)=argmaxjXminiNsij.(35)

  3. The absolute-regret criterion evaluates absolute regret, that is, the maximum difference ex post between what is and the best that could have been, resulting in the “(minimax) absolute-regret solution,”

    jARargminjXmaxλΔ{F*(λ)F(j|λ)}=argminjXmaxiN{s^isij}.(36)

4.1.3. Comparison.

The following example, which features a finite action set, illustrates the proposed robust solution in Equation (32) against decisions recommended by alternative robustness criteria, notably the Laplace criterion in Equation (34), the WC (or maximin) solution in Equation (35), and the solution minimizing the (maximum) AR in Equation (36).

Example 16.

Consider a discrete-choice situation for J=5 options, for which the scores sij across n=3 criteria are recorded in Table 1. In the context of relative robustness, options 2 and 5 are tied for the highest performance index in Equation (31) and are thus both pseudo-robust, so Ψ={2,5}. At the same time, option 5 Pareto-dominates option 2 (with P={1,3,4,5}), so that j*=5R=ΨP is the unique robust choice. This solution can also be obtained directly from Equation (32), written in the form

j*Lim¯ε0+argmaxjXΦε(j),

Table

Table 1. Discrete Decision Options in Example 16, Evaluated with Different Robustness Criteria

Table 1. Discrete Decision Options in Example 16, Evaluated with Different Robustness Criteria

Option (j)CriteriaPerformance ratios/indexAlternative evaluations
s1js2js3js1j/s^1s2j/s^2s3j/s^3ρjLaplaceWCAR
1242016110/114/154/15201644
276327/243/118/153/1115628
312601/241/1111/2421123
422221611/1214/154/15201644
5126361/23/113/53/1118624


Note. Values in bold indicate optimality for the corresponding robustness criterion.

where Φε(j)=(1ε)ρj+(ε/n)i=1n(sij/s^i), for all jX and all ε[0,1]. Indeed, because ρ2=ρ5=3/11>ρ1=ρ4=4/15>ρ3=1/24, for sufficiently small ε(0,1] the ε-augmented performance index is

maxj{2,5}Φε(j)>Φε(4)>Φε(3).

Meanwhile, it is

Φε(5)Φε(2)=ε3(12724+9815)=ε3(1140)>0,ε(0,1].

Therefore, ε={5}, for all sufficiently small ε>0, so j*Lim¯ε0+ε={5}, and one finds j*=5 as the unique robust option. Regarding alternative robustness evaluation, both the Laplace criterion in Equation (34) and the absolute-regret criterion in Equation (36) produce the solution jLaplace=jAR=3 with very poor performance in at least one criterion. In fact, changing the payoff s13 from one to zero would produce even a zero performance index and zero worst-case performance guarantee for that same decision (still optimal under these criteria),39 which may be rather difficult to justify in any real-world scenario. Finally, maximizing the worst-criterion performance in Equation (35) leads to indifference between options 1 and 4, at a suboptimal performance index and largest absolute regret. By contrast, the proposed robust solution j*=5 provides (by construction) the best performance index and a reasonable compromise solution in terms of the other criteria. It guarantees a strictly positive performance across all criteria. From Equations (19) and (20) we find that the corresponding (normalized) robust weight vector is

λ^=(112+16+136)1(112,16,136)=(0.3,0.6,0.1).

Consistent with Equation (21), it is such that

λ^isij*=(iN1sij*)1=(112+16+136)1=3.6maxjX\{j*}miniNλ^isij,i{1,2,3},
where 3.6 corresponds to the harmonic mean of the criterion scores for the robust option j*. Consider now the robustly reweighted scores σij=λ^isij, for all (i,j)N×X, displayed in Table 2, together with the associated robustness evaluations. It is interesting that after reweighting, option 3 goes from lowest to largest absolute regret, which highlights the sensitivity of the AR criterion to uncertainty about the relative importance of the criteria. The maximin payoff (WC) solution shifts from options 1 and 4 to option 5, whereas the Laplace solution shifts from option 3 to option 4. By contrast, the performance index remains unaffected by the robust reweighting (or any other reweighting), so option 5 is still the unique robust solution.

Table

Table 2. Discrete Decision Options in Example 16, Evaluated after Robust Reweighting

Table 2. Discrete Decision Options in Example 16, Evaluated after Robust Reweighting

Option (j)CriteriaPerformance ratios/indexAlternative evaluations
σ1jσ2jσ3jσ1j/σ^1σ2j/σ^2σ3j/σ^3ρ^jLaplaceWCAR
17.2121.6110/114/154/1520.81.64.4
22.13.63.27/243/118/153/118.92.19.6
30.31.261/241/1111/247.50.312
46.613.21.611/1214/154/1521.41.64.4
53.63.63.61/23/113/53/1110.83.69.6


Note. Values in bold indicate optimality for the corresponding robustness criterion.

4.2. Data-Driven Approach

Consider any (nonempty, compact) action set XRm for some integer m1, as in Section 2. Assume further that the decision maker does not know the functional form of the multicriterion f=(f1,,fn), but is still able to observe its value sk=(s1k,,snk) for different decisions xkX over the course of K1 experiments, where kK={1,,K}. Indeed, given the realized score set S^={sk:kK} and the realized action set X^={xk:kK}X, the average criterion scores are40

f^i(x)=kK1{xk=x}sikkK1{xk=x},(x,i)X^×N,
for all (x,i)X^×N. With this, the (data-driven) criterion-specific performance ratios are given by
ϕ^i(x)=f^i(x)f^i*,(x,i)X^×N,
where f^i*=maxf^i(X^) is the i-th maximum average criterion score. The (data-driven) performance index,
ρ^(x)=miniNϕ^i(x),xX^,
is defined for all observed decisions. A (data-driven) robust decision x^* (among all sampled actions in X^) can then be determined in the standard way by applying Proposition 3 to the observational analogues of our standard approach:
x^*Lim¯ε0+argmaxxX^{(1ε)ρ^(x)+εniNϕ^i(x)}.(37)

In other words, ρ^(x) captures the worst-case relative performance of action x across all observed criteria, and x^* identifies a robust decision, which is efficient and has the best-possible worst-case performance. Note that instead of going through the motions of actually taking the limit, it is sufficient to maximize the ε-augmented performance index, that is, the maximand in Equation (37), for a sufficiently small ε(0,1]. This approximates the optimal robustness performance, captured by the optimal performance index ρ*, arbitrarily closely by virtue of the ε-performance guarantee provided in Proposition 4. Finally, as in Equations (19) and (20), the (data-driven) robust weight is

λ^=(iN1f^i(x^*))1(1f^i(x^*))i=1nΔ.(38)

Based on the observed data, this vector provides a robust estimate of the tradeoffs between the different criteria from the vantage point of relative robustness.

Example 17.

Consider the joint evaluation of human performance on a certain task (such as giving a university lecture) by K evaluators who score J individuals in N different performance dimensions (or criteria) on a Likert scale (from one to seven),41 so K={1,,K}, N={1,,N}, and X={1,,J}. Figure 8 shows a spider plot comparing J=4 different individuals. Assuming that all evaluators scored all the individuals using a seven-point Likert scale, the realized score set S^ has elements sk{1,,7}N, for all kK, whereas the realized action set X^ is equal to X. Individual 3, although never achieving a highest criterion-specific average score, exhibits the best data-driven performance index (ρ^*=ρ^(3)=76.19%=maxj{1,2,3,4}ρ^(j)). Because in a noisy observation environment ties in the performance index are fairly unlikely, the set of pseudo-robust alternatives Ψ tends to be a singleton, thus also resulting in a singleton set of robust options (e.g., R={3}). This example illustrates how the data-driven approach identifies robust performance through balanced tradeoffs rather than peak performance in any one dimension, thus favoring individual 3, whose weakest dimension is relatively strong.

Figure 8. Average Scores for N=5 Criteria (Evaluated by K=3 Experts), and Resulting Data-Driven Performance Indices for J=4 Options in Example 17
Note. All options are Pareto-efficient (i.e., P={1,2,3,4}); only option 3 is also pseudo-robust (i.e., Ψ={3}), so R=ΨP={3}.

4.3. Real-World Applications

Relatively robust multicriteria optimization provides a flexible and transparent framework for decision-making under uncertainty. As developed in Sections 2 and 3, it is especially well suited for complex problems where the relative importance of multiple criteria is ambiguous or hard to justify. The method systematically balances competing objectives using only minimal assumptions on the decision maker’s preferences, and delivers solutions that are both Pareto-efficient and robust (cf. Proposition 3). We illustrate its practical relevance in three diverse contexts: the energy trilemma, quality-adjusted life year (QALY)-based health evaluations, and corporate resource allocation. In each case, the relatively robust methodology facilitates structured tradeoff management and data-informed robustness, whether operating in an abstract policy space or on a finite empirical action set (cf. Sections 4.1 and 4.2).

4.3.1. Energy Trilemma: Balancing Energy Security, Equity, and Sustainability.

The World Energy Council’s Energy Trilemma framework evaluates nations on three key dimensions, namely “energy security” (i.e., the reliability and resilience of energy supply), “energy equity” (i.e., the accessibility and affordability of energy for all citizens), and “environmental sustainability” (i.e., the reduction of greenhouse gas emissions and environmental impact). Nations face challenges in improving one dimension without compromising others (World Energy Council 2024).42 For instance, increasing energy equity by subsidizing fossil fuels may undermine sustainability, while prioritizing environmental sustainability through renewable energy investment might initially reduce energy security or equity. An optimal strategy depends on the relative importance attributed to each of the three criteria. In this setting, a relatively robust approach can

  • Identify Robust Policies: By modeling each nation’s energy policies and outcomes as feasible decisions, the method identifies those policies that achieve strong tradeoffs across all three dimensions. For example, a relatively robust policy might balance investment in renewable energy, grid modernization, and subsidies for low-income households, ensuring consistent performance regardless of variations of the relative importance across dimensions, which might still be importance-ranked (cf. Section 3.7).

  • Quantify Tradeoffs: The approach provides a clear representation of tradeoffs, such as how much equity might need to be sacrificed for a given improvement in sustainability under different weighting scenarios.

  • “Robustify” Strategic Goals: Policymakers can develop long-term strategies that are resilient to evolving societal priorities, such as shifts toward greater emphasis on the precautionary principle (cf. Section 3.9).

4.3.2. QALY Impact of Diseases: Evaluating Quality-Adjusted Life Years.

QALYs measure the impact of diseases and medical treatments by combining longevity and quality of life into a single index (Pliskin 1974, Pliskin and Beck 1976, Miyamoto et al. 1998).43 The impact of different impairments—such as mobility loss, chronic pain, or cognitive decline—depends on how these are weighted relative to each other in calculating overall health outcomes. Subject to restrictions on weights (e.g., a priority ranking; cf. Example 12) and with a flexible aggregation of criteria (cf. Section 3.9) the method can

  • Handle Uncertain Weightings: When precise utility-weightings for different impairments are unavailable (as one would generally assume), the method identifies health interventions or treatment plans that remain effective across a wide range of subjective assessments. For example, it might suggest treatments that optimize outcomes for both pain relief and mobility restoration, ensuring robust quality-of-life improvements.

  • Optimize Resource Allocation: Health authorities can use the method to prioritize interventions that deliver the highest QALY improvements per dollar spent, even when the relative importance of various health dimensions (e.g., physical versus mental health) is uncertain.44

  • Support Evidence-Based Policy: The method provides a performance guarantee for proposed policies, demonstrating their effectiveness regardless of the relative emphasis placed on specific impairments.

4.3.3. Resource Allocation in Companies: Balancing Risk, Resources, and Rival Assets.

In organizations, resource allocation involves deciding how to distribute limited resources—financial, human, or physical—across competing projects. Each project is evaluated based on multiple criteria, such as “risk of noncompletion” (i.e., the probability that a project fails because of delays or unforeseen issues), “use of human resources” (i.e., the availability and workload of skilled personnel required for the project), and “use of rival assets” (i.e., the occupation of assets that cannot be used by multiple projects simultaneously, for example, specialized machinery). Applying the framework of relatively robust decision making allows organizations to45

  • Identify Balanced Portfolios: By modeling projects and their criteria as feasible decisions xX, with performance criteria f1(x),,fn(x), the method selects a portfolio of projects that balances competing objectives, ensuring efficient resource use even when preferences over criteria are unclear.

  • Manage Tradeoffs: By aggregating criterion-specific performance ratios ϕi(x) into a performance index ρ(x), the method reveals, for example, how prioritizing low-risk projects impacts resource utilization and asset deployment, informing decisions about a robust balance between risk and resource efficiency.

  • Adapt to Uncertainty: As organizational priorities shift (e.g., toward innovation or risk aversion), the method ensures that allocation strategies can remain robust across changing criteria (cf. Section 3.8) and weight configurations for the criteria, and possibly across different aggregation methods (cf. Section 3.9).

In settings with discrete project sets and empirical observations, a data-driven approach can be applied (cf. Section 4.2).

5. Conclusion

Multicriteria optimization aims to resolve conflicts between competing objectives by finding Pareto-efficient decisions for which improving one criterion necessarily degrades at least one other. Although Pareto-efficiency sets a minimum standard of nonwastefulness, practical decision making often requires selecting a single compromise solution from the available Pareto frontier. Scalarization, such as assigning weights to criteria, is a common method for operationalizing this selection, but it assumes knowledge of the weights, which may not be available or easily justifiable, and it may also exclude Pareto-efficient solutions when the action set is nonconvex.

Here, we seek to mitigate the dependence on precise weight specifications by introducing a performance index that evaluates the worst-case weighted performance of a decision relative to its maximum potential. A Pareto-efficient decision that maximizes this index is viewed as relatively robust, balancing competing criteria in a way that offers resilience to weight uncertainty. A critical feature of the method is its computational simplicity, relying only on the compactness of the feasible set and the continuity of criterion functions to guarantee the existence and basic regularity of the solutions. This ensures broad applicability, even for complex, nonconvex problems. Criterion ambiguity and more general aggregation methods can be accommodated. In the case of finite sets, this method is completely general and can be operationalized through a data-driven approach under virtually no assumptions.

Appendix. Proofs

Proof of Proposition 1.

The proof proceeds by analyzing what happens when either one of the two inequalities is violated, and subsequently when both are violated. This yields three cases, each of which leads to a contradiction.

  • Case 1: If fi(x^)<fi(x) and fj(x^)fj(x), then

    F*(λ^)=F(x^|λ^)=l=1nλ^lfl(x^)<l=1nλ^lfl(x)=F(x|λ^),
    which is a contradiction.

  • Case 2: If fi(x^)fi(x) and fj(x^)>fj(x), then

    F*(λ)=F(x|λ)=l=1nλlfl(x)<l=1nλlfl(x^)=F(x^|λ),
    which is a contradiction.

  • Case 3: If fi(x^)<fi(x) and fj(x^)>fj(x), then

    F*(λ^)=F(x^|λ)+ε(fi(x^)fj(x^))F(x|λ^)=F(x|λ)+ε(fi(x)fj(x)).

But because ε(fi(x)fj(x))>ε(fi(x^)fj(x^)), this implies

F(x^|λ)F(x|λ^)ε(fi(x^)fj(x^))=F(x|λ)=F*(λ)+ε(fi(x)fj(x))ε(fi(x^)fj(x^))>0>F*(λ),
which is a contradiction.

Hence, fi(x^)fi(x) and fj(x^)fj(x), as claimed. □

Proof of Lemma 1.

Fix δ>0 and iN, and let (x,x^)X(w)×X(w^). Then

F*(w^)F*(w)=F(x^|w^)F(x|w)=F(x^|w^)F(x|w^)0+F(x|w^)F(x|w)=δfi(x)δfi(x),
because, by assumption, w^=w+δei. As the preceding inequality holds for any xX(w), it follows that
F*(w^)F*(w)δ(maxfi(X(w)))>0,
and thus F*(w^)>F*(w), which completes the proof. □

Proof of Proposition 2.

We first show that, for any given decision, the performance ratio is quasiconcave in the weight. To that end, fix any feasible action xX, and define

Uc(x)={λΔ:φ(x|λ)c}(A.1)
as the set of (normalized) weights λ that yield a performance ratio φ(x|λ) in Equation (3) at a level of at least c[0,1]. In fact, the upper contour sets in Equation (A.1) are nested in their level, that is,
cc^Uc^(x)Uc(x),(A.2)
for any c^[0,1]. Furthermore, for sufficiently small values of c, we have Uc(x)=Δ.46 Thus, we may choose c[0,1] such that Uc(x) is nonempty, and consider any two weights λ,λ^Uc(x). For any θ(0,1), their convex combination λθ=θλ+(1θ)λ^ belongs to Uc(x) if and only if
φ(x|λθ)=F(x|λθ)F*(λθ)c.(A.3)

Because φ(x|λ)c and φ(x|λ^)c, the definition of the weighted objective in Equation (1) implies

F(x|λθ)=F(x|θλ+(1θ)λ^)=θF(x|λ)+(1θ)F(x|λ^)c(θF*(λ)+(1θ)F*(λ^)),(A.4)
where we use that min{F*(λ),F*(λ^)}>0 by the nontriviality condition (N). Moreover, the convex combination of the individually optimized objectives F*(λ) and F*(λ^) (each obtained from possibly different decisions) cannot be smaller than the value of the jointly optimized objective (from a single decision):
θF*(λ)+(1θ)F*(λ^)=maxx,x^X{θF(x|λ)+(1θ)F(x^|λ^)}maxxX{θF(x|λ)+(1θ)F(x|λ^)}=maxxX{F(x|θλ+(1θ)λ^)}=F*(λθ).(A.5)

Therefore, as long as c0, the fact that F*(λθ)>0, together with Equations (A.4) and (A.5), implies that Equation (A.3) holds. As an immediate consequence, it is λθUc(x), which in turn means that Uc(x) is convex for all c[0,1] and all xX, as claimed.

The convexity of the upper contour sets Uc(x) implies that φ(x|λ) is quasiconcave in λ (see, e.g., Arrow and Enthoven 1961, p. 780). Hence, the minimum of the performance ratio over all weights in Δ is attained on the boundary:

minλΔφ(x|λ)=minλΔφ(x|λ).

The same quasiconcavity argument applies to any face of Δ, so the minimum must also lie on the boundary of each face. Repeating this argument recursively over edges and vertices, we conclude that the minimum is attained in the set Λ={e1,,en} of the n vertices of Δ. Thus, we obtain a form of “perfect complementarity,”

ρ(x)=minλ{e1,,en}{F(x|λ)F*(λ)}=miniN{F(x|ei)F*(ei)}=miniN{fi(x)fi*}=miniNϕi(x),
as claimed in Equation (6), which concludes the proof. □

Proof of Lemma 2.

For any xX, Proposition 2 provides for a representation of the performance index, which can be rewritten in the form ρ(x)=ϕι(x), where ι(x)(x)=argminiNϕi(x). The set of pseudo-robust decisions Ψ in Equation (7) is nonempty by the extreme-value theorem (Rudin 1976, theorem 4.16, p. 89), and it is compact by the maximum theorem (Berge 1963, p. 116). These two theorems also guarantee that the correspondence (·) is upper semicontinuous and nonempty-valued. Consider now P, which by Equations (9) and (10) can be written as

P={xX:xX s.t. xx}={xX:xx,xX},
where is the negation of . By the continuity of f, the preference is continuous, so that P is compact. We now show that it must also be nonempty. For this, start with any ξ0Ψ: If ξ0P, then immediately ξ0R, so the claim holds. Otherwise, if ξ0P, then there exists ξ1X such that ξ1ξ0. By Equation (9) there exists iN such that fi(ξ1)>fi(ξ0) and f(ξ1)f(ξ0). Equivalently, ϕi(ξ1)>ϕi(ξ0) and ϕj(ξ1)ϕj(ξ0), for all jN, so that ρ(ξ1)ρ(ξ0). Because by assumption ξ0Ψ, by Equation (7) it must be ρ(ξ1)ρ(ξ0)=ρ*=maxρ(X), so ξ1Ψ as well. In this manner one can now construct a sequence of decisions ξk, for k{0,1,}, continuing until ξkP for some k0, which then establishes that ΨP is indeed nonempty. Failing that, by the Bolzano-Weierstrass theorem (Berge 1963, p. 67) the sequence (ξk)k0 must have a convergent subsequence in the compact set X. Because the convergence is monotone in the (continuous) preference, there can only be a single accumulation point, which is equal to the limit of the sequence: x^=limkξk. Given that the set of all pseudo-robust decisions Ψ is compact (i.e., in particular closed) as noted earlier, it is x^Ψ. In addition, xx^ for all xX, as the upper and lower contour sets of a continuous preference are closed. But this means that x^ΨP, so the robust decision set R is both nonempty and compact, for it is the intersection of two compact sets. □

Proof of Lemma 3.

By Lemma 2, the robust decision set is nonempty and compact. Hence, by the extreme-value theorem there exists x^*R=ΨP, and Equation (7) yields ρ(R)=ρ(ΨP)={ρ*}=ρ(Ψ), as well as maxρ(P)=ρ*=maxρ(X), establishing the claim. □

Proof of Lemma 4.

(i) The claim follows by combining Equations (7) and (12). (ii) Fix ε(0,1], and—following the outcome-based logic discussed in Remark 9—consider a selection

yε*Yε=argmaxyY{iNyi:tεy1,,tεyn,tεTε}.

We now show that yε* is efficient (with respect to the coordinates of points in Y).47 Suppose it is not; then there exists a feasible y^yε* which achieves a strictly greater payoff, a contradiction, which in turn establishes the claimed efficiency of yε*. Moreover, it is clear that Yε=ϕ(ε), so that

ε=ϕ1(Yε)={xX:ϕ(x)Yε}.

As a result, εP, for all ε(0,1], as claimed. (iii) Let ε(0,1], and consider any (x,x^)ε×Ψ. If xΨ, then

ρ=miniNϕi(x)<miniNϕi(x^)=ρ*=maxρ(X).

On the other hand, xε implies that Φε(x)Φε(x^). Let κ=iN(ϕi(x)ϕi(x^)) be the total coordinate-wise difference in performance. Then for any ε^>0,

Φε^(x)Φε^(x^)ε^·κρ*ρ>0ε^ρ*ρκ>0.

This establishes the claim in part (iii), for any ε(0,min{ε,(ρ*ρ)/κ}) and ε^(0,ε). For ε^=0, the claim follows from part (i). □

Proof of Proposition 3.

We first note that ε is upper semicontinuous in ε[0,1], and set Q=Lim¯ε0+ε. The proof proceeds in two steps. We first show that QR and then Q.

Step 1: QR. By upper semicontinuity of ε we have that Q0=Ψ (see, e.g., Aubin and Frankowska 1990, p. 41), taking into account that Ψ is compact; cf. Endnote 19 for Section 3.1. Because εP, for all ε(0,1], we further obtain QclP. Assume that there is a point xQ\P, which means that x=limε0+xε, for some selection xεε, where ε(0,1]. Because xP, there exists x^P such that x^x. That is, ϕj(x^)>ϕj(x) for some jN, and ϕ(x^)ϕ(x). If we denote by A=(1/n)iNϕi the average performance over all criteria, then xP implies that A(x)<A(x^). By the continuity of A(·) on X, it is therefore limε0+A(xε)=A(x)<A(x^). Hence, there exists a ε¯(0,1) such that A(xε)<A(x^) for all ε(0,ε¯). Thus, for any given ε(0,ε¯):

Φε(x^)Φε(xε)=(1ε)(ρ(x^)ρ(xε))+ε(A(x^)A(xε))ε(A(x^)A(xε))>0,
where we have taken into account that x^Ψ, so ρ(x^)=maxρ(X)ρ(xε). But this means Φε(xε)<Φε(x^), contradicting xεε. Hence, we have shown that necessarily Q\P=. Because we already know that QΨ(clP), one obtains QΨP=R, as claimed.

Step 2: Q. Consider a monotone sequence (εk)k=0(0,1) with εk+1<εk, for all k0, and such that limkεk=0. Using the same selection xε as in Step 1, the sequence (xεk)k=0X is a sequence of points contained in the compact set X, so that by the Bolzano-Weierstrass theorem (Berge 1963, p. 67) there exists a convergent subsequence (xεkj)j=0(xεk)k=0 (with limit in X). That is, there exists qX, such that q=limjxεkj, and by the definition of Q we therefore obtain that qQ and thus, Q, as stated in the result.

This concludes the proof. □

Proof of Lemma 5.

Continuity of the optimal ε-augmented performance index Φε* over ε[0,1] follows from the maximum theorem (Berge 1963, p. 116), given the continuity of Φε in both arguments and the compactness of X. To establish monotonicity, fix ε,ε[0,1] with ε>ε. By optimality of xε for Φε and feasibility of xε at ε, we obtain

Φε*=(1ε)ρε+εμε(1ε)ρε+εμε=Φε(xε).

Subtracting Φε*=(1ε)ρε+εμε, we find

Φε*Φε*Φε(xε)Φε*=(εε)(μερε)0,
because μερε by definition. This confirms that Φε* is nondecreasing on [0, 1]. To establish convexity, we show that regular increments in ε increase the corresponding differences of the value function. If εˇ=(ε+ε)/2 denotes the midpoint between ε and ε, it is sufficient to show that the difference of successive differences,
Δ=(Φε*Φεˇ*)(Φεˇ*Φε*),
must be nonnegative. Indeed, setting δ=(εε)/2>0, it is
Φε*Φεˇ*δ(μεˇρεˇ)Φεˇ*Φε*,
where the first inequality follows from Φε*Φε(xεˇ), and the second from Φε*Φε(xεˇ). Together, these imply Δ0, which proves that Φε* is convex on [0, 1]. □

Proof of Lemma 6.

(i)/(iv) The function Φε(·) conforms to Equation (1) as a weighted objective comprising the two criteria ρ(·) and μ(·). By Proposition 1, increasing ε[0,1), which shifts weight from the first criterion to the second, can only decrease the optimal value of the first and increase that of the second. Thus, ρε is decreasing and με is increasing in ε[0,1], as claimed. (ii) Because Φ0=ρ, we have that ρ0=maxxXΦ0(x)=ρ*. Hence, by part (i) we have that ρ*ρε, for all ε[0,1]. (iii) Because ρε is bounded above by ρ* and nonincreasing in ε, the sequence (ρε)ε[0,1] converges as ε1 by the monotone convergence theorem (see, e.g., Rudin 1976, theorem 3.14, p. 55). Moreover, because ρ* is the least upper bound, the limit of ρε as ε1 must equal ρ*. (v) This result follows from the monotonicity of Φε* established in Lemma 5. Because Φε*=(1ε)ρε+εμε is nondecreasing in ε, and ρερ*, by parts (i) and (ii), it must be that μερ* for all ε[0,1]; otherwise, the convex combination would decrease, contradicting monotonicity. □

Proof of Proposition 4.

By Equations (15) and (16), along with Lemma 5, we have

0ψε=ρ*ρε=Φ0*Φε*εμε1εΦ0*Φε*+εμεεμε,ε[0,1].

Thus, given a desired approximation-error bound δ>0, for any ε^(0,1], it is

εmin{ε^,δ/με^}ψεδ.

Because, by Lemma 6 (iv), the function ε^δ/με^ is decreasing, a simpler implication follows by choosing ε^=1:

εδ/μ1ψεδ.

This establishes the claimed approximation guarantee. □

Proof of Lemma 7.

Fix any ε[0,1]. By Equation (16), the difference between the upper and lower bounds in Equation (17) is given by

dε=Φε*ρε=ε(μερε)0.

The midpoint estimator ρ^ε defined in Equation (18) is the arithmetic average of these bounds:

ρ^ε=Φε*+ρε2.

By construction, this implies

|ρ^ερ*|12(Φε*ρε)=dε2.

Therefore, ρ^ε approximates ρ* to within dε/2, as claimed. □

Proof of Lemma 8.

Let ΩR+n\{0} be a nonempty compact set, which is not reduced to the origin.

  1. (a) Because Ω is nonempty, its radial projection π(Ω) is nonempty as well. Moreover, π is continuous on the compact set Ω, so π(Ω) is compact (Apostol 1974, theorem 4.25, p. 82). We now show that π(Ω)=C(Ω)Δ. For the inclusion π(Ω)C(Ω)Δ, take any λπ(Ω). Then there exists wΩ such that π(w)=λ, i.e., λ=w/w1. Because ΩR+n\{0}, it follows that for any α>0, we have αwC(Ω), and in particular, λ=αw for α=1/w1. Hence λC(Ω)Δ. Conversely, suppose λ^C(Ω)Δ. Then there exists α^>0 such that w^=α^λ^Ω. Applying the radial projection yields π(w^)=w^/w^1=λ^, because λ^1=1. Therefore, λ^π(Ω), proving C(Ω)Δπ(Ω). Together, we obtain π(Ω)=C(Ω)Δ. (b) Because ΩΩ, we trivially have π(Ω)π(Ω). To prove the reverse inclusion, let λ^π(Ω). Then λ^=π(w^) for some w^=α^λ^Ω, with α^>0. Because Ω is compact and 0Ω, the ray αλ^ intersects Ω in a bounded interval of positive length. That is, there exist real numbers α¯,α¯, with 0<α¯α^α¯<, such that α¯λ^,α¯λ^Ω. Hence, λ^=π(α¯λ^)π(Ω), so π(Ω)π(Ω), completing the proof that π(Ω)=π(Ω).

  2. Let wint(Ω). Because ΩC(Ω), it follows that wint(C(Ω)). Because Δ is a smooth manifold and the intersection of an open set with it is open (in the relative topology), we obtain that π(int(Ω))=int(C(Ω))Δ is open in Δ,48 and thus π(w)int(π(Ω)). This proves that the radial projection maps interior points of Ω to interior points of π(Ω). It follows that if ΩΩ is open in R+n, then π(Ω) is open in Δ.

  3. Let ΩΩ be nonempty. Then π(Ω)Δ is nonempty. Its closure is compact because Ω is contained in the compact set Ω. Using continuity of π and part (i) (b) we have that

    π(Ω)=(cl(π(Ω)))=π(cl(Ω))=π((cl(Ω)))=π(Ω).

    Hence, the boundary of π(Ω) equals the boundary of the projection of the boundary of Ω.

  4. Suppose Ω is convex. If π(Ω) were not convex, then there would exist λ,λπ(Ω) and μ(0,1) such that λ=μλ+(1μ)λπ(Ω). Let wπ1({λ}), and wπ1({λ}). Then for any α>0,

    αλ=αμw1w+α(1μ)w1w.

    In particular, define

    α=(μw1+1μw1)1>0,andμ^=αμw1(0,1).

    Then

    αλ=μ^w+(1μ^)wΩ,
    by convexity of Ω. Hence, π(αλ)=λπ(Ω), contradicting our assumption. Therefore, π(Ω) must be convex.

  5. We now show that the image π(Ω) of a straight line segment ΩΩ is a straight line segment in Δ, with the possibility of a degenerate case when the straight line segment is projected to a single point in Δ. Indeed,

    π(μ^w+(1μ^)w)=μ^w+(1μ^)wμ^w+(1μ^)w1=μ^w1λ+(1μ^)w1λμ^w+(1μ^)w1=μ^w1μ^w1+(1μ^)w1λ+(1μ^w1μ^w1+(1μ^)w1)λ=μ(μ^)λ+(1μ(μ^))λ,
    where λ=w/w1,λ=w/w1, and
    μ(μ^)=μ^w1μ^w1+(1μ^)w1,μ^[0,1].

    The latter function is continuously differentiable, with

    dμ(μ^)dμ^=w1w1(μ^w1+(1μ^)w1)2>0,μ^[0,1],
    which implies that μ(·) is strictly increasing on [0, 1]. Thus, provided that λλ, the radial projection of a straight line is a bijection, which also preserves the orientation of any straight path between two different points in Ω, as long as they do not map to the same point in Δ (for λ=λ).

  6. Consider first the case where co(Ω) is a bounded polyhedron (i.e., a polytope; see, e.g., Nemhauser and Wolsey 1999, definition 2.2, p. 86). By part (i) (b) it is π(co(Ω))=π((co(Ω))). By part (iv), the set π(co(Ω)) is convex. Part (v) then guarantees that the straight lines between any two vertices of co(Ω) remain straight lines in their radial projection onto Δ, which implies it is enough to project the vertices of co(Ω) and then take the convex hull. Because the vertices are included in Ω, we therefore obtain that π(co(Ω))=co(π(Ω)) as claimed. If the bounded convex set co(Ω) is not a polytope, then it can be approximated arbitrarily closely by a polytope (see, e.g., Bronstein 2008), so that the claim—omitting some of the convergence-specific details—follows in the limit.

  7. Let Ω,ΩR+n such that Ω=ΩΩ. Then C(ΩΩ)=C(Ω)C(Ω), which implies the claim by part (i) (a) (cf. Endnote 48): π(Ω)=(C(Ω)Δ)(C(Ω)Δ)=π(Ω)π(Ω).

  8. Let Ω,ΩR+n such that Ω=ΩΩ. Then, by part (i) (a) we have π(Ω)=C(ΩΩ)Δ. If Ω=ΩΩ for some Ω,ΩR+n, then π(Ω)=C(ΩΩ)Δ, as claimed. □

Proof of Proposition 5.

Recall that the ambiguity set ΩR+n is compact, nonempty, and not reduced to the origin. For any feasible decision xX, the Ω-conditioned performance index corresponds to its worst-case relative performance. Using the radial projection of Ω and invoking Lemma 8 (i), we obtain:49

ρ(x|Ω)=minwΩF(x|w)F*(w)=minλπ(Ω)F(x|λ)F*(λ)=minλπ(Ω)F(x|λ)F*(λ).

As shown in the proof of Proposition 2, the performance ratio φ(x|λ)=F(x|λ)/F*(λ) is quasiconcave in λ for fixed xX. Because π(Ω) is compact, the minimum is attained on the boundary π(Ω), which equals π(Ω) by Lemma 8 (i) (b). Moreover, the quasiconcavity of φ(x|·) implies that all upper contour sets are convex, so the minimum is also attained on the boundary of the convex hull co(π(Ω)). Thus, if there exists an extremal base Λ such that co(Λ)=co(π(Ω)), then the minimum is attained on the compact set Λ, as claimed. □

Proof of Proposition 6.

Fix any θΘ, and consider the state-contingent Ω-conditioned performance index,

ϕ(x,θ|Ω)=minλπ(Ω)G(x,θ|λ)G*(θ|λ),xX,(A.6)
which measures the performance of an action relative to all feasible weighted-sum scalarizations of the multicriteria decision problem, in state θ. Because Λ is an extremal base of π(Ω), we have co(Λ)=co(π(Ω)). The quasiconcavity of G(x,θ|λ)/G*(θ|λ) in λ for fixed (x,θ), which obtains as in the proof of Proposition 2, ensures that the minimum in Equation (A.6) is attained at an extreme point of co(π(Ω)). Thus, as in Proposition 5, we obtain an equivalent representation,
ϕ(x,θ|Ω)=minλΛG(x,θ|λ)G*(θ|λ),xX.(A.7)

By Equation (27), the performance index is the minimum of the state-contingent Ω-conditioned performance index in Equation (A.7) over all θΘ, that is,

ρ(x|Ω,Θ)=minθΘϕ(x,θ)=minθΘ{minλΛG(x,θ|λ)G*(θ|λ)},xX.(A.8)

Thus, reversing the order of minimization in Equation (A.8) yields Equation (27), as claimed. □

Proof of Corollary 1.

Let Λ={λ(1),,λ(n)} be the (smallest) extremal base of π(Ω) (or, equivalently, of co(π(Ω))), which is finite by hypothesis, with n1. Then Equation (27) becomes

ρ(x|Ω,Θ)=minλΛ{minθΘG(x,θ|λ)G*(θ|λ)}=miniN{minθΘG(x,θ|λ(i))G*(θ|λ(i))}=miniN{fi(x)},xX,(A.9)
where we have set fi(x)=minθΘ{G(x,θ|λ(i))/G*(θ|λ(i))}, for all (x,i)X×N. Because by definition maxxXG(x,θ|λ(i))=G*(θ|λ(i)), it is fi*=maxxXfi(x)=1, for all iN, so that Equation (A.9) is in fact equivalent to Equation (28), which completes the proof. □

Proof of Lemma 9.

Fix i,j,mN (with mn),  λΔ, and δ>0. Then for any yY: If y^=y+δeiY, then

H(y|λ)=h1(l=1nλlh(yl))=h1(l=1nλlh(y^l)λi(h(y^i)h(yi))){<H(y^|λ),if λi>0,=H(y^|λ),if λi=0.

This implies Axiom 1.50 Consider now the case where (y^,λ^)=(y,λ)+(yiyj,λiλj)(ejei). Then for i=j, Axiom 2 holds trivially. For ij, it is

H(y^|λ^)=h1(lN\{i,j}λlh(yl)+λih(yi)+λjh(yj))=H(y|λ),
which also yields Axiom 2. Because λ is by assumption normalized, we also obtain that Axiom 3 holds, as
H(α1n)=h1(l=1nλlh(α))=h1(h(α))=α,
for any α>0. As far as associativity is concerned, let us assume that (λ1,,λm)0, and let
α=H(l=1mylel|l=1mλlell=1mλl)=h1(l=1mλlh(yl)l=1mλl).

With this, one obtains

H(α1m,ym+1,,yn|λ)=h1(hh1(l=1mλlh(yl)l=1mλl)·(l=1mλl)+l=m+1nλlh(yl))=H(y|λ),
whence Axiom 4 must hold (taking into account that hh1(·)=(·)). Finally, we observe that
H(y|ei)=h1h(yi)=yi,
so the coordinate-filter requirement, which constitutes Axiom 5, is also satisfied. This concludes the proof. □

Endnotes

1 Practical examples for multicriteria decision making are legion; see, for example, the collections of case studies by Berbel et al. (2018) in agriculture, Masri et al. (2018) in financial decision making, and Ravindran (2016) for supply chain management, to name just a few.

2 Extant ESG metrics differ widely. In their approach to “quantifying the impact of impact investing,” Lo and Zhang (2024) remain agnostic about the impact factors to be used, taking them as given and thus keeping at bay the difficulties of attributing weights to different ESG criteria and of dealing with this model uncertainty.

3 Lancaster (1966) already noted that a product can be viewed as a bundle of its attributes, with consumer valuations often empirically assessed using conjoint analysis (see, e.g., Green and Srinivasan 1990).

4 Example 9 in Section 3.6 discusses robust allocation in a two-agent exchange economy using the proposed framework.

5 This is notwithstanding the “norm equivalence” in a finite-dimensional Euclidean space, in the sense that any norm · can be bracketed by any other norm ^·^, so χ1 ^·^·χ2 ^·^ for suitable scalars χ1,χ2>0.

6 For the computation of Pareto sets, see Kung et al. (1975), Gabow et al. (1984), and Bentley et al. (1993).

7 See Yamamoto (2002) for details about maximizing a function on a Pareto-efficient set in a polyhedral setting.

8 A lexicographic evaluation of criteria based on perceived importance may justify an “elimination by aspects” (Tversky 1972), or more nuanced partial elimination heuristics using “attribute filters” (Kimya 2018).

9 Numerous ad hoc methods for determining weights exist, for example, in reliability engineering based on standards (Jiang and Chen 2020).

10 In his “Discussion on Making Things Equal,” Zhuang Zhou points out that “[t]here is nothing in the world bigger than the tip of an autumn hair, and Mount T’ai is tiny” (Watson 1968, p. 44), where Mount Tai is the highest point in the Shandong province of China.

11 For example, getting $100 in a month, in addition to repayment of the invested principal, would be attractive if obtained by investing a principal of $1 today, but not if it required investing a principal of $10 million today.

12 A point xX is said to be isolated if there exists an open set ORm such that OX={x}.

13 By the extreme-value theorem (Rudin 1976, theorem 4.16, p. 89), the minimum of a continuous function on a compact set exists and is finite.

14 It is R+n\{0}=α>0αΔ.

15 More precisely, as shown in the proof of Lemma 1, we have F*(w^)F*(w)+δ(maxfi(X(w))).

16 This “complete ignorance” (or full ambiguity) is relaxed in Section 3.7 where we allow the decision maker to face limited ambiguity.

17 Because F*(λ)minF*(Δ)>0, the function φ:X×Δ[0,1] is well defined.

18 Leontief (1941) employed this aggregation in fixed proportions as a simplification for his analysis of a larger economy.

19 By the extreme-value theorem, Ψ is nonempty, and by the maximum theorem, it is compact.

20 Given two vectors z,z^Rn, where z=(z1,,zn) and z^=(z^1,,z^n), the standard vector inequalities are defined as follows: (i) zz^ziz^i,iN; (ii) zz^zi<z^i,iN; (iii) z<z^(zz^andjNsuchthatzj<z^j). Here, N={1,,n}, with n2.

21 The relevant lower (set) limit is given by Lim¯ε0+ε={xX:limε0+(infxεxx)=0} (see, e.g., Aubin and Frankowska 1990, definition 1.4.6, p. 41). Given a sequence (εk)k=1(0,1] with limkεk=0, this lower limit contains the accumulation points of any sequence (xk)k=1 with elements xkεk, for all k1.

22 Note that: Φε(2,32)maxx1[0,1]{(1ε)x12+ε2(x12+2+1x13)}=ε31(7/12)ε2ε>0, for all ε(0,1].

23 The larger (path-connected, compact) action set X=X{(2,32)ζ:ζ[0,1]} leaves results unchanged.

24 It is 124(163ε2)/(2ε)Φε(3,12)=124(816ε+7ε2)/(2ε)>0 if and only if ε<47(212).

25 Normalizing the maximum criteria to the same score (e.g., c{1,10,100}) is natural in many applications.

26 The term “robustness equivalence principle” is analogous to the well-known “certainty equivalence principle,” for example, in linear-quadratic optimal control problems (Bertsekas 1995, p. 23), where it is optimal (i.e., maximizing the expected value of a quadratic objective functional) to replace uncertain parameters by their means.

27 For α=β, Equations (22) and (23) imply the unique robust allocation x^=(1/2,1/2), achieving ρ*=1/2.

28 The converse of part (iv) does not hold. That is, if Ω is nonconvex, then π(Ω) may still be convex.

29 The analysis in Example 10 yields an extreme counterexample for Ω={w}=ΩΩ, with Ω=[w¯,w], Ω=[w,w¯], and w¯ww¯. Indeed, for w¯10+ one obtains int(Δ)π(Ω), so π(Ω)π(Ω)=π(Ω), whereas π(Ω)=π({w}) is merely a singleton.

30 The underlying justification is Minkowski’s theorem, which states that any compact convex set is equal to the convex hull of its extreme points; see, for example, Rockafellar (1970, corollary 18.5.1, p. 167) or Schneider (2014, corollary 1.4.5, p. 17).

31 In general, the smallest Λ may not be finite (e.g., if ΩR+3 is a unit ball, then the smallest Λ=π(Ω) has a continuum of elements).

32 For analogous comments about the assumed continuity of the criteria in the action, see Remark 1 (i) in Section 2.

33 For example, in the case where Θ={(1,0),(0,1)} for l=n=2, observing θ1=1 implies θ2=0 (and vice versa).

34 For any mN, we define 1m=(1,,1)R+m.

35 It is (i=1mλiei)/(i=1mλi)=π(i=1mλiei), with the radial projection π(·), from R+n\{0} onto Δ; see Section 3.8.

36 Kolmogorov (1930) showed that if Axioms 14 are satisfied for λ=1n/n (cf. Endnote 34), then there exists a (continuous) strictly monotonic function h:DR, defined on a suitable domain DR, such that

H(y|1n/n)=h1(i=1n(1/n)h(yi)),yY.

Nagumo (1930) obtained a similar result, and the preceding “quasiarithmetic mean” is therefore also known as the “Kolmogorov-Nagumo mean.”

37 Homogeneity is not required by Axioms 15, because relatively robust decisions (cf. Proposition 2) are invariant under rescaling.

38 The gradient of LSE is the softmax function (Boltzmann 1871; Gibbs 1902, chapter XIV), widely used in machine learning (Bridle 1990).

39 The purpose of restricting all criteria to strictly positive values was merely to skip checking the nontriviality condition (N) in Section 2.1 which is satisfied here (e.g., xd=2). Hence, setting s13=0 poses no problem.

40 Under some mild additional assumptions (which we omit), the law of large numbers would guarantee convergence of the average scores to the true criterion-function values, at least when the number of available score observations for each realized action (i.e., each element of X^) goes to infinity, assuming that decisions can be properly recognized, as may naturally be the case on a sufficiently discrete underlying action set X.

41 Nathan and Lord (1983) propose “knowledge,” “delivery,” “relevance,” “interpersonal,” and “organization” as criteria for the quality of a university lecturer; see also DeNisi and Murphy (2017) for a survey.

42 This is not the only “energy trilemma”; Tilman et al. (2009) discuss a food-energy-environment trilemma in the context of biofuels.

43 Klarman et al. (1968) recognized the need for quality-of-life considerations in cost-effective treatment options (for chronic renal disease).

44 Cutler and Richardson (1997), Buxton (2005), and Newhouse (2021) discuss QALY for assessing cost-effectiveness in healthcare.

45 Baker and Freeland (1975) provide an overview of early scoring models. Stewart (1991) proposes a multicriteria decision support system for project selection using essentially equal weights for different objectives (akin to the Laplacian approach; cf. Section 4.1). Finally, Hall et al. (2015) suggest a project evaluation with an “underperformance riskiness index” relying on (typically unavailable) past statistical data.

46 Indeed, because φ(x|λ)miniN{fi(x)/F¯(xd)}=c¯(x)[0,1], for all λΔ, Equation (A.2) implies that cc¯(x)Uc(x)=Δ.

47 Efficiency in the outcome set corresponds directly to efficiency in the action set (cf. Section 3.2), because the latter is evaluated via a criterion vector that produces points in the outcome set.

48 The representation π(Ω)=C(Ω)Δ was given in part (i) for a compact set Ω, but it also applies to any subset of Ω.

49 By part (a) of Lemma 8 (i), π(Ω) is nonempty and compact, so the minimum with respect to λ must be attained by the extreme-value theorem.

50 When h is increasing, the argument of h1 cannot become smaller by dropping the term λi(h(y^i)h(yi)). Conversely, when h is decreasing, the argument of h1 weakly decreases, so the value of that inverse cannot go down.

References

  • Apostol TM (1974) Mathematical Analysis, 2nd ed. (Addison-Wesley, Reading, MA).Google Scholar
  • Arrow KJ, Enthoven AC (1961) Quasi-concave programming. Econometrica 29(4):779–800.CrossrefGoogle Scholar
  • Aubin J-P, Frankowska H (1990) Set-Valued Analysis (Birkhäuser, Boston).Google Scholar
  • Baker N, Freeland J (1975) Recent advances in R&D benefit measurement and project selection methods. Management Sci. 21(10):1164–1175.LinkGoogle Scholar
  • Ben-David S, Borodin A (1994) A new measure for the study of on-line algorithms. Algorithmica 11(1):73–91.CrossrefGoogle Scholar
  • Ben-Tal A, den Hertog D, Waegenaere AD, Melenberg B, Rennen G (2013) Robust solutions of optimization problems affected by uncertain probabilities. Management Sci. 59(2):341–357.LinkGoogle Scholar
  • Bentley JL, Clarkson KL, Levine DB (1993) Fast linear expected-time algorithms for computing maxima and convex hulls. Algorithmica 9(2):168–183.CrossrefGoogle Scholar
  • Berbel J, Bournaris T, Manos B, Matsatsinis N, Viaggi D, eds. (2018) Multicriteria Analysis in Agriculture: Current Trends and Recent Applications (Springer, New York).CrossrefGoogle Scholar
  • Berge C (1963) Topological Spaces (Oliver and Boyd, Edinburgh, UK).Google Scholar
  • Bertsekas DP (1995) Dynamic Programming and Optimal Control, vol. 1 (Athena Scientific, Belmont, MA).Google Scholar
  • Blanchet J, Murthy K (2019) Quantifying distributional model risk via optimal transport. Math. Oper. Res. 44(2):565–600.LinkGoogle Scholar
  • Boltzmann L (1871) Über das Wärmegleichgewicht zwischen mehratomigen Gasmolekülen. Sitzungsberichte der Mathematisch-Naturwissenschaftlichen Classe der Kaiserlichen Akademie der Wissenschaften, Wien 63(3):397–418.Google Scholar
  • Bridle JS (1990) Probabilistic interpretation of feedforward classification network outputs, with relationships to statistical pattern recognition. Fogelman Soulié F, Hérault J, eds. Neurocomputing: Algorithms, Architectures and Applications, NATO ASI Series, vol. 68 (Springer, New York), 227–236.CrossrefGoogle Scholar
  • Bronstein EM (2008) Approximation of convex sets by polytopes. J. Math. Sci. 153(6):727–762. [Translation of Russian original, which appeared in Sovremennaya Matematika / Fundamental’nye Napravleniya (Contemporary Mathematics / Fundamental Directions), Vol. 22, Geometry, 2007.]Google Scholar
  • Buxton MJ (2005) How much are health-care systems prepared to pay to produce QALY? Eur. J. Health Econom. 6(4):285–287.CrossrefGoogle Scholar
  • Charnes A, Cooper WW (1961) Management Models and Industrial Applications of Linear Programming, vol. 1 (Wiley, New York).Google Scholar
  • Charnes A, Cooper WW, Ferguson RO (1955) Optimal estimation of executive compensation by linear programming. Management Sci. 1(2):138–151.LinkGoogle Scholar
  • Cournot A (1838) Recherches sur les Principes Mathématiques de la Théorie des Richesses (Hachette, Paris).Google Scholar
  • Cutler DM, Richardson E (1997) Measuring the health of the U.S. population. Brookings Papers Econom. Activity Microeconom. 28(1997):217–282.CrossrefGoogle Scholar
  • Delage E, Ye Y (2010) Distributionally robust optimization under moment uncertainty with application to data-driven problems. Oper. Res. 58(3):595–612.LinkGoogle Scholar
  • DeNisi AS, Murphy KR (2017) Performance appraisal and performance management: 100 years of progress? J. Appl. Psych. 102(3):421–433.CrossrefGoogle Scholar
  • Edgeworth FY (1881) Mathematical Psychics (C. Kegan Paul, London).Google Scholar
  • Edgeworth FY (1897) La teoria pura del monopolio. Giornale degli Economisti 15(Anno 8):13–31.Google Scholar
  • Ehrgott M (2010) Multicriteria Optimization, 2nd ed. (Springer, Berlin).Google Scholar
  • Fechner GT (1860) Elemente der Psychophysik (Part II) (Breitkopf & Härtel, Leipzig, Germany).Google Scholar
  • Gabow HN, Bentley JL, Tarjan RE (1984) Scaling and related techniques for geometry problems. Proc. 16th Annual ACM Sympos. Theory Comput. (Association for Computing Machinery, New York), 135–143.Google Scholar
  • Gibbs JW (1902) Elementary Principles in Statistical Mechanics (General Publishing, Toronto).Google Scholar
  • Goel A, Meyerson A, Weber TA (2009) Fair welfare maximization. Econom. Theory 41(3):465–494.CrossrefGoogle Scholar
  • Green PE, Srinivasan V (1990) Conjoint analysis in marketing: New developments with implications for research and practice. J. Marketing 54(4):3–19.CrossrefGoogle Scholar
  • Hall NG, Long DZ, Qi J, Sim M (2015) Managing underperformance risk in project portfolio selection. Oper. Res. 63(3):660–675.LinkGoogle Scholar
  • Han J, Weber TA (2023) Price discrimination with robust beliefs. Eur. J. Oper. Res. 306(2):795–809.CrossrefGoogle Scholar
  • Hardy GH, Littlewood JE, Pólya G (1934) Inequalities (Cambridge University Press, Cambridge, UK).Google Scholar
  • Harsanyi JC (1955) Cardinal welfare, individualistic ethics, and interpersonal comparisons of utility. J. Political Econom. 63(4):309–321.CrossrefGoogle Scholar
  • Hawkins TR, Gausen OM, Strømman AH (2012) Environmental impacts of hybrid and electric vehicles: A review. Internat. J. Life Cycle Assessment 17(8):997–1014.CrossrefGoogle Scholar
  • Jiang R, Chen Z (2020) A standard-based approach for multi-criteria performance evaluation of engineered systems. Reliability Engrg. System Safety 202(107001):1–12CrossrefGoogle Scholar
  • Kahneman D, Tversky A (1979) Prospect theory: An analysis of decision under risk. Econometrica 47(2):263–291.CrossrefGoogle Scholar
  • Kahneman D, Tversky A (1984) Choices, values, and frames. Amer. Psychologist 39(4):341–350.CrossrefGoogle Scholar
  • Keeney RL, Raiffa H (1993) Decisions with Multiple Objectives (Cambridge University Press, Cambridge, UK).CrossrefGoogle Scholar
  • Kimya M (2018) Choice, consideration sets, and attribute filters. Amer. Econom. J. Microeconom. 10(4):223–247.CrossrefGoogle Scholar
  • Klarman HE, Francis JOS, Rosenthal GD (1968) Cost effectiveness analysis applied to the treatment of chronic renal disease. Medical Care 6(1):48–54.CrossrefGoogle Scholar
  • Kolmogorov AN (1930) Sur la notion de la moyenne. Atti della Accademia Nazionale dei Lincei, Classe di Scienze Fisiche, Matematiche e Naturali Rendiconti 12(9):388–391.Google Scholar
  • Kouvelis P, Yu G (1997) Robust Discrete Optimization and Its Applications (Springer, New York).CrossrefGoogle Scholar
  • Kung H, Luccio F, Preparata F (1975) On finding the maxima of a set of vectors. J. ACM 22(4):469–476.CrossrefGoogle Scholar
  • Lancaster KJ (1966) A new approach to consumer theory. J. Political Econom. 74(2):132–157.CrossrefGoogle Scholar
  • Leontief WW (1941) The Structure of American Economy 1919–1929: An Empirical Application of Equilibrium Analysis (Harvard University Press, Cambridge, MA).Google Scholar
  • Lo AW, Zhang R (2024) Quantifying the impact of impact investing. Management Sci. 70(10):7161–7186.LinkGoogle Scholar
  • Loewenstein GF, Thompson L, Bazerman MH (1989) Social utility and decision making in interpersonal contexts. J. Personality Soc. Psych. 57(3):426–441.CrossrefGoogle Scholar
  • Marshall A (1890) Principles of Economics (Macmillan, London).Google Scholar
  • Mas-Colell A, Whinston MD, Green JR (1995) Microeconomic Theory (Oxford University Press, Oxford, UK).Google Scholar
  • Masri H, Pérez-Gladish B, Zopounidis C, eds. (2018) Financial Decision Aid Using Multiple Criteria: Recent Models and Applications (Springer, New York).CrossrefGoogle Scholar
  • Miyamoto JM, Wakker PP, Bleichrodt H, Peters HJM (1998) The zero-condition: A simplifying assumption in QALY measurement and multiattribute utility. Management Sci. 44(6):839–849.LinkGoogle Scholar
  • Mohajerin Esfahani P, Kuhn D (2018) Data-driven distributionally robust optimization using the Wasserstein metric: Performance guarantees and tractable reformulations. Math. Programming 171(1–2):115–166.CrossrefGoogle Scholar
  • Nagumo M (1930) Über eine Klasse der Mittelwerte. Japanese J. Math. 7:71–79.CrossrefGoogle Scholar
  • Nathan BR, Lord RG (1983) Cognitive categorization and dimensional schemata: A process approach to the study of halo in performance ratings. J. Appl. Psychol. 68(1):102–114.Google Scholar
  • Nemhauser GL, Wolsey LA (1999) Integer and Combinatorial Optimization (Wiley Interscience, New York).Google Scholar
  • Newhouse JP (2021) An ounce of prevention. J. Econom. Perspect. 35(2):101–118.CrossrefGoogle Scholar
  • Pareto V (1894) Il massimo di utilità dato dalla libera concorrenza. Giornale degli Economisti 9(2):48–66.Google Scholar
  • Pareto V (1897) Cours d’Économie Politique, vol. 2 (F. Rouge, Lausanne, Switzerland).Google Scholar
  • Pareto V (1906) Manuale di Economia Politica (Società Editrice Libraria, Milan).Google Scholar
  • Pliskin JS (1974) The management of patients with end-stage renal failure: A decision-theoretic approach. Doctoral thesis, Harvard University, Boston.Google Scholar
  • Pliskin JS, Beck CH (1976) A health index for patient selection: A value function approach with application to chronic renal failure patients. Management Sci. 22(9):1009–1021.LinkGoogle Scholar
  • Ravindran AR, ed. (2016) Multiple Criteria Decision Making in Supply Chain Management (CRC Press, New York).CrossrefGoogle Scholar
  • Rockafellar RT (1970) Convex Analysis (Princeton University Press, Princeton, NJ).CrossrefGoogle Scholar
  • Rudin W (1976) Principles of Mathematical Analysis, 3rd ed. (McGraw-Hill, New York).Google Scholar
  • Savage LJ (1951) The theory of statistical decision. J. Amer. Statist. Assoc. 46(253):55–67.CrossrefGoogle Scholar
  • Schneider R (2014) Convex Bodies: The Brunn-Minkowski Theory, 2nd ed. (Cambridge University Press, Cambridge, UK).Google Scholar
  • Sleator DD, Tarjan RE (1985) Amortized efficiency of list update and paging rules. Comm. ACM 28(2):202–208.CrossrefGoogle Scholar
  • Steuer RE (1986) Multiple Criteria Optimization: Theory, Computation, and Application (Wiley, New York).Google Scholar
  • Stewart TJ (1991) A multi-criteria decision support system for R&D project selection. J. Oper. Res. Soc. 42(1):17–26.Google Scholar
  • Tilman D, Socolow R, Foley JA, Hill J, Larson E, Lynd L, Pacala S, et al. (2009) Beneficial biofuels: The food, energy, and environment trilemma. Science 325(5938):270–271.CrossrefGoogle Scholar
  • Tversky A (1972) Elimination by aspects: A theory of choice. Psych. Rev. 79(4):281–299.CrossrefGoogle Scholar
  • Villani C (2008) Optimal Transport: Old and New, Grundlehren der Mathematischen Wissenschaften, vol. 338 (Springer, Berlin).Google Scholar
  • Wald A (1945) Statistical decision functions which minimize the maximum risk. Ann. Math. 46(2):265–280.CrossrefGoogle Scholar
  • Wang T, Fu Y (2020) Constructing composite indicators with individual judgements and best-worst method. Soc. Indicators Res. 149(1):1–14.CrossrefGoogle Scholar
  • Watson B (1968) The Complete Works of Chuang Tzu (Columbia University Press, New York).Google Scholar
  • Weber EH (1846) Der Tastsinn und das Gemeingefühl. Wagner R, ed. Handwörterbuch der Physiologie mit Rücksicht auf die physiologische Pathologie, vol. 3, section 2 (Vieweg, Braunschweig, Germany), 481–588.Google Scholar
  • Weber TA (2014) On the (non-)equivalence of IRR and NPV. J. Math. Econom. 52:25–39.CrossrefGoogle Scholar
  • Weber TA (2023) Relatively robust decisions. Theory Decision 94(1):35–62.CrossrefGoogle Scholar
  • Weber TA (2024) Optimal depth of discharge for electric batteries with robust capacity-shrinkage estimator. Proc. 2024 4th Internat. Conf. Smart Grid Renewable Energy (SGRE) (IEEE, New York), 1–5.Google Scholar
  • Weber TA (2025) Monopoly pricing with unknown demand. Scandinavian J. Econom. 127(1):235–285.CrossrefGoogle Scholar
  • World Energy Council (2024) World Energy Trilemma 2024: Evolving with resilience and justice. Technical report, World Energy Council, London.Google Scholar
  • Yager RR (1988) On ordered weighted averaging aggregation operators in multi-criteria decision making. IEEE Trans. Systems Man Cybernetics 18(1):183–190.CrossrefGoogle Scholar
  • Yamamoto Y (2002) Optimization over the efficient set: Overview. J. Global Optim. 22(1–4):285–317.CrossrefGoogle Scholar
  • Zarghami M, Szidarovszky F (2010) On the relation between compromise programming and ordered weighted averaging operator. Inform. Sci. 180(11):2239–2248.CrossrefGoogle Scholar
  • Zeleny M (1974) A concept of compromise solutions and the method of the displaced ideal. Comput. Oper. Res. 1(3):479–496.CrossrefGoogle Scholar