Asymptotic Welfare Performance of Boston Assignment Algorithms

Published Online:https://doi.org/10.1287/stsy.2022.0104

Abstract

We make a detailed analysis in a special case of the Boston algorithm, which is widely used around the world to assign students to schools. We compute the limiting distribution in large random markets of both the utilitarian welfare and the order bias, a recently introduced average-case fairness measure. Our results show that the differences in utilitarian welfare between the Boston algorithms and the serial dictatorship (SD) algorithm are small and positive, whereas the differences in terms of order bias are large and positive. The naive implementation of the Boston algorithm beats its adaptive implementation on both utilitarian welfare and order bias, and both apparently beat SD on both criteria. In order to establish our results, we derive several basic results on the time evolution of the assignments made by the algorithms, which we expect to be useful for other applications. For example, we compute limiting distributions as a function of θ[0,1] of the exit time and preference rank obtained for an arbitrary agent whose initial relative position in the tiebreak order is θ.

1. Introduction

The school choice problem, involving matching students to places in schools, is of great theoretical and practical interest. The Boston mechanism is widely used in practice for school choice, despite criticism that it gives strategic incentives for agents (students and parents) to misrepresent their preferences. Its main rival is the (student-proposing) deferred acceptance (DA) mechanism, which is strategyproof; there is no incentive for students to misrepresent their preferences. However, it has been long known that DA pays a price in overall welfare for this. This important trade-off has been heavily studied.

Analysis of such mechanisms is not easy in full generality, and various simplifying assumptions are often made. As do many authors, we consider here the very special case in which each school has a single seat, schools have strict priorities given by a common tiebreak order over the students, and students have complete strict preferences over schools. This leads us to the house allocation (Hylland and Zeckhauser 1979) or one-sided matching problem, another classic problem in the area of allocation of indivisible goods. Note that this is different from the closely related housing market problem of Shapley and Scarf (1974) because in our case, agents do not start with an item and trade it but are simply matched to an item.

In the housing allocation framework, perhaps the most commonly seen mechanism in theory and practice is serial dictatorship (SD), which is strategyproof. The Boston algorithms are believed to generally outperform SD in overall welfare, but analytic results are currently lacking. The present paper derives precise asymptotic results to help fill this gap.

1.1. The Algorithms

In this paper, we consider three prominent algorithms: serial dictatorship and the naive and adaptive variants of the Boston mechanism. Each can be described (and may be implemented in practice) via a centralized procedure that takes an entire preference profile and outputs a matching of agents to items, but each is more easily and commonly interpreted as a dynamic process. This interpretation does not change the results because we care only about which agent ultimately obtains which item, not the process that delivers such an allocation. Each algorithm assumes an exogenous total order on agents, allowing them to take turns choosing or bidding for items.

SD is perhaps the most famous algorithm for housing allocation; each agent, in turn, chooses and is matched to the item he most prefers among those still available.

Naive Boston (NB) proceeds in rounds. In each round, some of the agents and items will be permanently matched, and the rest will be relegated to the following round. At round r (r=1,2), each remaining unmatched agent, in turn, bids for his rth choice among all the items and will be matched to that item if it is still available.

Adaptive Boston (AB) is similar but differs in the bidding. At each round of this algorithm, each remaining agent, in turn, bids for his most-preferred item among those still available at the start of the round. The adaptive Boston mechanism takes fewer rounds to finish than the naive version because agents do not waste time bidding for items that have already been assigned to someone else in a previous round. Naive and adaptive Boston are identical in the first round but different thereafter; in round r, each remaining agent bids for his sth preference, where sr and s depends on the agent (and the history of the process so far).

1.2. Performance Measures

Comparisons between mechanisms require a performance measure (sometimes called a figure of merit) to assess the value of the matchings achieved. If such a measure is to be derived from the agents’ satisfaction with the items they are assigned, then in the absence of other information, it can only be based on their ordinal preference ranks for those items. One commonly used way to quantify this is to devise a positional scoring rule giving the utility σ(s) derived by any agent from receiving his sth choice of item. In this paper, we consider two performance measures and two scoring rules. The normalized Borda score (also known as linear utility) for the sth preference among n items is σ(s)=nsn1. For many purposes, any decreasing linear function of s would be equivalent. The other scoring rule is k-approval, which has σ(s)=1 for sk and σ(s)=0 for s > k.

Utilitarian welfare judges the value of a matching to be the total utility derived by all agents from the items they are assigned. Order bias (Freeman et al. 2021), a more recently introduced measure, is the maximum difference in expected utility obtained by any two agents. This is an average-case measure; the use of expected utilities implies averaging over a probability distribution on profiles and their resulting outcomes, a point we discuss. Usually but not always, the two agents whose expected utilities are maximally different will be the first and last agents in the exogenous choosing order.

1.3. Random Markets

A baseline probability model for preferences is that each agent has a preference order chosen at random, independently of other agents, with all possible permutations of the items being equally likely. This is often called the random market model or (sometimes in the social choice literature) impartial culture. Although far from realistic, this model has the advantage of mathematical tractability; it is often possible to obtain explicit distributional results for the outcomes of social-choice mechanisms in the random-market case.

1.4. Assumptions on Agent Behavior

We make the further assumption that agents are sincere; they do not misrepresent their preferences in an attempt to obtain a better outcome for themselves. Both Boston mechanisms are susceptible to this kind of manipulation, which can lead to worse outcomes for less adept players.

As a justification for this assumption, we offer two arguments. First, the random market model provides a partial defense against manipulation. If each agent knows only the (uniform) distribution of preferences of the other agents (which seems realistic in many applications), then there is no incentive to deviate from sincerity; in other words, the sincere profile is a Bayesian Nash equilibrium for the game induced by the mechanism. Second, understanding the social choice rule underlying a given mechanism is a first important step when comparing mechanisms, and the approach is often used in the literature.

In any case, from now on we shall ignore any issues of strategic behavior by agents.

1.5. Description of Our Results

This paper reports results on the asymptotic performance for the housing allocation problem of the Boston mechanisms in large random markets, where the number n of agents (and of items) tends to .

A key general observation regarding this setting is that agents’ preferences are rather diverse, so much so that with high probability, it is possible to match most of the agents to one of their first few preferences—and both Boston algorithms will successfully do so. More precisely, a given agent’s preference rank R for the item he is assigned has a probability distribution that converges to a limit as n. This means, in particular, that the value of R an agent is likely to obtain does not increase with n but has much the same distribution for any sufficiently large n. For the Boston mechanisms (but not for serial dictatorship), this is even true of the very last agent in the choosing order. This paper’s main results describe explicitly the limiting distributions for the preference rank obtained by an agent in a given position θ[0,1] in the choosing order, as a function of θ. We then obtain consequent results for the order bias and utilitarian welfare. To our knowledge, these are the first detailed analytic results on the rank distribution of allocations given by the Boston algorithms.

Our results suggest that serial dictatorship is (slightly) inferior to the Boston algorithms in utilitarian welfare and (very much) inferior in order bias.

1.6. Literature Review

We give a brief summary of some key references, with no claim to be complete. The housing allocation problem was formally introduced by Hylland and Zeckhauser (1979), the school choice problem was introduced by Abdulkadiroğlu and Sönmez (2003), and the Boston mechanism was introduced by Abdulkadiroğlu et al. (2005). The adaptive Boston mechanism was formalized by Mennle and Seuken (2021).

A persistent theme of the research literature is the inevitable trade-off between strategy proofness and agent welfare, and there is still much to be learned about these issues. SD is strategyproof, whereas neither Boston mechanism is; AB gives less incentive to strategize than NB (Mennle and Seuken 2021). A comparison of Boston and other school choice mechanisms shows that welfare is generally higher under Boston (Calsamiglia et al. 2020). The Boston mechanism has been studied axiomatically (Kojima and Ünver 2014, Dur et al. 2018). Several arguments in favor of the Boston mechanism have been made, mainly on the basis of improved efficiency/welfare (Miralles 2009, Abdulkadiroğlu et al. 2011), and several against (Ergin and Sönmez 2006, Pathak and Sönmez 2008), mainly on the basis of unfairness to agents who strategize poorly.

Several authors have studied the behavior of prominent algorithms for house allocation in the large random market model (Knuth 1996, Nikzad 2022, Ortega and Klein 2022) (we say more about their results in Section 8.2). More general analysis of entire classes of algorithms was performed by Che and Tercieux (2018), showing that under rather general conditions on the utility functions of agents and even allowing for some correlation in preferences, all Pareto efficient mechanisms (a class that includes Boston and SD) asymptotically achieve the maximum possible utilitarian welfare. Pycia (2019) showed that, for a more general class of problems that includes house allocation, most common measures of performance are asymptotically the same for all Pareto efficient and strategyproof mechanisms.

1.7. Outline of the Remainder of the Paper

Each section deals first with average-case results for an arbitrary initial segment of agents in the choosing order and then, with the fate of an individual agent at an arbitrary position.

The core technical results for naive Boston are Theorem 7 (number of agents remaining at a given round) and Theorem 9 (exit time of a given agent). For adaptive Boston, we have Theorem 10 (number of agents remaining at a given round), Theorem 12 (bids and exits at a given round), and Theorem 13 (fate of a given agent). The results involve recursively defined quantities ωr,zr,yr,urs.

The results for serial dictatorship are straightforwardly derived, but the Boston algorithms require nontrivial analysis. Of those, naive Boston is much easier because the exit time of an agent equals the preference rank of the item obtained by the agent. However, in adaptive Boston, this is no longer true, which necessitates substantial extra technical work.

We apply the basic results to compute utilitarian welfare in Section 3.1, the key result being Theorem 2 and its corollaries, and order bias in Section 3.2, the main results being Theorems 46. Sections 46 present and prove the technical results needed to establish the results. We give all remaining proofs of the main results in Section 7. In Section 8, we discuss the limitations and implications of our results and point out opportunities for future work.

2. Preliminaries

In this section, we list more formally the basic assumptions that delineate our study in this paper.

2.1. The Model

We assume throughout that n is a positive integer. An instance of the housing allocation problem of size n is defined by a set An of n agents and a set In of n items. Each agent has a complete strict preference ordering (linear order) of items, and together, these form a preference profile. We assume sincere behavior by all agents throughout—their submitted preference orders are identical to their sincere orders.

In addition, there is an exogenous linear order ρ on An. All of the algorithms we consider will rely on this order when agents are required to take turns choosing or bidding. We shall consider the fortunes of agents as functions of their position in the order ρ.

Definition 1.

Define the relative position of an agent a in the order ρ to be the fraction of all the agents whose position in ρ is no worse than that of a (this includes a itself). Thus, the first agent in ρ has relative position 1/n, and the last has relative position 1. For 0θ1, let An(θ) denote the set of agents whose relative position is at most θ, and let an(θ) be the last agent in An(θ).

Remark 1.

For completeness, when θ<1/n we let an(θ) be the first agent in ρ. This exceptional definition will cause no trouble, as for θ>0 it applies to only finitely many n and so, does not affect asymptotic results, whereas for θ = 0, it allows us to say something about the first agent in ρ.

We shall assume throughout that we deal with a random market. This means that each agent’s preference order is sampled independently from the uniform distribution on all n! such orders of In. The order ρ on agents is not random but fixed. Our results deal exclusively with large random markets: that is, asymptotically as n.

2.2. Evolution of Assignments by the Algorithms

Each matching algorithm could be described and implemented as a two-step process. First, the agents generate their preference orders; then, a centralized procedure processes the preference profile and outputs a matching of agents to items. However, it is often more convenient to imagine the agents developing their preference orders as the algorithm proceeds rather than in advance. This interpretation does not change the result, only the process that delivers the final matching. The evolution of the assignments for the Boston algorithms can then be described by the following stochastic processes.

In the first round, the naive and adaptive Boston processes proceed identically; each agent randomly chooses one of the n items, independently of other agents and with uniform probabilities 1n, as his most preferred item for which to bid. Each item that is so chosen is assigned to the first (in the sense of the agent order ρ) agent who bids for it; items not chosen by any agent are relegated, along with the unsuccessful agents, to the next round.

In the rth round (r2), the naive algorithm causes each remaining agent to randomly choose his rth most-preferred item, independently of other agents and of his own previous choices, uniformly from the nr+1 items for which he has not previously bid. (Note that included among these are all the items still available in the current round.) Each item so chosen is assigned to the first agent who chose it; other items and unsuccessful agents are relegated to the next round.

In the rth round of the adaptive Boston algorithm, each agent repeatedly chooses his next most-preferred item by random sampling, without replacement, from the set of items he has not previously chosen until the item chosen is among those still available at this round. This item becomes his bid for the round. Bids are then resolved as in the naive mechanism. Each chosen item is assigned to the first agent who chose it; other items and unsuccessful agents are relegated to the next round.

Example 1.

Consider a situation with n = 4, agents 1,4 and objects a,b,c,d, and an exogenous order 1,2,3,4 on the agents. Suppose that agents 1, 3, 4 have a as their first choice, and agent 2 ranks b first. Under SD, agent 1 takes object a, and agent 2 takes b. We then proceed to agent 3, who must take his second choice—unless that is b, in which case he must be content with his third choice. Finally, agent 4 takes the last remaining object.

If the same four agents instead use NB, then in the first round, agent 1 takes a and agent 2 takes b. In the second round, agents 3 and 4 each bid for their second choice. Suppose these bids are b for agent 3 and c for agent 4; then, agent 4 takes c and agent 3 is relegated to the third round, in which he is the only participant. It is now inevitable that agent 3 will be matched to d, the only remaining item, but if we suppose that his third choice is c, he must first make a useless bid for that item before receiving d in the fourth round.

Should the same agents instead use AB, the first round is the same as for NB, but in the second round, agents 3 and 4 both bid for c, which is given to agent 3. The process terminates after the third round, in which agent 4 bids for and receives d.

Thus, if the partial profile looks like 1:a,2:b,3:abc,4:ac, then under SD, we obtain the assignment 1:a,2:b,3:c,4:d; under NB, 1:a,2:b,3:d,4:c; and under AB, 1:a,2:b,3:c,4:d. For each algorithm, every 1 of the 72 profiles consistent with this partial profile outputs the same assignment.

2.3. Performance Measures

To compare the performance of the algorithms, we impute utility to agents. This requires a means of converting ordinal preferences to numerical utility values.

Definition 2.

A positional scoring rule is given by a sequence (σn(s))s=1n of real numbers with 0σn(s)σn(s1)1 for 2sn.

Each positional scoring rule defines an induced rank utility function, common to all agents; an agent matched to his sth preference derives utility σn(s) therefrom. Commonly used scoring rules include k-approval defined by (1,1,,1,0,0,,0), where the number of ones is fixed at k independent of n; when k = 1, this is the usual plurality rule. Note that k-approval is coherent; for all n, the utility of a fixed rank object depends only on the rank and not on n. Another well-known rule is Borda defined by σn(s)=nsn1; Borda is not coherent. Borda utility is often used in the literature, sometimes under the name linear utilities.

The (utilitarian) welfare performance measure is defined as follows. Suppose that an assignment mechanism for n agents matches Sn(s,θ) of the agents with relative position at most θ to their sth preferences, for each s=1,2,. According to the utility function induced by the scoring rule (σn(s))s=1n, the welfare (total utility) of the agents with relative position at most θ is thus

Wn(θ)=s=1nσn(s)Sn(s,θ).(1)

Remark 2.

Note that the k-approval welfare is just the fraction of agents who obtain one of their top k choices. Also, the average rank of the item received by a random agent, another well used measure, equals 1+n(1u), where u is the expected Borda utility. Thus, u1 as n if and only if the expected rank is o(n).

We also consider the order bias, an average-case performance measure recently introduced in Freeman et al. (2021). The relevant definitions are recalled here for an arbitrary discrete assignment algorithm A that fixes an order on agents (such as the order ρ assumed in the present paper).

Definition 3.

The expected rank distribution under A is the mapping DA on {1,,n}×{1,,n} whose value at (r, j) is the probability, under the random-market assumption, that A assigns the rth agent his jth most-preferred item.

We usually represent this mapping as a matrix where the rows represent agents and the columns represent items.

Definition 4.

Let u be a common rank utility function for all agents; u(j) is the utility derived by an agent who obtains his jth preference. Define the order bias of A by

βn(A;u)=max1p,qn|U(p)U(q)|u(1)u(n),
where U(p)=j=1nDA(p,j)u(j), the expected utility of the item obtained by the pth agent.

It is often desirable that βn be as small as possible, out of fairness to each position in the order in the absence of any knowledge of the profile. In any case, knowledge of the order bias is important. For example, we may wish to apply “affirmative action,” as is done for example in many sports draft procedures.

In all three mechanisms in this paper (naive and adaptive Boston and serial dictatorship), the two agents whose expected utilities are maximally different are the first and last agents in ρ. The first agent in ρ always obtains his first-choice item, and so has the best possible expected utility. The last agent in ρ has the smallest expected utility; this is a consequence of the following result.

Theorem 1

(Earlier Positions Do Better on Average). Let a be an agent in an instance of the house allocation problem with random-market preferences. Let the random variable S be the preference rank of the item obtained by a. The naive and adaptive Boston mechanisms and serial dictatorship all have the property that for all s1,P(S>s) is monotone increasing in the relative position of a (i.e., greater for later agents in ρ).

Proof.

See Section 7. □

Remark 3.

Thus, the probability distribution of the preference rank of the item received by an agent stochastically dominates that for any later agent. For each common rank utility function u, the expected utility of agent a is u(1)+s=1n1(u(s+1)u(s))P(S>s), so Theorem 1 implies that the expected utility is monotone decreasing in the relative position of a. In particular, the first agent has the highest and the last agent the lowest expected utility.

3. Main Results on Agent Satisfaction

In this section, we state the main results of this paper. Proofs are deferred to Section 7.

3.1. Welfare

In this section, we state results on the utilitarian welfare achieved by the three mechanisms.

For each s=1,2,, we denote by Sn(s,θ) the number of agents with relative position at most θ who are matched to their sth preferences.

Theorem 2

(Asymptotic Welfare of the Mechanisms). Assume an assignment mechanism with

1nSn(s,θ)p0θqs(ϕ)dϕ as n, for each s=1,2,,
where s=1qs(θ)=1. Suppose the scoring rule (σn(s))s=1n satisfies
σn(s)λs as n, for each s=1,2,

Then, the welfare given by (1) satisfies

1nWn(θ)ps=1λs0θqs(ϕ)dϕ.

Corollary 1.

The average k-approval welfare over all agents satisfies

1nWn(1)p{1ωk+1for naive Boston(1e1){(r,s):rsk}e1rursfor adaptive Bostonkk+1for serial dictatorship.

The sequence ωk is defined by ω1=1 and the recursion ωk+1=ωkeωk for k1. The bivariate sequence urs is defined by the recursion u11=1,u1,s=0 for s > 1, urs = 0 for s < r, and

urs=e1rur1,s1+(1e1r)ur,s1.

The limiting values of k-approval welfare are given in some special cases in Table 1. Figure 1 shows the limiting values for the three-approval scoring rule. It shows that naive Boston outperforms the adaptive version, with much of the difference attributable to the last 20% of agents. Also, serial dictatorship is more favorable for agents in the middle of the order but much worse for those toward the end.

Table

Table 1. Limiting Values as n of k-Approval Welfare

Table 1. Limiting Values as n of k-Approval Welfare

Algorithmk = 1k = 2k = 3
Naive Boston1e10.6321e1ee10.7451e1ee1eee10.803
Adaptive Boston1e10.632(1e1)(1+e2)0.718(1e1)(1+2e2e3+e5)0.776
Serial dictatorship1/2=0.5002/30.6673/4=0.750
Figure 1. Limiting Values as n of Cumulative Three-Approval Welfare as a Function of θ
Corollary 2.

Let σ be a positional scoring rule such that for all but finitely many s, σn(s)=0 for all n. Then, the asymptotic utilitarian welfare with respect to σ is strictly greater under naive Boston than under serial dictatorship.

This last result is intuitively reasonable because naive Boston maximizes the number of agents receiving their first choice, then subject to that the number receiving their second choice, etc. Note that this is not a proof because it may be that NB makes choices that may prevent it beating SD in some lower ranks. Adaptive Boston apparently scores better than serial dictatorship for each k and worse than naive Boston, although we do not yet have a formal proof. Figure 2 illustrates this for 1k10. Already for k = 3, the algorithms give similar welfare results, and they each asymptotically approach one as k.

Figure 2. Limiting Values as n of k-Approval Welfare for 1k10
Note. (Top line) Naive Boston. (Middle line) Adaptive Boston. (Bottom line) Serial dictatorship.

In contrast to the results for k-approval, the Borda welfare does not distinguish asymptotically between the three algorithms.

Corollary 3.

For an assignment mechanism as in Theorem 2, the Borda welfare satisfies

1nWn(θ)pθ.

Corollary 4.

For each of naive Boston, adaptive Boston, and serial dictatorship, the average normalized Borda welfare over all agents is asymptotically equal to one, and hence, the average rank obtained by an agent is o(n).

Remark 4.

Note that the Borda utility of a fixed preference rank s has the limit λs=1, meaning that, in the asymptotic limit as n, agents value the sth preference (of n) just as highly as the first preference. Consequently, mechanisms such as serial dictatorship or the Boston algorithms, which in a random market, are able to give most agents one of their first few preferences, achieve the same asymptotic Borda welfare as if every agent was matched to his first preference. This behavior is really a consequence of the normalization of the Borda utilities σn(s)=nsn1 to the interval [0,1]; the first few preferences all have utility close to one.

A striking feature of our welfare results is that although the Boston algorithms have a utilitarian welfare advantage over serial dictatorship, the advantage is rather small. Asymptotically, both achieve maximum possible welfare under the Borda scoring rule. Under k-approval for fixed k, there is an advantage to Boston that of course decreases with k. These results are consistent with those of Che and Tercieux (2018), which state that under rather weak conditions, Pareto efficient mechanisms all asymptotically attain the maximum possible utilitarian welfare (normalized to one in our results). This is because the k-approval utility function does not satisfy the hypotheses of the results in Che and Tercieux (2018), and although neither does Borda, that case is similar to the special case with U(x,y)=y; the utilities are chosen independently and uniformly distributed on [0,1], so that the utility of the kth preference will be approximately 1kn if n is large.

3.2. Order Bias

In this section, we state results on the order bias achieved by the three mechanisms. Proofs are mostly deferred to Section 7.

Theorem 3.

For each fixed k, the k-approval order bias of naive Boston is asymptotically j=2k+1(1ωj), where (ωj) is as given in Definition 5.

Theorem 4.

For each fixed k, the k-approval order bias of adaptive Boston is asymptotically

1e1{(r,s):rsk}(1e1)r1urs.

(urs) is as given in (17).

Theorem 5.

The Borda order bias of each Boston mechanism is asymptotically zero.

The order bias of serial dictatorship is easy to analyze; we include the results here for comparison with the Boston mechanisms.

Theorem 6.

Fix k1 and n1.

  1. The k-approval order bias for serial dictatorship equals 1kn.

  2. The Borda order bias for serial dictatorship equals 1/2.

Corollary 5.

For each fixed k, the k-approval order bias of SD is asymptotically equal to 1, and the Borda order bias is asymptotically equal to 1/2.

Figure 3 compares the asymptotic k-approval order bias for 1k10. Table 2 gives some numerical values.

Figure 3. Limiting Values as n of k-Approval Order Bias for 1k10
Note. (Top line) Serial dictatorship. (Middle line) Adaptive Boston. (Bottom line) Naive Boston.
Table

Table 2. Limiting Quantities for k-Approval Order Bias

Table 2. Limiting Quantities for k-Approval Order Bias

Algorithmk = 1k = 2k = 3
NB1e10.632(1e1)(1e1ee1)0.471(1e1)(1e1ee1)(1e1ee1eee1)0.378
AB1e10.632(1e1)(1e2)0.547(1e1)(1e2)(1e1)2(e2+e4)0.485
SD111

There is a large difference between mechanisms in their order bias. For k-approval, serial dictatorship is asymptotically as biased as it could be, whereas the Boston algorithms have much lower bias. The Boston algorithms are asymptotically unbiased with respect to our normalized Borda utilities, whereas SD has bias 1/2 for every n. In other words, when n is large then under SD, the last agent receives a random rank item, whereas under the Boston algorithms, this agent is quite likely to get one of his top few choices.

4. Technical Results and Proofs for Naive Boston

Theorem 2 involves a random quantity, which in the asymptotic limit as n, approximates a constant (nonrandom) fraction of n. There are several more results of this kind in this section and the next one. A useful first move in approaching such results is to replace the random quantity with its expectation.

Lemma 1.

Let (Xn) be a sequence of nonnegative random variables with Var(Xn)E[Xn] and 1nE[Xn]c as n, for some constant c. Then, 1nXnpc as n (convergence in probability).

Remark 5.

The bounding of the variance of a random variable by its mean implies a distribution with relatively little variation about the mean when the mean is large. Lemma 1 makes use of this behavior to provide us with a stepping stone to establish results like Theorem 2 by performing an average-case analysis only. The same idea works if the averages are conditional expectations, as in the following lemma.

Lemma 2.

Let (Xn) be a sequence of nonnegative random variables and (Fn) a sequence of σ-fields, with Var(Xn|Fn)E[Xn|Fn] and 1nE[Xn|Fn]pc as n. Then, 1nXnpc as n.

Proof of Lemmas 1 and 2.

Lemma 1 is just the special case of Lemma 2, in which all the σ-fields Fn are trivial. For a proof of Lemma 2, it suffices to show that 1n(XnE[Xn|Fn])p0. For any ϵ>0, we have by Chebyshev’s inequality (Durrett 2019)

P(|XnE[Xn|Fn]|>ϵn|Fn)(ϵn)2Var(Xn|Fn)(ϵn)2E[Xn|Fn].

Because n2E[Xn|Fn]p0, it follows that P(|XnE[Xn|Fn]|>ϵn|Fn)p0. As these conditional probabilities are a bounded (and thus, uniformly integrable) sequence, the convergence is also in L1 (theorem 4.6.3 in Durrett 2019), and so,

P(1n|XnE[Xn|Fn]|>ϵ)=E[P(|XnE[Xn|Fn]|>ϵn|Fn)]0,
giving the required convergence in probability. □

In the present analysis, the random variables will mostly be numbers of agents who are successful (or not) at a given round of one of the Boston mechanisms. The next lemma provides the necessary variance bounds in such cases.

Lemma 3.

Suppose we have m items (m2), of which  are blue. A sequence of agents (Agent 1, Agent 2, ) each randomly (independently and uniformly) chooses an item. Let AN be a subset of the agents, and let CA be the number of blue items first chosen by a member of A. Equivalently, CA is the number of members of A who choose a blue item that no previous agent has chosen: that is, the cardinality of the set

{aA:Qa is blue and QbQab<a},
where Qa denotes the item chosen by a. Then,
Var(CA)E[CA]=maA(11m)a1.

Remark 6.

Lemma 3 describes a situation congruent to the way bids are resolved in a round of the naive Boston mechanism. The “blue” items correspond to those still available at the start of the round, and CA is the number of members of A who successfully obtain an item during the round. In the actual naive Boston algorithm, the set of unavailable items that an agent may still bid for will typically be different for different agents, but the number of them (m) is the same for all agents, which is all that matters for our purposes.

Lemma 3 is also applicable to the adaptive Boston mechanism. In this case, =m; agents bid only for items still available at the start of the round.

Proof of Lemma 3.

Let Fi denote the agent who is first to choose item I and Xia the indicator of the event {Fi=a}. That is, Xia = 1 if and only if Fi = a. We have P(Fi=a)=1m(11m)a1; agent a must choose i, whereas all previous agents choose items other than i. Let B be the set of blue items. Then, CA=iBaAXia, so

E[CA]=iBaAP(Fi=a)=iBaA1m(11m)a1=maA(11m)a1,
as claimed. Also,
E[CA2]=iBjBaAbAE[XiaXjb].(2)

For ab, these summands are identical for all ij (and zero for i = j); for a = b, they are identical for all i = j (and zero for ij). Thus, (2) reduces to

E[CA2]=(1)a,bA;abE[X1,aX2,b]+aAE[X1,a].(3)

The second term of (3) is E[CA] again. For a < b and ij, we have

E[XiaXjb]=P(Fi=a and Fj=b)=(12m)a11m(11m)ba11m.

(Agents prior to a must choose neither i nor j, a must choose i, agents between a and b must choose items other than j, and b must choose j.) Because 12m<(11m)2, this gives

E[XiaXjb]1m2(11m)a+b3.(4)

As this last expression is symmetric in a and b, (4) also holds for a > b. Hence,

E[CA2]E[CA]+(1)m2a,bA;ab(11m)a+b3.

We have (1)m2=2m2(11)2m2(11m) because m. This gives

E[CA2]E[CA]+2m2a,bA;ab(11m)a+b2,
enabling us to bound the variance as required; Var(CA)=E[CA2]E[CA]2, and so,
Var(CA)E[CA]2m2a,bA;ab(11m)a+b2(maA(11m)a1)2=2m2(a,bA;ab(11m)a+b2a,bA(11m)a+b2)0.

The limiting constants for the naive Boston algorithm will often involve terms of the following sequence.

Definition 5.

The sequence (ωr)r=1 is defined by ω1=1 and the recursion ωr+1=ωreωr for r1.

Thus, for example, ω1=1,ω2=e1,ω3=e1ee1. The value of ωr approximates r1, a relationship made more precise in the following result.

Lemma 4.

For all r3,

1r+logr<ωr<1r.

Proof.

For 3r8, the inequalities can be verified by direct calculation. Beyond this, we rely on induction; assume the result for a given r8, and consider ωr+1. Observe that the function xxex is monotone increasing on [0,1]; this gives us

ωr+1=ωreωr<e1/rr=1r+1exp(log(1+1r)1r)1r+1
via the well-known inequality log(1+x)x. Also,
ωr+1=ωreωr>e1/(r+logr)r+logr1r+1+log(r+1)(1+1+log(r+1)logrr+logr)(11r+logr)
via the well-known inequality ex1x. Thus,
ωr+1>1r+1+log(r+1)(1+(r1+logr)(log(r+1)logr)1(r+logr)2)>1r+1+log(r+1)(1+2+logr(r+1)(r+logr)2)
because
log(r+1)logr=rr+1t1dt>1r+1.

For r8, we have logr>2, and so, the result follows. □

4.1. Groups of Agents

In this subsection, we present results concerning the fortunes of those agents whose relative positions fall in an interval.

Theorem 7

(Number of Agents Remaining). Consider the naive Boston algorithm. Fix r1 and a relative position θ[0,1]. Then, the number Nn(r,θ) of members of An(θ) present at round r satisfies

1nNn(r,θ)pzr(θ).
where z1(θ)=θ and
zr+1(θ)=zr(θ)(1ezr(θ))ωr for r1.(5)

In particular, the total number of agents (and of items) present at round r satisfies

1nNn(r,1)pωr.

Proof.

Induct on r. For r = 1, the result is immediate because Nn(1,θ)=nθ. Now, fix r1, and assume the result for round r. Let Fr be the σ-field generated by events prior to round r. Conditional on Fr, we have the situation of Lemma 3; there are Nn(r) available items and Nn(r,θ) agents of An(θ) who will be the first to attempt to claim them, with the agents’ bids chosen independently and uniformly from a larger pool of nr+1 items. Letting Sn denote the number of these agents whose bids are successful, Lemma 3 gives

Var(Sn|Fr)E[Sn|Fr]=Nn(r)nr+1a=1Nn(r,θ)(11nr+1)a1.

Summing the geometric series,

E[Sn|Fr]=Nn(r)(1(11nr+1)Nn(r,θ)).

It then follows by the inductive hypothesis that

1nE[Sn|Fr]pωr(1ezr(θ)) as n.

By Lemma 2,

1nSnpωr(1ezr(θ)).

We have Nn(r+1,θ)=Nn(r,θ)Sn and so, obtain

1nNn(r+1,θ)pzr(θ)ωr(1ezr(θ))=zr+1(θ).

The result follows. □

Some of the functions zr(θ) are illustrated in Figure 4. Note that agents with an earlier position in ρ are more likely to exit in the early rounds. A consequence is that the position of an unsuccessful agent relative to other unsuccessful agents tends to improve each time he fails to claim an item.

Figure 4. The Limiting Fraction of the Agents Who Have Relative Position θ or Better and Survive to Participate in the rth Round
Corollary 6

(Limiting Distribution of Exit Time/Preference Rank Obtained). The number Sn(s,θ) of members of An(θ) matched to their sth preference satisfies

1nSn(s,θ)p0θqs(ϕ)dϕ.
where
qs(θ)=zs(θ)zs+1(θ)=zs(θ)ωsezs(θ).

In particular, the fraction of agents who exit at round s satisfies

1nSn(s,1)pωsωs+1.

Proof.

An agent is matched to his sth preference if, and only if, he is present at round s but not at round s + 1. The result follows by Theorem 7. □

The limiting functions qs(θ) are illustrated in Figure 5 (note that the vertical scale is logarithmic).

Figure 5. The Limiting Probability That an Agent Exits the Naive Boston Mechanism at the rth Round (and So, Obtains His rth Preference) as a Function of the Agent’s Initial Relative Position θ
Note. Logarithmic scale on the vertical axis.

A better understanding of the functions zr(θ) is given by the following result.

Theorem 8.

The functions zr(θ) satisfy zr(θ)=0θzr(ϕ)dϕ, where

zr(θ)=k=1r1fk(θ) for r2, and z1(θ)=1(6)
fr(θ)=1ωr exp(zr(θ)).(7)

In particular,

zr(1)=k=2r(1ωk)(8)
fr(1)=ωr+1.(9)

Proof.

Differentiate (5) with respect to θ. Alternatively, integrate (6) by parts. □

The quantity fr(θ) can be interpreted (in a sense to be made precise later) as the conditional probability that an agent with relative position θ, if present at round r, is unmatched at that round. The quantity zr(θ) can then be interpreted as the probability that an agent with relative position θ is still unmatched at the beginning of round r. Other quantities for the first few rounds are shown in Table 3.

Table

Table 3. Limiting Quantities as n for the Early Rounds of the Naive Boston Algorithm

Table 3. Limiting Quantities as n for the Early Rounds of the Naive Boston Algorithm

Meaning at round rQuantityr = 1r = 2r = 3
Fraction of all agents
 Presentωr1e10.3679exp(1e1)0.2546
 In
An(θ) and present
zr(θ)θθ+eθ1θ+eθ1e1+exp(θeθ)
For an agent with relative position θ
 P(present)zr(θ)11eθ(1eθ)(1exp(θeθ))
 P(unmatched—present)fr(θ)1eθ1exp(θeθ)1exp(θeθexp(θeθ))

4.2. Individual Agents

Theorem 7 and Corollary 6 are concerned with the outcomes achieved by the agent population collectively and will yield the results in Section 3.1 on utilitarian welfare.

Suppose, however, that our interest lies with individual agents. It is tempting to informally “differentiate” the result of Theorem 7 with respect to θ and thereby, draw conclusions about the fate of a single agent. The following result puts those conclusions on a sound footing.

Theorem 9

(Exit Time of Individual Agent). Consider the naive Boston algorithm. Fix r1 and a relative position θ[0,1]. Let Rn(θ) denote the round number at which the agent an(θ) (the last agent with relative position at most θ) is matched. Equivalently, Rn(θ) is the preference rank of the item obtained by this agent. Then,

P(Rn(θ)r)zr(θ)as n.

Proof.

The result is trivial for r = 1. Assume the result for a given value of r, and let Fr be the σ-field generated by events prior to round r. Conditional on Fr, we can apply Lemma 3 to the single agent an(θ) to obtain

P(Rn(θ)r+1)=E[P(Rn(θ)r+1|Fr)]=E[1Rn(θ)rYn],(10)
where
Yn=1(11nr+1)Nn(r,θ)1(Nn(r)nr+1).

Observe that Ynp1ωrezr(θ)=fr(θ) from Theorem 7. Equation (10) gives

P(Rn(θ)r+1)zr+1(θ)=E[1Rn(θ)r(Ynfr(θ))]+E[1Rn(θ)rzr(θ)]fr(θ).

The second term converges to zero as n by the inductive hypothesis. For the first term, note that the convergence Ynfr(θ)p0 is also convergence in L1 by theorem 4.6.3 in Durrett (2019) and so, 1Rn(θ)r(Ynfr(θ))0 in L1 also. □

Remark 7.

The result of Theorem 9 could equivalently be stated as

P(Rn(θ)=r)qr(θ),
where qr(θ) is as in Corollary 6. Note that r=1qr(θ)=1, consistent with the role of qr(θ) as an asymptotic probability.

Remark 8.

Theorem 9 tells us that an agent with fixed relative position θ has a good chance of obtaining one of his first few preferences, even if n is large. This is even true of the very last agent (θ = 1). For example, the first agent is guaranteed his first choice, and an agent at relative position 1/2 has probability over 78%, whereas the last agent has corresponding probability just under 37%.

5. Technical Results and Proofs for Adaptive Boston

We again begin with results about initial segments of the queue of agents and follow up with results about individual agents. More work is required than in the naive case because the rank of the object obtained is not the same as the round in which it is obtained.

5.1. Groups of Agents

A simple stochastic model of bidding for the adaptive Boston mechanism can be similar to the naive case. At the beginning of the rth round, each remaining agent randomly chooses an item as his next preference for which to bid; the bid is successful and the agent is matched to that item if no other agent with an earlier position in the order ρ bids for the same item. However, whereas a naïve Boston participant chooses from the set of nr+1 items for which he has not already bid, the adaptive Boston participant chooses from a smaller set: the Nn(r) items actually still available at the beginning of the round. This model allows a result analogous to Theorem 7.

Theorem 10

(Number of Agents Remaining). Consider the adaptive Boston algorithm. Fix r1 and a relative position θ[0,1]. Then, the number Nn(r,θ) of members of An(θ) present at round r satisfies

1nNn(r,θ)pyr(θ).
where y1(θ)=θ and
yr+1(θ)=yr(θ)e1r(1exp(er1yr(θ))) for r1.(11)

In particular, the total number Nn(r) of agents (and of items) present at round r satisfies

1nNn(r)pyr(1)=e1r.

Proof.

Induct on r. For r = 1, we have Nn(1,θ)=nθ; the result follows immediately. Now, suppose the result for a given value of r, and consider r + 1. Let Tn be the number of agents of An(θ) matched at round r. Conditioning on the σ-field Fr generated by events prior to round r, we have the situation of Lemma 3 (with =m); there are Nn(r) available items and Nn(r,θ) agents of An(θ) who will be the first to attempt to claim them, with each such agent bidding for one of the available items, chosen uniformly at random independently of other agents. Lemma 3 gives us Var(Tn|Fr)E[Tn|Fr] and

E[Tn|Fr]=a=1Nn(r,θ)(11Nn(r))a1=Nn(r)(1(11Nn(r))Nn(r,θ)).

By the inductive hypothesis,

Nn(r)npe1r and (11Nn(r))Nn(r,θ)pexp(er1yr(θ)).

This gives us

1nE[Tn|Fr]pe1r(1exp(er1yr(θ)))=yr(θ)yr+1(θ).

By Lemma 2, then

1nTnpyr(θ)yr+1(θ).

Because Tn=Nn(r,θ)Nn(r+1,θ), it follows that 1nNn(r+1,θ)pyr+1(θ) and hence, the result. □

Remark 9.

Some of the functions yr(θ) are illustrated in Figure 4. It is apparent that the adaptive Boston mechanism proceeds more quickly than naive Boston; e1r decays much more quickly than ωr as r. Also, the tendency of advantageously ranked agents to be matched in relatively early rounds is even greater for the adaptive version of the algorithm. In an adaptive Boston assignment of a large number of items to agents with random-market preferences, under 2% of the agents will be unmatched after four rounds (versus 16% for naive Boston), and most of these (about 2/3) will be among the last 10% of agents in the original agent order.

A better understanding of the functions yr(θ) is given by the following result, which is analogous to Theorem 8.

Theorem 11.

The functions yr(θ) satisfy yr(θ)=0θyr(ϕ)dϕ, where

yr(θ)=k=1r1gk(θ) for r2, and y1(θ)=1(12)
gr(θ)=1exp(er1yr(θ)).(13)

In particular,

yr(1)=(1e1)r1(14)
gr(1)=1e1.(15)

Proof.

Differentiate (11) with respect to θ. Alternatively, integrate (12) by parts. □

Remark 10.

The quantity gr(θ) is analogous to fr(θ) in the naive case and can be interpreted (in a sense to be made precise later) as the conditional probability that an agent with relative position θ, if present at round r, is unmatched at that round. The quantity yr(θ), analogous to zr(θ) in the naive case, can then be interpreted as the probability that an agent with relative position θ is still unmatched at the beginning of round r. Other quantities for the first two rounds are shown in Table 4.

Table

Table 4. Limiting Quantities for the Early Rounds of the Adaptive Boston Algorithm

Table 4. Limiting Quantities for the Early Rounds of the Adaptive Boston Algorithm

Meaning at round rQuantityr = 1r = 2
Fraction of all agents
 Presente1r1e10.3679
 In An(θ) and presentyr(θ)θθ+eθ1
For an agent with relative position θ
 P(present)yr(θ)11eθ
 P(unmatched—present)gr(θ)1eθ1exp(e(θ+eθ1))
 P(bids for sth preference—present)urs1s=1e1(1e1)s21s2

5.1.1. The Rank of the Item Received.

Theorem 10 is less satisfying than Theorem 7. The naive Boston mechanism has a key simplifying feature; the rank of an item within its assigned agent’s preference order is equal to the round number in which it was matched. This means that Theorem 7 already enables some conclusions about agents’ satisfaction with the outcome of the process (see Corollary 6). However, in the adaptive case, we know only that an item matched at round r > 1 will be no better (and could be worse) than its assigned agent’s rth preference.

To do better, we need a more detailed stochastic bidding model. An agent a still present at the beginning of the rth round will have thus far determined an initial subsequence of his preference order comprising some number Fa,r1 of most-preferred items and failed to obtain any of them. He thus has a pool of nFa,r1 previously unconsidered items from which to choose, of which the Nn(r) items actually still available are a subset. In accordance with the random market model, let us imagine that he now generates further preferences by repeated random sampling without replacement from the previously unconsidered items, until one of the available items is sampled; this item becomes his bid in the current round. Denote by Gar the number of items sampled to construct this bid; thus, Far=j=1rGaj and Ga,1=1. If the bid is successful, the agent will be matched to his Farth preference.

Note that although the simple bidding model used in Theorem 10 provides enough information to determine the matching of items to agents (along with the round numbers at which the items are matched), it does not completely determine the agents’ preference orders. In particular, it does not determine the agents’ preference ranks for the items they are assigned. The random variables Gar provide additional information sufficient to determine this interesting feature of the outcome.

It is convenient to think of the Gar and Far as being determined by an auxiliary process that runs after the simple bidding model has been run and the matching of agents to items determined. This auxiliary process can be described in the following way. Fix integers n1>n2>>nr>0.

  • Place n1 balls, numbered from one to n1, in an urn.

  • For i=1,,r,

    • deem the ni lowest-numbered balls remaining in the urn “good,” and

    • draw balls at random from the urn, without replacement, until a good ball is drawn.

Let H(n1,,nr) be the probability distribution of the total number of balls drawn and q(s;n1,,nr)=P(X=s), where XH(n1,,nr).

Denote by M the σ-field generated by the simple bidding model, including the items on which each agent bids and the resulting matching. Conditional on M, the random variable Far for an agent a still present at round r has the H(n,Nn(2),,Nn(r)) distribution. That is,

P(Far=s|M)=q(s;n,Nn(2),,Nn(r)).(16)

Also, the {Far:a present at round r} are conditionally independent given M.

Lemma 5.

q(1;n1)=1, q(s;n1)=0 for s > 1, and q(s;n1,,nr)=0 for s < r or s>n1nr+1. The H(n1,,nr) distributions other probabilities are given by the recurrence

q(s;n1,,nr)=t=r1s1q(t;n1,,nr1)(nrn1s+1)0i<st1(1nrn1ti).

Proof.

Let N be the number of balls drawn in the first r – 1 iterations of the process and M the number drawn in the final iteration. Then, P(N+M=s)=t=r1s1P(N=t)P(M=st|N=t), and we have

P(M=st|N=t)=(nrn1s+1)0i<st1(1nrn1ti).

(The final iteration must first sample st1 consecutive nongood balls; the probabilities of achieving this are 1nrn1t for the first, 1nrn1t1 for the second, 1nrn1s+2 for the last. At last, a good ball must be drawn; the probability of this is nrn1s+1.) The result follows. □

Our interest in the H(n1,,nr) distribution mostly concerns its asymptotic limits as the numbers of balls become large, and the “without replacement” stipulation becomes unimportant. To this end, fix p1,,pr(0,1], and let u(s;p1,,pr)=P(r+i=1rGi=s), where G1,,Gr are independent random variables with geometric distributions: P(Gi=x)=pi(1pi)x for x=0,1,.

Lemma 6.

u(s;p)=p(1p)s1, u(s;p1,,pr)=0 for s < r, and

u(s;p1,,pr)=t=r1s1u(t;p1,,pr)pr(1pr)st1.

Proof.

P(r+i=1rGi=s)=t=r1s1P(r1+i=1r1Gi=t)P(1+Gr=st).

Lemma 7.

q(s;n1,,nr)u(s;p1,,pr)as n1,,nr with nin1pi.

Proof.

Take limits in Lemma 5; compare Lemma 6. □

Corollary 7.

Consider the adaptive Boston mechanism, and fix s. We have

q(s;n,Nn(2),,Nn(r))pu(s;1,e1,,e1r)as n.

Proof.

Use the convergence of 1nNn(i) given by Theorem 10. □

Corollary 7 and (16) give us an asymptotic limit for the distribution, conditional on M, of Far, the preference rank of the bid made at round r by an agent still present at that round. To condense notation, we will denote the limit u(s;1,e1,,e1r) by urs. That is,

P(Far=s|M)purs.

Note that the limit urs does not depend on the position of the agent a in the choosing order. It is fairly clear why this should be so; all remaining agents must enter their bids at the beginning of the round, before any other agent has bid, and so, the bidding process, at least, treats them symmetrically. The advantage arising from a favorable position lies in a higher probability of obtaining the item bid for, not in constructing the bid itself.

We make use of the following simplified recurrence.

Lemma 8.

u(s;p)=p(1p)s1 and u(s;p1,,pr)=0 for s < r; other values are given by the recurrence

u(s;p1,,pr)=pru(s1;p1,,pr1)+(1pr)u(s1;p1,,pr).

In particular, u11=1,u1,s=0 for s > 1, urs = 0 for s < r, and

urs=e1rur1,s1+(1e1r)ur,s1.(17)

Proof.

The recurrence in (17) has a unique solution, as does the one in Lemma 6. It is easy to check that either solution also satisfies the other recurrence. □

Remark 11.

It follows directly from (17) that the bivariate generating function F(x,y)=r,sursxrys satisfies the defining equation F(x,y)(1y)=xy+F(x/e,y)(xe). It follows directly (from substituting y = 1) that s=rurs=1, consistent with its role as a probability distribution. We have not found a nicer explicit formula for urs.

We can now state a more detailed version of Theorem 10.

Theorem 12

(The Bidding Process at a Given Round). Consider the adaptive Boston algorithm. Fix sr1 and a relative position θ[0,1]. Let yr(θ) be as in Theorem 10 and urs be as in Lemma 8.

  1. The number Nn(r,s,θ) of members of An(θ) making a bid for their sth preference at round r satisfies

    1nNn(r,s,θ)pursyr(θ).

  2. The number Un(r,s,θ) of members of An(θ) making an unsuccessful bid for their sth preference at round r satisfies

    1nUn(r,s,θ)pursyr+1(θ).

  3. The number Sn(r,s,θ) of members of An(θ) making a successful bid for their sth preference at round r satisfies

    1nSn(r,s,θ)purs(yr(θ)yr+1(θ)).

Proof.

Conditional on the σ-field M, each agent a participating in round r enters a bid for his Farth preference; the Far for this group of agents is conditionally independent given M. Thus, the conditional distribution of N(r,s,θ) given M is the binomial distribution with Nn(r,θ) trials and success probability P(Far=s|M) given by (16). The variance of a binomial distribution never exceeds its mean (Feller 1970), so Lemma 2 applies. We will thus obtain part (i) of the theorem if we can merely show that 1nE[Nn(r,s,θ)|M]pursyr(θ); that is,

1nNn(r,θ)q(s;n,Nn(2),,Nn(r))pursyr(θ).(18)

Theorem 10 gives 1nNn(r,θ)pyr(θ), and Corollary 7 gives q(s;n,Nn(2),,Nn(r))purs. Part (i) follows.

The proof of part (ii) is very similar; the conditional distribution of U(r,s,θ) given M is the binomial distribution with Nn(r+1,θ) trials and success probability P(Far=s|M) given by (16). Part (iii) follows from parts (i) and (ii). □

We now have the analog for adaptive Boston of Corollary 6.

Corollary 8

(Limiting Distribution of Preference Rank Obtained). The number Sn(s,θ) of members of An(θ) matched to their sth preference satisfies

1nSn(s,θ)p0θqs(ϕ)dϕ.
where
qs(θ)=r=1surs(yr(θ)yr+1(θ))=r=1sursyr(θ)exp(er1yr(θ)).

In particular,

1nSn(s,1)p(1e1)r=1se1rurs.

Proof.

This is an immediate consequence of part (iii) of Theorem 12, with Theorem 11 providing the integral form of the limit. □

The functions qs(θ) are illustrated in Figure 6. Figure 7 shows for the last agent (θ = 1) the distribution of the rank of the item bid for and the item obtained at the second round.

Figure 6. The Limiting Probability qs(θ) That an Agent Obtains His sth Preference via the Adaptive Boston Mechanism as a Function of the Agent’s Initial Relative Position θ
Note. Logarithmic scale on the vertical axis.
Figure 7. Distribution of Rank of the Item for Which the Last Agent Bids (Upper Dots) and Successfully Bids (Lower Dots) in Round 2, Adaptive Boston
Remark 12.

It is clear from the definition (and Remark 11) that s=1qs(θ)=1. This is consistent with the implied role of qs(θ) as a probability distribution: the limiting probability that an agent in position θ obtains his sth preference. See also Theorem 13, part (iv).

5.2. Individual Agents

If we wish to follow the fate of a single agent in the adaptive Boston mechanism, we need limits analogous to that of Theorem 9. These are provided by the following result.

Theorem 13

(Exit Time and Rank Obtained for Individual Agent). Consider the adaptive Boston algorithm. Fix sr1 and a relative position θ[0,1]. Let Vn(r,θ) denote the preference rank of the item for which the agent an(θ) (the last agent with relative position at most θ) bids at round r. (For completeness, set Vn(r,θ)=0 whenever an(θ) is not present at round r.) Let Rn(θ) denote the round number at which an(θ) is matched.

  1. (Agent present at round r.)

    P(Rn(θ)r)yr(θ)as n.

  2. (Agent bids for sth preference at round r.)

    P(Vn(r,θ)=s)yr(θ)ursas n.

  3. (Agent matched to sth preference at round r.)

    P(Rn(θ)=r and Vn(r,θ)=s)yr(θ)urs(1gr(θ))as n.

  4. (Agent matched to sth preference.)

    P(Vn(Rn(θ),θ)=s)qs(θ)as n.

The limiting quantities yr(θ),gr(θ), urs, and qs(θ) are as defined in Theorem 11, Lemma 8, and Corollary 8.

Proof.

Part (i) is proved in a similar way to Theorem 9. The result is trivial for r = 1. Assume the result for a given value of r, and let Fr be the σ-field generated by events prior to round r. Then,

P(Rn(θ)r+1)=E[P(Rn(θ)r+1|Fr)]=E[1Rn(θ)rYn],(19)
where (by applying Lemma 3 (with =m) to the single agent an(θ))
Yn=1(11Nn(r))Nn(r,θ)1.

Observe that Ynp1exp(er1yr(θ))=gr(θ) by Theorem 10. Equation (19) gives

P(Rn(θ)r+1)yr+1(θ)=E[1Rn(θ)r(Yngr(θ))]+E[1Rn(θ)ryr(θ)]gr(θ).

The second term converges to zero as n by the inductive hypothesis. For the first term, note that the convergence Yngr(θ)p0 is also convergence in L1 by theorem 4.6.3 in Durrett (2019) and so, 1Rn(θ)r(Yngr(θ))0 in L1 also. Part (i) follows.

For part (ii), we have

P(Vn(r,θ)=s)=E[1Rn(θ)rP(Fan(θ),r=s|M)]=E[1Rn(θ)rq(s;n,Nn(2),,Nn(r))].

Hence,

P(Vn(r,θ)=s)ursyr(θ)=E[1Rn(θ)r(q(s;n,Nn(2),,Nn(r))urs)]+urs(P(Rn(θ)r)yr(θ)).(20)

Both terms converge in probability to zero. For the second term, the convergence is given by part (i). For the first term, it is a consequence of Corollary 7: q(s;n,Nn(2),,Nn(r))purs, which is also convergence in L1 by theorem 4.6.3 in Durrett (2019). Part (ii) follows.

The proof of part (iii) is very similar to that of part (ii); just replace 1Rn(θ)r by 1Rn(θ)=r and yr(θ) by yr(θ)yr+1(θ).

Part (iv) is obtained from part (iii) by summation over r. □

6. Technical Results and Proofs for Serial Dictatorship

Unlike the Boston algorithms, SD is strategyproof, but it is known to behave worse in welfare and fairness. However, we are not aware of detailed quantitative comparisons in the literature. The analysis for SD is very much simpler than for the Boston algorithms. In particular, the exit time is not interesting.

6.1. Groups of Agents

Analysis in this case is much simpler than for the Boston algorithms. Results analogous to those in Sections 4 and 5 are obtainable from the following explicit formula.

Theorem 14.

The probability that the kth agent obtains his sth preference is (nsks)/(nk1) for s=1,,k and zero for other values of s.

Proof.

By the time agent k gets an item, a random subset T of k – 1 of the n items is already taken. This agent’s sth preference will be the best one left if and only if T includes his first s – 1 preferences but not the sth preference. Of the (nk1) equally probable subsets T, the number satisfying this condition is (nsks); the remaining ks items in T must be chosen from ns possibilities. □

In particular, the nth and last agent is equally likely to get each possible item.

Corollary 9

(Preference Rank Obtained). Consider the serial dictatorship algorithm. Fix s1 and a relative position θ[0,1]. The number Sn(s,θ) of members of An(θ) matched to their sth preference satisfies

1nSn(s,θ)p0θqs(ϕ)dϕ,
where qs(θ)=θs1(1θ).

Proof.

Let pkn=(nsks)/(nk1). Let Xkn be the indicator of the event that the kth agent (of n) is matched to his sth preference; thus, E[Xkn]=pkn and Var(Xkn)=pkn(1pkn). The impartial culture model requires agents to choose their preferences independently; thus, the random variables (Xkn)k=1n are independent. We have

Sn(s,θ)=k=snθXkn,
and so, E[Sn(s,θ)]=k=snθpkn and Var(Sn(s,θ))=k=snθpkn(1pkn). Hence, Var(Sn(s,θ))E[Sn(s,θ)], and Lemma 1 applies. It now remains only to show that 1nE[Sn(s,θ)]0θqs(ϕ)dϕ.

Note that

pkn=(nk+1)·(k1)(k2)(ks+1)n(n1)(ns+1)=(1k1n)j=1s1(kjnj).

Hence,

1nE[Sn(s,θ)]=1nk=snθ(1k1n)j=1s1(kjnj)=0θfn(ϕ)dϕ,
where
fn(ϕ)={(1k1n)j=1s1(kjnj)for k1nϕ<kn,k=s,,nθ0otherwise.

As n,fn(ϕ)(1ϕ)ϕs1 pointwise; because we also have 0fn(ϕ)1, the dominated convergence theorem (Durrett 2019) ensures that 0θfn(ϕ)dϕ0θ(1ϕ)ϕs1dϕ. □

6.2. Individual Agents

For individual agents, we have the following analogous result.

Theorem 15

(Preference Rank Obtained). Consider the serial dictatorship algorithm. Fix s1 and a relative position θ[0,1]. The probability that agent an(θ) (the last with relative position at most θ) is matched to his sth preference converges to qs(θ)=θs1(1θ) as n.

Proof.

From Theorem 14, this probability is

(1nθ1n)j=1s1(nθjnj).

The result follows immediately. □

Figure 8 compares the limiting values of Sn(3,θ) for our three algorithms. There is an interesting relationship between the curves. For small θ, AB is most likely to get its third choice, followed by NB and then SD. For most of the range, SD is the most likely, and NB overtakes AB only around θ=0.65. Near θ = 1, SD drops dramatically, consistent with what we already know about how badly the last 10%–20% of agents are treated under SD.

Figure 8. Limiting Values as n of the Probability of Getting One’s Third Choice as a Function of θ

7. Remaining Proofs of Theorems

In this section, we provide proofs of results not provided earlier in the paper.

Proof of Theorem 1.

Let a1 and a2 be consecutive agents, with a2 immediately after a1 in ρ. Let S1 and S2 be the preference ranks of the items obtained by a1 and a2. It will suffice to show that P(S1>s)P(S2>s). To this end, consider an alternative instance of the problem in which a1 and a2 exchange preference orders before the allocation mechanism is applied. We will refer to this instance and the original one as the “exchanged” and “nonexchanged” processes, respectively. Denote by S1 and S2 the preference ranks of the items obtained by a1 and a2 in the exchanged process. Because the exchanged process is also a random market, S1 and S1 have the same probability distribution and similarly, S2 and S2.

We now show that all three of our allocation mechanisms have the property that S1S2. From this, the result will follow because S1S2P(S1>s)P(S2>s)=P(S2>s).

For serial dictatorship, the exchanged and nonexchanged processes evolve identically for agents preceding a1 and a2. In the nonexchanged process, agent a1 then finds that his first S11 preferences are already taken; in the exchanged process, these same items are the first S11 preferences of a2. Hence, S2S1.

For the Boston mechanisms, let R be the number of unsuccessful bids made by a1 in the nonexchanged process. Then, the exchanged and nonexchanged processes evolve identically for the first R rounds, except that the bids of a1 and a2 are made in reversed order; this reversal has no effect on the availability of items to other agents. After these R rounds, a1 (in the nonexchanged process) and a2 (in the exchanged process) have reached the same point in their common preference order; in the next round, both will bid for the S1th preference in this order. Hence, S2S1. □

Proof of Theorem 2.

For convenience, define σn(s)=0 when n < s; this allows us to write Wn(θ)=s=1σn(s)Sn(s,θ). For any fixed s, the finite sum Yn(s) defined by

Yn(s)=s=1s(σn(s)Sn(s,θ)nλs0θqs(ϕ)dϕ)
has Yn(s)p0 as n. We have
Wn(θ)ns=1λs0θqs(ϕ)dϕ=Yn(s)+s>sσn(s)Sn(s,θ)ns>sλs0θqs(ϕ)dϕ,
and so,
|Wn(θ)ns=1λs0θqs(ϕ)dϕ||Yn(s)|+s>sSn(s,θ)n+s>s0θqs(ϕ)dϕ(21)
(because 0σn(s)1). Note also that s=1sSn(s,θ)nps=1s0θqs(ϕ)dϕ, whereas
s=1Sn(s,θ)n=nθnθ=s=10θqs(ϕ)dϕ,
and so,
s>sSn(s,θ)nps>s0θqs(ϕ)dϕ.

We can now establish the required convergence in probability. Let ϵ>0, and choose s so that s>s0θqs(ϕ)dϕ<ϵ/3. Then, (21) gives

P(|Wn(θ)ns=1λs0θqs(ϕ)dϕ|>ϵ)P(|Yn(s)|>ϵ/3)+P(s>sSn(s,θ)n>ϵ/3)0
as n. □

Proof of Corollary 1.

Combine Theorem 2 with Corollary 6 (naive Boston), Corollary 8 (adaptive Boston), and Corollary 9 (serial dictatorship). □

Proof of Corollary 2.

Each such σ can be written as a finite nonnegative linear combination of k-approval rules. For each k-approval rule, Corollary 1 and Lemma 4 show that NB strictly beats SD. □

Proof of Corollary 3.

Combine Theorem 2 with Corollary 6 (naive Boston), Corollary 8 (adaptive Boston), and Corollary 9 (serial dictatorship). □

Proof of Corollary 4.

This follows directly from Corollary 3 and Remark 2. □

Proof of Theorem 3.

By Theorem 9, the asymptotic probability that the last agent obtains one of his first k choices is

P(Rn(1)k)1zk+1(1).

The corresponding probability for the first agent is one. For other agents, this probability falls between the values for the first and last agents (Theorem 1). Hence, the asymptotic order bias is zk+1(1), which according to Theorem 8, can also be written j=2k+1(1ωj). □

Proof of Theorem 4.

The probability that the last agent in ρ is matched to one of his first k preferences is s=1kDA(n,s). According to Theorem 13, part (iv), the asymptotic limit of this quantity is s=1kqs(1), where

qs(1)=r=1surs(yr(1)yr+1(1)).

The asymptotic order bias is thus

limn(1s=1kDA(n,s))=1s=1kr=1surs(yr(1)yr+1(1)).

As noted in Remark 10, we have yr(1)=(1e1)r1. The result follows. □

Proof of Theorem 5.

Let n denote the expected Borda utility of the last agent in ρ: that is,

n=s=1n(nsn1)DA(n,s).

Then, for any s0,

liminfnnliminfn(ns0n1)s=1s0DA(n,s)=s=1s0qs(1),
where qs(1)=limnDA(n,s), as given by Theorem 9 (naive Boston) and Theorem 13 (adaptive Boston). Because s=1qs(1)=1 (see Remarks 7 and 12) and s0 was arbitrary, we obtain limnn=1. The order bias is 1n and hence, the result. □

Proof of Theorem 6.

From Theorem 14, we see that the first agent always gets his first choice, whereas the last agent gets each rank with probability 1/n (this special case is obvious and does not require Theorem 14). Hence, the expected utility under k-approval for the last agent is k/n, and the expected utility under Borda for that agent is

1nj=1nnjn1=1n(n1)j=0n1j=12.

The result follows immediately by subtraction. □

Proof of Corollary 5.

This follows immediately from Theorem 6 because k is fixed and n. □

8. Conclusion

We conclude this paper with discussion of possible extensions to the work herein.

8.1. Robustness to the Model Assumptions

If we relax the strict random-market assumption on preferences, we should expect different results, although the relative performance of the three algorithms will likely not vary. For example, simulations (Freeman et al. 2021) with preferences drawn from the Mallows distribution show that for small values of the Mallows dispersion parameter, it is much harder to satisfy all agents or keep order bias or average rank low, but nevertheless, NB beats AB, which beats SD, over the entire range of parameters. It is also clear that the Boston mechanisms beat SD in egalitarian welfare; the last agent in particular fares considerably better.

We have studied only sincere behavior by agents. Strategic behavior under the Boston mechanisms does occur in practice and does cause welfare loss, but the social welfare cost of adopting a strategyproof alternative such as (random) serial dictatorship is often substantial, as shown, for example, in analysis of Harvard course matching (Budish and Cantillon 2012). Note that for symmetric heterogeneous preferences such as those provided by our random markets assumption, there is no incentive for agents to report their preferences untruthfully. Even if strategic behavior is a serious consideration in some situations, it is still important to analyze and compare the underlying algorithms thoroughly.

The k-approval utilities we have used here are widely used in assignment applications. For example, statistics, such as the fraction of school choice students obtaining one of their top three choices or their one favorite course, are commonly discussed. This makes sense in many practical situations because eliciting complete agent preferences over n items for large n is infeasible, and many agents will have only a few items they consider acceptable. Furthermore, we have seen that in a random market, a large fraction of agents obtains one of their first few choices. We have also seen that Borda utilities yield different results from the k-approval case, as would any utility function satisfying the conditions of Che and Tercieux (2018). However, we do not expect that the relative order of the algorithms with respect to utilitarian welfare (naive Boston beats adaptive Boston, which beats serial dictatorship) will change.

8.2. Ideas for Future Work

An obvious question that we did not answer here is what the expected rank of the item gained by a random agent is. For SD, the asymptotic answer Θ(logn) was derived by Frieze and Pittel (1995), and this was refined to an exact formula ((n+1)Hnn)/nlogn by Knuth (1996). For the Boston mechanisms, Corollary 4 shows that the rank is o(n), but we suspect that it is much smaller. It is known (Nikzad 2022, Ortega and Klein 2022) that for the rank-maximizing mechanism RM (Featherstone 2020), which maximizes the number of agents receiving their first choice, then subject to that the number of agents receiving their second choice, etc., the expected average rank in a random market is asymptotically constant. Although similar to RM at first sight, naive Boston is not as strict because it makes a choice based on tiebreaking at the first round and hence, may diverge from RM even at the second round. Based on numerical simulation, we conjecture that the expected average ranks for naive Boston and adaptive Boston are each in fact Θ(logn) but smaller by a constant factor than that for SD.

Another important question concerns the expectation of the largest rank obtained by some agent (the egalitarian welfare). This coincides for NB with the number of rounds of the algorithm. The techniques of this paper do not allow us to answer this question. We conjecture based on the size of ωr that for NB, the expected egalitarian welfare is Θ(n). Note that the analogous quantities for SD and RM are at least n/2 and about log2n, respectively (Ortega and Klein 2022).

A simple idea that will reduce order bias is to reverse the order in which agents choose at each round (or just at the second round). Quantifying the improvement via an analysis analogous to that in this paper is not easy because it is no longer clear that the worse-off agent will be the last one in the initial choosing order; indeed, preliminary analysis shows it is not. We leave further exploration of this variant for future work.

The Boston algorithms discussed here are specializations of algorithms used for school choice to the case where each school has a single seat and schools have a common preference order over applicants. Further analysis of school choice mechanisms in the general case, from the viewpoint of welfare and order bias, would be very desirable. It would also be interesting to study welfare and order bias in the multiunit assignment model used by Budish and Cantillon (2012).

The basic results on distribution of exit time and rank of the item received by an agent may be useful in other contexts. As noted by Knuth (1996), there is direct connection between the analysis here of SD and that of uniform hashing. Naive Boston corresponds to a form of hashing where if a slot is full, the item must go to the back of the queue and wait its turn to be rehashed.

The order bias of the Boston algorithms, although smaller than that of SD, is still rather large. Thus, if this fairness criterion is important, it makes sense to use a mechanism like top trading cycles, which is strategyproof and has zero order bias in this situation with respect to every scoring rule (Freeman et al. 2021). Note that because the top trading cycles mechanism (TTC) (with a randomly chosen endowment) is equivalent to the randomized version of SD, RSD (Abdulkadiroğlu and Sönmez 1998), and SD does not give up much in utilitarian welfare to NB, TTC may be a good choice if preferences of agents are well described by IC.

Acknowledgments

The authors acknowledge useful feedback by Nick Arnosti and Rupert Freeman.

References

  • Abdulkadiroğlu A, Sönmez T (1998) Random serial dictatorship and the core from random endowments in house allocation problems. Econometrica 66(3):689–701.Google Scholar
  • Abdulkadiroğlu A, Sönmez T (2003) School choice: A mechanism design approach. Amer. Econom. Rev. 93(3):729–747.Google Scholar
  • Abdulkadiroğlu A, Che YK, Yasuda Y (2011) Resolving conflicting preferences in school choice: The “Boston mechanism” reconsidered. Amer. Econom. Rev. 101(1):399–410.Google Scholar
  • Abdulkadiroğlu A, Pathak PA, Roth AE, Sönmez T (2005) The Boston public school match. Amer. Econom. Rev. 95(2):368–371.Google Scholar
  • Budish E, Cantillon E (2012) The multi-unit assignment problem: Theory and evidence from course allocation at Harvard. Amer. Econom. Rev. 102(5):2237–2271.Google Scholar
  • Calsamiglia C, Fu C, Güell M (2020) Structural estimation of a model of school choices: The Boston mechanism vs. its alternatives. J. Polital Econom. 128(2):642–680.Google Scholar
  • Che YK, Tercieux O (2018) Payoff equivalence of efficient mechanisms in large matching markets. Theoret. Econom. 13(1):239–271.Google Scholar
  • Dur U, Mennle T, Seuken S (2018) First-choice maximal and first-choice stable school choice mechanisms. Tardos E, Elkind E, Vohra R, eds. Proc. 2018 ACM Conf. Econom. Comput. (ACM, New York), 251–268.Google Scholar
  • Durrett R (2019) Probability: Theory and Examples, Cambridge Series in Statistical and Probabilistic Mathematics, 5th ed. (Cambridge University Press, Cambridge, UK).Google Scholar
  • Ergin H, Sönmez T (2006) Games of school choice under the Boston mechanism. J. Public Econom. 90(1–2):215–237.Google Scholar
  • Featherstone C (2020) Rank efficiency: Modeling a common policymaker objective. Working paper, Baylor University, Waco, TX., https://clayton-featherstone.github.io/.Google Scholar
  • Feller W (1970) An Introduction to Probability Theory and Its Applications, vol. 1, 3rd ed. (Wiley, New York).Google Scholar
  • Freeman R, Pritchard G, Wilson MC (2021) Order symmetry: A new fairness criterion for assignment mechanisms. Preprint, submitted July 20, https://doi.org/10.31235/osf.io/xt37c.Google Scholar
  • Frieze A, Pittel BG (1995) Probabilistic analysis of an algorithm in the theory of markets in indivisible goods. Ann. Appl. Probab. 5(3):768–808.Google Scholar
  • Hylland A, Zeckhauser R (1979) The efficient allocation of individuals to positions. J. Political Econom. 87(2):293–314.Google Scholar
  • Knuth DE (1996) An exact analysis of stable allocation. J. Algorithms 20(2):431–442.Google Scholar
  • Kojima F, Ünver MU (2014) The “Boston” school-choice mechanism: An axiomatic approach. Econom. Theory 55(3):515–544.Google Scholar
  • Mennle T, Seuken S (2021) Partial strategyproofness: Relaxing strategyproofness for the random assignment problem. J. Econom. Theory 191(2021):105144.Google Scholar
  • Miralles A (2009) School choice: The case for the Boston mechanism. Internat. Conf. Auctions Market Mechanisms Their Appl. (Springer, Berlin), 58–60.Google Scholar
  • Nikzad A (2022) Rank-optimal assignments in uniform markets. Theoret. Econom. 17(1):25–55.Google Scholar
  • Ortega J, Klein T (2022) A more efficient and egalitarian mechanism for school choice. Preprint, submitted July 15, https://arxiv.org/abs/2204.07255.Google Scholar
  • Pathak PA, Sönmez T (2008) Leveling the playing field: Sincere and sophisticated players in the Boston mechanism. Amer. Econom. Rev. 98(4):1636–1652.Google Scholar
  • Pycia M (2019) Evaluating with statistics: Which outcome measures differentiate among matching mechanisms? Working paper, University of Zurich, Zurich.Google Scholar
  • Shapley L, Scarf H (1974) On cores and indivisibility. J. Math. Econom. 1(1):23–37.Google Scholar