Open Access

Queuing Uncertainty of Limit Orders

Bart Zhou Yueshen
Bart Zhou Yueshen
[email protected]
https://orcid.org/0000-0002-4326-3658
Lee Kong Chian School of Business, Singapore Management University, Singapore 178899
Search for more papers by this author

Bart Zhou Yueshen

[email protected]

https://orcid.org/0000-0002-4326-3658

Lee Kong Chian School of Business, Singapore Management University, Singapore 178899

Search for more papers by this author

Published Online:17 Sep 2025https://doi.org/10.1287/mnsc.2023.03371

Abstract

Limit orders submitted around the same time are subject to random latencies and will be queued accordingly. In equilibrium, end-of-queue limit orders always lose money—the liquidity supply appears excessive. The model generates empirical predictions regarding such “overshooting” liquidity: (i) new limit orders appear fleeting—clustered submissions are followed by immediate cancellations, (ii) the resulting cancel-to-add count ratio reflects adverse selection, and (iii) the cancel-to-add size ratio measures high-frequency market-making activity. Welfare can be hurt by the overshooting liquidity if it induces excessive speculation. Overall, the model contributes to a more comprehensive understanding and better utilization of order book data.

This paper was accepted by Agostino Capponi, finance.

Supplemental Material: The supplementary appendix and data files are available at https://doi.org/10.1287/mnsc.2023.03371.

1. Introduction

This paper studies the liquidity provision in limit order markets, the most prevalent market structures of financial securities trading. The key underlying premise, also the main departure from the canonical models, is the impossibility of perfectly timing one’s action.

This premise is well grounded in the ultrafast modern financial markets. Technologies have remarkably reduced latencies from minutes (Biais et al. 1995) to milliseconds (Hasbrouck and Saar 2013), with the latest timestamps in nanoseconds (e.g., the Daily TAQ data set since mid-2016). Attempting to perfectly time one’s actions in such a low-latency environment would be an arduous endeavor. The observed snapshots of order books do not reflect the real-time supply and demand: it takes time—even just a few milli- or microseconds—for electronic messages to travel from the data feed to the trader, and during this split second, the order book might have evolved already (Ding et al. 2014).

For liquidity suppliers, such timing difficulty implies a specific form of uncertainty for their limit orders: one does not know in what sequence, relative to their competitors, will their submitted limit orders be processed by the exchange. Such sequencing matters because of the time priority rule¹ and because of the increasing adverse selection (see, e.g., the survey by Parlour and Seppi 2008): a limit order queued very behind, given the time priority rule, will only be executed after all the preceding orders have been executed. This requires a large (combination of) market order(s), which is indicative of significant adverse information. Therefore, the profitability of the same limit order will be high (low, even negative) if it is the first (last) in queue. But at the time of order submission, there is no telling of the exact queue positions, only queuing uncertainty.

The prevalent market fragmentation makes matters worse. For example, suppose the aggregate depth at the national best ask is two units, each from a different exchange. Both of these limit orders are the sole top orders—hence, no queuing uncertainty—within the respective exchanges. But their aggregate queue positions are still uncertain, as a market(able) order might arrive first at either venue. That is, even if one knows exactly their limit order’s place in one exchange’s order book, she still does not know their queue position across all venues.

This paper develops a model to study the liquidity supply process in view of such queuing uncertainty. The notion of liquidity in the framework boils down to the amount of depth posted at the top of the (aggregate) order book, that is, at the best bid and ask. The focus on depth is for two reasons. First, in reality, price priority precedes time priority in (and across) all exchanges (by the no-trade-through rule). That is, only when two orders have the same limit price, possibly on different exchanges, will queuing uncertainty arise. Otherwise, the order with the higher bid or the lower ask always queues before the other.

Second, more importantly, changes of (top-of-)book depth actually constitute the lion’s share of limit order market activities and, yet, have been subject to relatively little scrutiny in the literature. Consider, for example, the TAQ data set, which is arguably the most common and heavily used data of U.S. equity trading. Figure 1 dissects all the TAQ messages of 400 randomly selected stocks during normal trading hours of all trading days in 2018—about 25.4 billion messages in total. The single largest category, close to 60%, is “NBBO price unchanged,” constituting all quote messages that affect the aggregate (displayed) depth at the NBBO (national best bid or offer) but not the NBBO prices. Such a large chunk of the TAQ data lies “dormant” in most of the empirical analyses (who, instead, focus on prices and trades). Turning to depth, therefore, this paper helps understand why there are so many top-of-book depth changes, what market quality they reflect, and how to “wake them up” for further research.

Figure 1. TAQ Messages by Type
*Notes*. This figure decomposes the TAQ data set according to the types of messages. The sample is based on all the messages during trading hours (9:30 a.m. to 4:00 p.m. Eastern time) of 400 randomly selected stocks across all trading days in 2018. There are in total 25.39 billion such messages, which are then classified according to the respective labels. The percentages are computed as the total number of the respective message type divided by 25.39 billion. For example, there are, in total, 1.44 billion trade messages and, hence, $5.7 % = 1.44 / 25.39$ .

Section 2 sets up the model. A number of liquidity suppliers simultaneously submit limit orders to a one-tick empty limit order book.² These limit orders are then randomly queued. To study depth dynamics, the liquidity suppliers are allowed, with some probability, to observe their orders’ queue positions and to revise them. Finally, an investor arrives and endogenously chooses an optimal market order to trade against the randomly queued limit orders. The investor demands liquidity for two reasons: she needs to hedge an endowment shock and wants to speculate on a privately observed signal about the asset’s fundamentals.

Section 3 then solves the equilibrium. Section 3.1 derives how the investor combines their two trading motives into the optimal market order size, taking as given the order book depth. In particular, their hedging motive is the source of limit orders’ profit, whereas their speculation creates adverse selection that makes limit orders lose money. Section 3.2 shows that as the optimal market order becomes larger, the adverse selection component intensifies, and, consequently, front-of-queue limit orders expect higher profit than end-of-queue orders, which, in fact, lose money.

The literature has examined the order book equilibrium in similar settings (as surveyed by Parlour and Seppi 2008), and the equilibrium depth has been defined to be such that the end-of-queue marginal limit order breaks even. However, Section 3.3 highlights that this is no longer the case with queuing uncertainty, which instead makes the end-of-queue limit order always lose money. That is, the equilibrium depth always overshoots—beyond the marginally break-even level.

To see why, consider, first, a benchmark where the limit order queue is deterministic. In this case, knowing that their order will be first in queue, a liquidity supplier will occupy all profitable depth in the order book, as if they were a monopolist. That is, the last unit of their limit order must break even, and so, no further limit orders down the queue will be profitable. All other liquidity suppliers are crowded out by the first-in-queue monopolist, and there is effectively no competition.

Queuing uncertainty brings competition back to the game: all suppliers, whose orders might queue in the front probabilistically, will have incentive to compete for such front-of-queue profit. The increased competition pushes the equilibrium book depth to always exceed the monopolist (marginally break-even) level. In other words, the last unit of liquidity supply always “overshoots,” losing money in expectation. In fact, the severity of queuing uncertainty becomes a synonym of competition, bridging monopoly (no queuing uncertainty) with perfect competition (maximum queuing uncertainty).

Section 3.4 then turns to order book depth dynamics: if a liquidity supplier realizes that their submitted limit order is (likely) to be end of queue, she will have incentive to (partially) cancel it. This leads to a stylized pattern that the book depth initially overshoots but then quickly reverts. The short-lived overshooting recalls the phenomenon of “ghost” or “phantom” liquidity, which often has the negative connotation of being elusive, intangible, and possibly associated with the ill-purposed strategies such as quote stuffing. Instead, joining Baruch and Glosten (2013), this paper argues that at least some of such fleeting, short-lived liquidity is a natural equilibrium manifestation of liquidity suppliers’ competition.

The depth dynamics generate two additional empirical predictions. First, if liquidity suppliers turn more aggressive, exacerbating the overshooting, then there will be more cancellations. Among other factors, adverse selection is arguably a leading deterrent to liquidity suppliers’ aggressiveness. The model therefore predicts that limit orders’ cancellations (relative to submissions) should be negatively correlated with order flow toxicity.

Second, note that a faster liquidity supplier, knowing that their order is more likely to be at the top of the queue, will play more aggressively (i.e., submitting larger orders) than their slower competitors. In equilibrium, therefore, larger orders tend to be faster and more likely to concentrate in the front of the queue, with smaller ones slower and in the rear. Thus, cancellations (mostly at the rear of the queue) are bound to be smaller than additions (mostly at the front). Because of this mechanism, the model predicts that a smaller cancellation size relative to addition size is an indication of more faster liquidity suppliers (e.g., high-frequency market makers).

Section 4 examines how queuing uncertainty affects welfare in terms of total gains from trade. The analysis shows that liquidity overshoot—as a result of queuing uncertainty—whereas it initially improves welfare, might eventually hurt welfare; this happens because the investor in the model is the inefficient holder of the asset (because of their private costs, such as their inventory and/or risk aversion) compared with liquidity suppliers. And yet, the investor might speculate, based on their private information, to buy too much of the asset. A social planner disapproves, but the overshooting liquidity—because of queuing uncertainty—actually indulges, such inefficient overbuying.

This novel, nonmonotonic link between market liquidity and welfare sheds new light on market design issues. Notably, many protocols aiming at taking away speed advantages, such as speed bumps and latency floors, can have potential detrimental welfare effects: leveling everyone’s speed, they essentially increase limit orders’ queuing uncertainty, intensify competition, and, consequently, might induce too much inefficient speculative trading.

1.1. Contributions and Related Literature

The paper primarily contributes to the literature of limit order book models. The review below explains how the key friction, the equilibrium characterization, and the applications of the current paper relate to other works in this literature. The key model friction, queuing uncertainty, relates to the literature studying smart order routing in settings with fragmented trading venues, as in Foucault and Menkveld (2008). In these models, because of the random types of market orders, a limit order could be executed either in the front of the aggregate queue (against a local market order) or in the back (against a smart order). That is, the uncertain market order types can be a particular source of limit orders’ queuing uncertainty. This paper instead shows that queuing uncertainty can still emerge even without random types of market orders. Additionally, this paper adds novel empirical predictions about order book depth as well as welfare implications of liquidity overshooting.

More broadly speaking, the friction of queuing uncertainty relates to the “pick-off risk” that arises when news suddenly arrives and liquidity suppliers cannot timely cancel their stale limit orders; see, among many others, Menkveld and Zoican (2017), for example. In these models, there is queuing uncertainty between the cancellations of limit orders and the arrivals of (news-driven) market orders. Instead, this paper focuses on the queuing uncertainty among limit orders, with the possibly informed market orders arriving at the end. This paper thus extends the idea of imperfect timing of liquidity taking (pick-off risks) to also liquidity supply.

In terms of equilibrium characterization, the most relevant works are those with nontrivial depth. There are two different definitions of equilibrium depths in this literature. First, many authors require that the marginal depth break even in equilibrium, such as Seppi (1997) and Sandås (2001) (see the review in Parlour and Seppi 2008, section 2). Yet another definition is to require that liquidity suppliers earn zero profit altogether, as in Glosten (1994), for example. Under the former definition, liquidity suppliers together earn the monopolist profit, whereas under the latter, they perfectly compete to earn zero. This paper bridges these two polar cases with queuing uncertainty: with maximum queuing uncertainty, competition becomes so fierce and book depth overshoots so much that liquidity suppliers all earn zero, and without queuing uncertainty, the fastest liquidity supplier is effectively a monopolist, filling the depth until their marginal profit is zero.

In terms of the applications, the paper’s focus on book depth differentiates it from two large classes of limit order book models. First, the static equilibrium has also been characterized in an order book with continuous prices, as in Glosten (1989, 1994), Biais et al. (2000, 2013), and Back and Baruch (2013). Whereas price continuity helps characterize the shape of the equilibrium order book elegantly, it makes queuing uncertainty—the focus of this paper—difficult to conceptualize.³ Second, for tractability, it has become standard to restrict order sizes to be one unit, following Glosten and Milgrom (1985) and Easley and O’Hara (1987, 1992). The book depth then, uninterestingly, degenerates to a constant of one unit. In these two classes of models, instead of depth, the focal liquidity measures are price based, such as bid-ask spread and price impact. The model prediction that the overshooting liquidity is immediately canceled adds to the literature documenting and explaining so-called short-lived limit orders. For example, Hasbrouck and Saar (2009) find evidence consistent with traders use “fleeting orders” to chase price trends or search for latent liquidity. Egginton et al. (2016) document that the phenomenon of “quote stuffing” is pervasive in equity markets. Hasbrouck (2018) examines quote volatility at high frequency and argues that the patterns are more likely because of recurrent cycles of undercutting. Degryse et al. (2024) show that orders duplicated on competing exchanges are swiftly canceled as soon as one of the duplicates is executed. Dahlström et al. (2024) study the cancellations of limit orders in general. From the theory side, both Baruch and Glosten (2013) and Bhattacharya and Saar (2021) predict that short-lived quotes can arise naturally in equilibrium, and this paper joins such an innocuous view to explain short-lived limit orders but from the novel angle of their queuing uncertainty.

The literature focusing on order book depth remains limited. Kavajecz (1999) studies how specialists contribute to depths, focusing on adverse selection and inventory management. Kavajecz and Odders-White (2001) analyze a structural model of specialists’ price schedules in terms of both quotes and depths. Kavajecz and Odders-White (2004) show that various technical analyses predict depth patterns in order books. Over the decades, the market structure has changed drastically, with fragmented electronic limit order books taking over specialist markets. Providing a theoretical framework (and novel predictions) to examine the process of limit order submission, this paper hopes to stimulate new research to analyze book depth empirically, waking up the dormant lion’s share of the widely available data of equity trading (Figure 1).

2. Model Setup

2.1. Overview

The model concerns the trading in a stylized limit order book market. Figure 2 illustrates the timeline. A number of market makers first submit limit orders, which are randomly queued. Then, with some probability, the queuing uncertainty is resolved—all market makers observe the queue positions of their orders and can choose to revise them. After revisions, a liquidity-demanding investor arrives and submits a market order to trade against the standing limit orders. Payoffs then realize, and the game ends. The model belongs to the category of “static equilibrium models” as surveyed by Parlour and Seppi (2008). The new feature is the queuing uncertainty of limit orders.

Figure 2. Timeline
*Notes*. This figure illustrates the timing of the events. At $t = 0$ , market makers submit new limit orders, which are then randomly queued. Queuing uncertainty is resolved only with probability $η$ . At $t = 1$ , market makers can revise their limit orders. Finally, an investor arrives and submits a market order.

2.2. Asset

Each unit of the traded asset pays a random V units of the numéraire good after trading.

2.3. Limit Order Book

The limit order book has one tick, that is, only two prices, the best bid b and the best ask a, where $a > E [V] > b$ and is initially empty. By symmetry, only the ask side will be analyzed.

2.4. Liquidity Supply

There are n risk-neutral limit order traders indexed by $i \in {1, \dots, n}$ , where n is a positive integer. These agents are referred to as “market makers” because their limit orders provide liquidity, although in practice, they can be any traders who use limit orders. They arrive at $t = 0$ , and each submits a limit order of size $q_{i 0}$ , asking at a, to maximize expected trading profit.

2.5. Queuing Uncertainty of Limit Orders

The n limit orders are randomly queued, and the realization is written as a vector $k$ of length n, where the i-th value $k_{i} \in {1, . ., n}$ indicates the queue position of the limit order by market maker i. For example, if $n = 3$ , a queue realization $k = [2, 3, 1]$ means that market maker $i = 1$ is the second in the queue, $i = 2$ the third, and $i = 3$ the first. (Ties are ruled out.) Ex ante, $k$ is a random vector whose distribution is known to all and is independent of market makers’ order sizes ${q_{0 i}}_{i = 1}^{n}$ . Speed heterogeneity is allowed. For example, if $n = 2$ , market maker $i = 1$ is faster than $i = 2$ if $P [k = [1, 2]] > P [k = [2, 1]]$ . Supplementary Appendix S.2 proposes a microfoundation to characterize the distribution of $k$ . By $t = 1$ , the uncertainty of $k$ is resolved, meaning that the market makers observe the realization of $k$ , only with probability $η \in [0, 1]$ .

2.6. Order Revision

If one chooses to revise their order at $t = 1$ , she can modify its size to $q_{i 1} \in [0, q_{i 0}]$ . That is, she cannot revise the order size up, as in reality, platforms only allow limit order sizes to be revised down to prevent one from swelling their existing order to push back other orders. The revisions do not affect the orders’ queue $k$ (which is formed before $t = 1$ ).

2.7. Liquidity Demand

A market order trader, called “the investor,” arrives at the end of $t = 1$ . They is risk neutral but incurs quadratic inventory cost; that is, their after-trading payoff if holding x units of the asset is $V x - (ρ / 2) x^{2}$ , where $ρ$ ( $> 0$ ) measures the severity of the inventory cost. They privately observes a noisy signal $S = V + ε$ . Their initial inventory is zero but suffers an endowment shock of U units of the asset. Neither S nor U is known to the market makers. Observing the order book depth, she chooses the market order size x to consume liquidity.

2.8. Distribution Assumptions

The fundamental sources of randomness in the model are (i) limit orders queuing $k$ , (ii) a Bernoulli draw of whether the queuing uncertainty is resolved by $t = 1$ , and (iii) investor and asset characteristics V, $ε$ , and U. They are assumed to be all independent of each other. In particular, ${V, ε, U}$ are jointly normal with $E [V] = E [ε] = E [U] = 0$ and respective variances ${τ_{V}^{- 1}, τ_{ε}^{- 1}, τ_{U}^{- 1}}$ .

2.9. Model Discussions

Remark 1

(One-Tick, Empty Order Book). The limit order book is extremely stylized. The one-tick assumption is motivated by the empirical fact that most of the order book activity concentrates on the best prices (Figure 1). The empty-book assumption is also realistic, as empty best-price levels appear frequently throughout trading hours. For example, after a large “sweeping” order or when significant news arrives, market makers compete to provide liquidity at the new, empty price level.

Remark 2

(Limit Orders’ Queuing Uncertainty). The queuing uncertainty of limit orders arise from various sources. First, different agents’ reaction speeds differ because, for example, of their infrastructure investments and distances from the exchanges. Second, even if two market makers react exactly at the same time, transmission latencies are random in nature. Third, market fragmentation and order (re)routing play an important role. Even if one order is top of queue in, for example, NYSE, the incoming market order might execute first elsewhere. This top order in NYSE is executed only after the depth on, for example, NASDAQ is depleted and if the remainder of the market order is routed to NYSE. That is, the NYSE top order might still be queued behind NASDAQ orders. Fourth, exchanges’ designs matter. Notably, various forms of “speed bumps” can affect the queue realization, as discussed in Section 4.2.

Note also that the queuing uncertainty only pertains to the n limit orders, with the market order always arriving last. This is an intentional modeling choice so as to be consistent and more comparable with existing static limit order book models. Blending limit and market orders with queuing uncertainty is similar to models featuring “pick-off risks” such as Menkveld and Zoican (2017). In these models, after sudden news, market orders rush to pick off stale limit orders, which are canceled in time only probabilistically: there is queuing uncertainty between market order arrival and limit order cancellation. This paper extends the idea of imperfect timing also to, and focuses on, liquidity supply. Importantly, there is no fundamental news in the current model, and hence, the equilibrium order revisions are driven only by the resolution of queuing uncertainty.

Remark 3

(Resolution of Queuing Uncertainty). Whereas ex ante—before submitting their limit orders—market makers always face queuing uncertainty, ex post—after submission—such uncertainty can be resolved. After receiving an order, some platforms return a confirmation of receipt with the order’s queue position to the submitter. Even without such direct reports, still, a market maker can (imperfectly) infer their queue position by comparing the timestamps of the confirmation and the recent book updates from various data feeds. It is also common practice for traders to frequently ping exchanges (sending irrelevant limit orders, e.g., deep in the book), only to receive timestamps that can be used to construct empirical latency distributions, helping infer the relevant orders’ queue positions.

There are also practical reasons why queuing uncertainty might not be (timely) resolved. First, the market order might arrive too soon, before the market makers can parse the queue information and react accordingly.⁴ Second, even when there is sufficient time before the market order arrives, limit order traders do not always perfectly learn their queue positions: not all platforms disseminate queue positions (or not always timely), lacking sufficient historical latency data, the inferred queue position might not be reliable, and even one’s exact queue position is known within an exchange, its queue position across all exchanges is still uncertain. The parameter $η$ reflects how plausible it is to resolve queuing uncertainty.

Remark 4

(Liquidity Demand). The liquidity demand can be modeled differently; see the examples in Supplementary Appendix S.3. These alternative setups do not qualitatively affect the findings in Section 3. The above specification is chosen mainly for the welfare analysis in Section 4, where the aggregate gains from trade can be computed because all agents are fully rational.

Remark 5

(Other Omitted Features of Limit Order Markets). The model abstracts from many other realistic and important features of limit order markets: the interaction of size and price discreteness (Li and Ye 2023), fee structures (Colliard and Foucault 2012, Riccó et al. 2021), alternative priority rules (Aspris et al. 2015, Degryse and Karagiannis 2022), and queue rationing and jumping (Buti et al. 2015, Yao and Ye 2018). It is the hope of this paper that the novel mechanism of queuing uncertainty and the resulting overshooting book depth will interact with and enrich future analyses of these aspects of limit order markets.

3. Equilibrium

This section solves the model and derives empirical predictions. The equilibrium is analyzed backward by first deriving the investor’s optimal market order, then characterizing market makers’ limit order profitability, and, finally, examining the limit order submission and revision strategies.

3.1. The Investor’s Optimal Market Order

Arriving in period $t = 1$ , the investor observes the depth y ( $\geq 0$ ) at the ask price a. (For now, y is taken as given, though later in equilibrium, y will be endogenously determined as the sum of the limit order sizes, i.e., $y = \sum_{i = 1}^{n} q_{i, t}$ .) Conditional on their private information ${S, U}$ , she then chooses the market order size x ( $\geq 0$ , to buy) to maximize their expected trading gain (after trade minus no trade):

\begin{array}{l} \max_{0 \leq x \leq y} E [U V + (V - a) x - \frac{ρ}{2} {(U + x)}^{2} | S, U] - E [U V - \frac{ρ}{2} U^{2} | S, U] \\ \Leftrightarrow \max_{0 \leq x \leq y} (E [V | S] - ρ U - a) x - \frac{ρ}{2} x^{2} . \end{array}

(1)

For notation simplicity, define z as the investor’s pretrade marginal valuation for the asset; that is,

z ≔ E [V | S] - ρ U = E [V] + θ \cdot (S - E [V]) - ρ U = θ S - ρ U, where θ ≔ \frac{τ_{ε}}{τ_{V} + τ_{ε}} \in (0, 1) .

(2)

The Quadratic Optimization Problem (1) has a possibly cornered solution:

x (z; y) = \min {y, \max {0, \frac{z - a}{ρ}}} .

(3)

That is, the investor submits a market (buy) order only if their pretrade marginal valuation z is above the ask price a, and their order size is capped by the depth y if $z \geq a + ρ y$ .

3.2. Limit Orders’ Profitability

Given the Optimal Market Order (3), how profitable are the limit orders? Let $\dot{π} (y)$ be the marginal profit of the y-th unit of limit order at the best ask a. Then, the profit of the first y units of limit order is $π (y) = \int_{0}^{y} \dot{π} (\tilde{y}) d \tilde{y}$ . As such, for example, a first limit order of size three expects $π (3)$ , and if another limit order of size 10 is appended behind, it expects $\int_{3}^{3 + 10} \dot{π} (\tilde{y}) d \tilde{y} = π (13) - π (3)$ .

The exact functional form of $\dot{π} (y)$ can be derived using (3). In particular, the y-th unit of the limit order is executed if and only if $x (z; y) \geq y$ . Therefore, the expected marginal profit is

\dot{π} (y) = P [x (z; y) \geq y] (a - E [V | x (z; y) \geq y]) = \int_{a + ρ y}^{\infty} (a - ϕ z) f (z) d z,

(4)

where the second equality writes

f (z)

as the unconditional density of z and uses Bayesian updating to obtain

E [V | z] = E [V] + ϕ \cdot (z - E [V]) = ϕ z

, with

ϕ ≔ θ τ_{U} / (θ τ_{U} + ρ^{2} τ_{V}) \in (0, 1)

reflecting the severity of adverse selection. Accordingly, the total profit

π (y)

can be pinned down by integrating

\dot{π} (y)

, subject to the terminal condition

π (0) = 0

Figure 3(a) illustrates the marginal profit function $\dot{π} (y)$ , and several properties are worth highlighting. Notably, $\dot{π} (y)$ is initially positive but eventually negative, crossing zero once and only once at $y = \bar{y}$ . That is, if the market makers’ limit orders form a queue, those at the front expect positive profit, whereas those at the rear expect losses. This is the “top-of-queue advantage” of limit orders: end-of-queue limit orders are (i) less likely to be executed, and (ii) conditional on being executed, they are more likely adversely selected by a more informed market order. Figure 3(b) plots the total profit $π (y)$ . Consistent with (a), $π (y)$ initially increases, peaks at $\bar{y}$ , and then drops, eventually becoming negative. The following lemma characterizes these features formally.

Figure 3. (Color online) Illustration of Limit Order Profit
*Notes*. (a) and (b) plot, respectively, the marginal profit $\dot{π} (y)$ and the total profit $π (y)$ of limit orders. The threshold $\bar{y}$ is where the marginal profit $\dot{π} (y) = 0$ , thus maximizing the total profit $π (y)$ . The threshold $\bar{\bar{y}}$ is where $π (y) = 0$ . For this illustration, the parameters are set as $a = 5.0$ , $τ_{V} = 0.3$ , $τ_{U} = 2.0$ , $τ_{ε} = 1.1$ , and $ρ = 1.0$ .

Lemma 1

(Top-of-Queue Advantage of Limit Orders). There exist two exogenous thresholds $a^{*}$ ( $> E [V] = 0$ ) and $τ_{U}^{*}$ ( $> 0$ ), as determined by (A.1) and (A.2) in the proof, respectively. Suppose $a > a^{*}$ and $τ_{U} > τ_{U}^{*}$ . Then,

the marginal profit $\dot{π} (\cdot)$ satisfies $\dot{π} (0) > 0$ , crosses zero once and only once at $y = \bar{y} > 0$ , and is strictly decreasing on $y \in (0, \bar{y}]$ .
The total profit $π (\cdot)$ equals zero only at $y = 0$ and at $y = \bar{\bar{y}} > \bar{y}$ , is strictly positive when $y \in (0, \bar{\bar{y}})$ , peaks at $y = \bar{y}$ , and is strictly negative when $y > \bar{\bar{y}}$ .

From their on, the analysis shall always assume that $a > a^{*}$ and $τ_{U} > τ_{U}^{*}$ . Three comments are in order. First, a sufficiently high ask price a ( $> a^{*}$ ) is required to ensure $\dot{π} (y) > 0$ at least for some $y > 0$ . Intuitively, if an ask price is too low, insufficient to compensate adverse selection, then no limit order will be posted there.⁵ In other words, with a more elaborate order book of multiple ticks, market makers will always find a sufficiently high ask price that is profitable enough to provide liquidity.

Second, a sufficiently large $τ_{U}$ makes market makers’ adverse selection sufficiently severe, which is a commonly needed assumption for limit order book models (see, e.g., Back and Baruch 2013). Intuitively, a large $τ_{U}$ means more precise hedging motive and, so, a more informed market order from the investor. Specifically, $τ_{U} > τ_{U}^{*}$ ensures the existence of the zero-profit depth, that is, $\bar{\bar{y}} < \infty$ , or equivalently, $\lim_{y \to \infty} π (y) < 0$ , so that the aggregate depth, that is, $\sum_{i = 1}^{n} q_{i 0}$ , is always bounded in equilibrium. The same can be achieved by assuming, instead, a sufficiently large $τ_{ε}$ .

Third, it is worth emphasizing that the above top-of-queue property is more general than the current specific setup. Supplementary Appendix S.3 demonstrates that the above features of $\dot{π} (\cdot)$ and $π (\cdot)$ also naturally emerge in other commonly used microstructure frameworks. In fact, the results in Sections 3.3 and 3.4 only require the features of $π (\cdot)$ as characterized in Lemma 1. The specific microfoundation (the way the market order is endogenized) is useful for the welfare analysis in Section 4.

3.3. Without Resolution of Queuing Uncertainty

Consider first a benchmark without the resolution of queuing uncertainty, that is, setting $η = 0$ . Because there is no new information about the random queue $k$ , market makers will not revise their orders in $t = 1$ , even if they can do so. In this sense, in the upper branch of Figure 2, the event of “market makers revise limit orders” can be omitted.

3.3.1. Equilibrium Limit Order Sizes in $t = 0$ .

Because there will be no revision in $t = 1$ , it only remains to determine market makers’ optimal limit order size ${q_{i 0}}$ in $t = 0$ . Consider market maker i, whose limit order’s queue position is written as $k_{i}$ . Then, the aggregate size of the orders that are queued before i’s order is

Q_{i}^{≺} (k) ≔ \sum_{j = 1}^{n} q_{j 0} 𝟙_{{k_{j} < k_{i}}},

(5)

where the superscript “

≺

” emphasizes that

Q_{i}^{≺}

only counts the limit orders strictly before i’s order. The market maker cares about the size of this

Q_{i}^{≺}

, but not the orders queued behind theirs.

Given others’ order sizes $q_{j 0}$ (where $j \neq i$ ), she then chooses their own order size $q_{i 0}$ to maximize their expected profit, where the expectation is taken over the random queue $k$ :

q_{i 0} \in \arg \max_{q_{i 0}} E [π (Q_{i}^{≺} (k) + q_{i 0}) - π (Q_{i}^{≺} (k))] .

(6)

A (pure-strategy) Nash equilibrium is a set ${q_{10}, \dots, q_{n 0}}$ that solves (6) for all $i \in {1, \dots, n}$ .

Lemma 2

(Nash Equilibrium). If there is no resolution of queuing uncertainty, then there exists a pure-strategy Nash equilibrium ${q_{10}, \dots, q_{n 0}} \in {[0, \bar{y}]}^{n}$ , where $\bar{y}$ is the unique solution to $\dot{π} (y) = 0$ as given in Lemma 1. In such an equilibrium, the first-order condition

E [\dot{π} (Q_{i}^{≺} (k) + q_{i 0})] = 0,

(7)

holds at least for one market maker i.

Note that Lemma 2 gives an upper bound, $\bar{y}$ , for each individual’s equilibrium limit order size. This is because of the top-of-queue advantage: no market maker will post more than $\bar{y}$ units, for any depth beyond this marginally break-even point always loses, regardless of the queue realization.⁶

3.3.2. Liquidity Overshoot.

The equilibrium depth always satisfies the following property.

Proposition 1

(Liquidity Overshoot). If there is no resolution of queuing uncertainty, then the equilibrium depth $\hat{y} ≔ \sum_{i} q_{i 0} \geq \bar{y}$ . That is, liquidity overshoots in the sense that $\dot{π} (\hat{y}) \leq 0$ . The inequalities are strict if and only if no market maker is almost surely the fastest, that is, if and only if $P [k_{i} = 1] < 1$ for all i.

To shed some light, consider the following example, which gives an analytical solution.

Example 1

(Two Market Makers, Linear Marginal Profit). Suppose that there are $n = 2$ market makers, i and j. Write $β ≔ P [k = [i, j]] \in [0, 1]$ , that is, the probability for i’s order to queue before j’s. Further, for analytic solution, take the linear approximation for the marginal profit function: $\dot{π} (y) \approx \dot{π} (0) + \ddot{π} (0) y$ so that the marginally break-even depth is $\bar{y} \approx - \frac{\dot{π} (0)}{\ddot{π} (0)}$ . (Note from Lemma 1 that $\dot{π} (0) > 0$ and $\ddot{π} (0) < 0$ , and so, $\bar{y} > 0$ .) The linearization ensures the strict second-order condition, and so, the First-Order Condition (7) suffices for the best response. Then, taking as given each other’s limit order, the two market makers solve

\begin{array}{l} β \dot{π} (q_{i 0}) + (1 - β) \dot{π} (q_{j 0} + q_{i 0}) \approx \dot{π} (0) - \frac{\dot{π} (0)}{\bar{y}} (q_{i 0} + (1 - β) q_{j 0}) = 0 \\ β \dot{π} (q_{i 0} + q_{j 0}) + (1 - β) \dot{π} (q_{j 0}) \approx \dot{π} (0) - \frac{\dot{π} (0)}{\bar{y}} (β q_{i 0} + q_{j 0}) = 0 \end{array}} \Rightarrow q_{i 0} = q_{j 0} = \frac{β \bar{y}}{1 - (1 - β) β},

(8)

which is the unique equilibrium solution. Indeed, the equilibrium depth is

\hat{y} = q_{i 0} + q_{j 0} = \bar{y} / (1 - (1 - β) β) \geq \bar{y}

, where the overshoot is strict if and only if

(1 - β) β > 0 \Leftrightarrow 0 < β < 1

; that is,

P [k_{i} = 1] < 1

, and

P [k_{j} = 1] = 1 - P [k_{i} = 1] < 1

3.3.2.1. Intuition: Queuing Uncertainty Softens Strategic Substitution.

A market maker i’s first-order condition implies $q_{i 0}$ as a function of the other market maker’s order size $q_{j 0}$ . How does $q_{j 0}$ affect $q_{i 0}$ ? By the implicit function theorem,

\frac{d q_{i 0}}{d q_{j 0}} = - \frac{(1 - β) \ddot{π} (q_{j 0} + q_{i 0})}{β \ddot{π} (q_{i 0}) + (1 - β) \ddot{π} (q_{j 0} + q_{i 0})} \approx - (1 - β) \in [- 1, 0],

where the approximation follows Example 1 above. That

\frac{d q_{i 0}}{d q_{j 0}} \leq 0

means that

q_{i 0}

and

q_{j 0}

are strategic substitutes: whenever j increases their order size, she “threatens” to capture i’s profit, and this happens probabilistically if j’s order queues before i’s.

More importantly, the substitution rate is bounded by unity because of queuing uncertainty: market maker i “downplays” such a threat from j, as there is nonzero probability for j’s order to queue behind i’s, in which case the threat is ineffective—market maker i could not care less about those orders behind theirs. As a result, downplaying each other’s threats, the market makers engage in fierce competition, resulting in liquidity overshoot.

Instead, without queuing uncertainty, for example, when $β = P [k_{i} = 1] = 0$ , market maker i will never downplay the threat from j’s $q_{j 0}$ because that competitor’s order is always first in queue, resulting in $\frac{\partial q_{i 0}}{\partial q_{j 0}} = - 1$ . That is, in this case, j’s order fully substitutes—or crowds out—i’s limit order. As a result, $q_{i 0} = 0$ , $q_{j 0} = \bar{y}$ , and depth $\hat{y} = \bar{y}$ —no more overshoot.

3.3.3. Enriching the Notion of Equilibrium Depth.

The above discussion alludes to a connection between queuing uncertainty and market makers’ competition: on the one hand, when there is no queuing uncertainty, for example, when $β = 0$ , market maker j is effectively a monopolist—effectively no competition. On the other hand, when $β = \frac{1}{2}$ , which maximizes queuing uncertainty $var [k_{i}] = var [k_{j}] = (1 - β) β$ , the equilibrium depth is also maximized $\hat{y} = \frac{4}{3} \bar{y}$ —maximum competition between i and j.

This competition view of queuing uncertainty bridges two different definitions of equilibrium order book depth in the literature:

Zero-profit equilibrium: Glosten (1994, p. 1139, proposition 2(iii)) defines the equilibrium depth to be “the solution to the zero-profit condition,” that is, $\bar{\bar{y}}$ in the current model.
Marginally break-even equilibrium: Seppi (1997, p. 112, definition 1) defines the equilibrium depth to be “such that the marginal expected profit [is zero],” that is, $\bar{y}$ in the current model. Similarly, in Sandås (2001, p. 716), the equilibrium depth is such that “the quantity offered […] must be such that the last unit breaks even.”⁷

To see how, consider the following special case:

Example 2

(Homogeneous Market Makers). Each market maker has probability $\frac{1}{n}$ to be in any queue position. That is, each $k_{i}$ is independent and identically distributed (i.i.d.) from the discrete uniform distribution on ${1, 2, \dots, n}$ . The amount of queuing uncertainty is then characterized by a single parameter, n. The larger n is, the more queuing uncertainty each of them faces.⁸

On one extreme, if $n = 1$ , that is, without any queuing uncertainty, then this monopolist market maker always posts $q_{i 0}$ according to the First-Order Condition (7), which becomes $\dot{π} (q_{i 0}) = 0$ , implying $q_{i 0} = \hat{y} = \bar{y}$ , the marginally break-even depth. That is, without queuing uncertainty, the equilibrium conforms with the definition in Seppi (1997) and Sandås (2001).

On the other hand, in the limit of $n \to \infty$ , the First-Order Condition (7) becomes

\lim_{n \to \infty} E [\dot{π} (Q_{i}^{≺} (k) + \frac{\hat{y}}{n})] = \lim_{n \to \infty} \sum_{k = 1}^{n} \frac{1}{n} \dot{π} (\frac{k}{n} \hat{y}) = \frac{1}{\hat{y}} \int_{0}^{\hat{y}} \dot{π} (y) d y = 0 \Leftrightarrow π (\hat{y}) = 0 and \hat{y} = \bar{\bar{y}},

where the first equality simply expands the expectation, and the second equality follows the definition of the definite integral. That is, in this other polar case with maximum queuing uncertainty, the equilibrium converges to the zero-profit one in Glosten (1994), where the equilibrium depth

\hat{y}

is the zero-profit

\bar{\bar{y}}

In between, therefore, $n \in (1, \infty)$ captures both the severity of queuing uncertainty and market makers’ competition. The current model thus bridges these two views via the notion of queuing uncertainty, enriching the equilibrium notion of limit order book depths.

3.4. With Resolution of Queuing Uncertainty

This subsection switches back on the possibility that queuing uncertainty might be resolved in $t = 1$ (with probability $η$ ). In such a case, market makers will likely revise their limit orders:

Example 1

(continued). Recall the earlier example where $n = 2$ market makers face a linear marginal profit $\dot{π} (y) \approx \dot{π} (0) - \frac{\dot{π} (0)}{\bar{y}} y$ . For simplicity, consider also the case of equally fast market makers; that is, $β ≔ P [k = [1, 2]] = \frac{1}{2}$ . Without resolution of queuing uncertainty, following (8), the symmetric equilibrium order submission strategy is $q_{10} = q_{20} = \frac{2}{3} \bar{y}$ . The total depth at $t = 0$ is therefore $\hat{y} = \frac{4}{3} \bar{y}$ .

Putatively, suppose the market makers have submitted the above orders in $t = 0$ and then observed their queue positions in $t = 1$ . Then, the second in queue will want to revise their order: the break-even depth is $\bar{y}$ , and their contribution to the depth starts from $\frac{2}{3} \bar{y}$ and ends at $\frac{4}{3} \bar{y}$ . The latter half of this second-in-queue order, from $\bar{y}$ to $\frac{4}{3} \bar{y}$ , loses money in expectation and therefore will be canceled.

However, the above is not an equilibrium because knowing such revision opportunity in $t = 1$ , the two market makers would play more aggressively in $t = 0$ . The rest of this subsection first solves the equilibrium backward from $t = 1$ to $t = 0$ and then examines model implications.

3.4.1. Equilibrium

3.4.1.1. The Optimal Revisions at $t = 1$ .

If queuing uncertainty is not resolved, there is no revision. If resolved, a market maker i knows that there is $Q_{i}^{≺} (k)$ units of depth queued before their own order, which is of size $q_{i 0}$ . Because all depth beyond $\bar{y}$ lose money, she then revises their order according to

q_{i 1} (q_{i 0}, k) = {\begin{array}{l} q_{i 0} (no revision), & if Q_{i}^{≺} (k) + q_{i 0} < \bar{y}; \\ \bar{y} - Q_{i}^{≺} (k) (partial cancellation), & if Q_{i}^{≺} (k) \leq \bar{y} < Q_{i}^{≺} (k) + q_{i 0}; \\ 0 (full cancellation), & if Q_{i}^{≺} (k) > \bar{y} . \end{array}

(9)

For convenience, the three scenarios will be referred to as the order i being “in the money” ( ${ITM}_{i}$ ), “at the money” ( ${ATM}_{i}$ ), and “out of the money” ( ${OTM}_{i}$ ), respectively.

For now, it is a conjecture (verified below in Proposition 2) that $\hat{y} = \sum_{i = 1}^{n} q_{i 0} \geq \bar{y}$ . Therefore, no market maker will submit new orders at $t = 1$ . Neither can they revise up their existing order sizes (see Section 2.6). The only equilibrium actions in $t = 1$ are the above cancellations by those whose orders are either ATM or OTM, and the after-revision book depth is always $\bar{y}$ , breaking even on the margin. Note that such cancellations are only triggered by the resolution of queuing uncertainty. This differs from the order revisions driven by fundamental information, for example, in Budish et al. (2015), Dugast (2018), and Bhattacharya and Saar (2021).

3.4.1.2. The Optimal Order Sizes at $t = 0$ .

Knowing their Optimal Revision Strategy (9) and taking all others’ initial order sizes ${q_{j 0}}_{j \neq i}$ as given, a market maker i chooses $q_{i 0}$ to solve

\max_{q_{i 0}} (1 - η) E [π (Q_{i}^{≺} (k) + q_{i 0}) - π (Q_{i}^{≺} (k))] + η E [π (Q_{i}^{≺} (k) + q_{i 1}) - π (Q_{i}^{≺} (k))] .

A pure-strategy Nash equilibrium exists, following the same Proof of Lemma 2 in the appendix.

3.4.1.3. Overshooting Is Exacerbated.

The revision opportunities push market makers to play even more aggressively at $t = 0$ :

Proposition 2

(Liquidity Overshoot with Revision). The $t = 0$ liquidity overshoots, that is, $\hat{y} = \sum_{i = 1}^{n} q_{i 0} \geq \bar{y}$ , and more so when the revision opportunity is larger: $\frac{d \hat{y}}{d η} \geq 0$ . The inequalities are strict if $P [k_{i} = 1] < 1$ for all i.

As in Proposition 1, the overshoot is strict if and only if there is no market maker who is always the first in queue. To see why the overshoot exacerbates, consider a market maker’s first-order condition, noting that $\frac{d q_{i 1}}{d q_{i 0}} = 𝟙_{{{ITM}_{i}}}$ :

\begin{array}{l} (1 - η) E [\dot{π} (Q_{i}^{≺} (k) + q_{i 0})] + η E [\dot{π} (Q_{i}^{≺} (k) + q_{i 1}) \frac{d q_{i 1}}{d q_{i 0}}] = 0 \\ \Leftrightarrow (1 - η) E [\dot{π} (Q_{i}^{≺} (k) + q_{i 0})] + η P [{ITM}_{i}] E [\dot{π} (Q_{i}^{≺} (k) + q_{i 0}) | {ITM}_{i}] = 0 . \end{array}

(10)

Compared with the No-Resolution Benchmark (7), a new term of $η P [{ITM}_{i}] E [\dot{π} (Q_{i}^{≺} (k) + q_{i 0}) | {ITM}_{i}]$ arises. It is strictly positive as long as i has nonzero probability of being ITM, thus offsetting some of the losses that would occur without resolution of queuing uncertainty. Larger $η$ means larger chances of offsetting losses, giving rise to more fierce competition and hence more severe overshoot.

Example 1

(continued). Continue with the earlier example, where $n = 2$ equally fast market makers face a linear marginal profit $\dot{π} (y) \approx \dot{π} (0) - \frac{\dot{π} (0)}{\bar{y}} y$ . The equilibrium limit order sizes are determined by the two first-order conditions following (10):

\begin{array}{l} (1 - η) [\frac{1}{2} \dot{π} (q_{10}) + \frac{1}{2} \dot{π} (q_{20} + q_{10})] + η \frac{1}{2} \dot{π} (q_{10}) = 0 \\ (1 - η) [\frac{1}{2} \dot{π} (q_{20}) + \frac{1}{2} \dot{π} (q_{10} + q_{20})] + η \frac{1}{2} \dot{π} (q_{20}) = 0 \end{array}} \Rightarrow q_{10} = q_{20} = \frac{2 - η}{3 - 2 η} \bar{y},

which is indeed monotone increasing in the revision opportunity

η

. In particular, if

η ↑ 1

, that is, if, for sure, queuing uncertainty will be resolved, each market maker plays

q_{i 0} = \bar{y}

, just like a monopolist—whoever becomes first in queue takes all the profitable depth, and the second in queue cancels in full.

3.4.2. Prediction 1: Depth Dynamics, Overshooting, Then Correction.

Proposition 2 predicts a very stylized pattern of order book depth dynamics: it overshoots and then quickly reverts. Figure 4 illustrates the pattern with $n = 5$ equally fast market makers. They each submit the same $q_{i 0} = q_{0}$ at $t = 0$ . Suppose only the first two fastest limit orders will be ITM, the third ATM, and the last two OTM. Then, at $t = 1$ , there will be three modifications, one partial and two full cancellations. These events—five additions and three cancellations—are plotted sequentially.

Figure 4. (Color online) Overshoot and Then Correction
*Notes*. This figure illustrates the book depth dynamics, initial overshooting followed by immediate correction, with $n = 5$ homogeneous market makers, each submitting at $t = 0$ a limit order of size $q_{0}$ . These orders are then randomly queued and arrive sequentially. The resulting depth at the end of the queue, beyond the marginally break-even $\bar{y}$ , is then canceled.

The pattern—clustered submissions followed by immediate cancellations—is similar in appearance to what is known as “ghost liquidity,” “fleeting orders,” and the ill-purposed “quote stuffing.” For example, in a sample of NASDAQ stocks traded in October 2004, Hasbrouck and Saar (2009) refer to those nonmarketable limit orders as “fleeting” if they are canceled within two seconds. Degryse et al. (2024) consider a limit order as a “ghost” if it is a duplicate of liquidity provision in other venues and is intended to be canceled as soon as any other duplicate is executed. Egginton et al. (2016) refer to quote stuffing as “a practice in which a large number of orders to buy or sell securities are placed and then canceled almost immediately.” Hasbrouck (2018) examines the flickering of bid-and-ask prices (but not depth) and studies short-run volatility.

This paper offers a rather innocuous explanation to such transient limit orders: they may be an equilibrium outcome because of queuing uncertainty. This innocuous view seems to echo Baruch and Glosten (2013) and Bhattacharya and Saar (2021), both arguing that “fleeting” limit orders are a natural equilibrium phenomenon. The mechanisms are, however, different. The agents coordinate to play a mixed-strategy stage game repeatedly in Baruch and Glosten (2013). The order book reshuffles upon arrivals of possibly informed orders in Bhattacharya and Saar (2021). Here, because of overshooting, cancellations spontaneously arise and concentrate only on the tails of newly formed limit order queues. A similar idea is seen in, for example, Menkveld and Zoican (2017) and Baldauf and Mollner (2022), where every limit order, except the first in queue, is immediately canceled. The current model enriches the depth dynamics by allowing endogenous limit order sizes and potential cancellations ( $η < 1$ ). Importantly, the current model does not resort to news about asset fundamentals.

3.4.3. Prediction 2: Cancellation vs. Addition Message Counts.

The with-resolution extension speaks to the cancel-to-add count ratio of limit orders. How much of all the order book messages is about cancellation? What does the proportion of cancellations reflect, and what affects it? As shown in Figure 1, the answers to these questions relate to about 60% of the TAQ data, providing a more comprehensive understanding of order book messages.

3.4.3.1. The Equilibrium Cancel-to-Add Count Ratio.

Because depth always overshoots in equilibrium (Proposition 2, assuming $P [k_{i} = 1] < 1$ $\forall i$ ), there is always an ATM limit order. Denote this ATM order’s queue position by $\bar{k}$ so that $\sum_{i} q_{i 0} 𝟙_{{k_{i} < \bar{k}}} \leq \bar{y} < \sum_{i} q_{i 0} 𝟙_{{k_{i} \leq \bar{k}}}$ . At $t = 1$ , this $\bar{k}$ -th order is partially canceled, and all those behind it are canceled in full. Therefore, the total number of cancellations is $n - \bar{k} + 1$ , and the cancel-to-add count ratio is

c / a = \frac{n - \bar{k} + 1}{n} .

(11)

In general, $\bar{k}$ is random, depending on the queue realization, and the $c / a$ ratio above is therefore also random. Such randomness makes it difficult to characterize the above ratio.

To obtain useful insights, consider a large sector of homogeneous market makers who are equally fast; that is, every limit order is equally likely to be anywhere in the queue, as in Example 2. The homogeneity allows the analysis to focus on a symmetric-strategy equilibrium, where each market maker posts the same $q_{i 0} = q_{0}$ at $t = 0$ . A “large sector” turns the attention to the limit of $n \to \infty$ . This is to circumvent a technicality: there always is an ATM limit order straddling the break-even threshold $\bar{y}$ . Its partial cancellation, as in (9), introduces an unfriendly “kink.” As $n \to \infty$ , this single ATM limit order becomes inconsequential, thus simplifying the analysis.

Lemma 3

(Cancel-to-Add Count Ratio). With a large sector of homogeneous market makers, the limit orders’ cancel-to-add count ratio

c / a \overset{a . s .}{\to} \frac{α}{1 + α} < 1, where α ≔ \frac{\hat{y}}{\bar{y}} - 1 > 0 .

(12)

The endogenous $α$ reflects market makers’ liquidity supply aggressiveness. To see why, note that by symmetry, the individual equilibrium order size can be written as $q_{0} = \hat{y} / n = (1 + α) \bar{y} / n$ . If the market makers were to coordinate, they could each submit an order of size $\bar{y} / n$ to maximize their aggregate profit, or $α = 0$ , not aggressive at all. However, in equilibrium, they compete to submit larger orders, resulting in higher aggressiveness $α > 0$ .

Note that $α$ also measures the severity of liquidity overshooting. Indeed, the overshooting will be more excessive if the market makers are more aggressive. Unsurprisingly, the limit $c / a$ ratio given by the lemma is monotone in $α$ : if market makers post orders more aggressively, the more excessive liquidity overshooting will be followed by more cancellations.

3.4.3.2. Comparative Statics.

To generate testable predictions regarding the $c / a$ ratio, it is useful to ask what market conditions or asset characteristics might affect it, and how. That is, given a relevant empirical measure $ζ$ , what is the model prediction of $\frac{d}{d ζ} (c / a)$ ? The following proposition states the result.

Proposition 3

(Comparative Statics of the $c / a$ Ratio). Write the aggregate expected profit of the market-making sector as $Π$ , and let it be parameterized by $ζ$ , an exogenous variable of interest. Then, the $c / a$ ratio increases with $ζ$ if and only if the partial derivative $\frac{\partial Π}{\partial ζ} > 0$ .

Intuitively, the proposition says there are more limit order cancellations (relative to additions) if and only if the market-making sector becomes more profitable; that is, $\frac{\partial Π}{\partial ζ} > 0$ . Such higher profitability induces market makers to play more aggressively (higher $α$ ), and as a result, there is more overshooting and also more correction.

3.4.3.3. Adverse Selection as an Example.

More severe adverse selection should reduce market-making profitability $Π$ , thus resulting in a lower cancel-to-add count ratio $c / a$ according to Proposition 3. Figure 5(a) graphically illustrates the model predictions. The (blue) solid line (left axis) shows that the equilibrium $c / a$ ratio indeed decreases as adverse selection—the investor’s private signal precision $τ_{ε}$ —increases (horizontal axis). The (red) dashed line (right axis) shows that although $\frac{\partial Π}{\partial τ_{ε}}$ increases with $τ_{ε}$ , it remains negative throughout, ranging from $- 2.8$ to $- 1.5$ .

Figure 5. (Color online) Adverse Selection and the Cancel-to-Add Count Ratio, $c / a$
*Notes*. This figure illustrates how adverse selection, represented by the investor’s private signal precision $τ_{ε} \in [1.1, 1.3]$ (horizontal axis), affects the equilibrium $c / a$ ratio (the solid line, left axis). Also plotted is the partial derivative of $\frac{\partial Π}{\partial τ_{ε}} < 0$ (the dashed line, right axis). The other parameters are set at $a = 5.0$ , $τ_{V} = 0.3$ , $τ_{U} = 2.0$ , $ρ = 1.0$ , and $η = 0.1$ .

3.4.4. Prediction 3: Cancellation Size vs. Addition Size.

One might also be interested in the size of limit order cancellations. Are they larger or smaller than the additions? What does the size difference imply? What market conditions drive the differences?

3.4.4.1. The Equilibrium Cancel-to-Add Size Ratio.

The submissions at $t = 0$ result in an aggregate depth (before revision) of $\hat{y}$ . Because there are n market makers, the average addition size is simply $\hat{y} / n$ . Recall from Section 3.4.3 that the number of cancellations, partial or full, is $n - \bar{k} + 1$ , where $\bar{k}$ is the queue position of the ATM order. The total depth canceled is $\hat{y} - \bar{y}$ , and the average per-cancellation size is $(\hat{y} - \bar{y}) / (n - \bar{k} + 1)$ . Therefore, the cancel-to-add size ratio $C / A$ —not to be confused with the lower case $c / a$ count ratio—is

C / A = \frac{(\hat{y} - \bar{y}) / (n - \bar{k} + 1)}{\hat{y} / n} = (1 - \frac{\bar{y}}{\hat{y}}) \frac{n}{n - \bar{k} + 1} = \frac{α}{1 + α} \frac{1}{c / a},

(13)

where the last equality uses the definitions of the

c / a

ratio as in (11) and the aggressiveness

α

as in (12).

As a quick check, consider the special case of a large sector of homogeneous market makers. As $n \to \infty$ , the count ratio $c / a \to \frac{α}{1 + α}$ by Lemma 3 and, therefore, the size ratio $C / A \to 1$ . Indeed, as the homogeneous market makers all submit the same $q_{i 0}$ at $t = 0$ , the cancellation size should also be $q_{i 0}$ except for the partial cancellation of the ATM order, which becomes inconsequential in a large market-making sector.

3.4.4.2. Speed Heterogeneity and the Cancel-to-Add Size Ratio.

In reality, however, not all market makers are identical. An important dimension, in the context of queuing uncertainty, is their speed. To reflect such differences and enrich the model predictions regarding $C / A$ ratio, consider two groups of market makers, “F” for fast and “S” for slow. Specifically, there are $ψ_{F} n$ fast market makers and $ψ_{s} n$ slow ones, where $ψ_{F} = 1 - ψ_{s} \in (0, 1)$ . Market makers within each group are homogeneous—equally fast—and have the same latency distribution. That is, a limit order submitted by the market maker i from group $j \in {F, S}$ is processed by the exchange with latency $t_{i}$ , which is i.i.d. from the cumulative distribution function (c.d.f.) $G_{j} (t_{i})$ . For the labels of “F” and “S” to be meaningful, let $G_{S} (\cdot)$ first-order stochastically dominate $G_{F} (\cdot)$ . The stochastic dominance means that a fast market maker is strictly more likely to draw a lower latency than a slow one (see Supplementary Appendix S.2). Assume that both $G_{j} (\cdot)$ are everywhere differentiable so that there are almost surely no ties. Normalize the supports for both $G_{j} (\cdot)$ to be $t \in (0, 1)$ . This way, all limit orders submitted at $t = 0$ will have been processed by $t = 1$ , consistent with the timeline sketched in Figure 2. The analysis shall focus on symmetric strategies; that is, all market makers within the same group j submit limit orders of the same size $q_{j}$ at $t = 0$ .

Proposition 4

(Cancel-to-Add Size Ratio). With a large sector of fast and slow market makers, the $C / A$ ratio (i) is below one and (ii) is higher with more fast market makers; that is, $\frac{d}{d ψ_{F}} (C / A) > 0$ .

The intuition is as follows. Because of the top-of-queue advantage, the fast market makers will play more aggressively than the slow ones. That is, the faster limit orders are not only more likely to be at the top of the queue but also larger in size. Equivalently, the slow orders are smaller, concentrate in the rear of the queue, and, therefore, constitute the bulk of the cancellations. Therefore, the cancel-to-add size ratio is lower than one.

When there are more fast market makers (larger $ψ_{F}$ ), they compete more fiercely within group F, and every fast limit order becomes smaller. On the other hand, the fewer slow market makers compete less, and every slow order becomes larger. The size ratio of slow orders over fast ones then increases with $ψ_{F}$ . In other words, most of the canceled orders (the slow ones) become relatively larger.

3.4.4.3. Measuring Market-Making Activeness.

Proposition 4 suggests that the $C / A$ ratio could serve as a proxy for the “activeness” of the fast or high-frequency market makers. Figure 6 illustrates this prediction. As the fraction of fast market makers $ψ_{F}$ increases, the $C / A$ ratio monotonically increases (solid line, left axis), whereas the average limit order latency $E [t_{i}]$ drops (dashed line, right axis).

Figure 6. (Color online) Speed and the Cancel-to-Add Size Ratio, $C / A$
*Notes*. This figure illustrates Proposition 4 by varying the fraction of fast market makers $ψ_{F} \in (0, 1)$ on the horizontal axis. The left axis plots the equilibrium $C / A$ ratio (the solid line), whereas the right axis plots the average limit order latency $E [t_{i}]$ (the dashed line). The other parameters used in this illustrations are $a = 5.0$ , $τ_{V} = 0.3$ , $τ_{U} = 2.0$ , $τ_{ε} = 1.1$ , $ρ = 1.0$ , and $η = 0.1$ . The latency distributions are $G_{S} (t) = \sqrt{t}$ and $G_{S} (t) = t$ , where $t \in (0, 1)$ .

4. Welfare

Is the liquidity overshoot generated by queuing uncertainty socially good or bad? This section examines the model implications for welfare and discusses the related market design implications.

4.1. Welfare, Liquidity Supply, and Queuing Uncertainty

Welfare is the sum of (i) the investor’s expected trading gain, and (ii) the total expected profit of the n market makers. To derive (i), note that for any given depth y, the investor expects⁹

g (y) ≔ E [(z - a) x (z; y) - \frac{ρ}{2} x {(z; y)}^{2}],

(14)

where

x (z; y)

is the investor’s optimal market size solved in (3). For (ii), the aggregate market-making profit is

π (y)

and can be evaluated by integrating over

\dot{π} (y)

, or more directly,

π (y) = E [(a - V) x (z; y)] .

(15)

In equilibrium, the depth is either the endogenously submitted $\hat{y}$ when, with probability $1 - η$ , there is no resolution of queuing uncertainty, or the marginally break-even $\bar{y}$ when, with probability $η$ , there is resolution. Therefore, welfare can be written as

\begin{array}{l} w (\hat{y}) & = (1 - η) (g (\hat{y}) + π (\hat{y})) + η \cdot (g (\bar{y}) + π (\bar{y})) \\ = (1 - η) E [(z - V) x (z; \hat{y}) - \frac{ρ}{2} x {(z; \hat{y})}^{2}] + η E [(z - V) x (z; \bar{y}) - \frac{ρ}{2} x {(z; \bar{y})}^{2}] . \end{array}

(16)

Note that welfare w is written as a function of the endogenous no-resolution depth $\hat{y}$ . This is because queuing uncertainty only affects $\hat{y}$ —the more severe is queuing uncertainty, the larger is $\hat{y}$ (as illustrated in Examples 1 and 2). Therefore, to explore the effect of queuing uncertainty on welfare, the analysis below simply varies $\hat{y}$ ( $\geq \bar{y}$ ) as if it is an exogenous variable.¹⁰

Proposition 5

(Welfare as a Function of the Aggregate Depth). As $\hat{y}$ increases from $\bar{y}$ , welfare $w (\cdot)$ is initially increasing, reaching its maximum at some $y^{*}$ ( $> \bar{y}$ ), and eventually decreasing.

Proposition 5 notes that there might be times when too much liquidity—order book depth—hurts welfare ( $\dot{w} < 0$ locally). Figure 7(a) illustrates an example of such inefficient liquidity overshoot (when $\hat{y} > y^{*} > \bar{y}$ ). The discussion below explains this perhaps surprising result in two steps: the source of the potential welfare loss and how that relates to queuing uncertainty.

Figure 7. (Color online) Queuing Uncertainty, Liquidity (Depth), and Welfare
*Notes*. (a) The shape of welfare w (solid line) and the aggregate market-making profit $π$ (dashed line) as functions of the no-revision depth $\hat{y}$ (which is varied by exogenously changing the severity of queuing uncertainty). (b) How, in equilibrium, the no-revision depth $\hat{y}$ and welfare w respond to the amount of queuing uncertainty. For both panels, the parameters are set at $a = 5.0$ , $τ_{V} = 0.3$ , $τ_{U} = 2.0$ , $τ_{ε} = 1.1$ , $ρ = 1.0$ , and $η = 0.1$ . In (b), the severity of queuing uncertainty is captured by the number $n \in {1, \dots, 10}$ of equally fast market makers (as in Example 2).

4.1.1. How Welfare Loss Arises.

Only the investor’s private-value trading motive, which is to hedge against the inventory shock U, possibly creates welfare gain.¹¹ If a social planner observes U and can prescribe the market order size x, she would minimize the investor’s inventory cost $(ρ / 2) {(U + x)}^{2}$ by setting $x = - U$ , subject to $x \geq 0$ (because the model focuses on the ask side). That is, the planner would like the investor to buy $- U$ units only from the market makers.

But the investor has, in addition to hedging, another motive: to speculate on their private signal S. (Indeed, as shown in (2), the investor’s marginal valuation is $z = θ S - ρ U$ .) As such, when they buys, the combined motive might push their to buy too much so that $x > - U$ . This happens in particular when S is very large. The resulting inventory cost then turns into welfare loss.

4.1.2. The Role of Queuing Uncertainty.

For the investor to buy “too much,” the depth $\hat{y}$ must be sufficiently large. This is where queuing uncertainty comes in: it encourages market makers’ competition and results in liquidity overshoot (Proposition 1). Figure 7(b) illustrates how more severe queuing uncertainty (larger n, as in Example 2) exacerbates liquidity overshoot (the dashed line, right axis) and possibly hurts welfare (the solid line, left axis).

In particular, note that Proposition 5 emphasizes that the welfare-maximizing depth $y^{*}$ is larger than market makers’ marginally break-even $\bar{y}$ . Hence, for welfare losses to occur ( $\dot{w} < 0$ ), market makers must be losing money ( $\dot{π} < 0$ ), which must be because of severe adverse selection. This observation first echoes with the above discussion that the investor must be speculating on S when welfare losses occur. Second, it underscores why queuing uncertainty is necessary: when queuing is certain, that is, when $P [k_{i} = 1] = 1$ for some i, the equilibrium depth is $\hat{y} = \bar{y}$ (Propositions 1 and 2), and welfare loss cannot arise at all in this case.

To sum up, the model reveals a novel detrimental effect of too much liquidity (depth), driven by queuing uncertainty: it creates liquidity overshoot, which, if severe enough, allows the investor to speculate too much. Such speculation-driven transfers can be inefficient from a social planner’s point of view.

4.2. Market Design Implications

Many real-world market designs affect queuing uncertainty—hence, also the equilibrium liquidity supply and welfare. Notably, various forms of “speed bumps” have emerged in recent years. Whereas the exact implementations differ, such “sand in the wheels” has been often viewed as deterrence against excessively fast trading (and the arms race behind; see, e.g., Baldauf and Mollner 2020, Brolley and Cimon 2020, Khapko and Zoican 2021). The novel welfare mechanism above sheds new light on such market designs.

4.2.1. Speed Bumps Targeting Limit Orders.

Not all speed bumps are the same. Consider first the effect of those targeting limit orders, directly affecting their queuing. For example, in foreign exchange markets, EBS implements a “latency floor” that randomizes the sequences of orders by different participants (CME Group 2023). The aim is to curb the ultrafast participants’ speed advantages, effectively increasing queuing uncertainty.

The current model cautions that such queue-randomizing speed bumps must be calibrated with great care. As seen from Example 1, the equilibrium depth increases with queuing uncertainty, but a deeper order book does not necessarily mean better welfare, as seen in Proposition 5. In particular, if the initial level of queuing uncertainty is already too high, that is, if the order book depth is already very large ( $\hat{y} \geq y^{*}$ ), then queue randomization further increases $\hat{y}$ and hurts welfare. Even if there is room for welfare improvement (if $\hat{y} < y^{*}$ ), injecting too much queuing uncertainty can still hurt because the excessive liquidity overshoot might indulge too much risk taking by speculative traders. Figure 7(b) illustrates such scenarios: whereas the equilibrium depth (right axis) always increases, welfare (left axis) is hump-shaped, peaking at some modest amount of queuing uncertainty.

4.2.2. Speed Bumps Targeting Market Orders.

Many marketplaces impose speed bumps that only slow down market orders (see, e.g., Khapko and Zoican 2021, table 1). Such speed bumps effectively give more time to market makers to revise their orders—a higher $η$ . How is market quality, in particular, welfare affected?

The current model framework can shed light on this question. From (16), it can be seen that an increase in $η$ —a slowdown of market orders—therefore has two effects on welfare:

\underset{(1) direct effect}{\underset{︸}{[(g (\bar{y}) + π (\bar{y})) - (g (\hat{y}) + π (\hat{y}))]}} \underset{(2) indirect effect}{\underset{︸}{+ (1 - η) [\dot{g} (\hat{y}) + \dot{π} (\hat{y})] \frac{d \hat{y}}{d η}}} .

With a larger $η$ , market orders arrive more slowly, and there is more time to cancel the overshooting limit orders. The direct welfare effect reflects the reduction of depth from the overshooting $\hat{y}$ to the monopolist level $\bar{y}$ . Intuitively, this direct effect is negative because the minimum depth $\bar{y}$ is as if supplied by a monopolist, constraining the possible gains from trade.

The indirect effect arises from how $η$ boosts the no-revision depth $\hat{y}$ : $\frac{d \hat{y}}{d η} > 0$ (Proposition 2). Such a boost happens only with probability $(1 - η)$ and improves welfare only if $\dot{w} (\hat{y}) = \dot{g} (\hat{y}) + \dot{π} (\hat{y}) > 0$ . However, for $η$ large enough, this indirect effect becomes negligible (as $1 - η \to 0$ ). Then, the detrimental direct effect dominates.

Corollary 1

(Market-Order-Only Speed Bumps). A speed bump slowing down market orders hurts welfare if it raises $η$ , thus allowing limit order revision too often. That is, there exists an $η^{*} \in [0, 1)$ such that the Welfare Expression (16) monotonically decreases with $η$ if $η > η^{*}$ .

The corollary cautions against such market-order speed bumps. The conventional wisdom suggests that with market orders slowing down, market makers face less adverse selection and will compete to narrow bid-ask spreads—improving liquidity. This has been a focal point of many empirical studies, such as Chen et al. (2017) and Hu (2019). Such a channel is switched off in the current model, as $η$ does not affect limit order profitability $π (\cdot)$ . Instead, complementing this liquidity-enhancing (spread-narrowing) view of speed bumps, Corollary 1 unveils a novel negative liquidity channel of depth reduction: they give more time to revise and cancel the overshooting limit orders. The current model, therefore, emphasizes the importance of empirically examining, and distinguishing, the different measures of liquidity (spread versus depth) when studying speed bumps.

5. Conclusion

This paper introduces queuing uncertainty to a standard limit order book model. When submitting limit orders, market makers face queuing uncertainty, which strengthens their competition and results in liquidity—book depth—overshoot in equilibrium.

The model enriches the understanding of order book depth dynamics. Specifically, the model predicts a stylized pattern: the book depth first shoots up and, after peaking, quickly lowers to the marginally break-even level. Further, the model predicts that the cancel-to-add count ratio and the size ratio relate to, respectively, the severity of adverse selection and the amount of high-frequency market-making activity. To this extent, this paper answers the call from O’Hara (2015) and pushes forward the understanding and utilization of modern microstructure data.

Further, the model shows that the overshooting liquidity might hurt welfare by allowing the liquidity demander to engage in excessive risky speculation. Queuing uncertainty thus underscores a novel channel through which limit order book depths affect welfare nonmonotonically. This novel angle sheds new light on market design issues such as speed bumps.

Acknowledgments

The author thanks the editor (Agostino Capponi), an associate editor, and two referees for their constructive feedback and comments. Additionally, this paper benefits tremendously from very helpful discussions with Shmuel Baruch, Hans Degryse, Jérôme Dugast, Thierry Foucault, Sergei Glebkin, Terrence Hendershott, Andrei Kirilenko, Roman Kozhan, John Kuong, Katya Malinova, Albert Menkveld, Sophie Moinas, Artem Neklyudov, Andreas Park, Christine Parlour, Barbara Rindi, Ioanid Roşu, Duane Seppi, Mark van Achter, Vincent van Kervel, Kumar Venkataraman, Chen Yao, Mao Ye, Haoxiang Zhu, and the seminar and conference participants at the University of Gothenburg, Rotterdam School of Management, Aalto University Business School, INSEAD, IHS Vienna, Paris-Dauphine, China Interenational Conference in Finance (Chengdu), Western Finance Association meeting (Monterey), Frontiers of Finance (Warwick), 17th SGF (Zürich), 11th Paris Finance Meeting, and 2013 NBER Market Microstructure Meeting. There are no competing financial interests that might be perceived to influence the analysis, the discussion, and/or the results of this article.

Appendix. Proofs

Proof of Lemma 1.

From (4), at $y = 0$ , $\dot{π} (0) = \int_{a}^{\infty} (a - ϕ z) f (z) d z = P [z > a] (a - ϕ E [z | z > a]) = (1 - F (a)) a - ϕ var [z] f (z)$ , where the last equality follows the mean of a truncated normal distribution. Hence, $sign [\dot{π} (0)] = sign [a r (a) - ϕ var [z]]$ , where $r (a) ≔ (1 - F (a)) / f (a)$ is the Mills ratio (or the reciprocal of the hazard rate). It is a known property that the product $a r (a)$ is monotone increasing in $a \in [0, \infty)$ from zero to $var [z]$ (e.g., Pinelis 2002, proposition 1.2). Therefore, there exists a unique threshold $a^{*} > 0$ defined by (noting that $ϕ var [z] = cov [V, z] = θ / τ_{V}$ )

\frac{1 - F (a^{*})}{f (a^{*})} a^{*} = \frac{θ}{τ_{V}},

(A.1)

such that

\dot{π} (0) > 0

for all

a > a^{*}

Next, consider the existence and the uniqueness of $\bar{y}$ that satisfies $\dot{π} (\bar{y}) = 0$ . Note that the integrand $(a - ϕ z) f (z)$ in $\dot{π} (\cdot)$ drops below zero and stays strictly negative for sufficiently large z. Therefore, for sufficiently large y, $\dot{π} (y) < 0$ . Because $\dot{π} (0) > 0$ for $a > a^{*}$ , there exists at least one $\bar{y} > 0$ . To see the uniqueness, note that $0 = \dot{π} (\bar{y}) = \int_{a + ρ \bar{y}}^{\infty} (a - ϕ z) f (z) d z < ((1 - ϕ) a - ϕ ρ \bar{y}) (1 - F (a + ρ \bar{y}))$ , where the inequality follows because in the range of integration, $z > a + ρ \bar{y}$ . This implies that $(1 - ϕ) a - ϕ ρ \bar{y} > 0$ . Note that $\ddot{π} (y) = - ρ \cdot ((1 - ϕ) a - ϕ ρ y) f (a + ρ y)$ . Hence, $sign [\ddot{π} (\bar{y})] = - sign [(1 - ϕ) a - ϕ ρ \bar{y}] < 0$ . That is, at any $y = \bar{y}$ , $\dot{π} (y)$ is strictly concave, and therefore, $\dot{π} (y)$ crosses zero once and only once (from above).

Note from the above that $sign [\ddot{π} (y)] = - sign [(1 - ϕ) a - ϕ ρ y]$ . The above has established that, for all $y \in [0, \bar{y}]$ , $0 \leq \dot{π} (y) < ((1 - ϕ) a - ϕ ρ y) (1 - F (a + ρ y))$ . Hence, on this range, $\ddot{π} (y) < 0$ .

Finally, consider the limit of the total profit $π (y)$ : $π_{\infty} ≔ \lim_{y \to \infty} π (y) = \int_{a}^{\infty} - \frac{1}{ρ} (ϕ z - a) (z - a) f (z) d z$ . Recall that $ϕ = \frac{θ}{τ_{V} var [z]}$ and $f (z) = \frac{1}{\sqrt{2 π var [z]}} e^{- z^{2} / (2 var [z])}$ , where $var [z] = \frac{θ}{τ_{V}} + \frac{ρ^{2}}{τ_{U}}$ . That is, $π_{\infty}$ is a function of $τ_{U}$ and satisfies $\frac{d π_{\infty}}{d τ_{U}} = - \frac{(1 - ϕ) ρ a}{2 τ_{U}^{2}} f (a) < 0$ . That is, $π_{\infty}$ is monotone decreasing in $τ_{U}$ . Also,

\lim_{τ_{U} \to 0} π_{\infty} = \int_{a}^{\infty} \frac{a}{ρ} (z - a) f (z) d z > 0 > \lim_{τ_{U} \to \infty} π_{\infty} = \int_{a}^{\infty} - \frac{1}{ρ} {(z - a)}^{2} f (z) d z .

Therefore, there exists a unique $τ_{U}^{*} \in (0, \infty)$ that solves

\lim_{y \to \infty} π (y) = \int_{a}^{\infty} - \frac{1}{ρ} (ϕ (τ_{U}) z - a) (z - a) f (z; τ_{U}) d z = 0,

(A.2)

so that

π_{\infty} < 0

if and only if

τ_{U} > τ_{U}^{*}

. Therefore, as long as

τ_{U} > τ_{U}^{*}

, there exists

\bar{\bar{y}} \in (0, \infty)

such that

π (\bar{\bar{y}}) = 0

. Also, because

\dot{π} (\cdot)

crosses zero from above only once,

π (\cdot)

peaks at

\bar{y}

, and so,

\bar{\bar{y}} > \bar{y}

. □

Proof of Lemma 2.

This proof applies to the general case with $η \in [0, 1]$ . First, recall from Lemma 1 that there exists a unique $\bar{y} > 0$ such that $\dot{π} (\bar{y}) = 0$ ; that is, the $\bar{y}$ -th marginal order breaks even. Second, no market maker will submit a limit order larger than $\bar{y}$ because the part exceeding $\bar{y}$ always loses in expectation ( $\dot{π} (y) < 0$ for all $y > \bar{y}$ ). This confines each market maker’s strategy space to $[0, \bar{y}]$ . Therefore, all best response correspondences can be summarized as a vector-valued function ${[0, \bar{y}]}^{(n - 1)} \mapsto {[0, \bar{y}]}^{n}$ , which is nonempty and compact. The convexity of the value set follows the differentiability of $π (\cdot)$ . Then, by Kakutani’s fixed-point theorem, there exists at least one fixed point that solves all market makers’ optimization problems.

Now, consider the First-Order Condition (7). In equilibrium, its left-hand side, which is the first-order derivative of market maker i, must be nonpositive. If it were strictly negative for all i, then $q_{i 0} = 0$ for all i, which cannot be the case in equilibrium because $\dot{π} (0) > 0$ following Lemma 1. □

Proof of Proposition 1.

The proof proceeds in three steps: (1) to prove that $\hat{y} \geq \bar{y}$ always holds in equilibrium; (2) to prove that if the inequality is strict, then no market maker can be almost surely first in queue; and (3) to prove that if no market maker is almost surely first in queue, then the inequality is strict.

To prove $\hat{y} \geq \bar{y}$ : Suppose the opposite holds; that is, $\hat{y} = \sum_{i} q_{i 0} < \bar{y}$ . Recall from Lemma 1 that $\dot{π} (y) > 0 = \dot{π} (\bar{y})$ for all $y \in [0, \hat{y})$ , and $\dot{π} (y)$ is strictly decreasing on $y \in (0, \hat{y}]$ . Note that for any queue realization $k$ , $Q_{i}^{≺} (k) + q_{i 0} = \sum_{j = 1}^{n} q_{j 0} 𝟙_{{k_{j} < k_{i}}} \leq \sum_{j}^{n} q_{j 0} = \hat{y}$ . Therefore, $\dot{π} (Q_{i}^{≺} (k) + q_{i 0}) > 0$ for any realized $k$ , and so, any market maker i’s first-order derivative $E [\dot{π} (Q_{i}^{≺} (k) + q_{i 0})] > 0$ . They can therefore always improve their expected profit by raising $q_{i 0}$ , making it no longer an equilibrium. Thus, the opposite must be true; that is, $\hat{y} \geq \bar{y}$ , and $\dot{π} (\hat{y}) \leq 0$ .
To prove $\hat{y} > \bar{y}$ $\Rightarrow$ $P [k_{i} = 1] < 1$ for all i: If any market maker i is almost surely first in queue, that is, if $P [k_{i} = 1] = 1$ for i, their first-order derivative becomes $E [\dot{π} (Q_{i}^{≺} (k) + q_{i 0})] = \dot{π} (q_{i 0})$ , and Lemma 1 ensures that they always chooses $q_{i 0} = \bar{y}$ . Knowing this, all other market makers choose $q_{j 0} = 0$ for $j \neq i$ . This leads to $\hat{y} = \bar{y}$ , contradicting $\hat{y} > \bar{y}$ . Therefore, given that $\hat{y} > \bar{y}$ , there can be no $P [k_{i} = 1] = 1$ for any i, or equivalently, $P [k_{i} = 1] < 1$ for all i.
To prove the other direction $P [k_{i} = 1] < 1$ for all i $\Rightarrow$ $\hat{y} > \bar{y}$ : Note that given $P [k_{i} = 1] < 1$ , no market maker will post $q_{i 0} = \bar{y}$ . If i did so, their first-order condition $E [\dot{π} (Q_{i}^{≺} (k) + q_{i 0})] = 0$ would hold only if all others $j \neq i$ post $q_{j 0} = 0$ . But that cannot be true because if it were true, for any $j \neq i$ , their first-order derivative would be
$E [\dot{π} (Q_{j}^{≺} (k) + q_{j 0})] = \underset{= 0}{\underset{︸}{P [k_{i} = 1] E [\dot{π} (\bar{y}) | k_{j} = 1]}} + \underset{> 0}{\underset{︸}{(1 - P [k_{i} = 1]) E [\dot{π} (0) | k_{j} > 1]}} > 0,$

disproving the equilibrium. Therefore, when

P [k_{i} = 1] < 1

for all i,

q_{i 0} < \bar{y}

for all i.

Then, let us assume that, opposite to the statement, $\hat{y} \leq \bar{y}$ . That is, for any queue realization $k$ , $Q_{i}^{≺} (k) + q_{i 0} \leq \sum_{j}^{n} q_{j 0} = \hat{y} \leq \bar{y}$ , and hence, $\dot{π} (Q_{i}^{≺} (k) + q_{i 0}) \geq 0$ . Note that there must exist at least one market maker i for whom $P [k_{i} = 1] > 0$ , and their first-order derivative

\begin{array}{l} E [\dot{π} (Q_{i}^{≺} (k) + q_{i 0})] = P [k_{i} = 1] E [\dot{π} (Q_{i}^{≺} (k) + q_{i 0}) | k_{i} = 1] + (1 - P [k_{i} = 1]) E [\dot{π} (Q_{i}^{≺} (k) + q_{i 0}) | k_{i} > 1] \\ = \underset{> 0}{\underset{︸}{P [k_{i} = 1] \dot{π} (q_{i 0})}} + \underset{\geq 0}{\underset{︸}{(1 - P [k_{i} = 1]) E [\dot{π} (Q_{i}^{≺} (k) + q_{i 0}) | k_{i} > 1]}} > 0, \end{array}

where the first component is strictly positive because the previous paragraph has established that

q_{i 0} < \bar{y}

and because Lemma 1 has shown that

\dot{π} (y) > 0

for all

y \in [0, \bar{y})

. This market maker can therefore always improve their expected profit by raising

q_{i 0}

, thus disproving

\hat{y} \leq \bar{y}

. Thus,

P [k_{i} = 1] < 1

for all i

\Rightarrow

\hat{y} > \bar{y}

must be true. □

Proof of Proposition 2.

The First-Order Condition (10) implies that $- η E [\dot{π} (Q_{i}^{≺} + q_{i 0})] = (1 - η) P [{ITM}_{i}] E [\dot{π} (Q_{i}^{≺} + q_{i 0}) | {ITM}_{i}] \geq 0$ , where the inequality holds because of the conditioning on ${ITM}_{i}$ . Therefore, by the implicit function theorem,

\frac{d q_{i 0}}{d η} = - \frac{- E [\dot{π} (Q_{i}^{≺} + q_{i 0})] + P [{ITM}_{i}] E [\dot{π} (Q_{i}^{≺} + q_{i 0}) | {ITM}_{i}]}{(1 - η) E [\ddot{π} (Q_{i}^{≺} + q_{i 0})] + η P [{ITM}_{i}] E [\ddot{π} (Q_{i}^{≺} + q_{i 0}) | {ITM}_{i}]} \geq 0,

as the denominator is strictly negative because the second-order condition must hold in equilibrium. That is, every

q_{i 0}

is monotone increasing in

η

. As such, the aggregate depth

\hat{y} = \sum_{i}^{n} q_{i 0}

is also increasing in

η

. Finally, by Proposition 1,

\hat{y} > \bar{y}

when

η = 0

, implying that

\hat{y} > \bar{y}

for all

η

. The proof of

P [k_{i} = 1] < 1

for all i

\Leftrightarrow

\hat{y} > \bar{y}

is the same as in the Proof of Proposition 1 and is omitted. □

Proof of Lemma 3.

Under symmetry, the unique break-even queue position $\bar{k} \in {2, \dots, n}$ is defined by $\bar{k} q_{0} \leq \bar{y} < (\bar{k} + 1) q_{0}$ . Equivalently, scaling the inequality by $n q_{0}$ ,

\frac{\bar{k}}{n} \leq \frac{\bar{y}}{n q_{0}} = \frac{\bar{y}}{\hat{y}} < \frac{\bar{k}}{n} + \frac{1}{n} \Leftrightarrow \frac{\bar{y}}{\hat{y}} - \frac{1}{n} < \frac{\bar{k}}{n} \leq \frac{\bar{y}}{\hat{y}} .

Note that both the upper and the lower bounds of $\frac{\bar{k}}{n}$ above converge to $\frac{\bar{y}}{\hat{y}}$ as $n \to \infty$ . Therefore, the $c / a$ Ratio (11) also converges to $1 - \frac{\bar{y}}{\hat{y}} = \frac{α}{1 + α}$ . □

Proof of Proposition 3.

The proof proceeds in three steps. The first is to derive an explicit expression of $Π$ . Note that the expected profit for a market maker i is $(1 - η) E [π (Q_{i}^{≺} + q_{i 0}) - π (Q_{i}^{≺})] + η E [π (Q_{i}^{≺} + q_{i 1}) - π (Q_{i}^{≺})]$ . Adding this up for all $i \in {1, 2, \dots, n}$ yields

Π = (1 - η) π (\hat{y}) + η π (\bar{y}) = (1 - η) π ((1 + α) \bar{y}) + η π (\bar{y}),

because regardless of the queue realization, the aggregate depth with revision is always

\bar{y}

and that without is always

\hat{y} = (1 + α) \bar{y}

The second step is to show that it is a necessary equilibrium outcome that $Π = 0$ when $n \to \infty$ . From the Proof of Lemma 3, $\lim_{n \to \infty} \frac{\bar{k}}{n} = \frac{\bar{y}}{\hat{y}}$ , where $\bar{k}$ is the break-even queue position such that $\bar{k} q_{0} \leq \bar{y} < (\bar{k} + 1) q_{0}$ . Every market maker has the same ITM probability of $P [{ITM}_{i}] = \frac{\bar{k} - 1}{n} \to \frac{\bar{y}}{y}$ . Then, in the limit of $n \to \infty$ , the First-Order Condition (10) becomes

\begin{array}{l} 0 = (1 - η) \lim_{n \to \infty} (E [\dot{π} (Q_{i}^{≺} + q_{0})]) + η \lim_{n \to \infty} (P [{ITM}_{i}] E [\dot{π} (Q_{i}^{≺} + q_{0}) | {ITM}_{i}]) \\ = (1 - η) \lim_{n \to \infty} \sum_{k = 1}^{n} [\dot{π} (\frac{k}{n} \hat{y}) \frac{1}{n}] + η \lim_{n \to \infty} (\frac{\bar{k} - 1}{n} \sum_{k = 1}^{\bar{k}} [\dot{π} (\frac{k}{n} \hat{y}) \frac{1}{\bar{k}}]) \\ = (1 - η) \frac{1}{\hat{y}} \int_{0}^{\hat{y}} \dot{π} (z) d z + η \frac{1}{\hat{y}} \int_{0}^{\bar{y}} \dot{π} (z) d z = (1 - η) \frac{1}{\hat{y}} π (\hat{y}) + η \frac{1}{\hat{y}} π (\bar{y}), \end{array}

where the second line expands the expectations, and the third follows the definition of definite integrals. Multiplying both sides with

\hat{y}

then yields

Π = 0

Finally, note that by fixing $α$ , any exogenous parameter $ζ$ , other than $η$ , affects $Π$ via either $π (\cdot)$ or $\bar{y}$ ; that is, $Π$ can be written as a function of $Π (α, ζ) = (1 - η) π ((1 + α) \bar{y} (ζ); ζ) + η π (\bar{y} (ζ); ζ)$ . If $ζ$ is $η$ , then $Π (α, ζ) = (1 - ζ) π ((1 + α) \bar{y}) + ζ π (\bar{y})$ . Because $Π (α; ζ) = 0$ , by implicit function theorem, $\frac{d α}{d ζ} = - \frac{\partial Π / \partial ζ}{\partial Π / \partial α}$ . Note that $\frac{\partial Π}{\partial α} = (1 - η) \dot{π} ((1 + α) \bar{y}) \bar{y} < 0$ because of the top-of-queue advantage (Lemma 1). Noting that the $c / a$ ratio increases in $α$ ; therefore, $sign [\frac{d}{d ζ} (c / a)] = sign [\frac{d α}{d ζ}] = sign [\frac{\partial Π}{\partial ζ}]$ . □

Proof of Proposition 4.

After all n market makers submit limit orders at $t = 0$ by time $t \in (0, 1)$ , the fraction of these n orders that have been processed is, by the law of large numbers,

\frac{1}{n} \sum_{i = 1}^{n} 𝟙_{{t_{i} \leq t}} \overset{n \to \infty}{\to} ψ_{F} G_{F} (t) + ψ_{s} G_{S} (t) .

Define $u_{j} = q_{j} n$ as the “intensity” of the group j’s limit order size.¹² Then, in the limit, the total depth that will have been processed by time t is

y (t) ≔ \sum_{i = 1}^{n} 𝟙_{{t_{i} \leq t}} q_{i} = \sum_{i = 1}^{n} 𝟙_{{t_{i} \leq t}} \frac{u_{i}}{n} \overset{n \to \infty}{\to} ψ_{F} G_{F} (t) u_{F} + ψ_{s} G_{S} (t) u_{S} .

(A.3)

(For now, it is a conjecture that $u_{j} = q_{j} n$ exists in the limit for both $j \in {F, S}$ , and it is verified below.) Clearly, $y (t)$ is monotone increasing in $t \in (0, 1)$ , with $y (0) = 0$ , and by Proposition 2, $y (1) = \hat{y} > \bar{y}$ . Therefore, there exists a unique $\bar{t} \in (0, 1)$ such that $y (\bar{t}) = \bar{y}$ . That is, $\bar{t}$ is the latency threshold below which a limit order is ITM. A group j market maker i’s probability of being ITM is $P [{ITM}_{i}] \to G_{j} (\bar{t})$ .

Suppose a market maker i is from group j, and their order arrives at $t_{i}$ . Following the definition of $y (t)$ above, the book depth at that time, including their order, converges to

Q_{i}^{≺} + q_{j} = (\sum_{l = 1}^{n} 𝟙_{{t_{l} \leq t_{i}}} q_{l}) + \frac{u_{j}}{n} \to y (t_{i}),

because

q_{j} = \frac{u_{j}}{n} \to 0

. Then, for a group j market maker, their first-order derivative—the left-hand side of (10)—becomes

(1 - η) \int_{0}^{1} \dot{π} (y (t)) g_{j} (t) d t + η \int_{0}^{\bar{t}} \dot{π} (y (t)) g_{j} (t) d t ≕ κ_{j} .

(A.4-j)

Depending on whether $κ_{j} ≶ 0$ , an equilibrium must be of one of the following three cases: (a) $κ_{F} = 0$ but $κ_{S} < 0$ , (b) $κ_{F} < 0$ but $κ_{S} = 0$ , or (c) $κ_{F} = κ_{S} = 0$ . The other cases cannot be an equilibrium: if $κ_{j} > 0$ , a group j market maker will always want to increase their order size intensity $u_{j}$ , and if both $κ_{F} < 0$ and $κ_{S} < 0$ , then the equilibrium depth $\hat{y} = 0$ , contradicting $\dot{π} (0) > 0$ (Lemma 1). If $κ_{j} < 0$ , it is a cornered equilibrium with $u_{j} = 0$ , but this does not mean that the group j investors do not submit limit orders. Instead, they submit negligibly small orders in the limit of $n \to \infty$ (and cancel them if OTM).

In any of the three cases, the following property holds: scale (A.4-j) with the respective aggregate depth from group j, that is, $ψ_{F} u_{F}$ for F and $ψ_{s} u_{S}$ for S, and sum them up to get

\begin{array}{l} (1 - η) \int_{0}^{1} \dot{π} (y (t)) (ψ_{F} u_{F} g_{F} (t) + ψ_{s} u_{S} g_{S} (t)) d t + η \int_{0}^{\bar{t}} \dot{π} (y (t)) (ψ_{F} u_{F} g_{F} (t) + ψ_{s} u_{S} g_{S} (t)) d t \\ = (1 - η) \int_{0}^{\hat{y}} \dot{π} (y) d y + η \int_{0}^{\bar{y}} \dot{π} (y) d y = (1 - η) π (\hat{y}) + η π (\bar{y}) = 0, \end{array}

(A.5)

where the first equality holds because

d y (t) = (ψ_{F} u_{F} g_{F} (t) + ψ_{s} u_{S} g_{S} (t)) d t

, and the last “

= 0

” holds because at least one

κ_{j} = 0

. (If a

κ_{j} < 0

, it is scaled by

u_{j} = 0

, hence not affecting the above.) Thus, the equilibrium depth

\hat{y}

is pinned down by (A.5) and is independent of market makers’ speed heterogeneity. Lemma 1 ensures that such a

\hat{y}

is unique and always exists for sufficiently small

η

. Because such a

\hat{y} = ψ_{F} u_{F} + ψ_{s} u_{S}

is finite, both

u_{F}

and

u_{S}

must also be finite, verifying the earlier conjecture.

The equilibrium $\bar{t}$ can then be determined: Define $H (t) ≔ \frac{y (t)}{\hat{y}}$ , which is monotone increasing in t from $H (0) = 0$ to $H (1) = 1$ . The break-even latency $\bar{t}$ is then uniquely determined by $\bar{y} = y (\bar{t}) = \hat{y} H (\bar{t})$ , or $\bar{t} = H^{- 1} (\frac{\bar{y}}{\hat{y}})$ .

Following (13), $C / A = \frac{α}{1 + α} \frac{1}{c / a}$ . In equilibrium, $c / a$ is the fraction of orders that arrive after $\bar{t}$ , that is, $c / a = 1 - (ψ_{F} G_{F} (\bar{t}) + ψ_{s} G_{S} (\bar{t})) > 1 - G_{F} (\bar{t})$ because, following the stochastic dominance, $G_{F} (\bar{t}) > G_{S} (\bar{t})$ . Note that $\hat{y} = ψ_{F} u_{F} + ψ_{s} u_{S}$ , and so, $H (t)$ is effectively a weighted average between $G_{F} (t)$ and $G_{S} (t)$ , and hence, $G_{F} (t) \geq H (t)$ and $G_{F}^{- 1} (p) \leq H^{- 1} (p)$ . Therefore, $G_{F} (\bar{t}) = G_{F} (H^{- 1} (\frac{\bar{y}}{\hat{y}})) \geq G_{F} (G_{F}^{- 1} (\frac{\bar{y}}{\hat{y}})) = \frac{\bar{y}}{\hat{y}}$ . That is, in any of the cases, $c / a > 1 - \frac{\bar{y}}{\hat{y}} = \frac{α}{1 + α}$ , and hence, $C / A < 1$ .

Next, consider the cases separately to examine the effect of $ψ_{F}$ on $C / A$ .

Case (a) and (b): These two cases can be combined as $κ_{j} = 0$ and $κ_{- j} < 0$ , and therefore, $u_{j} > 0$ and $u_{- j} = 0$ . Then, $H (t) = G_{j} (t)$ , and hence, $\bar{t} = G_{j}^{- 1} (\frac{\bar{y}}{\hat{y}})$ . Neither $α = \frac{\hat{y}}{\bar{y}} - 1$ or $\bar{t} = G_{j}^{- 1} (\frac{\bar{y}}{\hat{y}})$ is affected by $ψ_{F}$ . Therefore, $\frac{d}{d ψ_{F}} (c / a) = - G_{F} (\bar{t}) + G_{S} (\bar{t}) < 0$ because of the stochastic dominance, and so, a higher $ψ_{F}$ increases $C / A$ .

Case (c): In this case, the two $u_{j} > 0$ are endogenously determined by the two first-order conditions of $κ_{j} = 0$ . Consider either $κ_{j} = 0$ , which involves $y (t) = \hat{y} G_{S} (t) + (ψ_{F} u_{F}) (G_{F} (t) - G_{S} (t))$ (using $\hat{y} = ψ_{F} u_{F} + ψ_{s} u_{S}$ ). Therefore, by implicit function theorem,

\frac{d u_{F}}{d ψ_{F}} = - \frac{\partial κ_{j} / \partial ψ_{F}}{\partial κ_{j} / \partial u_{F}} = - \frac{(1 - η) \int_{0}^{1} \ddot{π} (y) g_{j} Δ d t u_{F} + η \int_{0}^{\bar{t}} \ddot{π} (y) g_{j} Δ d t u_{F}}{(1 - η) \int_{0}^{1} \ddot{π} (y) g_{j} Δ d t ψ_{F} + η \int_{0}^{\bar{t}} \ddot{π} (y) g_{j} Δ d t ψ_{F}} = - \frac{u_{F}}{ψ_{F}},

where for notation simplicity,

Δ (t) ≔ G_{F} (t) - G_{S} (t)

, and the argument

(t)

is omitted for

y (t)

Δ (t)

, and

g_{F} (t)

. (Note that

\dot{π} (y (\bar{t})) = \dot{π} (\bar{y}) = 0

.) Similarly, rewriting

y (t) = \hat{y} G_{F} (t) - (ψ_{s} u_{S}) (G_{F} (t) - G_{S} (t))

, it can be found that

\frac{d u_{S}}{d ψ_{F}} = - \frac{\partial κ_{j} / \partial ψ_{F}}{\partial κ_{j} / \partial u_{S}} = - \frac{(1 - η) \int_{0}^{1} \ddot{π} (y) g_{j} Δ d t u_{S} + η \int_{0}^{\bar{t}} \ddot{π} (y) g_{j} Δ d t u_{S}}{- (1 - η) \int_{0}^{1} \ddot{π} (y) g_{j} Δ d t ψ_{s} - η \int_{0}^{\bar{t}} \ddot{π} (y) g_{j} Δ d t ψ_{s}} = \frac{u_{S}}{ψ_{s}} .

The above then implies that the product $ψ_{j} u_{j}$ is unaffected by $ψ_{F}$ : $\frac{d (ψ_{F} u_{F})}{d ψ_{F}} = u_{F} + ψ_{F} \cdot \frac{d u_{F}}{d ψ_{F}} = 0$ , and $\frac{d (ψ_{s} u_{S})}{d ψ_{F}} = - u_{S} + ψ_{s} \cdot \frac{d u_{S}}{d ψ_{F}} = 0$ .

Recall that $\bar{t} = H^{- 1} (\frac{\bar{y}}{\hat{y}})$ , or equivalently, $\bar{y} = y (\bar{t}) = ψ_{F} u_{F} G_{F} (\bar{t}) + ψ_{s} u_{S} G_{S} (\bar{t})$ , which implies

\frac{d \bar{t}}{d ψ_{F}} = - \frac{\frac{d (ψ_{F} u_{F})}{d ψ_{F}} G_{F} (\bar{t}) + \frac{d (ψ_{s} u_{S})}{d ψ_{F}} G_{S} (\bar{t})}{ψ_{F} u_{F} g_{F} (\bar{t}) + ψ_{s} u_{S} g_{S} (\bar{t})} = 0 .

That is, the break-even time is not affected either. Then, finally, $\frac{d}{d ψ_{F}} (c / a) = - (G_{F} (\bar{t}) - G_{S} (\bar{t})) - (ψ_{F} g_{F} (\bar{t}) + ψ_{s} g_{S} (\bar{t})) \frac{d \bar{t}}{d ψ_{F}} = - (G_{F} (\bar{t}) - G_{S} (\bar{t})) < 0 .$ Hence, a higher $ψ_{F}$ also increases $C / A$ in this case. □

Proof of Proposition 5.

Note from (16) that the equilibrium depth $\hat{y}$ only affects the case of no-revision, that is, the first expectation. Because this proposition is about the shape of $w (\hat{y})$ , therefore, setting $η = 0$ does not change $sign [\dot{w} (\cdot)]$ . Fix a depth level of $\hat{y} = y \geq 0$ . Direct computation yields $\dot{w} (y) - \dot{π} (y) = \int_{a + ρ y}^{\infty} (z - a - ρ y) f (z) d z$ , which is positive because $z > a + ρ y$ in the integrand. Therefore, for all $y \geq 0$ , $\dot{w} (y) > \dot{π} (y)$ . Recall that $\dot{π} (y) \geq 0$ for all $y \in [0, \bar{y}]$ . Then, $\dot{w} (y) > \dot{π} (y) \geq 0$ ; that is, $w (y)$ is strictly increasing initially on $y \in [0, \bar{y}]$ . In particular, the $y^{*}$ that maximizes $w (y)$ must satisfy $y^{*} > \bar{y}$ because $\dot{w} (\bar{y}) > \dot{π} (\bar{y}) = 0$ .

To show that $w (y)$ eventually decreases, directly compute $\dot{w} (y)$ to get

\dot{w} (y) = \int_{a + ρ y}^{\infty} ((1 - ϕ) z - ρ y) f (z) d z = (1 - ϕ) var [z] f (a + ρ y) - ρ y \cdot (1 - F (a + ρ y)),

where

ϕ \in (0, 1)

, and the last equality uses the property of the normal density that

z f (z) = - var [z] \dot{f} (z)

. Therefore,

sign [\dot{w} (y)] = sign [(1 - ϕ) var [z] - ρ y r (a + ρ y)]

, where

r (z) ≔ (1 - F (z)) / f (z)

is the Mills ratio. Note that

\lim_{y \to \infty} ρ y r (a + ρ y) = \lim_{y \to \infty} \frac{ρ y}{a + ρ y} (a + ρ y) r (a + ρ y) = var [z],

where the last equality uses the property that

z r (z)

monotonically increases from zero to

var [z]

z \in [0, \infty)

; see, for example, Pinelis (2002, proposition 1.2). Therefore,

sign [\dot{w} (y)] \to sign [- ϕ var [z]] < 0

. □

Proof of Corollary 1.

Consider the direct effect first. Directly evaluate the difference $(g (\hat{y}) + π (\hat{y})) - (g (\bar{y}) + π (\bar{y})) = \int_{\bar{y}}^{\hat{y}} (\dot{g} (y) + \dot{π} (y)) d y = \int_{\bar{y}}^{\hat{y}} \int_{a + ρ y}^{\infty} ((1 - ϕ) z - ρ y) f (z) d z d y$ $> \int_{\bar{y}}^{\hat{y}} \int_{a + ρ y}^{\infty} ((1 - ϕ) a - ϕ ρ y) f (z) d z d y = \int_{\bar{y}}^{\hat{y}} ((1 - ϕ) a - ϕ ρ y) (1 - F (a + ρ y)) d y = \int_{\bar{y}}^{\hat{y}} - \frac{1}{ρ} \ddot{π} (y) r (a + ρ y) d y,$ where the inequality follows because $z \geq a + ρ y$ in the inner definite integral, and the last equality uses the expression of $\ddot{π} (y)$ derived in the Proof of Lemma 1, with $r (z) = \frac{1 - F (z)}{f (z)}$ denoting the Mills ratio. Integrating by parts, $\int_{\bar{y}}^{\hat{y}} - \frac{1}{ρ} \ddot{π} (y) r (a + ρ y) d y = - \frac{1}{ρ} \dot{π} (\hat{y}) r (a + ρ \hat{y}) + \int_{\bar{y}}^{\hat{y}} \dot{π} (y) \dot{r} (a + ρ y) d y > 0$ because $\dot{π} (y) < 0$ for $y > \bar{y}$ and because $r (\cdot)$ is monotone decreasing. Therefore, $g (\hat{y}) + π (\hat{y}) > g (\bar{y}) + π (\bar{y})$ .

Next, consider the indirect effect, $(1 - η) (\dot{g} (\hat{y}) + \dot{π} (\hat{y})) \frac{d \hat{y}}{d η} = (1 - η) \dot{w} (\hat{y}) \frac{d \hat{y}}{d η}$ . Recall from Proposition 2 that $\frac{d \hat{y}}{d η} > 0$ . If $\hat{y} \geq y^{*}$ , then $\dot{w} (\hat{y}) \leq 0$ as seen in Figure 7(a), and the indirect effect is negative. Consider the case of $\bar{y} \leq \hat{y} < y^{*}$ . In this range, as $η \to 1$ , the indirect effect converges to zero if $\dot{w} (\hat{y})$ and $\frac{d \hat{y}}{d η}$ are both bounded, which is indeed the case: direct evaluation yields $\dot{w} (\hat{y}) = (1 - ϕ) P [z > a + ρ \hat{y}] E [z | z > a + ρ] - ρ \hat{y} \cdot (1 - F (a + ρ \hat{y}))$ , which is clearly finite, because the upper tail expectation of a normally distributed z is finite. Also, $\frac{d \hat{y}}{d η}$ is bounded because $\bar{y} \leq \hat{y} < y^{*}$ . By continuity, there exists an $η^{*} \in [0, 1)$ such that for all $η > η^{*}$ , $\frac{d w}{d η} < 0$ . □

Endnotes

¹ Almost all exchanges adopt the time priority rule, under which limit orders of the same limit price are matched with market(able) orders on a first-in-first-out basis.

² Whereas in the model, limit orders are posted simultaneously, it should be emphasized that these liquidity suppliers are not required to move simultaneously in reality. Rather, they only need to move within a relatively tight time interval, unable to observe each other’s decision. The simultaneity is a means of modeling queuing uncertainty.

³ When the liquidity supply is a continuous and smooth function of price, any two (nonzero measure) limit orders must differ in their limit prices, for otherwise, the supply would not be smooth. Then, the time priority is no longer meaningful, and neither is queuing uncertainty, as the orders’ execution sequence is only determined by price priority.

⁴ Indeed, the arrival time of market order is unpredictable: investors make trading decisions at unpredictable times (e.g., a trader might be taking a coffee break), market orders’ transmission latencies are random, (re)routing across venues can take an uncertain amount of time, speed bumps slow trading randomly, etc.

⁵ The threshold $a^{*}$ is, in fact, the lowest ask price at which competitive market makers will post limit orders. Its Definition (A.1) coincides with the one under the normal distribution example of Glosten (1994). As such, $a > a^{*}$ is a standard assumption inherent from the literature to ensure nonempty limit order books.

⁶ Lemma 2 only establishes the existence of the equilibrium. In general, multiple equilibria are possible, as are common in quantity competition games (see, e.g., Tirole 1988, chapter 5).

⁷ Whereas this paper focuses on discrete-price limit order books, the distinction between the two definitions does not rely on price discreteness. Even when prices are continuous, the difference is still meaningful, as can be most clearly seen by comparing the continuous-price part of Glosten (1994), which is a zero-profit equilibrium, with the monopolist specialist part of Glosten (1989), which is a marginally break-even equilibrium. The resulting shapes of the equilibrium order books are clearly different, despite that prices are continuous in both cases.

⁸ Quantifying the severity of queuing uncertainty in general requires a huge number of parameters: the random queue $k$ has n! possible realizations, and hence, $(n! - 1)$ parameters are needed to fully characterize its distribution. The homogeneity of market makers simplifies this quantification by naturally directing the attention to the symmetric-strategy equilibrium so that a single parameter n suffices.

⁹ The welfare analysis is isomorphic if instead the investor has mean-variance utility or constant absolute risk aversion (CARA) utility. The advantage of adopting quadratic utility is that it gives clear decomposition of the gains from trade, thus making the intuition transparent.

¹⁰ More formally, denote the degree of queuing uncertainty by $ς$ , which can be the $| β - \frac{1}{2} |$ in Example 1 or the n in Example 2. Then, $w (\hat{y}) = w (\hat{y} (ς))$ , where $\hat{y} (ς)$ is monotone increasing in $ς$ . As such, for the purpose of examining welfare, varying $ς$ is equivalent to varying $\hat{y}$ .

¹¹ To see this, consider the limit case of $ρ \to 0$ ; that is, the investor has only the speculative trading motive. Then, welfare $w (\hat{y})$ indeed reduces to zero. For example, consider the first expectation in Expression (16), which reduces to $E [(z - V) x (z; \hat{y})]$ . Note from (3) that as $ρ \to 0$ , $x (z; \hat{y}) \to 𝟙_{{z > a}} \hat{y}$ . Therefore, $E [(z - V) x (z; \hat{y})] = P [z > a] E [(z - V) \hat{y} | z > a] = \hat{y} P [z > a] E [E [z - V | z] | z > a]$ , where the last equality holds by iterated expectations. Recall that $E [V | z] = ϕ z \to z$ , where $ϕ = \frac{θ τ_{U}}{θ τ_{U} + ρ^{2} τ_{V}} \to 1$ as $ρ \to 0$ . Therefore, the above expectation is indeed zero, and the same holds for the second expectation, yielding $w (\hat{y}) \to 0$ .

¹² More generally, one can define a measure $μ (\cdot)$ for the power set of the n market makers, where each market maker i has the same measure of $μ (i) = 1 / n$ . This way, the measure of the fast group is $ψ_{F}$ , and the slow is $ψ_{s}$ . Then, a market maker i’s intensity $u_{j} = q_{j} n = q_{j} / (1 / n) = q_{j} / μ (i)$ is essentially their order size $q_{j}$ relative to their measure $μ (i)$ . This construct will be convenient in analyzing the limit of $n \to \infty$ . For example, consider the aggregate depth, which is $\hat{y} = \sum_{i} q_{i} = \sum_{i} u_{i} \frac{1}{n} = \sum_{i} u_{i} μ (i)$ , where $u_{i} = u_{j}$ if i is in group j. In the limit of $n \to \infty$ , it has an intuitive integral equivalent of $\hat{y} = \int_{0}^{1} u_{i} d i = ψ_{F} u_{F} + ψ_{s} u_{S}$ .

References

Aspris A, Foley S, Harris D, O’Neill P (2015) Time pro-rata matching: Evidence of a change in LIFFE STIR futures. J. Futures Markets 35(6):522–541.Crossref, Google Scholar
Back K, Baruch S (2013) Strategic liquidity provision in limit order markets. Econometrica 81(1):363–392.Crossref, Google Scholar
Baldauf M, Mollner J (2020) High-frequency trading and market performance. J. Finance 75(3):1495–1526.Crossref, Google Scholar
Baldauf M, Mollner J (2022) Fast traders make a quick buck: The role of speed in liquidity provision. J. Financial Markets 58:100621.Crossref, Google Scholar
Baruch S, Glosten LR (2013) Fleeting orders. Working paper, Carlson School of Management, Minneapolis.Google Scholar
Bhattacharya A, Saar G (2021) Limit order markets under asymmetric information. Working paper, Cornell SC Johnson College of Business, Ithaca, NY.Google Scholar
Biais B, Hillion P, Spatt C (1995) An empirical analysis of the limit order book and the order flow in the Paris bourse. J. Finance 50(5):1655–1689.Crossref, Google Scholar
Biais B, Martimort D, Rochet J-C (2000) Competing mechanisms in a common value environment. Econometrica 68(4):799–837.Crossref, Google Scholar
Biais B, Martimort D, Rochet J-C (2013) Corrigendum to “competing mechanisms in a common value environment”. Econometrica 81(1):393–406.Crossref, Google Scholar
Brolley M, Cimon DA (2020) Order flow segmentation, liquidity and price discovery: The role of latency delays. J. Financial Quant. Anal. 55(8):2555–2587.Crossref, Google Scholar
Budish E, Cramton P, Shim J (2015) The high-frequency trading arms race: Frequent batch auctions as a market design response. Quart. J. Econom. 14(3):1547–1621.Crossref, Google Scholar
Buti S, Consonni F, Rindi B, Wen Y, Werner IM (2015) Sub-penny and queue-jumping. Working paper, Bocconi University, Milan.Google Scholar
Chen H, Foley S, Goldstein M, Ruf T (2017) The value of a millisecond: Harnessing information in fast, fragmented markets. Preprint, submitted October 28, https://doi.org/10.2139/ssrn.2860359.Google Scholar
CME Group (2023) EBS dealing rules—Appendix. Technical report, CME Group, Chicago, https://www.cmegroup.com/trading/market-tech-and-data-services/files/ebs-dealing-rules-ebs-mar ket-appendix-effective-20230811.pdf.Google Scholar
Colliard J-E, Foucault T (2012) Trading fees and efficiency in limit order markets. Rev. Financial Stud. 25(11):3389–3421.Crossref, Google Scholar
Dahlström P, Hagströmer B, Nordén LL (2024) The determinants of limit order cancellations. Financial Rev. 59(1):181–201.Crossref, Google Scholar
Degryse H, Karagiannis N (2022) Priority rules. Preprint, submitted May 29, https://doi.org/10.2139/ssrn.3186009.Google Scholar
Degryse H, De Winne R, Gresse C, Payne R (2024) Duplicated orders, swift cancellations, and fast market making in fragmented markets. Preprint, submitted April 10, https://doi.org/10.2139/ssrn.3356695.Google Scholar
Ding S, Hanna J, Hendershott T (2014) How slow is the NBBO? A comparison with direct exchange feeds. Financial Rev. 49(2):313–332.Crossref, Google Scholar
Dugast J (2018) Unscheduled news and market dynamics. J. Finance 73(6):2537–2586.Crossref, Google Scholar
Easley D, O’Hara M (1987) Price, trade size, and information in securities markets. J. Financial Econom. 19(1):69–90.Crossref, Google Scholar
Easley D, O’Hara M (1992) Time and the process of security price adjustment. J. Finance 47(2):576–605.Crossref, Google Scholar
Egginton J, van Ness BF, van Ness RA (2016) Quote stuffing. Financial Management 45(3):583–608.Crossref, Google Scholar
Foucault T, Menkveld AJ (2008) Competition for order flow and smart order routing systems. J. Finance 63(1):119–158.Crossref, Google Scholar
Glosten LR (1989) Insider trading, liquidity, and the role of the monopolist specialist. J. Bus. 62(2):211–235.Crossref, Google Scholar
Glosten LR (1994) Is the electronic limit order book inevitable? J. Finance 49(4):1127–1161.Crossref, Google Scholar
Glosten LR, Milgrom PR (1985) Bid, ask, and transaction prices in a specialist market with heterogeneously informed agents. J. Financial Econom. 42(1):71–100.Crossref, Google Scholar
Hasbrouck J (2018) High frequency quoting: Short-term volatility in bids and offers. J. Financial Quant. Anal. 53(2):613–641.Crossref, Google Scholar
Hasbrouck J, Saar G (2009) Technology and liquidity provision: The blurring of traditional definitions. J. Financial Markets 12(2):143–172.Crossref, Google Scholar
Hasbrouck J, Saar G (2013) Low-latency trading. J. Financial Markets 16(4):646–679.Crossref, Google Scholar
Hu E (2019) Intentional access delays, market quality, and price discovery: Evidence from IEX becoming an exchange. Preprint, submitted June 27, https://doi.org/10.2139/ssrn.3195001.Google Scholar
Kavajecz KA (1999) A specialist’s quoted depth as a strategic choice variable. J. Finance 54(2):747–771.Crossref, Google Scholar
Kavajecz KA, Odders-White ER (2001) An examination of changes in specialists’ posted price schedules. Rev. Financial Stud. 14(3):681–704.Crossref, Google Scholar
Kavajecz KA, Odders-White ER (2004) Technical analysis and liquidity provision. Rev. Financial Stud. 17(4):1043–1071.Crossref, Google Scholar
Khapko M, Zoican M (2021) Do speed bumps curb low-latency investments? Evidence from a laboratory market. J. Financial Markets 55:100601.Crossref, Google Scholar
Li S, Ye M (2023) Discrete prices, discrete quantities, and the optimal price of a stock. Preprint, submitted March 8, https://doi.org/10.2139/ssrn.3763516.Google Scholar
Menkveld AJ, Zoican MA (2017) Need for speed? Exchange latency and market quality. Rev. Financial Stud. 30(4):1188–1228.Crossref, Google Scholar
O’Hara M (2015) High frequency market microstructure. J. Financial Econom. 116(2):257–270.Crossref, Google Scholar
Parlour CA, Seppi DJ (2008) Limit order markets: A survey. Boot AWA, Thakor AV, eds. Handbook of Financial Intermediation and Banking (Elsevier Publishing, Amsterdam) 63–96.Crossref, Google Scholar
Pinelis I (2002) Monotonicity properties of the relative error of a Padé approximation for Mills’ ratio. J. Inequalities Pure Appl. Math. 3(2):1–8.Google Scholar
Riccó R, Rindi B, Seppi DJ (2021) Optimal market access pricing. Working paper, Bocconi University, Milan.Google Scholar
Sandås P (2001) Adverse selection and competitive market making: Empirical evidence from a limit order market. Rev. Financial Stud. 14(3):705–734.Crossref, Google Scholar
Seppi DJ (1997) Liquidity provision with limit orders and a strategic specialist. Rev. Financial Stud. 10(1):103–150.Crossref, Google Scholar
Tirole J (1988) The Theory of Industrial Organization (MIT Press, Cambridge, MA).Google Scholar
Yao C, Ye M (2018) Why trading speed matters: A tale of queue rationing under price controls. Rev. Financial Stud. 31(6):2158–2183.Crossref, Google Scholar

Volume 72, Issue 6

June 2026

Pages 4569-5489, iv-vi

Article Information

Supplemental Material

Metrics

Information

Received:October 16, 2023
Accepted:February 26, 2025
Published Online:September 17, 2025

Cite as

Bart Zhou Yueshen (2025) Queuing Uncertainty of Limit Orders. Management Science 72(6):4760-4779.

https://doi.org/10.1287/mnsc.2023.03371

Keywords

Acknowledgments

PDF download

Available Issues

Available Issues

Queuing Uncertainty of Limit Orders

Abstract

1. Introduction

1.1. Contributions and Related Literature

2. Model Setup

2.1. Overview

2.2. Asset

2.3. Limit Order Book

2.4. Liquidity Supply

2.5. Queuing Uncertainty of Limit Orders

2.6. Order Revision

2.7. Liquidity Demand

2.8. Distribution Assumptions

2.9. Model Discussions

3. Equilibrium

3.1. The Investor’s Optimal Market Order

3.2. Limit Orders’ Profitability

3.3. Without Resolution of Queuing Uncertainty

3.3.1. Equilibrium Limit Order Sizes in t=0.

3.3.2. Liquidity Overshoot.

3.3.2.1. Intuition: Queuing Uncertainty Softens Strategic Substitution.

3.3.3. Enriching the Notion of Equilibrium Depth.

3.4. With Resolution of Queuing Uncertainty

3.4.1. Equilibrium

3.4.1.1. The Optimal Revisions at t=1.

3.4.1.2. The Optimal Order Sizes at t=0.

3.4.1.3. Overshooting Is Exacerbated.

3.4.2. Prediction 1: Depth Dynamics, Overshooting, Then Correction.

3.4.3. Prediction 2: Cancellation vs. Addition Message Counts.

3.4.3.1. The Equilibrium Cancel-to-Add Count Ratio.

3.4.3.2. Comparative Statics.

3.4.3.3. Adverse Selection as an Example.

3.4.4. Prediction 3: Cancellation Size vs. Addition Size.

3.4.4.1. The Equilibrium Cancel-to-Add Size Ratio.

3.4.4.2. Speed Heterogeneity and the Cancel-to-Add Size Ratio.

3.4.4.3. Measuring Market-Making Activeness.

4. Welfare

4.1. Welfare, Liquidity Supply, and Queuing Uncertainty

4.1.1. How Welfare Loss Arises.

4.1.2. The Role of Queuing Uncertainty.

4.2. Market Design Implications

4.2.1. Speed Bumps Targeting Limit Orders.

4.2.2. Speed Bumps Targeting Market Orders.

5. Conclusion

Appendix. Proofs

References

Volume 72, Issue 6

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

3.3.1. Equilibrium Limit Order Sizes in $t = 0$ .

3.4.1.1. The Optimal Revisions at $t = 1$ .

3.4.1.2. The Optimal Order Sizes at $t = 0$ .