Open Access

Asymptotically Optimal Inventory Control for Assemble-to-Order Systems

Martin I. Reiman
Martin I. Reiman
[email protected]
https://orcid.org/0000-0003-4919-2894
Industrial Engineering and Operations Research, Columbia University, New York, New York 10027;
Search for more papers by this author
,
Haohua Wan
Haohua Wan
[email protected]
Industrial and Enterprise Systems Engineering, University of Illinois at Urbana–Champaign, Urbana, Illinois 61801
Search for more papers by this author
,
Qiong Wang
Corresponding Author
Qiong Wang
[email protected]
https://orcid.org/0000-0002-8782-0460
Industrial and Enterprise Systems Engineering, University of Illinois at Urbana–Champaign, Urbana, Illinois 61801
Search for more papers by this author

Martin I. Reiman

[email protected]

https://orcid.org/0000-0003-4919-2894

Industrial Engineering and Operations Research, Columbia University, New York, New York 10027;

Search for more papers by this author

Haohua Wan

[email protected]

Industrial and Enterprise Systems Engineering, University of Illinois at Urbana–Champaign, Urbana, Illinois 61801

Search for more papers by this author

Qiong Wang

Corresponding Author

Qiong Wang

[email protected]

https://orcid.org/0000-0002-8782-0460

Industrial and Enterprise Systems Engineering, University of Illinois at Urbana–Champaign, Urbana, Illinois 61801

Search for more papers by this author

Published Online:26 Oct 2022https://doi.org/10.1287/stsy.2022.0099

Abstract

We consider assemble-to-order (ATO) inventory systems with a general bill of materials and general deterministic lead times. Unsatisfied demands are always backlogged. We apply a four-step asymptotic framework to develop inventory policies for minimizing the long-run average expected total inventory cost. Our approach features a multistage stochastic program (SP) to establish a lower bound on the inventory cost and determine parameter values for inventory control. Our replenishment policy deviates from the conventional constant base stock policies to accommodate nonidentical lead times. Our component allocation policy differentiates demands based on backlog costs, bill of materials, and component availabilities. We prove that our policy is asymptotically optimal on the diffusion scale, that is, as the longest lead time grows, the percentage difference between the average cost under our policy and its lower bound converges to zero. In developing these results, we formulate a broad stochastic tracking model and prove general convergence results from which the asymptotic optimality of our policy follows as specialized corollaries.

Funding: This study is based on work supported by the National Science Foundation [Grant CMMI-1363314].

1. Introduction

Optimal control of assemble-to-order (ATO) inventory systems is a canonical problem in inventory theory. In an ATO system, product assembly is assumed to take a negligible amount of time, so there is no need to keep inventories of final products. Component supplies are not capacity constrained, but it is necessary to hold inventories for them to accommodate replenishment lead times (i.e., delays between ordering and receiving components). The goal of ATO inventory management is to minimize the total inventory cost, which consists of both holding and backlog costs, by controlling the timing and quantities of component orders and allocation of available components to different product demands.

Although optimal control of single-product ATO systems has long been settled (see Karlin and Scarf (1958) and Rosling (1989) for systems with single and multiple components, respectively), optimizing multiproduct ATO systems is an immensely more difficult problem that remains unsolved. The complexity arises from the need to allocate components that are used by multiple products. Optimal allocation depends on component availabilities, which are outcomes of replenishment decisions. Optimal replenishment needs to take into account how components will be allocated. A joint optimal solution of both decisions is contingent on not only the current inventory and backlog levels, but also the arrival times of all outstanding replenishment orders, giving rise to an enormous state space. The problem becomes even more complicated in systems where components do not have identical lead times, because the ordering of components often needs to be coordinated with the availabilities of other components with longer lead times.

There has been a large body of studies on managing multiproduct ATO systems. One may find a thorough review of the related literature in Song and Zipkin (2003), and a more recent one in Atan et al. (2017). Many previous studies focus on particular types of policies, such as base stock replenishment policies, FIFO, or no-hold-back allocation policies (see Lu and Song (2005), Lu et al. (2010), and Huang and de Kok (2015) for some samples), and for periodic-review systems, allocation policies that always satisfy demands from previous periods first (Zhang 1997, Hausman et al. 1998, Agrawal and Cohen 2001, Akçay and Xu 2004). Restricting the consideration to these types of policies makes the problem more tractable, but also sacrifices optimality, because both analytical and numerical studies (Doğru et al. 2010, 2017) have shown that there are often better alternatives in other types of policies.

Asymptotic analysis has recently emerged as a powerful tool for analyzing inventory systems. Although a weaker standard than being exactly optimal, asymptotic optimality provides vital guidance to formulating novel inventory policies and evaluating their performance when meeting the latter criterion is analytically intractable. Asymptotic optimality has been used to justify the use of base stock policies in lost sale inventory systems (in the regime of high penalty costs; Huh and Rusmevichienton 2009), ATO inventory systems with identical lead times (Reiman and Wang 2015) or when differences in lead times are small relative to the lead times themselves (Reiman et al. 2016), or systems with nonstationary demands and probabilistic service level constraints (Wei et al. 2021). It also supports the use of no-hold-back allocation policies in assemble-to-order N and W systems (Lu et al. 2015). Asymptotic analysis has led to several surprising discoveries. For instance, simple constant ordering policies can be highly effective in managing lost sale inventory systems with long lead times (Reiman 2004, Goldberg et al. 2016, Xin and Goldberg 2016, Bu et al. 2020), and randomness of lead times is a useful feature that can be exploited to reduce the inventory cost in backlog systems when orders can cross in time (Stolyar and Wang 2022). We refer readers to Goldberg (2021) for a comprehensive review of these developments.

In this paper, we study ATO inventory systems with nonidentical, deterministic lead times and a general bill of materials (BOM). We develop an inventory policy that includes both replenishment and allocation decisions. We prove that our policy is asymptotically optimal, that is, as the longest lead time increases, the percentage difference between the long-run average inventory cost under our policy and the optimal policy converges to zero.

Our analysis follows a general four-step asymptotic framework that has produced many ground-breaking results in the study of stochastic processing networks (Harrison 1988, 1996; Harrison and Wein 1990; Harrison and López 1999). It has also been adopted for optimizing pricing, capacity, and allocation decisions in ATO production-inventory systems (Plambeck and Ward 2006). In Reiman and Wang (2015), the framework is formally summarized where each step is explicitly described and specialized to the study of ATO inventory systems with identical lead times. As a continuation of the latter work, here we apply the framework to address ATO inventory systems with general deterministic lead times. A step-by-step discussion of previous results and our new contributions is provided.

Step 1: Relax some feasibility constraints of the original system to formulate a proxy model that is easier to solve and provides a lower bound on the cost achievable under any feasible policy.
In Reiman and Wang (2012), a multistage stochastic program (SP) is formulated as a static analog to dynamic ATO systems. The optimal objective value of the SP is proven to be a lower bound on the average inventory cost of the latter systems under any feasible policy. Unfortunately, this lower bound takes the form of an infimum that is often not attained at finite values of the decision variables.
As a contribution of this paper, we transform this SP into an equivalent form where the optimal objective value is the same, whereas the optimal decision variables are finite. This paves the way for using these optimal decision variables as parameters for ATO inventory control policies.
Step 2: Solve the proxy model.
There have been many studies on the model structure and computational efficiency of the aforementioned SP, mainly on special cases that have two stages and correspond to systems with particular BOMs (Doğru et al. 2010, Nadar et al. 2014, van Jaarsveld and Scheller-Wolf 2015, Zipkin 2016, DeValve et al. 2020). Developing efficient algorithms for multistage SPs is an active area of research. Although we solve the SP for some simple examples in Section 8, solving more general cases is well beyond the scope of this paper, and we focus on a different aspect of the problem that is critical for proving asymptotic optimality of our policy. To use optimal solutions of the SP as parameters of inventory control policies, we need to show that these values are stable; that is, they do not change drastically in response to small fluctuations of inputs. The analysis should be applicable to the SP with a general number of stages and general BOMs.
To this end, we show that with a negligible loss of the accuracy, the SP can be approximated by a finite-dimension linear program (LP), which has a unique optimal solution that is Lipschitz continuous in input parameters. By characterizing and making use of the structure of constraints of the approximating LPs, we prove that their optimal solutions are Lipschitz continuous in inputs that change with the state of the ATO system, and importantly, the Lipschitz constant is independent of the problem size.
Step 3: Use the optimal solution of the proxy model to formulate inventory control policies for the original systems.
In Reiman and Wang (2015), a two-stage SP is used to set asymptotically optimal inventory policies for systems with identical lead times. Replenishment decisions follow a base stock policy with the base stock levels specified by the first stage SP solution. Allocation decisions are controlled by an allocation principle, which uses the second-stage SP solution to set backlog targets to dynamically choose the amount of each demand to serve. However, it is well known that base stock policies are inefficient for general ATO systems considered in this paper, which allow significantly different lead times (Zipkin 2000).
To the best of our knowledge, except for single-product ATO systems (Rosling 1989), an asymptotically optimal replenishment policy remains to be developed for systems with general deterministic lead times and general BOMs. We fill the gap in this paper by formulating such a policy, which uses the optimal solutions of the aforementioned multistage SP to set dynamic targets for inventory positions that determine order quantities. The policy generalizes previous work: it specializes to the policy in Rosling (1989) in systems with a single product and the base stock policy in Reiman and Wang (2015) in systems with identical lead times. We adopt the same allocation principle in Reiman and Wang (2015) with minor twists to fit with the new replenishment policy.
Step 4: Show the performance benefit of the policy by proving that it is asymptotically optimal.
For the special case of identical lead times, an SP-based policy has been shown to be asymptotically optimal (Reiman and Wang 2015). However, the proof relies on the fact that the constant inventory positions prescribed by the two-stage SP can be exactly followed by a base stock policy. Also, the state of the system under this policy is completely determined by the history within the previous lead time. In contrast, for systems with nonidentical lead times, the ideal inventory positions prescribed by the multistage SP may not always be attainable. The state of system can be affected by the history in the unbounded past. Therefore, new techniques are needed to prove asymptotic optimality of our policy for general systems.

We introduce a broadly defined stochastic tracking model, which features a general target process and a general tracking process. We prove that the expected difference between these two processes converges to zero. We apply this model to compare the inventory position and backlog targets prescribed by the SP for reaching the cost lower bound with their actual levels under our policy. We show that these comparisons are special cases of the convergence results of this tracking model and use this to establish asymptotic optimality of our policy. We also perform simulations on several special ATO systems to illustrate that our policy performs well, even under “nonasymptotic” conditions.

The rest of the paper is organized as follows. We define the problem in Section 2. In Section 3, we formulate the SP, develop bounds on its optimal solutions (Theorem 1), and show that its optimal objective value is the same as that of the SP in Reiman and Wang (2012) and thus is a lower bound on the average inventory cost of the ATO system (Theorem 2). In Section 4, we present our inventory policy. In Section 5, we show that the policy is asymptotically optimal if the resulting inventory positions and backlog levels converge to their respective SP-based targets (Theorem 3). The latter convergence results are proven in Section 6 with the formulation and analysis of the aforementioned stochastic tracking model. There, Theorem 4 proves convergence for the general stochastic tracking model. Its corollaries show that inventory positions and backlog levels converge to their targets if the latter satisfy stability conditions that require them to be asymptotically Lipschitz continuous in changes of demands in some previous periods. In Section 7, we prove that the target stability conditions are indeed satisfied with the aforementioned development of finite-dimensional LP approximations. We support our analysis with numerical studies in Section 8 and conclude the paper in Section 9.

As for notation, $R^{l}$ and $R_{+}^{l}$ are, respectively, sets of l-dimensional real vectors and nonnegative real vectors ( $l \geq 1$ ). Their superscripts are omitted when l = 1. We define $1 {}$ to be an indicator function, which equals one if the statement inside the bracket is true and zero otherwise. The maximum and minimum of x₁ and x₂ are denoted by $x_{1} \lor x_{2}$ and $x_{1} \land x_{2}$ , respectively, and $\max (x, 0)$ is denoted by x⁺. Vector symbols are always in bold, and as two special vectors, $e_{j}$ is the unit vector with the jth element taking the value of unity, and $\vec{1}$ is the vector of all 1s (dimensions of both vectors depend on the context). The norm $| | x {||}_{β}$ on $R^{l}$ is defined by $| | x {||}_{β} = {(\sum_{i = 1}^{l} | x_{i} |^{β})}^{1 / β}$ for $l \geq 1$ and $β \geq 1$ . For each pair of vectors $x_{1}$ and $x_{2}$ , the maximum and minimum, $x_{1} \lor x_{2}$ and $x_{1} \land x_{2}$ , are taken componentwise. Between any pair of vectors, $x_{1} \geq (\leq) x_{2}$ if every component of $x_{1}$ is greater (less) than or equal to its corresponding component in $x_{2}$ , and $x_{1} \neq x_{2}$ unless each component in $x_{1}$ equals the corresponding component in $x_{2}$ .

To improve the flow and avoid distracting from our main results, we leave proofs of all lemmas and some theorems in Appendix A. To highlight the major ideas of our work, we include proofs of key results, Theorems 3, 4, and 6, and related corollaries, in the main body of the paper.

2. Problem Formulation

We consider continuous-review ATO inventory systems. Inventories are nonperishable and unserved demands are always backlogged. Component lead times are deterministic but may differ from each other. Our policy and analysis apply to periodic-review systems but do not extend to cases with perishable inventories, lost sales, or stochastic lead times.

2.1. System

There are m products and n components. The BOM, given as an n × m nonnegative integer matrix, A, specifies the use of components by different products. Elements of A, a_ji, represents the amount of component j needed to assemble product i ( $1 \leq i \leq m$ ). Thus, the jth row of A, $A_{j}$ , specifies the amounts of component j needed by all products ( $1 \leq j \leq n$ ). Without loss of generality, we assume that there is at least one nonzero entry in every row and every column of A.

Figure 1 shows three ATO systems that are common in the literature. The W system has two products, and three components, one of which is used by both products. Each product is assembled from one unit of the common component and one unit of a product-specific component. With a slight deviation from the aforementioned notation, the common component is referred to as component 0. The M system uses two components to build three products. Products 1 and 2 use one unit of components 1 and 2, respectively. The third product, referred to as product 0 in a slight deviation from the aforementioned notation, uses one unit of both components. The N system is a special case of both the W and M systems. It has two products and two components. Product 0 uses one unit of both components 0 and 1 and Product 1 uses only one unit of component 0.

**Figure 1. Examples: The W, M, and N Systems**

There are K distinct component lead times, $L_{1} < \dots < L_{K}$ . Define $L_{0} = 0$ for notational convenience. Let n_k be the number of components with lead time L_k ( $1 \leq k \leq K$ ). Components are indexed according to an ascending order of their lead times. Let ${\bar{n}}_{0} = 0$ and ${\bar{n}}_{k} = \sum_{k^{'} \leq k} n_{k^{'}}$ ( $1 \leq k \leq K$ ). Observe that ${\bar{n}}_{K} = n$ . Thus, ${{\bar{n}}_{k - 1} + 1, \dots, {\bar{n}}_{k}}$ are the indexes of components with lead times L_k ( $1 \leq k \leq K$ ). We associate each component j with an index k_j ( $1 \leq k_{j} \leq K$ ) such that $L_{k_{j}}$ is the lead time of component j ( $1 \leq j \leq n$ ). Without loss of generality, we arrange the rows of A in an order such that the submatrix A^k, composed of rows ${\bar{n}}_{k - 1} + 1, .., and {\bar{n}}_{k}$ of A, specifies the use of components with lead time L_k ( $1 \leq k \leq K$ ).

Without loss of generality and for simplicity, we let the system start at a time when there is no inventory on-hand, no order in transit, and no backlog, and define that time to be $t = - L_{K}$ . Demand arrives according to an integer vector valued compound Poisson process

D (t) = (D_{1} (t), \dots, D_{m} (t)), t \geq - L_{K},

where

D_{i} (t)

is the amount of demand for product i (

1 \leq i \leq m

) that arrives during

[- L_{K}, t]

. The process

{D (t), t \geq 0}

is right continuous. The number of demand orders arriving during

[- L_{K}, t]

(

t \geq - L_{K}

) is a Poisson process

Λ = {Λ (t), t \geq - L_{K}}

, and there is an associated independent and identically distributed (i.i.d.) sequence of random vectors that give order sizes. A generic element of this sequence is denoted by

S = (S_{1}, S_{2}, \dots, S_{m})

, where S_i is the order size for product i. Although the elements of the sequence (the order size vectors) are independent, the components

(S_{1}, S_{2}, \dots, S_{m})

within each vector can be dependent. Let

\underline{λ} = E [Λ (1) - Λ (0)]

denote the order arrival rate. Mean demand arriving within a unit of time is

μ = (μ_{1}, \dots, μ_{m}) ≔ E [D (1) - D (0)] = \underline{λ} E [S] .

The covariance matrix of $(D (1) - D (0))$ is denoted by $Σ$ , of which the diagonal elements, σ_ii, are variances of demand i ( $1 \leq i \leq m$ ) over a unit of time. Because the demand process is stationary, $μ$ and $Σ$ are also, respectively, the means and the covariance matrix of demands over $[t, t + 1]$ ( $t \geq - L_{K}$ ).

We assume that S_i has a finite moment of order 6, that is,

η_{i} ≔ E [S_{i}^{6}] < \infty, 1 \leq i \leq m .

As will be evident from the proof of Theorem 4, we assume a finite moment of order 6 for a similar reason as that in Huh et al. (2009), which is to use the moment to bound the difference between two stochastic processes. (Beyond that, both the time horizon and stochastic processes involved are completely different between the two studies.)

2.2. Inventory Control Problem

Inventory control includes both replenishment and allocation decisions. For components with lead time L_k, the replenishment starts from time $- L_{k}$ and is defined by $R^{k} (t)$ $(t \geq - L_{k})$ , an integer-valued vector process with n_k components. Each component of $R^{k} (t), R_{j} (t)$ , represents the amount of component j ordered over the period $[- L_{k}, t]$ $({\bar{n}}_{k - 1} + 1 \leq j \leq {\bar{n}}_{k}$ ). In this context, $R^{k} (t)$ $(t \geq - L_{k})$ should be nonnegative and nondecreasing, which we assume in the paper. Orders are placed at distinct points of time. Hence, we assume the process is right-continuous.

The allocation decision is specified via an m-dimensional, integer-valued process $Z (t)$ . Each component of $Z (t), Z_{i} (t)$ $(t \geq - L_{K})$ , represents the amount of demand for product i $(1 \leq i \leq m)$ served over the period $[- L_{K}, t]$ . We assume that $Z (t)$ $(t \geq - L_{K})$ is also nonnegative, nondecreasing, and right-continuous.

Both $R^{k} (t)$ ( $t \geq - L_{k}, 1 \leq k \leq K$ ) and $Z (t)$ ( $t \geq 0$ ) must be adapted to the filtration generated by the initial states of the system, as well as $D (s)$ ( $- L_{K} \leq s \leq t$ ), $R^{k} (s)$ ( $- L_{k} \leq s < t, 1 \leq k \leq K$ ), and $Z (s)$ ( $- L_{K} \leq s < t$ ), which means that policies under our consideration are nonanticipating.

For the inventory control to be feasible, at any point of time, the total amounts of demand served cannot exceed the amounts that have arrived. The amounts of components used cannot exceed the amounts that have been received. Because the replenishment of components with the lead time L_k starts at time $- L_{k}$ $(1 \leq k \leq K)$ , no unit is received and thus no demand can be served before time 0. In our model, these restrictions are formalized by the following assumptions:

\begin{array}{l} Z (t) = 0, t < 0, Z (t) \leq D (t), t \geq 0, \\ and & A^{k} Z (t) \leq R^{k} (t - L_{k}), 1 \leq k \leq K, t \geq 0 . \end{array}

(1)

Observe that by definition, $R^{k} (t - L_{k})$ $(t \geq 0, 1 \leq k \leq K)$ are the amounts of components ordered at least one lead time before time t.

For $t \geq 0$ , the inventory level of components with the lead time L_k is

I^{k} (t) ≔ R^{k} (t - L_{k}) - A^{k} Z (t), k = 1, \dots, K,

(2)

and the backlog level is

B (t) ≔ D (t) - Z (t) .

(3)

By (1), both $I^{k} (t)$ $(1 \leq k \leq K)$ and $B (t)$ $(t \geq 0)$ are nonnegative. Also by (1) and our assumption about the replenishment starting times,

I^{k} (t) = 0, - L_{k} \leq t < 0, and B (t) = D (t), t < 0 .

Let b_i be the cost of backlogging one unit of demand of product i ( $1 \leq i \leq m$ ) per unit of time. Let h_j be the cost of holding one unit of inventory of component j ( $1 \leq j \leq n$ ) per unit of time. We assume that h_j ( $1 \leq j \leq n$ ) and b_i ( $1 \leq i \leq m$ ) are strictly positive. Let

b = (b_{1}, \dots, b_{m}) and h^{k} = (h_{{\bar{n}}_{k - 1} + 1}, \dots, h_{{\bar{n}}_{k}}), 1 \leq k \leq K .

Then at each time t ( $t \geq 0$ ), the system incurs the total inventory cost at the expected rate

C (t) = \sum_{k = 1}^{K} h^{k} \cdot E [I^{k} (t)] + b \cdot E [B (t)] .

(4)

The problem of inventory control is to develop integer-valued, nonnegative, nondecreasing, right-continuous, and nonanticipating vector processes $R^{k} (t)$ ( $t \geq - L_{k}, 1 \leq k \leq K$ ) and $Z (t)$ ( $t \geq - L_{K}$ ), subject to (1)–(3), to minimize the following long-run average expected total inventory cost:

C = \underset{T \to \infty}{\lim \sup} \frac{1}{T} \int_{0}^{T} C (t) d t .

(5)

2.3. Additional Variables, Processes, and Relationships

For the needs of our analysis, we introduce additional variables and processes and specify their relationships.

2.3.1. Variables and Processes.

The additional variables and processes, defined in terms of the more “primitive” processes introduced in Sections 2.1 and 2.2, are listed in Table 1 along with their definitions. For the convenience of the reader, Table 1 also includes the definitions of the inventory and backlog processes defined in Section 2.2. Further discussions of these quantities are as follows:

Table 1. Key Variables: Notation and Value

Table 1. Key Variables: Notation and Value

	Variables	Definitions
Demand	$D (t_{1}, t_{2}), - L_{K} \leq t_{1} < t_{2}$	$≔ D (t_{2}) - D (t_{1})$
	$D^{k} (t), t \geq - L_{K} + L_{k}, 1 \leq k \leq K$	$≔ D (t - L_{k - 1}) - D (t - L_{k})$
	${\bar{D}}^{k} (t), t \geq 0, 1 \leq k \leq K$	$≔ D (t - L_{k - 1}) - D (t - L_{K})$
	$\bar{D} (t), t \geq 0$	$≔ D (t) - D (t - L_{K})$
	$d (t), t \geq - L_{K}$	$≔ {\begin{array}{l} D (- L_{K}), & t = - L_{K} \\ D (t) - D (t^{-}), & t > - L_{K} \end{array}$
	$D^{k}, 1 \leq k \leq K$	$\overset{d}{=} D^{k} (t)$
	${\bar{D}}^{k}, 1 \leq k \leq K$	$\overset{d}{=} {\bar{D}}^{k} (t)$
	$\bar{D}$	$\overset{d}{=} \bar{D} (t)$
	${\underline{D}}^{k}, 1 \leq k \leq K$	$≔ D^{k} + \dots + D^{1} \overset{d}{=} D (t - L_{k}, t)$
Replenishment	$R^{k} (t), t \geq - L_{k}, 1 \leq k \leq K$	$≔ {\begin{array}{l} R^{k} (t) - R^{k} (t - L_{k}), & t \geq 0 \\ R^{k} (t), & t < 0 \end{array}$
Allocation	$Z (t_{1}, t_{2}), - L_{K} \leq t_{1} < t_{2}$	$≔ Z (t_{2}) - Z (t_{1})$
Inventory	$I^{k} (t), t \geq 0, 1 \leq k \leq K$	$≔ R^{k} (t - L_{k}) - A^{k} Z (t)$
Backlog	$B (t), t \geq 0$	$≔ D (t) - Z (t)$
	$B^{-} (t), t \geq 0$	$≔ {\begin{array}{l} D (t), & t = 0 \\ B (t^{-}) + d (t), & t > 0 \end{array}$
Inventory position	$I P^{k} (t), t \geq - L_{k}, 1 \leq k \leq K$	$≔ R^{k} (t) - A^{k} D (t)$
	$I P^{k -} (t), t \geq - L_{k}, 1 \leq k \leq K$	$≔ {\begin{array}{l} - A^{k} D (- L_{k}), & t = - L_{k} \\ I P^{k} (t^{-}) - A^{k} d (t), & t > - L_{k} \end{array}$
Inventory balance	$Q^{k} (t), t \geq 0, 1 \leq k \leq K$ ,	$≔ A^{k} D (t) - R^{k} (t - L_{k})$

The first processes we introduce here denote increments of the processes $D (t)$ $(t \geq - L_{K}), R^{k} (t)$ $(t \geq - L_{k}, 1 \leq k \leq K)$ , and $Z (t)$ $(t \geq - L_{K})$ over particular time intervals. Here $D (t_{1}, t_{2})$ denotes demand that arrives during the time interval $(t_{1}, t_{2}]$ ( $- L_{K} \leq t_{1} < t_{2}), D^{k} (t)$ are special cases of $D (t_{1}, t_{2})$ with $t_{1} = t - L_{k}$ and $t_{2} = t - L_{k - 1}$ $(t \geq - L_{K} + L_{k}, 1 \leq k \leq K$ ), and

{\bar{D}}^{k} (t) = \sum_{k^{'} = k}^{K} D^{k^{'}} (t), t \geq 0 .

We also use $\bar{D} (t)$ as a shorthand of ${\bar{D}}^{1} (t)$ $(t \geq 0)$ . (Recall that $L_{0} = 0$ .) We also introduce some random vectors. Let $D^{k}$ and ${\bar{D}}^{k}$ be random vectors that follow the same distributions of $D^{k} (t)$ and ${\bar{D}}^{k} (t), 1 \leq k \leq K, t \geq 0$ , respectively. Let $\bar{D}$ be a random vector that follows the same distribution of $\bar{D} (t)$ . We also define

{\underline{D}}^{k} ≔ D^{k} + \dots + D^{1},

which follows the same distribution of

D (t - L_{k}, t)

(t \geq - L_{K} + L_{k}, 1 \leq k \leq K)

. By stationarity of

D (t)

, these distributions do not depend on t. Recall that we assumed that the demand process is right continuous. We let

d (t)

denote the demand that arrives at time

t, t \geq - L_{K}

(if any).

Similarly, $R^{k} (t)$ $(1 \leq k \leq K)$ denotes the amounts of components ordered over the previous lead time, or for t < 0, since the beginning of the replenishment process. Each component of $R^{k} (t), R_{j} (t)$ $(t \geq - L_{k}, {\bar{n}}_{k - 1} + 1 \leq j \leq {\bar{n}}_{k})$ , represents the amount of component j that is in transit: ordered from the supplier but not yet arrived. The amounts of demand served over the interval $(t_{1}, t_{2}]$ is denoted by $Z (t_{1}, t_{2})$ $(- L_{K} \leq t_{1} < t_{2})$ .

Next we introduce two additional processes: inventory position $I P^{k} (t)$ and component balance $Q^{k} (t)$ $(t \geq - L_{k}, 1 \leq k \leq K)$ . For $k = 1, \dots, K$ , entries of $I P^{k} (t), I P_{j} (t)$ ( ${\bar{n}}_{k - 1} + 1 \leq j \leq {\bar{n}}_{k}$ ), are the differences between the total amounts of components that have been ordered up to time t $(t \geq - L_{k})$ and the total amounts needed to serve all demands that have arrived by that time. Entries of $Q^{k} (t), Q_{j} (t)$ ( ${\bar{n}}_{k - 1} + 1 \leq j \leq {\bar{n}}_{k})$ , are the differences between the amounts needed to serve demands and the total amounts that have been ordered and received.

Recall that the replenishment and allocation processes (which are the control processes) are assumed to be integer valued and right continuous. Thus, the controls are exercised at discrete time points. It is helpful to distinguish the values of certain processes immediately before these actions are taken. To this end, we let $I P^{k -} (t)$ $(1 \leq k \leq K$ ) denote the inventory positions at time t $(t \geq - L_{k})$ before placing any order at that time, and let $B^{-} (t)$ $(t \geq - L_{K})$ denote the backlog levels at time t $(t \geq - L_{K}$ ) before serving any demand at that time. In Table 1, $d (t), I P^{k -} (t)$ , and $B^{-} (t)$ are defined using the left limits $D (t^{-})$ $(t > - L_{K}), I P^{k} (t^{-})$ $(t > - L_{k}, 1 \leq k \leq K$ ), and $B (t^{-})$ $(t > - L_{K})$ . These left limits exist because in (2) and (3), $D (t)$ $(t \geq - L_{K}), R^{k} (t)$ $(t \geq - L_{k}, 1 \leq k \leq K$ ), and $Z (t)$ $(t \geq - L_{K})$ are nondecreasing and right-continuous. One may also observe that by definition,

\begin{array}{l} I P^{k} (t) = I P^{k -} (t) + R^{k} (t) - R^{k} (t^{-}), t \geq - L_{k}, 1 \leq k \leq K, \\ and & B (t) = B^{-} (t) - (Z (t) - Z (t^{-})), t \geq - L_{K} . \end{array}

2.3.2. Relationships.

The key definitions (2) and (3), along with the definitions introduced in Section 2.3.1, allow us to obtain relationships that are useful in our analysis. Using (2) and (3), and the definitions of $R^{k} (t)$ $(t \geq - L_{k}, 1 \leq k \leq K$ ) and $Z (t)$ $(t \geq - L_{K})$ in Table 1, changes of inventory and backlog levels over a lead time L_k can be determined by

\begin{array}{l} I^{k} (t) & = I^{k} (t - L_{k}) + R^{k} (t - L_{k}) - A^{k} Z (t - L_{k}, t), t \geq 0, 1 \leq k \leq K, \\ and B (t) & = B (t - L_{k}) + D (t - L_{k}, t) - Z (t - L_{k}, t), t \geq - L_{K} + L_{k} . \end{array}

(6)

Again using (2) and (3), along with definitions of $R^{k} (t)$ and $I P^{k} (t)$ $(t \geq - L_{k}, 1 \leq k \leq K)$ in the table,

\begin{array}{l} I P^{k} (t) & = I^{k} (t) + (R^{k} (t) - R^{k} (t - L_{k})) - A^{k} B (t) \\ = I^{k} (t) + R^{k} (t) - A^{k} B (t), t \geq - L_{k} . \end{array}

(7)

The second line corresponds to another definition of the inventory position in the literature: the total amount of the component in the system, including both on-hand inventory and orders in-transit, minus the amount needed to clear all existing backlog at that time.

Yet again using (2) and (3), along with the definition of $Q^{k} (t)$ in the table, and then applying (6) and (7) to replace $B (t)$ and $I^{k} (t)$ ,

\begin{array}{l} Q^{k} (t) & = A^{k} B (t) - I^{k} (t) \\ = A^{k} D (t - L_{k}, t) - I P^{k} (t - L_{k}), t \geq 0, 1 \leq k \leq K . \end{array}

(8)

Observe that the negative of an entry of $Q^{k} (t)$ ,

- Q_{j} (t) = I_{j} (t) - A_{j} \cdot B (t), {\bar{n}}_{k - 1} + 1 \leq j \leq {\bar{n}}_{k},

is commonly referred to as the net inventory of component j at time t

(t \geq 0)

: there is a shortage (surplus) of component j to clear existing backlogs at time t if

Q_{j} (t) > (<) 0

2.3.3. Other Parameters.

Define

c = (c_{1}, \dots, c_{m}) \equiv b + \sum_{k = 1}^{K} {(A^{k})}^{'} h^{k},

(9)

where element c_i of

c

represents the amount of inventory cost that can be removed from the system by serving one unit of demand i (

1 \leq i \leq m

For convenience, we define additional variables to denote the smallest (nonzero) and largest components of A, $b, h$ , and $c$ in Table 2 below.

3. Stochastic Program

As we outlined in Section 1, the first step of our analysis is to develop a multistage SP that provides a lower bound on the average inventory cost and sets the stage for developing inventory control policies to drive the cost toward that bound. To this end, we first present a relevant SP from the literature and discuss the intuition that underlies its formulation in Section 3.1. In Section 3.2, we transform this SP into an alternative SP that we use for policy development in Section 4.

3.1. Previous Result

For ATO systems formulated in the last section, theorem 1 in Reiman and Wang (2012) proves that

\underline{C} = \inf_{α \geq 0} {b \cdot α + Φ^{K} (α)} + b \cdot E [\bar{D}],

(10)

is a lower bound on

C

, the long-run average cost defined in (5). Here

\bar{D}

is defined in Table 1, and

Φ^{K} (α)

is the minimum objective value of the K + 1 stage stochastic program (SP):

\begin{array}{l} Φ^{K} (α) & = \inf_{y^{K} \geq 0} {h^{K} \cdot y^{K} + E [Φ^{K - 1} (y^{K}, α + D^{K})]}, \\ Φ^{k} (y^{k + 1}, \dots, y^{K}, x) & = \inf_{y^{k} \geq 0} {h^{k} \cdot y^{k} + E [Φ^{k - 1} (y^{k}, \dots, y^{K}, x + D^{k})]}, \\ k = K - 1, \dots, 1, \\ Φ^{0} (y^{1}, \dots, y^{K}, x) & = - \max_{z \geq 0} {c \cdot z | z \leq x, A^{k} z \leq y^{k}, 1 \leq k \leq K}, \end{array}

(11)

where

D^{k}

(

1 \leq k \leq K

) are also defined in Table 1.

We now take the rest of this section to provide some intuition as to why the SP (10)–(11) yields a lower bound on the cost in the ATO inventory control problem. The basic idea is that the SP (10)–(11) is a relaxation of the actual ATO inventory control problem. The SP corresponds to myopically focusing on a single point of time, with no concern for the effect that any replenishment or allocation decision might have on costs at other points of time. Thus, in the presence of random demand, all decisions are put off until the last possible moment, allowing as much information about actual demand as possible to be used for the decisions.

To explain the SP in more detail, using (6) and (9), with k = K when substituting for $B (t)$ and

Z (t - L_{k}, t) = Z (t - L_{K}, t) - Z (t - L_{K}, t - L_{k})

when substituting for

I^{k} (t)

(1 \leq k \leq K)

, we can write the inventory cost at any time t

(t \geq 0)

\begin{array}{l} \sum_{k = 1}^{K} h^{k} \cdot I^{k} (t) + b \cdot B (t) = b \cdot B (t - L_{K}) - c \cdot Z (t - L_{K}, t) \\ + \sum_{k = 1}^{K} h^{k} \cdot [I^{k} (t - L_{k}) + R^{k} (t - L_{k}) + A^{k} Z (t - L_{K}, t - L_{k})] + b \cdot D (t - L_{K}, t) . \end{array}

The cost at t depends only on states and processes over the period $[t - L_{K}, t]$ . Comparing (10)–(11) with this expression for the cost, $α$ corresponds to the initial backlog levels $B (t - L_{K})$ and $z$ corresponds to $Z (t - L_{K}, t)$ , the amounts of demand served in the period $[t - L_{K}, t]$ . For components with lead time L_k, $I^{k} (t - L_{k}) + R^{k} (t - L_{k}) + A^{k} Z (t - L_{K}, t - L_{k})$ includes the amounts in inventory at time $t - L_{k}$ , the amounts that will arrive between $t - L_{k}$ and t from the pipeline, and the amounts used in the period $[t - L_{K}, t - L_{k}]$ $(1 \leq k \leq K)$ . Corresponding to the sum of these three quantities, $y^{k}$ represents the total amounts of these components that can be used to serve demands in period $[t - L_{K}, t]$ .

In (10)–(11), $α, z$ , and $y^{k}$ $(1 \leq k \leq K)$ are chosen to minimize the inventory cost with no constraint other than the ones that must be satisfied by the aforementioned counterparts of these variables in the ATO system. By (1)–(3), $B (t)$ and $I^{k} (t)$ $(1 \leq k \leq K, t \geq 0$ ) are nonnegative, so using (6):

Z (t - L_{K}, t) \leq B (t - L_{K}) + D (t - L_{K}, t),

and

\begin{array}{l} A^{k} Z (t - L_{k}, t) & \leq I^{k} (t - L_{k}) + R^{k} (t - L_{k}) \\ i . e ., A^{k} Z ({t - L}_{K}, t) & \leq I^{k} (t - L_{k}) + R^{k} (t - L_{k}) + A^{k} Z (t - L_{K}, t - L_{k}), 1 \leq k \leq K . \end{array}

Correspondingly, in (11),

z \leq α + \bar{D} = α + D^{K} + \dots + D^{1} and A^{k} z \leq y^{k} (1 \leq k \leq K) .

The information constraint in the SP is less obvious, as the sequential/recursive nature of its formulation both encodes and potentially obscures the information available when certain decisions are made. Although it is not needed in the definition of the SP, the information available when decisions are made in the SP can be described via a discrete filtration

F_{K} \subseteq F_{K - 1} \subseteq \dots \subseteq F_{0},

(12)

where

F_{K} = {\emptyset, Ω}

and

F_{k}

is the σ-field generated by

{D^{K}, \dots D^{k + 1}, y^{K}, \dots y^{k + 1}}

(

0 \leq k < K

). The set of optimal values of

y^{k}

F_{k}

-measurable (

1 \leq k \leq K

) and that of

z

F_{0}

-measurable.

The filtration $F_{k}$ $(0 \leq k \leq K$ ) imitates the information available in the associated ATO system and is generated by both $D^{k^{'}}$ and $y^{k^{'}}$ , corresponding, respectively, to histories of demand arrivals in periods $[t - L_{k^{'}}, t - L_{k^{'} - 1}]$ and decision making at times $t - L_{k}^{'}$ $(k < k^{'} \leq K)$ . In ATO systems, $t - L_{k}$ is the last moment of decision making that can affect the value of $I^{k} (t - L_{k}) + R^{k} (t - L_{k}) + A^{k} Z (t - L_{K}, t - L_{k})$ , so letting $y^{k}$ be adapted to $F_{k}$ allows its value to be chosen with the maximum amount of information. This adaptedness is not explicitly enforced in the SP but is implicit in the recursive structure of the SP. When $Φ^{k}$ (which corresponds to stage $K + 1 - k$ ) is solved to obtain $y^{k}$ , the prior decisions $y^{K}, \dots, y^{k + 1}$ are all known, and $x = D^{K} + \dots + D^{k + 1}$ is known as well. Similarly, t is the last moment of decision making that can affect $Z (t)$ , so the choice of $z$ is adapted to $F_{0}$ . It is not surprising that, with $α, z$ , and $y^{k}$ $(1 \leq k \leq K)$ chosen under the minimum constraints and maximum information, (10) is a lower bound on the expected cost of the ATO system at any given time t and hence a lower bound on its average cost $C$ .

3.2. New Development

The SP in (11) is not directly applicable to our analysis. As one may observe from (10)–(11), and as is shown in the proof of Theorem 2, $b \cdot α + Φ^{K} (α)$ decreases in $α$ , and strictly so in many cases. Therefore, the lower bound $\underline{C}$ is not directly computable by solving (11) with any fixed $α$ . To reach the infimum in (10), $α$ , and thus $y^{k}$ ( $1 \leq k \leq K$ ) and $z$ often have to approach infinity, so one cannot make use of the optimal values of these decision variables for policy development.

To address this issue, we transform (11) into an alternative SP by replacing $y^{k}$ in (11) with $y^{k} - A^{k} α$ $(1 \leq k \leq K)$ and $z$ with $z - α$ and, following (10), letting $α$ approach infinity. The transformation removes $α$ and the nonnegativity constraints in (11) and leads to

\begin{array}{l} φ^{K} & = \inf_{y^{K} \in R^{n_{K}}} {h^{K} \cdot y^{K} + E [φ^{K - 1} (y^{K}, D^{K})]}, \\ φ^{k} (y^{k + 1}, \dots, y^{K}, x) & = \inf_{y^{k} \in R^{n_{k}}} {h^{k} \cdot y^{k} + E [φ^{k - 1} (y^{k}, \dots, y^{K}, x + D^{k})]}, 1 \leq k < K, \\ φ^{0} (y^{1}, \dots, y^{K}, x) & = - \max_{z \in R^{m}} {c \cdot z | z \leq x, A^{k} z \leq y^{k}, 1 \leq k \leq K} . \end{array}

(13)

For any feasible solution $α, y^{k}$ $(1 \leq k \leq K)$ , and $z$ in (11), $y^{k} - A^{k} α$ $(1 \leq k \leq K)$ and $z - α$ is a feasible solution to (13), so $φ^{K} \leq Φ^{K} (α)$ for all $α$ . Therefore, (13) can also be used to set a lower bound on the inventory cost of an ATO system, a result that we will formally present in Theorem 2. Here, we first prove that the optimal solution of (13) is bounded.

Theorem 1

(See Appendix A for Proof). For any given values of $y^{k + 1}, \dots, y^{K}$ , and $x$ , if $y^{k *} = (y_{{\bar{n}}_{k - 1} + 1}^{*}, \dots, y_{{\bar{n}}_{k}}^{*})$ is an optimal solution to the SP in (13) at stage k, that is,

h^{k} \cdot y^{k *} + E [φ^{k - 1} (y^{k *}, y^{k + 1} \dots, y^{K}, x + D^{k})]

attains the value of

φ^{k} (y^{k + 1}, \dots, y^{K}, x)

(k = K, \dots, 1)

, then

- \underline{β} (| | x | |_{1} + E [| | {\underline{D}}^{k} | |_{1}] + \sum_{l = k + 1}^{K} | | y^{l} | |_{1}) \leq y_{j}^{*} \leq \bar{β} (| | x | |_{1} + E [| | {\underline{D}}^{k} | |_{1}] + 1), {\bar{n}}_{k - 1} < j \leq {\bar{n}}_{k},

(14)

where

{\underline{D}}^{k}

(1 \leq k \leq K

) are random vectors defined in Table 1, and

\underline{β}

and

\bar{β}

are any constants that satify

\underline{β} \geq n \frac{{\bar{a}}^{2}}{\underline{a}} \frac{\bar{c}}{\underline{b}} and \bar{β} \geq \frac{\bar{a}}{\underline{a}} \frac{\bar{c}}{\underline{h}},

with constants on the right-hand side (RHS) of inequalities given in Table 2.

Table 2. Smallest and Largest Components of Relevant Vectors and Matrix

Table 2. Smallest and Largest Components of Relevant Vectors and Matrix

Symbol	$\underline{a}$	$\bar{a}$	$\bar{h}$	$\underline{h}$	$\bar{b}$	$\underline{b}$	$\bar{c}$
Definition	$\min_{i, j : a_{i j} > 0} {a_{i j}}$	$\max_{i, j} {a_{i j}}$	$\max_{j} {h_{j}}$	$\min_{j} {h_{j}}$	$\max_{i} {b_{i}}$	$\min_{i} {b_{i}}$	$\max_{i} {c_{i}}$

The formulation in (13) helps us to avoid the aforementioned issue with (10)–(11). By (14), the optimal solutions to the new SP are finite, so they can and will be used as parameters of our inventory policy. Moreover, instead of taking the infimum in (10), we can use the optimal objective value of (13), which is directly computable, to set the same lower bound on the cost objective.

Theorem 2

(See Appendix A for Proof). Let $\underline{C}$ be the lower bound defined in (10) and $φ^{K}$ be the optimal objective value of the SP defined in (13). Then

\underline{C} = φ^{K} + b \cdot E [\bar{D}] .

(15)

For future analysis, we replace $z$ with $B = x - z$ in (13) to transform the last stage LP into

\begin{array}{l} φ^{0} (y^{1}, \dots, y^{K}, x) & = φ_{B}^{0} (y^{1}, \dots, y^{K}, x) - c \cdot x, \\ where φ_{B}^{0} (y^{1}, \dots, y^{K}, x) & = \min_{B} {c \cdot B | B \geq 0, A^{k} B \geq A^{k} x - y^{k}, 1 \leq k \leq K} . \end{array}

(16)

In $φ_{B}^{0} (y^{1}, \dots, y^{K}, x), B$ represents the amounts of unserved demands and thus is analogous to $B (t)$ ( $t \geq - L_{K}$ ), backlog levels in the ATO system. Let $y^{K *}$ be an optimal solution to $φ^{K}, y^{k *}$ be an optimal solution to $φ^{k} (y^{k + 1 *}, \dots, y^{K *}, {\bar{D}}^{k + 1})$ ( $1 \leq k < K$ ), $z^{*}$ be an optimal solution to $φ^{0} (y^{1 *}, \dots, y^{K *}, \bar{D})$ , and $B^{*} = \bar{D} - z^{*}$ . Then by Theorem 2:

\begin{array}{l} \underline{C} = \sum_{k = 1}^{K} h^{k} \cdot E [y^{k *}] - c \cdot E [z^{*}] + b \cdot E [\bar{D}] = \sum_{k = 1}^{K} h^{k} \cdot E [y^{k *}] + c \cdot E [B^{*}] - \sum_{k = 1}^{K} [(A^{k})' h^{k}] \cdot E [\bar{D}] . \end{array}

(17)

Because both components and products are measured in discrete units, (11) and (13) should be discrete optimization problems that can be difficult to solve exactly. For the purpose of setting a lower bound, if suffices to solve (13) without the integrality constraint, which is a continuous relaxation of the discrete problem. However, as will be shown in the next section, we will also use the SP to formulate inventory control policies, and for that purpose, the solutions need to be integer valued and satisfy certain uniqueness and continuity conditions. These issues will be addressed in Section 7.

4. Inventory Policy

Here, we will first describe the general idea that motivates our approach in Section 4.1, followed by developments of our replenishment and allocation policies in Sections 4.2 and 4.3, respectively. We will then make a few important observations in Section 4.4. A simple example that illustrates the application of our inventory policy, involving the N system, is given in Appendix B.

4.1. General Idea

To drive the average cost of the ATO system toward its SP-based lower bound in (13), we formulate inventory policies that mimic the optimal solution of this SP. As is commonly known and easily verifiable from (6)–(7), in an ATO system, the net inventory levels satisfy

\begin{array}{l} I^{k} (t) - A^{k} B (t) & = I P^{k} (t - L_{k}) - A^{k} D (t - L_{k}, t) \\ = I P^{k} (t - L_{k}) - A^{k} [D (t - L_{K}, t) - \\ D (t - L_{K}, t - L_{k})], 1 \leq k \leq K, t \geq 0 . \end{array}

The second equality allows us to rewrite the expected cost rate as the following function of inventory positions and backlog levels:

\begin{array}{l} C (t) & = \sum_{k = 1}^{K} h^{k} \cdot E [I^{k} (t)] + b \cdot E [B (t)] \\ = \sum_{k = 1}^{K} h^{k} \cdot (E [I P^{k} (t - L_{k})] + A^{k} E [D (t - L_{K}, t - L_{k})]) + c \cdot E [B (t)] \\ - E [D (t - L_{K}, t)] \sum_{k = 1}^{K} {(A^{k})}^{'} h^{k}, t \geq 0, \end{array}

(18)

where

c

is defined in (9). Because

\bar{D} \overset{d}{=} D (t - L_{K}, t), E [\bar{D}] = E [D (t - L_{K}, t)]

. Thus, by a direct comparison of the cost rate in (18) with its lower bound in (17), at any time t

(t \geq 0)

if E [I P^{k} (t - L_{k})] + A^{k} E [D (t - L_{K}, t - L_{k})] = E [y^{k *}], 1 \leq k \leq K,

(19)

and E [B (t)] = E [B^{*}],

(20)

then C (t) = \underline{C} .

Hence, the average inventory cost of an ATO system $(C)$ reaches its lower bound $(\underline{C})$ if (19)–(20) are both satisfied for all time. Based on this observation, we develop replenishment and allocation policies by using the SP solutions to set targets for inventory positions and backlog levels, keeping actual values “close” to these targets under feasibility constraints.

4.2. Replenishment Policy

4.2.1 Overview.

The RHS of (19) is determined in (13). For k = K, $y^{K *}$ is constant, and for $k = K - 1, \dots, 1, y^{k *}$ , $(k = K - 1, \dots, 1)$ are determined recursively (going backward in k) on each sample path of $(D^{K}, \dots, D^{k + 1})$ as values of $y^{k}$ that minimize

h^{k} y^{k} + E [φ^{k - 1} (y^{k}, y^{(k + 1) *}, \dots, y^{K *}, x + D^{k})],

(21)

where

x

denotes the value of

D^{K} + \dots + D^{k + 1}

on that path.

Our replenishment policy is built on processes $Y^{k} (t)$ $(t \geq - L_{k})$ , with their distributions at each point of time mimicking those of $y^{k *}$ $(1 \leq k \leq K)$ . Figure 2 shows a match of these two quantities in any period $[t - L_{K}, t]$ ( $t \geq 0$ ). Let $Y^{K} (t - L_{K}) = y^{K *}$ , and determine $Y^{k} (t - L_{k})$ $(k = K - 1, \dots 1)$ recursively on each sample path of $D (t - L_{K}, t - L_{k})$ to minimize (21) with $x = D (t - L_{K}, t - L_{k})$ . Because $D (t - L_{k}, t - L_{k - 1})$ and $D^{k}$ $(1 \leq k \leq K)$ have the same distribution,

E [Y^{k} (t - L_{k})] = E [y^{k *}], 1 \leq k \leq K .

(22)

**Figure 2. Match Between Processes $Y^{k} (t)$ in the ATO System with SP Solutions $y^{k *}$ $(t \geq - L_{k}, 1 \leq k \leq K)$**

Motivated by (19), our policy uses $Y^{k} (t - L_{k})$ and $A^{k} D (t - L_{K}, t - L_{k})$ to set inventory position targets $I P^{k} (t)$ $(t \geq - L_{k}, 1 \leq k \leq K)$ and make ordering decision to move actual inventory positions $I P^{k} (t - L_{k})$ $(t \geq - L_{k}, 1 \leq k \leq K)$ toward these targets.

Except for components with the longest lead time L_K, inventory position targets typically change over time. When a target rises above the actual inventory position, it is feasible to bring the latter position to its target immediately by ordering an appropriate amount of the component. When a target falls below the actual position, it is not feasible to close the gap immediately before the target is reduced and/or new demand arrives. Recognizing the latter restriction, our replenishment policy orders a component when and only when its inventory position is below the target, in the exact amount needed to eliminate the deficit.

4.2.2. Specific Policy Procedure.

For components with the longest lead time L_K, the inventory position target process starts from time $- L_{K}$ and is determined by

I P^{K} (t) = Y^{K}, t \geq - L_{K},

(23)

where

Y^{K}

is the value of

y^{K}

that minimizes

h^{K} \cdot y^{K} + E [φ^{K - 1} (y^{K}, D^{K})] .

(24)

For components with lead time L_k $(k = K - 1, \dots, 1)$ , the target process starts from $- L_{k}$ and that its values are given by

I P^{k} (t) = Y^{k} (t) - A^{k} D (t + L_{k} - L_{K}, t), t \geq - L_{k}, 1 \leq k \leq K - 1,

(25)

where referring to (21) and the match shown in Figure 2,

Y^{k} (t)

is the value of

y^{k}

that minimizes

h^{k} \cdot y^{k} + E [φ^{k - 1} (y^{k}, Y^{k + 1} (t + L_{k} - L_{k + 1}), \dots, Y^{K}, D (t + L_{k} - L_{K}, t) + D^{k})], t \geq - L_{k} .

(26)

Starting from time $- L_{k}$ , the actual inventory position of a component with lead time L_k is compared with its target. New replenishment is ordered to keep the actual inventory position at

I P^{k} (t) = ⌈ I P^{k} (t) ⌉ \lor I P^{k -} (t), t \geq - L_{k}, 1 \leq k \leq K,

(27)

where recognizing that inventory positions need to be integers, the ceiling function is applied to

I P^{k} (t)

. The “preorder” inventory position

I P^{k -} (t)

(t \geq - L_{k}, 1 \leq k \leq K)

is defined in Table 1.

Although $I P^{k} (t)$ and $I P^{k} (t)$ $(t \geq - L_{k}, 1 \leq k \leq K)$ are both continuous-time processes, their values change only at discrete points of time, corresponding to new demand arriving at t, or a demand arrival at one of a particular set of previous times, as follows. For components with lead time L_K, as seen in (23), the procedure reduces to a constant base stock policy with $Y^{K}$ as base stock levels. For components with lead time $L_{K - 1}, I P^{K - 1} (t)$ needs to be updated only when there is a demand arrival at time $t + L_{K - 1} - L_{K}$ or t, which changes the value of $D (t + L_{K - 1} - L_{K}, t)$ , so (26) needs to be re-solved to update $Y^{K - 1} (t)$ $(t \geq - L_{K - 1}$ ). Similarly, by induction on k in (26), for $k < K - 1, I P^{k} (t)$ can change only when there is a demand arrival at time (i) t, which changes $D (t + L_{k} - L_{K}, t)$ , or (ii) $t + L_{k} - L_{k^{'}}$ ( $k < k^{'} < K$ ) which might change $Y^{k^{'}} (t + L_{k} - L_{k^{'}})$ , or (iii) $t + L_{k} - L_{K}$ , which changes both $D (t + L_{k} - L_{K}, t)$ and $Y^{k^{'}} (t + L_{k} - L_{k^{'}})$ ( $k < k^{'} < K$ ). Moreover, (27) implies that only at these times can $I P^{k} (t)$ change: because $I P^{k} (t)$ can only change at these times, and as Table 1 shows, $I P^{k -} (t)$ can differ from $I P^{k} (t^{-})$ only when there is a demand arrival at time t $(t \geq - L_{k}, 1 \leq k \leq K$ ).

Although it may seem that an exact implementation of the previous idea would require looking back in time at every moment t, to see if there is any demand arrival at times $t + L_{k} - L_{k^{'}}$ $(k < k^{'} < K)$ , to decide whether to update $I P^{k} (t)$ and place orders to reach new $I P^{k} (t)$ $(t \geq - L_{k}, 1 \leq k \leq K$ ), there is an equivalent simpler and more intuitive implementation, which allows demand arrivals to drive future updates. Specifically, when there is a demand arrival at time t, schedule updates of inventory position targets and corresponding ordering decisions at times $t, t + L_{k + 1} - L_{k}, \dots, t + L_{K} - L_{k}$ $(1 \leq k \leq K - 1)$ .

Figure 3 illustrates the process. The vertical line represents a particular time $t^{'}$ when there is a demand arrival. Each horizontal line corresponds to a lead time. On the line corresponding to L_k, circles mark points of time when (26) needs to be re-solved to update $Y^{k} (\cdot)$ and $I P^{k} (\cdot)$ $(1 \leq k < K$ ). The expression next to a circle shows the distance in time from time $t^{'}$ to the circle. Updates of $Y^{k} (\cdot)$ take place at times $t^{'}$ and $t^{'} + L_{K} - L_{k}$ as demand arrival at time $t^{'}$ changes $D (t + L_{k} - L_{K}, t)$ in (26) when $t = t^{'}$ or $t = t^{'} + L_{K} - L_{k}$ $(1 \leq k < K)$ . Moreover, as shown by two parallel dashed lines, updates of $Y^{K - 1} (\cdot)$ at times $t^{'}$ and $t^{'} + L_{K} - L_{K - 1}$ change values of $Y^{K - 1} (t + L_{k} - L_{K - 1})$ in (26) when $t = t^{'} + L_{K - 1} - L_{k}$ and $t = t^{'} + L_{K} - L_{k}$ , so $Y^{k} (\cdot)$ needs to be updated at the latter two times $(1 \leq k < K - 1$ ). In general updates of $Y^{k} (\cdot)$ at times $t^{'}$ and $t^{'} + L_{K} - L_{k}$ trigger updates of $Y^{l} (\cdot)$ at times $t^{'} + L_{k} - L_{l}$ and $t^{'} + L_{K} - L_{l}$ , respectively $(1 \leq l < k$ ).

**Figure 3. Inv. Position Target Updates and Ordering Decisions That Follow Demand Arrival at Time t**

The complete replenishment procedure is presented in Algorithm 1. The collection of times at which $Y^{k} (\cdot), I P^{k} (\cdot)$ and $I P^{k} (\cdot)$ possibly change is denoted by $Γ^{k} (1 \leq k \leq K)$ . The ceiling function on $I P_{j} (t)$ in step 2(c) ensures that both $I P_{j} (t)$ and $R_{j} (t)$ are integer-valued ( $t \geq - L_{k_{j}}, 1 \leq j \leq n$ ). In Steps 2(a) and 2(b), inventory position targets $I P^{k} (t)$ $(t \geq - L_{k}, 1 \leq k \leq K$ ) are updated repeatedly over time except for k = K, in which case $I P^{K} (t)$ are constants that are set once at time $- L_{K}$ . It is easy to verify that $R_{j} (t)$ $(t \geq - L_{k_{j}}, 1 \leq j \leq n)$ prescribed in Algorithm 1 satisfy the feasibility requirements defined in Section 2: integer-valued, nonnegative, nondecreasing, right-continuous, and nonanticipating.

It is worth pointing out that in ATO systems with a single product, this replenishment policy specializes to the one prescribed by Rosling (1989). In this special case, the latter policy is exactly optimal because all components are used according to fixed proportions, so one can always keep inventory positions at their targets (after some initial lapse for these positions to reach “coordinated levels”). In systems with identical lead times, our policy specializes to the one formulated in Reiman and Wang (2015), which uses base stock policies for replenishment and is asymptotically optimal in the large lead time regime.

Algorithm 1

(Replenishment Policy Procedure)

Initialization: for $k = 1, \dots K$ , let

Γ^{k} = {- L_{k}}, R^{k} (- L_{k}^{-}) = 0, and I P^{k} (- L_{k}) = - A^{k} D (- L_{k}) .

For $t \geq - L_{K}$ :

if $d (t) > 0$ , then for $k = 1, \dots, K$ such that $t \geq - L_{k}$ , let
$\begin{array}{l} I P^{k -} (t) = I P^{k} (t^{-}) - A^{k} d (t) . \\ and Γ^{k} = Γ^{k} \cup {t, t + L_{k + 1} - L_{k}, \dots, t + L_{K} - L_{k}} . \end{array}$
for $k = 1, \dots, K$ , if $t \in Γ^{k}$ , then:
- (a) let $Y^{k} (t)$ be the value of $y^{k}$ that minimizes (26)
- (b) as in (25), let
  $I ℙ^{k} (t) = Y^{k} (t) - A^{k} D (t + L_{k} - L_{K}, t) .$
- (c) for each component j with lead time L_k, let
  $R_{j} (t) = R_{j} (t^{-}) + {(⌈ I ℙ_{j} (t) ⌉ - I P_{j}^{-} (t))}^{+},$
  by ordering ${(⌈ I ℙ_{j} (t) ⌉ - I P_{j}^{-} (t))}^{+}$ units, changing its inventory position to
  $I P_{j} (t) = I P_{j}^{-} (t) + R_{j} (t) - R_{j} (t^{-}) .$
  End of Algorithm 1

4.3. Allocation Policy

4.3.1. Overview.

On the RHS of (20), $B^{*}$ is the optimal solution of the LP

\min_{B} {c B | B \geq 0, A^{k} B \leq A^{k} x - y^{k *}, 1 \leq k \leq K},

(28)

which is defined on each sample path of

(D^{K}, \dots, D^{1})

, with

x

denoting

D^{K} + \dots + D^{1}

and

y^{k *}

determined recursively by minimizing (21) for values of

(D^{K}, \dots, D^{k + 1})

(

1 \leq k \leq K

) on the same path. In the corresponding ATO system,

D (t - L_{K}, t)

has the same distribution as

D^{K} + \dots + D^{1}

and

Y^{k} (t - L_{k})

has the same distributions as

y^{k *}

(t \geq 0, 1 \leq k \leq K)

. Hence, we can mimic

B^{*}

at time t

(t \geq 0)

by letting

B^{*} (t)

be the value of

B

that minimizes (28) with

A^{k} x - y^{k *}

replaced by

Q^{k} (t) ≔ A^{k} D (t - L_{K}, t) - Y^{k} (t - L_{k}) .

(29)

As a result, (20) holds for $B^{*} (t)$ , that is,

E [B^{*} (t)] = E [B^{*}] .

(30)

Although $B^{*} (t)$ $(t \geq 0)$ gives the desired backlog levels for the inventory cost to reach its lower bound, it is determined by (see (25) and (29))

\begin{array}{l} Q^{k} (t) & = A^{k} D (t - L_{K}, t) - (I P^{k} (t - L_{k}) + A^{k} D (t - L_{K}, t - L_{k})) \\ = A^{k} D (t - L_{k}, t) - I P^{k} (t - L_{k}), 1 \leq k \leq K . \end{array}

Comparing with $Q^{k} (t)$ in Table 1, $Q^{k} (t)$ differs from actual component balances by having inventory position targets $I P^{k} (t - L_{k})$ in place of actual positions $I P^{k} (t - L_{k})$ $(t \geq 0, 1 \leq k \leq K)$ . Because our replenishment policy in general does not keep inventory positions at their targets, $Q^{k} (t)$ does not always capture the exact state of component availability in the system. Recognizing this discrepancy, we set the backlog target at $B (t)$ $(t \geq 0)$ , which, like $B^{*} (t)$ , is determined by minimizing (28), but using $Q^{k} (t)$ instead of $Q^{k} (t)$ to replace $A^{k} x - y^{k *}$ $(1 \leq k \leq K)$ . Although (20) may not hold for $B (t)$ $(t \geq 0)$ at equality, our analysis shows that the difference is asymptotically negligible.

Actual backlog levels may exceed or fall below their targets. It is generally infeasible to keep all backlog levels at their targets: a below-target backlog level cannot be raised immediately to the target if there is no new demand arrival, and an above-target backlog level cannot be reduced if a required component is not available. Naturally, we prescribe an allocation policy that uses all available components to reduce backlogs that are above their targets and never serves a demand when its backlog level is at or below the target.

Algorithm 2

(Allocation Policy Procedure)

Initialization: let

Z (t) = 0 for all t < 0 (so B (0^{-}) = D (0^{-})) .

For $t \geq 0$ ,

if t = 0 or $Q_{j} (t) - Q_{j} (t^{-}) \neq 0$ for some $j = 1, \dots, n$ , then

1 .let $B (t)$ be the value of $x$ that minimizes:
${c x | x \geq 0, A^{k} x \geq Q^{k} (t), 1 \leq k \leq K} .$ (31)
2. let
$Z (t) = D (t) - B (t),$ (32)
where $B (t)$ must be integer-valued and its choice satisfies:
- (a) for all $i = 1, \dots, m$ ,
  ${[⌊ B_{i} (t) - B_{i} (t) ⌋]}^{+} \land \min_{1 \leq j \leq n} {{(I_{j} (t) - a_{j i} + 1)}^{+}} = 0,$ (33)
  and
  $B_{i}^{-} (t) \land B_{i} (t) \leq B_{i} (t) \leq B_{i}^{-} (t),$ (34)
  where
  $B_{i}^{-} (t) = B_{i} (t^{-}) + d_{i} (t),$ (35)
  and by definition (8),
  $I_{j} (t) = \sum_{i^{'} = 1}^{m} a_{j i^{'}} B_{i^{'}} (t) - Q_{j} (t) .$
- (b) for all $j = 1, \dots, n$ ,
  $\sum_{i = 1}^{m} a_{j i} B_{i} (t) \geq Q_{j} (t) .$ (36)
  End of Algorithm 2

4.3.2. Specific Policy Procedures.

Algorithm 2 prescribes the specific policy procedure that implements the previous idea on component allocation.

To explain, the allocation decision starts at time 0 when the first batches of replenishment orders arrive. Before that time, backlogs simply accumulate as new demands arrive. After that time, new allocations are triggered by changes of $Q_{j} (t)$ ( $t \geq 0, 1 \leq j \leq n)$ , the balance of some components. (This occurs as demands and replenishments arrive.) As Table 1 shows, $Q^{k} (t)$ is the difference between the accumulated demand and supply, $A^{k} D (t) - R^{k} (t - L_{k})$ $(t \geq 0, 1 \leq k \leq K)$ . The former is a compound Poisson process and the latter is a pure jump process under the replenishment policy executed by Algorithm 1. Thus, allocation actions are taken at discrete points of time, which include using (31) to set backlog targets and serving demand under conditions (32)–(36). Of the latter conditions, (32) and (35) simply repeat definitions of $B^{-} (t)$ and $B (t)$ in Table 1. Distinct features of the policy are characterized by (33), (34), and (36).

In (33), $B_{i} (t)$ is specified as an integer, $B_{i} (t)$ is real number, and the floor function is applied to eliminate the rounding error in their difference. The equation defines the key feature of the policy: a backlog level can exceed its target (within the rounding error) only when the system runs out of a needed component to serve the demand, that is, when
$a_{j i} > \sum_{i^{'} = 1}^{m} a_{j i^{'}} B_{i^{'}} (t) - Q_{j} (t) = I_{j} (t) for some j (1 \leq j \leq n) .$
The left inequality of (34) requires that the backlog level of a demand ( $B_{i} (t)$ ) should be kept at its existing level ( $B_{i}^{-} (t))$ if the latter level does not exceed its target ( $B_{i} (t)$ ).
The right inequality of (34) encodes that serving a demand can only reduces its backlog level.
Referring to (8), (36) is equivalent to
$I_{j} (t) \geq 0, 1 \leq j \leq n,$
that is, demands cannot be served with a nonexisting component.

In prescribing this procedure, we have omitted specific processes for determining $B (t)$ $(t \geq 0)$ to satisfy (33), (34), and (36) because there can be many different ones. For instance, one can select all backlogs that exceed their targets and use available components to reduce them one at a time according to a particular order. Each backlog is reduced to the point where (33) applies, that is, either the backlog reaches the target or there are not enough components to bring it down further. This process obviously also satisfies (34) and (36). We can formulate many other processes by, for instance, changing the order by which backlogs are processed or not following a fixed order at all. Therefore, Algorithm 2 defines not a single policy but a family of eligible policies. Such policies always exist, for example, the one that follows the process we just described.

To show that this family of eligible policies satisfy requirements for feasible policies defined in Section 2: Once $B (t)$ $(t \geq 0)$ is determined, the original allocation process $Z (t)$ $(t \geq 0)$ is fully specified by (32). By (32) and (35), we can write

Z (t) = Z (t^{-}) + B^{-} (t) - B (t), t \geq 0,

and use the right inequality of (34) to show that

Z (t)

(t \geq 0)

is nondecreasing, and also nonnegative as

Z (t) = 0

for t < 0. Because the value of

Z (t)

(t \geq 0)

changes only at discrete points of time, the process is right-continuous. Apply the definition of

Q^{k} (t)

in Table 1, its alternative expression in (8), and use (32) to write

A^{k} Z (t) = A^{k} D (t) - A^{k} B (t) = A^{k} D (t) - (Q^{k} (t) + I^{k} (t)) = R^{k} (t - L_{k}) - I^{k} (t), 1 \leq k \leq K, t \geq 0,

from which it is easy to see that

Z (t)

(t \geq 0)

satisfies all conditions in (1).

From the first expression of $Q^{k} (t)$ $(1 \leq k \leq K, t \geq 0)$ in (8) and the constraints in the LP (31),

I_{j} (t) = \sum_{i = 1}^{m} a_{j i} B_{i} (t) - Q_{j} (t) \geq \sum_{i = 1}^{m} a_{j i} (B_{i} (t) - B_{i} (t)), 1 \leq j \leq n, t \geq 0 .

Therefore, for any given product i $(1 \leq i \leq m)$ ,

I_{j} (t) + \sum_{i^{'} \neq i} a_{j i^{'}} [B_{i^{'}} (t) - B_{i^{'}} (t)] \geq a_{j i} [B_{i} (t) - B_{i} (t)], 1 \leq j \leq n, t \geq 0 .

Apply the inequality to (33) leads to the following important property of our allocation policy:

For any $i = 1, \dots, m$ , if $B_{i} (t) - B_{i} (t) \geq 1$ , then

1 + \frac{\bar{a}}{\underline{a}} \sum_{i^{'} \neq i} {(B_{i^{'}} (t) - B_{i^{'}} (t))}^{+} \geq B_{i} (t) - B_{i} (t), t \geq 0 .

(37)

See Table 2 for definitions of $\bar{a}$ and $\underline{a}$ .

By this property, our policy does not allow backlog of any product i to exceed its target by more than a rounding error when no backlog is below its target (i.e., $B_{i^{'}} (t) \leq B_{i^{'}} (t)$ for all $i^{'} \neq i$ ). A similar observation was made in Reiman and Wang (2015) for developing asymptotically optimal policies to manage ATO systems with identical lead times. Likewise, we will use this fact in Section 6.2.2.

4.4. Further Comments

Algorithms 1 and 2 show that our policies can be fully implemented by controlling inventory positions via (27) and choosing backlog levels that satisfy (33), (34), and (36). For brevity, from now on, we will only use the latter variables to characterize our policies, instead of invoking the original replenishment process $R^{k} (t)$ $(t \geq - L_{k}, 1 \leq k \leq K)$ and allocation process $Z (t)$ $(t \geq - L_{K})$ .

The procedures in these tables implicitly assume that there is no ambiguity about $Y^{k} (t)$ $(t \geq - L_{k}, 1 \leq k \leq K)$ and $B (t)$ $(t \geq 0)$ , which is true if the related SPs for choosing these values all have unique optimal solutions. In Section 7, we will discuss how to perturb these SPs to satisfy the uniqueness condition without compromising asymptotic optimality.

5. Asymptotic Optimality

The rest of the paper will focus on performance evaluation, especially asymptotic optimality, of our policy, and this section provides a critical lead into this analysis. We first set up the asymptotic regime in Section 5.1. We then define the asymptotic optimality criteria in Section 5.2, followed by the development of a key theorem in Section 5.3 (Theorem 3) that specifies sufficient conditions for meeting these criteria.

5.1. Large Lead Time Asymptotic Regime

We introduce a family of systems indexed by L. In the L^th system, $L_{k}^{(L)}$ is the lead time of components ${\bar{n}}_{k - 1} + 1, \dots, {\bar{n}}_{k}$ ( $1 \leq k \leq K$ ). The longest lead time $L_{K}^{(L)} = L$ . We define the large lead time asymptotic regime by letting the longest lead time $L \to \infty$ .

For convenience, define $L_{0}^{(L)} = 0$ . Let

{\hat{L}}_{k}^{(L)} ≜ \frac{L_{k}^{(L)}}{L}, 0 \leq k \leq K .

(38)

We impose no assumptions on other lead times except that for all L,

L_{0}^{(L)} < L_{1}^{(L)} < L_{2}^{(L)} < \dots < L_{K}^{(L)}, and therefore 0 < {\hat{L}}_{1}^{(L)} < \dots {\hat{L}}_{K − 1}^{(L)} < 1 .

5.1.1. Demand Process.

The L^th system is empty at time –L when the demand process $D^{(L)} (t)$ ( $t \geq - L$ ) starts. This process is the same as $D (t)$ $(t \geq - L_{K})$ in Section 2 except for the starting time. Similar to definitions of $D (t_{1}, t_{2})$ and $d (t)$ in Table 1, let $D^{(L)} (t_{1}, t_{2})$ be the increment of $D^{(L)} (t)$ over $(t_{1}, t_{2}]$ $(- L \leq t_{1} < t_{2})$ , with $μ$ and $Σ$ denoting the mean and covariance matrix of $D^{(L)} (t, t + 1)$ , and $d^{(L)} (t)$ be the instantaneous change of $D^{(L)} (t)$ at t $(t \geq - L)$ . Define

\begin{array}{l} {\hat{D}}^{(L)} (t_{1}, t_{2}) ≜ \frac{D^{(L)} (L t_{1}, L t_{2}) - L (t_{2} - t_{1}) μ}{\sqrt{L}}, - 1 \leq t_{1} < t_{2}, \\ and & {\hat{d}}^{(L)} (t) ≜ \frac{d^{(L)} (L t)}{\sqrt{L}}, t \geq - 1 . \end{array}

(39)

Let $D^{k (L)}$ be a random vector with the same distribution as $D^{(L)} (0, L_{k}^{(L)} - L_{k - 1}^{(L)})$ $(1 \leq k \leq K)$ . Then

{\hat{D}}^{k (L)} ≜ \frac{D^{k (L)} - (L_{k}^{(L)} - L_{k - 1}^{(L)}) μ}{\sqrt{L}}

has the same distribution as

{\hat{D}}^{(L)} (t - {\hat{L}}_{k}^{(L)}, t - {\hat{L}}_{k - 1}^{(L)})

(t \geq - 1, 1 \leq k \leq K)

5.1.2. Replenishment and Inventory.

In the L^th system, replenishments of components with lead time $L_{k}^{(L)}$ start at time $- L_{k}^{(L)}$ ( $1 \leq k \leq K$ ). Following the policy description in Section 4.2, the inventory position targets are

I P^{k (L)} (t) = Y^{k (L)} (t) - A^{k} D^{(L)} (t + L_{k}^{(L)} - L, t), t \geq - L_{k}^{(L)}, 1 \leq k \leq K,

where

Y^{K (L)} (t)

is a constant vector (and thus will be denoted by

Y^{K (L)}

) that minimizes

h^{K} y^{K} + E [φ^{K - 1} (y^{K}, D^{K (L)})],

(40)

and

Y^{k (L)} (t)

(1 \leq k < K)

minimizes

h^{k} y^{k} + E [φ^{k - 1} (y^{k}, Y^{k + 1 (L)} (t + L_{k}^{(L)} - L_{k + 1}^{(L)}), \dots, Y^{K (L)}, x + D^{k (L)})]

(40′)

on the sample path of

D^{(L)} (t + L_{k}^{(L)} - L_{k^{'}}^{(L)}, t + L_{k}^{(L)} - L_{k^{'} - 1}^{(L)}), k^{'} = K, \dots, k + 1,

with

x

denoting the value of

D^{(L)} (t + L_{k}^{(L)} - L, t)

on that path.

Let $I P^{k (L)} (t)$ $(t \geq 0, 1 \leq k \leq K)$ be the actual inventory position in the L^th system. Under the aforementioned replenishment policy, for $k = 1, \dots, K$ :

\begin{array}{l} I P^{k (L) -} (- L_{k}^{(L)}) = - A^{k} D^{(L)} (- L_{k}^{(L)}), \\ I P^{k (L)} (t) = I P^{k (L)} (t) \lor I P^{k (L) -} (t), t \geq - L_{k}^{(L)}, \\ and & I P^{k (L) -} (t) = I P^{k (L)} (t^{-}) - A^{k} d^{(L)} (t), t > - L_{k}^{(L)}, \end{array}

where the operator

\lor

is applied componentwise. Component balances under the targeted and actual inventory positions are, respectively,

\begin{array}{l} Q^{k (L)} (t) & = A^{k} D^{(L)} (t - L_{k}^{(L)}, t) - I P^{k (L)} (t - L_{k}^{(L)}), \\ and Q^{k (L)} (t) & = A^{k} D^{(L)} (t - L_{k}^{(L)}, t) - I P^{k (L)} (t - L_{k}^{(L)}), t \geq 0, 1 \leq k \leq K . \end{array}

(41)

For $k = 1, \dots, K$ , let

\begin{array}{l} {\hat{Y}}^{k (L)} (t) ≜ \frac{Y^{k (L)} (L t) - L A^{k} μ}{\sqrt{L}}, t \geq - {\hat{L}}_{k}^{(L)}, \\ {\overset{⁁}{I ℙ}}^{k (L)} (t) ≜ \frac{I P^{k (L)} (L t) - A^{k} L_{k}^{(L)} μ}{\sqrt{L}}, {\overset{⁁}{I P}}^{k (L)} (t) ≜ \frac{I P^{k (L)} (L t) - A^{k} L_{k}^{(L)} μ}{\sqrt{L}}, t \geq - {\hat{L}}_{k}^{(L)}, \\ {\hat{Q}}^{k (L)} (t) ≜ \frac{Q^{k (L)} (L t)}{\sqrt{L}}, {\hat{Q}}^{k (L)} (t) ≜ \frac{Q^{k (L)} (L t)}{\sqrt{L}}, t \geq 0 . \end{array}

(42)

5.1.3. Allocation and Backlog.

Referring to our allocation policy in Algorithm 2, denote backlog targets $B (t)$ in the L^th system by $B^{(L)} (t)$ ( $t \geq 0$ ). Denote backlog levels $B^{-} (t)$ and $B (t)$ by $B^{(L) -} (t)$ and $B^{(L)} (t)$ ( $t \geq - L$ ), respectively. Hence, $B^{(L)} (t)$ $(t \geq 0)$ is the optimal solution that minimizes

\min_{x} {c x | x \geq 0, A^{k} x \geq Q^{k (L)} (t), 1 \leq k \leq K} .

(43)

Corresponding to $B^{*} (t)$ prescribed by (28)–(29), let $B^{* (L)} (t)$ be the optimal solution to (43) with $Q^{k (L)} (t)$ replaced by $Q^{k (L)} (t)$ $(t \geq 0, 1 \leq k \leq K)$ .

Define

\begin{array}{l} {\hat{B}}^{* (L)} (t) ≜ \frac{B^{* (L)} (L t)}{\sqrt{L}}, & {\hat{B}}^{(L)} (t) ≜ \frac{B^{(L)} (L t)}{\sqrt{L}}, t \geq 0, \\ {\hat{B}}^{(L)} (t) ≜ \frac{B^{(L)} (L t)}{\sqrt{L}}, & {\hat{B}}^{(L) -} (t) ≜ \frac{B^{(L) -} (L t)}{\sqrt{L}}, t \geq - 1 . \end{array}

(44)

Obviously, ${\hat{B}}^{* (L)} (t)$ and ${\hat{B}}^{(L)} (t)$ $(t \geq 0)$ are optimal solutions to (43) with $Q^{k (L)} (t)$ replaced by ${\hat{Q}}^{k (L)} (t)$ and ${\hat{Q}}^{k (L)} (t)$ $(1 \leq k \leq K, t \geq 0)$ , respectively.

5.1.4. Inventory Cost.

In the L^th system, the long-run average expected inventory cost is

C^{(L)} = \underset{T \to \infty}{\lim \sup} \frac{1}{T} \int_{0}^{T} C^{(L)} (t) d t .

Using (18), the expected inventory cost rate can be written as

C^{(L)} (t) = \sum_{k = 1}^{K} h^{k} \cdot (E [I P^{k (L)} (t - L_{k}^{(L)})] - A^{k} E [D^{(L)} (t - L_{k}^{(L)}, t)]) + c \cdot E [B^{(L)} (t)], t \geq 0 .

(45)

Define

{\hat{C}}^{(L)} ≜ \frac{C^{(L)}}{\sqrt{L}} .

Then

{\hat{C}}^{(L)} = \underset{T \to \infty}{\lim \sup} \frac{1}{T} \int_{0}^{T} {\hat{C}}^{(L)} (t) d t,

where the scaled inventory cost rate is defined to be

{\hat{C}}^{(L)} (t) ≜ \frac{C^{(L)} (L t)}{\sqrt{L}}, t \geq 0,

and by applying (39), (42), and (44) to (45),

{\hat{C}}^{(L)} (t)

can be written as

{\hat{C}}^{(L)} (t) = \sum_{k = 1}^{K} h^{k} \cdot (E [{\overset{⁁}{I P}}^{k (L)} (t - {\hat{L}}_{k}^{(L)})] - A^{k} E [{\hat{D}}^{(L)} (t - {\hat{L}}_{k}^{(L)}, t)]) + c \cdot E [{\hat{B}}^{(L)} (t)], t \geq 0 .

(46)

5.1.5. Summary of Definitions.

For future reference, we summarize the definitions made in this subsection for asymptotic analysis. Some vectors of random processes $x^{(L)} (t)$ are centered and scaled componentwise by

{\hat{x}}^{(L)} (t) = \frac{x^{(L)} (L t) - {\bar{x}}^{(L)}}{\sqrt{L}},

where t varies in a proper range of time and the centering

{\bar{x}}^{(L)}

is proportional to L. Similar operations apply to increments of demand processes over some periods

(t_{1}, t_{2}]

and demand vectors used in the lower bound SP. These definitions are given in Table 3.

Some other processes and variables are scaled componentwise (without any centering) by

{\hat{x}}^{(L)} (t) = \frac{x^{(L)} (L t)}{\sqrt{L}},

and their definitions are summarized in Table 4.

5.2. Asymptotic Optimality Criterion

Applying Theorem 2 to the L^th system, the average inventory cost $C^{(L)}$ has a lower bound ${\underline{C}}^{(L)}$ , determined by the SP (13), using $D^{k (L)}$ as inputs $D^{k}$ $(1 \leq k \leq K)$ . Define

{\hat{\underline{C}}}^{(L)} ≜ \frac{{\underline{C}}^{(L)}}{\sqrt{L}} .

Let $C_{\min}^{(L)}$ denote the infimum of the average inventory cost over all feasible policies. Because these values are bounded below by ${\underline{C}}^{(L)}$ , this infimum exists. We show that our policy is asymptotically optimal in the traditional sense that

\lim_{L \to \infty} \frac{C^{(L)} - C_{\min}^{(L)}}{C_{\min}^{(L)}} = 0,

(47)

that is, the percentage difference between the cost and its minimum value converges to zero as the longest lead time becomes large.

Because $C_{\min}^{(L)} \geq {\underline{C}}^{(L)}$ , we have

\frac{C^{(L)} - C_{\min}^{(L)}}{C_{\min}^{(L)}} \leq \frac{\sqrt{L}}{C_{\min}^{(L)}} {\frac{C^{(L)} - {\underline{C}}^{(L)}}{\sqrt{L}}} .

Hence, we can prove (47) by showing that

\underset{L \to \infty}{\lim \sup} {\frac{\sqrt{L}}{C_{\min}^{(L)}}} < \infty,

(48)

and,

\lim_{L \to \infty} \frac{C^{(L)} - {\underline{C}}^{(L)}}{\sqrt{L}} = \lim_{L \to \infty} {{\hat{C}}^{(L)} - {\hat{\underline{C}}}^{(L)}} = 0 .

(49)

To show (48) is true, apply the SP (13) to the L^th system with following changes: let $i^{'}$ be a product with $a_{n i^{'}} > 0$ , where component n has the longest lead time L. Keep h_n and $b_{i^{'}}$ intact while resetting h_j = 0 $(j \neq n$ ) and b_i = 0 ( $i \neq i^{'}$ ). The problem then becomes

\min_{y_{n}} {h_{n} y_{n} - (a_{n i^{'}} h_{n} + b_{i^{'}}) E [\min (y_{n} / a_{n i^{'}}, {\bar{D}}_{i^{'}}^{(L)})]},

where we use

{\bar{D}}_{i^{'}}^{(L)}

to denote

D_{i^{'}}^{K (L)}

, which has the same distribution as

D_{i^{'}}^{(L)} (t - L, t)

(t \geq 0)

Let $φ_{n}^{K (L)}$ be the optimal objective value of the above minimization problem. Obviously, $φ_{n}^{K (L)} \leq φ^{K (L)}$ , where $φ^{K (L)}$ is the objective value of (13) specialized to the L^th system. Apply $φ_{n}^{K (L)}$ and the new costs to (15) and compare the resulting value (denoted by $C_{n}^{(L)}$ ) with the lower bound and the minimum cost:

C_{n}^{(L)} \equiv φ_{n}^{K (L)} + b_{i^{'}} \cdot E [{\bar{D}}_{i^{'}}^{(L)}] \leq φ^{K (L)} + \sum_{i = 1}^{m} b_{i} E [{\bar{D}}_{i}^{(L)}] = {\underline{C}}^{(L)} \leq C_{\min}^{(L)} .

Furthermore, simple transformations of $φ_{n}^{K (L)}$ shows that

φ_{n}^{K (L)} + b_{i^{'}} \cdot E [{\bar{D}}_{i^{'}}^{(L)}] = \min_{y_{n}} {h_{n} E [{(y_{n} - a_{n i^{'}} {\bar{D}}_{i^{'}}^{(L)})}^{+}] + \frac{b_{i^{'}}}{a_{n i^{'}}} E [{(a_{n i^{'}} {\bar{D}}_{i^{'}}^{(L)} - y_{n})}^{+}]}

is a Newsvendor model (unlike the standard formulation, here we allow y_n to be negative, but it is easy to see that

y_{n} < 0

is never optimal), so

C_{n}^{(L)}

is proportional to the standard deviation of

{\bar{D}}_{i^{'}}^{(L)}

, which is on the order of

\sqrt{L}

, and (48) follows as a consequence.

Hence, to show (47) is satisfied, we only need to prove (49); that is, our policy is asymptotically optimal on the diffusion scale.

5.3. Sufficient Conditions for Asymptotic Optimality

The following theorem shows that (49) holds if inventory positions and backlog levels converge to their respective targets. Recall that these targets were set at levels at which the average inventory cost can attain its SP-based lower bound (see (19) and (20)). Although meeting these targets exactly is not possible in general, the theorem shows that meeting them on the diffusion scale, which we will prove to be feasible in the next section, is sufficient to guarantee asymptotic optimality.

Theorem 3.

In a family of systems indexed by their longest lead time L, if

\underset{L \to \infty}{\lim \sup} {\sup_{t \geq - {\hat{L}}_{k_{j}}^{(L)}} E [| {\overset{⁁}{I P}}_{j}^{(L)} (t) - {\overset{⁁}{I ℙ}}_{j}^{(L)} (t) |]} = 0, 1 \leq j \leq n,

(50)

and

\underset{L \to \infty}{\lim \sup} {\sup_{t \geq 0} E [| {\hat{B}}_{i}^{(L)} (t) - {\hat{B}}_{i}^{* (L)} (t) |]} = 0, 1 \leq i \leq m,

(51)

then (49) holds; that is, our policy is asymptotically optimal on the diffusion scale, and (47) follows as a consequence.

Proof.

Following (17), in the L^th system,

{\underline{C}}^{(L)} = \sum_{k = 1}^{K} h^{k} \cdot (E [y^{k * (L)}] - A^{k} E [D^{K (L)}]) + c \cdot E [B^{* (L)}],

where

y^{k * (L)}

(1 \leq k \leq K)

is obtained from (13) and

B^{* (L)}

is obtained from (16).

Specializing (22) and (30) in our policy formulation to the L^th system:

E [y^{k * (L)}] = E [Y^{k (L)} (t - L_{k}^{(L)})], 1 \leq k \leq K and E [B^{* (L)}] = E [B^{* (L)} (t)], t \geq 0 .

Moreover, $D^{K (L)}$ has the same distribution as $D^{(L)} (t - L, t)$ $(t \geq 0)$ by definition. Thus,

\begin{array}{l} {\underline{C}}^{(L)} & = \sum_{k = 1}^{K} h^{k} \cdot (E [Y^{k (L)} (t - L_{k}^{(L)})] - A^{k} E [D^{(L)} (t - L, t)]) + c \cdot E [B^{* (L)} (t)] \\ = \sum_{k = 1}^{K} h^{k} \cdot (E [I P^{k (L)} (t - L_{k}^{(L)})] - A^{k} E [D^{(L)} (t - L_{k}^{(L)}, t)]) + c \cdot E [B^{* (L)} (t)], t \geq 0, \end{array}

where the second equality results from the determination of inventory position targets in (25).

By the definition of ${\hat{\underline{C}}}^{(L)}$ and applying the second equality of the above equation:

\begin{array}{l} {\hat{\underline{C}}}^{(L)} = \frac{\sum_{k = 1}^{K} h^{k} \cdot (E [I P^{k (L)} (L t - L_{k}^{(L)})] - A^{k} E [D^{(L)} (L t - L_{k}^{(L)}, L t)]) + c \cdot E [B^{* (L)} (L t)]}{\sqrt{L}} \\ = \sum_{k = 1}^{K} h^{k} \cdot (E [{\overset{⁁}{I ℙ}}^{k (L)} (t - {\hat{L}}_{k}^{(L)})] - A^{k} E [{\hat{D}}^{(L)} (t - {\hat{L}}_{k}^{(L)}, t)]) + c \cdot E [{\hat{B}}^{* (L)} (t)], t \geq 0 . \end{array}

Comparing this with (46):

\begin{array}{l} {\hat{C}}^{(L)} - {\hat{\underline{C}}}^{(L)} = \underset{T \to \infty}{\lim \sup} \frac{1}{T} \int_{0}^{T} ({\hat{C}}^{(L)} (t) - {\underline{\hat{C}}}^{(L)}) d t = \underset{T \to \infty}{\lim \sup} \frac{1}{T} \int_{0}^{T} \sum_{k = 1}^{K} h^{k} \cdot (E [{\hat{I P}}^{k (L)} (t - {\hat{L}}_{k}^{(L)})] - E [{\overset{⁁}{I ℙ}}^{k (L)} (t - {\hat{L}}_{k}^{(L)})]) d t + \underset{T \to \infty}{\lim \sup} \frac{1}{T} \int_{0}^{T} c \cdot (E [{\hat{B}}^{(L)} (t)] - E [{\hat{B}}^{* (L)} (t)]) d t . \end{array}

(52)

The theorem follows immediately by applying (50) and (51) to (52). □

6. Proof of Sufficient Conditions

Continuing from Theorem 3, we prove in this section that (50)–(51) are satisfied under target stability conditions to be defined later. To put results obtained in this section in perspective, it was shown in Reiman and Wang (2015) that for systems with identical lead times only (51) was needed because asymptotic optimality can be achieved under a base-stock policy that keeps the inventory positions constant. The convergence (51) was shown in theorem 4 in Reiman and Wang (2015), whereas here it is shown in Corollary 2. The proof of (51) in Reiman and Wang (2015) does not carry over to this setting because it assumes the use of a base stock policy for replenishment.

Target Stability Conditions. Under our inventory policy, $Y^{k} (t)$ $(t \geq - L_{k}, 1 \leq k \leq K)$ , obtained by solving (26), $B^{*} (t)$ ( $t \geq 0$ ), obtained by solving (28)–(29), and $B (t)$ $(t \geq 0)$ , obtained by solving (31), have the following properties: there exists some constant κ, depending only on $h, b$ , and A, such that on each sample path of these processes,

for all $t_{2} > t_{1} \geq - L_{k_{j}}$ ,
$| Y_{j} (t_{2}) - Y_{j} (t_{1}) | \leq κ \sum_{k > k_{j}} | | D (t_{2} - (L_{k} - L_{k_{j}}), t_{2}) - D (t_{1} - (L_{k} - L_{k_{j}}), t_{1}) | |_{1}, 1 \leq j \leq n,$ (53)
for all $t \geq 0$ ,
$| B_{i} (t) - B_{i}^{*} (t) | \leq κ \sum_{j = 1}^{n} | Q_{j} (t) - Q_{j} (t) |, 1 \leq i \leq m,$ (54)
and for all $t_{2} > t_{1} \geq 0$ ,
$| B_{i}^{*} (t_{2}) - B_{i}^{*} (t_{1}) | \leq κ \sum_{j = 1}^{n} | Q_{j} (t_{2}) - Q_{j} (t_{1}) |, 1 \leq i \leq m .$ (55)

We will develop an SP solution procedure in Section 7 to satisfy (53)–(55). Here in this section, we assume these conditions hold to prove (50) and (51).

Let us first give an informal explanation of the intuition of our proof. Under our replenishment policy, an inventory position can differ from its target only by exceeding it. When this happens, our policy will stop ordering the component, so its inventory position drops at the same rate as its demand arrival rate. The target may also change as new demand arrives. However, Condition (53) ensures that the latter change in the target is “slow” in comparison with demand arrivals, so the excess of inventory position over its target can be eliminated fast enough to satisfy (50). We then use a similar argument to show that under Conditions (54)–(55), (51) cannot be violated by having a backlog level falling below its target. To show that (51) can also not be violated by having a backlog level exceeding its target, we make use of the property of our allocation policy given in (37); that is, no backlog level will exceed its target when no other backlog level is below its target.

To reduce redundancy and improve generality, in Section 6.1, we define a general problem, referred to as the stochastic tracking model, based on common features of (50) and (51). We develop Theorem 4, which applies to the general model and contains (50) and (51) as special cases.

6.1. Stochastic Tracking Model

Consider a family of systems indexed by L > 0, where the L^th system is associated with the following parameters and processes:

{D^{(L)} (t), t \geq - L; A; s_{l}^{(L)} (l \in K); t_{0}^{(L)}; T^{(L)} (t), t \geq t_{0}; W^{(L)} (t), t \geq t_{0}; W_{0}^{(L)}} .

(56)

The process $D^{(L)} (t)$ ( $t \geq - L)$ is the same compound Poisson demand process defined in Section 5.1.1 for the L^th system, with the arrival rate $\underline{λ}$ and order sizes $S$ apply to all systems. For the results of this section, it is important to recall that components of $S$ , S_i $(1 \leq i \leq m)$ , are assumed to have a finite moment of order 6. As in Section 5.1.1, the increment of $D^{(L)} (t)$ over interval $(t_{1}, t_{2}]$ is denoted by $D^{(L)} (t_{1}, t_{2})$ $(- L \leq t_{1} < t_{2})$ and the jump of $D^{(L)} (t)$ at time t is denoted by $d^{(L)} (t)$ $(t \geq - L)$ .
The vector $A$ is an m-dimensional vector of nonnegative integers. The value of the vector is the same for all systems and at least one of its elements is strictly positive.
Each system is associated with a set of constants $s_{l}^{(L)}$ ( $l \in K$ ). Values of $s_{l}^{(L)}$ $(l \in K)$ differ between systems, but the index set $K$ , which is finite, remains the same for all systems. In the L^th system,
$0 \leq s_{l}^{(L)} \leq L, (l \in K) .$
The process $T^{(L)} (t)$ $(t \geq t_{0}^{(L)})$ , which we refer to as the target process, is a pure jump process that starts at time $t_{0}^{(L)}$ , where in the L^th system,
$- L + s_{l}^{(L)} \leq t_{0}^{(L)} \leq 0, and thus t_{0}^{(L)} - s^{(L)} \geq - L, for all l \in K .$
The process $W^{(L)} (t)$ $(t \geq t_{0}^{(L)})$ , which we refer to as the tracking process, is another pure jump process that also starts at time $t_{0}^{(L)}$ from an initial level $W_{0}^{(L)}$ . The process is defined by
$\begin{array}{l} W^{(L) -} (t_{0}^{(L)}) & = W_{0}^{(L)}, \\ W^{(L)} (t) & = W^{(L) -} (t) \lor T^{(L)} (t) = T^{(L)} (t) + {(W^{(L) -} (t) - T^{(L)} (t))}^{+}, t \geq t_{0}^{(L)}, \\ W^{(L) -} (t) & = W^{(L)} (t^{-}) - A \cdot d^{(L)} (t), t > t_{0}^{(L)} . \end{array}$ (57)

For convenience, we denote the initial difference between the target and tracking processes by

G_{0}^{(L)} = W^{(L)} (t_{0}^{(L)}) - T^{(L)} (t_{0}^{(L)}) = {(W_{0}^{(L)} - T^{(L)} (t_{0}^{(L)}))}^{+} .

Observe that the tracking process can instantly catch the target that rises higher than its level. When the target is lower, the gap can be closed after a sufficient amount of demand has arrived.

Let ${\hat{D}}^{(L)} (t_{1}, t_{2})$ $(- 1 \leq t_{1} < t_{2})$ and ${\hat{d}}^{(L)} (t)$ $(t \geq - 1)$ be defined the same as in (39). Let

{\hat{s}}_{l}^{(L)} ≜ \frac{s_{l}^{(L)}}{L} (l \in K) and {\hat{t}}_{0}^{(L)} ≜ \frac{t_{0}^{(L)}}{L} .

Following the conditions imposed on $s_{l}^{(L)}$ and $t_{0}^{(L)}$ in their definitions,

0 \leq {\hat{s}}_{l}^{(L)} < 1 and - 1 + {\hat{s}}_{l}^{(L)} \leq {\hat{t}}_{0}^{(L)} < 0, l \in K .

Let

\begin{array}{l} {\hat{T}}^{(L)} (t) ≜ \frac{T^{(L)} (L t)}{\sqrt{L}} & and {\hat{W}}^{(L)} (t) ≜ \frac{W^{(L)} (L t)}{\sqrt{L}}, t \geq {\hat{t}}_{0}^{(L)}, \\ with {\hat{W}}_{0}^{(L)} ≜ \frac{W_{0}^{(L)}}{\sqrt{L}} & and {\hat{G}}_{0}^{(L)} ≜ \frac{G_{0}^{(L)}}{\sqrt{L}} = {({\hat{W}}_{0}^{(L)} - {\hat{T}}^{(L)} ({\hat{t}}_{0}^{(L)}))}^{+} . \end{array}

(58)

Finally, the definition of the model also requires the following asymptotic Lipschitz continuity condition on the target processes in this family of systems:

Asymptotic Lipschitz Condition. There exists some constant κ that applies to all systems, such that for all $t_{0}^{(L)} \leq t_{1} < t_{2}$ ,

| T^{(L)} (t_{2}) - T^{(L)} (t_{1}) | \leq κ \sum_{l \in K} | | D^{(L)} (t_{2} - s_{l}^{(L)}, t_{2}) - D^{(L)} (t_{1} - s_{l}^{(L)}, t_{1}) | |_{1} + E^{(L)} (t_{1}) + E^{(L)} (t_{2}),

(59)

where

E^{(L)} (t)

(

t \geq t_{0}^{(L)}

) satisfies the following condition: let

{\hat{E}}^{(L)} (t) ≜ \frac{E^{(L)} (L t)}{\sqrt{L}}, t \geq {\hat{t}}_{0}^{(L)} .

Then

\lim_{L \to \infty} E [\sup_{t \geq {\hat{t}}_{0}^{(L)}} | {\hat{E}}^{(L)} (t) |] = 0 .

(60)

The main conclusion we draw from this model is in Theorem 4, which shows that as L increases, the tracking process converges to the target process on the diffusion scale.

Theorem 4.

Assume that S_i $(1 \leq i \leq m)$ has a finite moment of order 6. If

\lim_{L \to \infty} E [{\hat{G}}_{0}^{(L)}] = 0,

(61)

then

\lim_{L \to \infty} E [\sup_{t \geq {\hat{t}}_{0}^{(L)}} {{\hat{W}}^{(L)} (t) - {\hat{T}}^{(L)} (t)}] = 0 .

(62)

Before proving the theorem, we provide some context and intuition for the result. The convergence in (62) is, in essence, an example of “state-space collapse,” a phenomenon that plays a critical role in the heavy traffic analysis of queueing systems and stochastic processing networks (Reiman 1984, Harrison and van Miegham 1997, Bramson 1998, Bell and Williams 2001, Atar et al. 2019). As noted previously, the tracking process ${\hat{W}}^{(L)}$ is able to instantly match the upward jumps of the target process but not the downward jumps. Under the scaling that we introduced, the family of processes ${\hat{D}}^{(L)}$ satisfies a functional central limit theorem (FCLT). This implies that ${\hat{D}}^{(L)}$ has “almost continuous” paths for large L. The assumed asymptotic Lipschitz continuity of ${\hat{T}}^{(L)}$ implies that it has almost continuous paths as well. When ${\hat{W}}^{(L)} (t) > {\hat{T}}^{(L)} (t)$ , it is following total demand downward. This is uncentered demand, so that ${\hat{W}}^{(L)}$ is decreasing at a “rate” that is $O (\sqrt{L})$ when ${\hat{W}}^{(L)} (t) > {\hat{T}}^{(L)} (t)$ . Thus, ${\hat{W}}^{(L)}$ never gets “far away” from ${\hat{T}}^{(L)}$ , and the two-dimensional processes $({\hat{T}}^{(L)} (t), {\hat{W}}^{(L)} (t))$ “collapse” into a one-dimensional process with ${\hat{T}}^{(L)} (t) = {\hat{W}}^{(L)} (t)$ in the limit. The proof of Theorem 4 makes this intuitive description rigorous.

As noted previously, ${\hat{D}}^{(L)}$ satisfies an FCLT. Interestingly, we never actually invoke this fact in our proofs. A key reason is that it would not be enough to obtain the result that we need. FCLTs involve convergence over a finite time interval. To prove asymptotic optimality under the long run average cost criterion that we use, we need uniform convergence over an infinite time interval, which does not follow from the FCLT.

The proof of Theorem 4 is built on the following four lemmas. The first two are restatements of two lemmas in Reiman and Wang (2015), and the next two are new. In all lemmas, order sizes S_i are assumed to have a finite moment of order $2 + δ$ ( $δ > 0$ ), except for Lemma 3, which requires a stronger condition of having a finite moment of order 6.

Lemma 1

(Lemma 2 in Reiman and Wang 2015). For all $i = 1, \dots, m$ ,

E [\sup_{0 \leq τ \leq 1} {\hat{d}}_{i}^{(L)} (τ)] \leq 3 {\underline{λ}}^{1 / (2 + δ)} (1 + η_{i}) L^{- δ / (2 (2 + δ))},

(63)

where

\underline{λ}

is the demand arrival rate and

η_{i} ≔ E [S_{i}^{2 + δ}]

(δ > 0)

Lemma 2

(Lemma 3 in Reiman and Wang 2015). For all $i = 1, \dots, m$ ,

E [\sup_{0 \leq τ \leq L^{- 1 / 4}} | {\hat{D}}_{i}^{(L)} (0, τ) |] \leq (1 + σ_{i i}^{2}) L^{- 1 / 4} .

(64)

This is where σ_ii is the variance of $D_{i} (0, 1)$ .

Let g be any strictly positive constant. Then for all $i = 1, \dots, m$ ,

E [\sup_{L^{- 1 / 4} \leq τ \leq 1} {(| {\hat{D}}_{i}^{(L)} (0, τ) | - \sqrt{L} τ g)}^{+}] \leq \frac{σ_{i i}^{2}}{g} L^{- 1 / 4} .

(65)

Lemma 3

(See Appendix A for Proof). Assume that S_i has a finite moment of order 6 $(1 \leq i \leq m)$ . Let ν be any strictly positive constant. Then for all $i = 1, \dots, m$ ,

\lim_{L \to \infty} E [\sup_{t \geq 0} {(| {\hat{D}}_{i}^{(L)} (0, t) | - \sqrt{L} t ν)}^{+}] = 0 .

(66)

Lemma 4

(See Appendix A for Proof). Let ν be any strictly positive constant. Then for all $i = 1, \dots, m$ ,

\lim_{L \to \infty} E [\sum_{τ = 0}^{\infty} \sup_{0 \leq t < 1} {({\hat{d}}_{i}^{(L)} (t) - \sqrt{L} ν τ)}^{+}] = 0 .

(67)

The theorem is proved here by showing that there is uniform upper bound on the expected value at the left-hand side (LHS) on (62) and the bound converges to zero as L increases.

Proof of Theorem 4.

We prove (62) in the theorem by bounding ${({\hat{W}}^{(L)} (t) - {\hat{T}}^{(L)} (t))}^{+}$ and prove the bound converges uniformly to zero for all $t \geq {\hat{t}}_{0}^{(L)}$ . By definition,

{\hat{W}}^{(L)} (t) \geq {\hat{T}}^{(L)} (t), t \geq {\hat{t}}_{0}^{(L)},

so to bound

{({\hat{W}}^{(L)} (t) - {\hat{T}}^{(L)} (t))}^{+}

, we only need to consider the case where

{\hat{W}}^{(L)} (t) > {\hat{T}}^{(L)} (t)

, and for the latter case, only need to consider changes of

{\hat{W}}^{(L)} (t) - {\hat{T}}^{(L)} (t)

since the last moment before t when

{\hat{W}}^{(L)} (\cdot)

jumps from

{\hat{W}}^{(L)} (τ^{-}) = {\hat{T}}^{(L)} (τ^{-})

{\hat{W}}^{(L)} (τ) > {\hat{T}}^{(L)} (τ)

. That moment is defined by

{\tilde{t}}^{(L)} = \sup_{{\hat{t}}_{0}^{(L)} \leq τ \leq t} {τ : {\hat{W}}^{(L)} (τ) = {\hat{T}}^{(L)} (τ)},

or if the set on the RHS is empty, let

{\tilde{t}}^{(L)} = {\hat{t}}_{0}^{(L)}

. Observe that, although not explicit in its notation,

{\tilde{t}}^{(L)}

depends on t.

Observe that

| {\hat{W}}^{(L)} (t) - {\hat{T}}^{(L)} (t) | = ({\hat{W}}^{(L)} (t) - {\hat{T}}^{(L)} (t)) 1 {{\tilde{t}}^{(L)} = {\hat{t}}_{0}^{(L)}} + ({\hat{W}}^{(L)} (t) - {\hat{T}}^{(L)} (t)) 1 {{\tilde{t}}^{(L)} > {\hat{t}}_{0}^{(L)}},

(68)

so the proof of the theorem is reduced to proving

\lim_{L \to \infty} E [\sup_{t \geq {\hat{t}}_{0}^{(L)}} {({\hat{W}}^{(L)} (t) - {\hat{T}}^{(L)} (t)) 1 {{\tilde{t}}^{(L)} = {\hat{t}}_{0}^{(L)}}}] = 0,

(69)

and \lim_{L \to \infty} E [\sup_{t \geq {\hat{t}}_{0}^{(L)}} {({\hat{W}}^{(L)} (t) - {\hat{T}}^{(L)} (t)) 1 {{\tilde{t}}^{(L)} > {\hat{t}}_{0}^{(L)}}}] = 0 .

(70)

To prove (69), when ${\tilde{t}}^{(L)} = {\hat{t}}_{0}^{(L)}, {\hat{W}}^{(L)} (τ) > {\hat{T}}^{(L)} (τ)$ for all $τ \in [{\hat{t}}_{0}^{(L)}, t]$ , so by (57),

{\hat{W}}^{(L)} (t) - {\hat{W}}^{(L)} ({\hat{t}}_{0}^{(L)}) = - A \cdot \frac{D^{(L)} (L {\hat{t}}_{0}^{(L)}, L t)}{\sqrt{L}} = - A \cdot ({\hat{D}}^{(L)} ({\hat{t}}_{0}^{(L)}, t) + \sqrt{L} μ (t - {\hat{t}}_{0}^{(L)})) .

By applying the previous expression and Condition (59), for any time t $(t \geq {\hat{t}}_{0}^{(L)})$ ,

\begin{array}{l} {\hat{W}}^{(L)} (t) - {\hat{T}}^{(L)} (t) & = ({\hat{W}}^{(L)} (t) - {\hat{W}}^{(L)} ({\hat{t}}_{0}^{(L)})) \\ + {\hat{G}}_{0}^{(L)} - ({\hat{T}}^{(L)} (t) - {\hat{T}}^{(L)} ({\hat{t}}_{0}^{(L)})) \\ \leq - A \cdot ({\hat{D}}^{(L)} ({\hat{t}}_{0}^{(L)}, t) + \sqrt{L} μ (t - {\hat{t}}_{0}^{(L)})) + {\hat{G}}_{0}^{(L)} \\ + κ \sum_{l \in K} | | {\hat{D}}^{(L)} (t - {\hat{s}}_{l}^{(L)}, t) - {\hat{D}}^{(L)} ({\hat{t}}_{0}^{(L)} - {\hat{s}}_{l}^{(L)}, {\hat{t}}_{0}^{(L)}) | |_{1} \\ + | {\hat{E}}^{(L)} (t) | + | {\hat{E}}^{(L)} ({\hat{t}}_{0}^{(L)}) | . \end{array}

Therefore, under Condition (61), (69) holds if

\begin{array}{l} \lim_{L \to \infty} E [\sup_{t \geq {\hat{t}}_{0}^{(L)}} {- A \cdot [{\hat{D}}^{(L)} ({\hat{t}}_{0}^{(L)}, t) + \sqrt{L} μ (t - {\hat{t}}_{0}^{(L)})] \\ + κ \sum_{l \in K} | | {\hat{D}}^{(L)} (t - {\hat{s}}_{l}^{(L)}, t) - {\hat{D}}^{(L)} ({\hat{t}}_{0}^{(L)} - {\hat{s}}_{l}^{(L)}, {\hat{t}}_{0}^{(L)}) | |_{1}}] = 0, \end{array}

(71)

which we prove next.

Let

ζ = \frac{A \cdot μ}{m (| | A | |_{\infty} + 2 κ | K |)} > 0 and \tilde{κ} = | | A | |_{\infty} + κ | K | > 0 .

Then

\begin{array}{l} - A \cdot [{\hat{D}}^{(L)} ({\hat{t}}_{0}^{(L)}, t) + \sqrt{L} μ (t - {\hat{t}}_{0}^{(L)})] + κ \sum_{l \in K} | | {\hat{D}}^{(L)} (t - {\hat{s}}_{l}^{(L)}, t) - {\hat{D}}^{(L)} ({\hat{t}}_{0}^{(L)} - {\hat{s}}_{l}^{(L)}, {\hat{t}}_{0}^{(L)}) | |_{1} \leq ‖ A ‖_{\infty} ‖ {\hat{D}}^{(L)} ({\hat{t}}_{0}^{(L)}, t) ‖_{1} - \sqrt{L} A \cdot μ (t - {\hat{t}}_{0}^{(L)}) \\ + κ \sum_{l \in K} (| | {\hat{D}}^{(L)} ({\hat{t}}_{0}^{(L)}, t) | |_{1} + | | {\hat{D}}^{(L)} ({\hat{t}}_{0}^{(L)} - {\hat{s}}_{l}^{(L)}, t - {\hat{s}}_{l}^{(L)}) | |_{1}) \\ \leq \tilde{κ} \sum_{i = 1}^{m} (| {\hat{D}}_{i}^{(L)} ({\hat{t}}_{0}^{(L)}, t) | - \sqrt{L} ζ (t - {\hat{t}}_{0}^{(L)}))^{+} \\ + κ \sum_{l \in K} \sum_{i = 1}^{m} (| {\hat{D}}_{i}^{(L)} ({\hat{t}}_{0}^{(L)} - {\hat{s}}_{l}^{(L)}, t - {\hat{s}}_{l}^{(L)}) | - \sqrt{L} ζ (t - {\hat{t}}_{0}^{(L)}))^{+} . \end{array}

(72)

Because $D^{(L)} (t)$ ( $t \geq - L$ ) is a stationary process,

| {\hat{D}}_{i}^{(L)} ({\hat{t}}_{0}^{(L)}, t) | \overset{d}{=} | {\hat{D}}_{i}^{(L)} ({\hat{t}}_{0}^{(L)} - {\hat{s}}_{l}^{(L)}, t - {\hat{s}}_{l}^{(L)}) | \overset{d}{=} | {\hat{D}}_{i}^{(L)} (0, t - {\hat{t}}_{0}^{(L)}) |, l \in K, 1 \leq i \leq m .

(73)

Because ${\hat{t}}_{0}^{(L)} \in [- 1, 0]$ , for all $i = 1, \dots, m$ ,

\begin{array}{l} \lim_{L \to \infty} E [\sup_{t \geq {\hat{t}}_{0}^{(L)}} {(| {\hat{D}}_{i}^{(L)} (0, t - {\hat{t}}_{0}^{(L)}) | - \sqrt{L} ζ (t - {\hat{t}}_{0}^{(L)}))}^{+}] \\ \leq \lim_{L \to \infty} E [\sup_{t \geq t^{'}} \sup_{- 1 \leq t^{'} \leq 0} {(| {\hat{D}}_{i}^{(L)} (0, t - t^{'}) | - \sqrt{L} ζ (t - t^{'}))}^{+}] \\ = \lim_{L \to \infty} E [\sup_{t \geq 0} {(| {\hat{D}}_{i}^{(L)} (0, t) | - \sqrt{L} ζ t)}^{+}] \\ = 0 (Lemma 3), \end{array}

(74)

and (71) follows directly from (72)–(74).

To prove (70), by the definition of ${\tilde{t}}^{(L)}$ , in cases where ${\tilde{t}}^{(L)} > {\hat{t}}_{0}^{(L)}$ ,

{\hat{W}}^{(L)} ({\tilde{t}}^{(L) -}) = {\hat{T}}^{(L)} ({\tilde{t}}^{(L) -}),

and thus

\begin{array}{l} {\hat{W}}^{(L)} (t) - {\hat{T}}^{(L)} (t) & = & {\hat{W}}^{(L)} (t) - {\hat{T}}^{(L)} (t) - [{\hat{W}}^{(L)} ({\tilde{t}}^{(L) -}) - {\hat{T}}^{(L)} ({\tilde{t}}^{(L) -})] \\ = & [{\hat{W}}^{(L)} (t) - {\hat{W}}^{(L)} ({\tilde{t}}^{(L)})] + [{\hat{W}}^{(L)} ({\tilde{t}}^{(L)}) - {\hat{W}}^{(L)} ({\tilde{t}}^{(L) -})] \\ + [{\hat{T}}^{(L)} ({\tilde{t}}^{(L) -}) - {\hat{T}}^{(L)} ({\tilde{t}}^{(L)})] + [{\hat{T}}^{(L)} ({\tilde{t}}^{(L)}) - {\hat{T}}^{(L)} (t)] . \end{array}

(75)

Because ${\hat{W}}^{(L)} (τ) > {\hat{T}}^{(L)} (τ)$ for all $τ \in ({\tilde{t}}^{(L)}, t]$ , by (57),

\begin{array}{l} {\hat{W}}^{(L)} (t) - {\hat{W}}^{(L)} ({\tilde{t}}^{(L)}) & = - A \cdot ({\hat{D}}^{(L)} ({\tilde{t}}^{(L)}, t) + \sqrt{L} μ (t - {\tilde{t}}^{(L)})) . \\ \leq ‖ A ‖_{\infty} \sum_{i = 1}^{m} | {\hat{D}}_{i}^{(L)} ({\tilde{t}}^{(L)}, t) | - \sqrt{L} (t - {\tilde{t}}^{(L)}) A \cdot μ, \end{array}

(76)

and

{\hat{W}}^{(L)} ({\tilde{t}}^{(L)}) - {\hat{W}}^{(L)} ({\tilde{t}}^{(L) -}) = - A \cdot {\hat{d}}^{(L)} ({\tilde{t}}^{(L)}) \leq 0 .

(77)

For the last two items in (75), by (59),

\begin{array}{l} | {\hat{T}}^{(L)} ({\tilde{t}}^{(L) -}) - {\hat{T}}^{(L)} ({\tilde{t}}^{(L)}) | \\ \leq κ \sum_{l \in K} (| | {\hat{d}}^{(L)} ({\tilde{t}}^{(L)} - {\hat{s}}_{l}^{(L)}) | |_{1} + | | {\hat{d}}^{(L)} ({\tilde{t}}^{(L)}) | |_{1}) + {\hat{E}}^{(L)} ({\tilde{t}}^{(L) -}) + {\hat{E}}^{(L)} ({\tilde{t}}^{(L)}) \\ = κ \sum_{i = 1}^{m} \sum_{l \in K} (| {\hat{d}}_{i}^{(L)} ({\tilde{t}}^{(L)} - {\hat{s}}_{l}^{(L)}) | + | {\hat{d}}_{i}^{(L)} ({\tilde{t}}^{(L)}) |) + {\hat{E}}^{(L)} ({\tilde{t}}^{(L) -}) + {\hat{E}}^{(L)} ({\tilde{t}}^{(L)}), \end{array}

(78)

and

\begin{array}{l} | {\hat{T}}^{(L)} ({\tilde{t}}^{(L)}) - {\hat{T}}^{(L)} (t) | \\ \leq κ \sum_{l \in K} | | {\hat{D}}^{(L)} (t - {\hat{s}}_{l}^{(L)}, t) - {\hat{D}}^{(L)} ({\tilde{t}}^{(L)} - {\hat{s}}_{l}^{(L)}, {\tilde{t}}^{(L)}) | |_{1} + {\hat{E}}^{(L)} ({\tilde{t}}^{(L)}) + {\hat{E}}^{(L)} (t) \\ = κ \sum_{l \in K} | | {\hat{D}}^{(L)} ({\tilde{t}}^{(L)}, t) - {\hat{D}}^{(L)} ({\tilde{t}}^{(L)} - {\hat{s}}_{l}^{(L)}, t - {\hat{s}}_{l}^{(L)}) | |_{1} + {\hat{E}}^{(L)} ({\tilde{t}}^{(L)}) + {\hat{E}}^{(L)} (t) \\ \leq κ \sum_{i = 1}^{m} \sum_{l \in K} (| {\hat{D}}_{i}^{(L)} ({\tilde{t}}^{(L)}, t) | + | {\hat{D}}_{i}^{(L)} ({\tilde{t}}^{(L)} - {\hat{s}}_{l}^{(L)}, t - {\hat{s}}_{l}^{(L)}) |) + {\hat{E}}^{(L)} ({\tilde{t}}^{(L)}) + {\hat{E}}^{(L)} (t) . \end{array}

(79)

Let

ν = \frac{A \cdot μ}{m (| | A | |_{\infty} + 4 κ | K |)} .

By applying (76), (77), (78), and (79) to bound each item in the last expression of (75) and replacing $A \cdot μ$ with $m (| | A | |_{\infty} + 4 κ | K |) ν$ , we can prove that (70) holds under Condition (61) by showing that for all $i = 1, \dots, m$ and $l \in K$ ,

\lim_{L \to \infty} E [\sup_{{\hat{t}}_{0}^{(L)} \leq t^{'} \leq t < \infty} {(| {\hat{D}}_{i}^{(L)} (t^{'}, t) | - \sqrt{L} (t - t^{'}) ν)}^{+}] = 0,

(80)

\lim_{L \to \infty} E [\sup_{{\hat{t}}_{0}^{(L)} \leq t^{'} \leq t < \infty} {(| {\hat{D}}_{i}^{(L)} (t^{'} - {\hat{s}}_{l}^{(L)}, t - {\hat{s}}_{l}^{(L)}) | - \sqrt{L} (t - t^{'}) ν)}^{+}] = 0,

(81)

\lim_{L \to \infty} E [\sup_{{\hat{t}}_{0}^{(L)} \leq t^{'} \leq t < \infty} {({\hat{d}}_{i}^{(L)} (t^{'} - {\hat{s}}_{l}^{(L)}) - \sqrt{L} (t - t^{'}) ν)}^{+}] = 0,

(82)

and \lim_{L \to \infty} E [\sup_{{\hat{t}}_{0}^{(L)} < t^{'} \leq t < \infty} {({\hat{d}}_{i}^{(L)} (t^{'}) - \sqrt{L} (t - t^{'}) ν)}^{+}] = 0 .

(83)

For each given i $(1 \leq i \leq m)$ and l $(l \in K)$ ,

\begin{array}{l} {\hat{D}}_{i}^{(L)} (t^{'}, t) & \overset{d}{=} & {\hat{D}}_{i}^{(L)} (t^{'} - {\hat{s}}_{l}^{(L)}, t - {\hat{s}}_{l}^{(L)}) \overset{d}{=} {\hat{D}}_{i}^{(L)} (0, t - t^{'}), {\hat{t}}_{0}^{(L)} \leq t^{'} \leq t, \\ {\hat{d}}_{i}^{(L)} (t - {\hat{s}}_{l}^{(L)}) & \overset{d}{=} & {\hat{d}}_{i}^{(L)} (t^{'}), t^{'} \geq {\hat{t}}_{0}^{(L)} . \end{array}

(Recall that $- 1 + {\hat{s}}_{l}^{(L)} \leq {\hat{t}}_{0}^{(L)}$ , so $t^{'} - {\hat{s}}_{l}^{(L)} \geq - 1$ in the previous expression). Therefore, (80) and (81) follow from a similar argument that proves (74): by the use of Lemma 3,

\lim_{L \to \infty} E [\sup_{{\hat{t}}_{0}^{(L)} \leq t^{'} \leq t < \infty} {(| {\hat{D}}_{i}^{(L)} (t^{'}, t) | - \sqrt{L} (t - t^{'}) ν)}^{+}] = \lim_{L \to \infty} E [\sup_{t \geq 0} {(| {\hat{D}}_{i}^{(L)} (0, t) | - \sqrt{L} t ν)}^{+}] = 0,

and

\begin{array}{l} \lim_{L \to \infty} E [\sup_{{\hat{t}}_{0}^{(L)} \leq t^{'} \leq t < \infty} {(| {\hat{D}}_{i}^{(L)} (t^{'} - {\hat{s}}_{l}^{(L)}, t - {\hat{s}}_{l}^{(L)}) | - \sqrt{L} (t - t^{'}) ν)}^{+}] \\ = \lim_{L \to \infty} E [\sup_{t \geq 0} {(| {\hat{D}}_{i}^{(L)} (0, t) | - \sqrt{L} t ν)}^{+}] = 0 . \end{array}

To prove (82), because $- 1 + {\hat{s}}_{l}^{(L)} \leq {\hat{t}}_{0}^{(L)} \leq t^{'} \leq t$ and the demand process is stationary,

\begin{array}{l} E [\sup_{{\hat{t}}_{0}^{(L)} \leq t^{'} \leq t < \infty} {({\hat{d}}_{i}^{(L)} (t^{'} - {\hat{s}}_{l}^{(L)}) - \sqrt{L} (t - t^{'}) ν)}^{+}] \\ \leq E [\sup_{- 1 \leq t^{'} \leq t < \infty} {({\hat{d}}_{i}^{(L)} (t^{'}) - \sqrt{L} (t - t^{'}) ν)}^{+}] \\ \leq E [\sup_{t \geq 0} (\sum_{τ = 0}^{⌊ t ⌋} {({\hat{d}}_{i}^{(L)} (t^{'}) - \sqrt{L} τ ν)}^{+} 1 (t - τ - 1 \leq t^{'} \leq t - τ) + \sup_{- 1 \leq t^{'} \leq 0} {\hat{d}}_{i}^{(L)} (t^{'}))] \\ \leq E [\sum_{τ = 0}^{\infty} \sup_{0 \leq t^{'} \leq 1} {({\hat{d}}_{i}^{(L)} (t^{'}) - \sqrt{L} τ ν)}^{+}] + E [\sup_{- 1 \leq t^{'} \leq 0} {\hat{d}}_{i}^{(L)} (t^{'})], \end{array}

and (82) follows from Lemmas 1 and 4. We then use the same argument to prove (83), and thus complete the proof of (70). □

6.2. Convergence to Targets

We again consider a family of ATO systems indexed by L, with $D^{(L)} (t)$ $(t \geq - L)$ as the demand process in the L^th system. For each component and for each product, we specify other parameters in (56) to define an instance of the stochastic tracking model and prove (50) and (51) as corollaries of Theorem 4. In fact, the theorem shows convergence of $E [\sup (\cdot)]$ over an infinite time horizon, which is a stronger result than what is needed, which is the convergence of $\sup E [(\cdot)]$ .

The proofs use the following lemma.

Lemma 5

(See Appendix A for Proof). Let ${\hat{Y}}_{j}^{(L)} (- {\hat{L}}_{k_{j}}^{(L)})$ be defined in (42) with $t = - {\hat{L}}_{k_{j}}^{(L)}$ $(1 \leq j \leq n)$ and ν be any strictly positive constant. Then

\lim_{L \to \infty} E [{(| {\hat{Y}}_{j}^{(L)} (- {\hat{L}}_{k_{j}}^{(L)}) | - \sqrt{L} ν)}^{+}] = 0, 1 \leq j \leq n .

(84)

6.2.1. Proof of Condition (50).

For each component j $(1 \leq j \leq n)$ , define

\begin{array}{l} A = A_{j}, K = {k_{j} + 1, \dots, K}, s_{l}^{(L)} = L_{l}^{(L)} - L_{k_{j}}^{(L)} (l \in K), t_{0}^{(L)} = - L_{k_{j}}^{(L)}, \\ T^{(L)} (t) = ⌈ I P_{j}^{(L)} (t) ⌉, t \geq t_{0}^{(L)}, \\ W^{(L)} (t) = I P_{j}^{(L)} (t), t \geq t_{0}^{(L)}, and W_{0}^{(L)} = - A_{j} \cdot D^{(L)} (- L, - L_{k_{j}}^{(L)}) . \end{array}

(85)

To verify this is an instance of the stochastic tracking model, by our replenishment policy in Algorithm 1, $⌈ I P_{j}^{(L)} (t) ⌉$ ( $t \geq 0$ ) is a pure jump process. Apply (25) to the L^th system,

I P_{j}^{(L)} (t) = Y_{j}^{(L)} (t) - A_{j} \cdot D^{(L)} (t + L_{k_{j}}^{(L)} - L, t), t \geq - L_{k_{j}}^{(L)} .

Thus, under (53), the asymptotic Lipschitz condition (59)–(60) is satisfied with $E^{(L)} (t) = 1$ .

By the definition of $I P_{j}^{-} (t)$ in Table 1 and referral to (27), in the L^th system,

\begin{array}{l} I P_{j}^{(L)} (t) = I P_{j}^{(L) -} (t) \lor ⌈ I P_{j}^{(L)} (t) ⌉, t \geq - L_{k_{j}}^{(L)}, \\ and & I P_{j}^{(L) -} (t) = I P_{j}^{(L)} (t^{-}) - A_{j} \cdot d^{(L)} (t), t > - L_{k_{j}}^{(L)}, \end{array}

which fits the definition of the tracking process

W^{(L)} (t)

(t \geq t_{0}^{(L)})

. The initial value

W_{0}^{(L)}

is the inventory position of component j before the placement of the first order at time

- L_{k_{j}}^{(L)}

Corollary 1.

For all $j = 1, \dots, n$ ,

\lim_{L \to \infty} E [\sup_{t \geq - {\hat{L}}_{k_{j}}^{(L)}} | {\overset{⁁}{I P}}_{j}^{(L)} (t) - {\overset{⁁}{I ℙ}}_{j}^{(L)} (t) |] = \lim_{L \to \infty} E [\sup_{t \geq - {\hat{L}}_{k_{j}}^{(L)}} | \frac{I P_{j}^{(L)} (L t) - ⌈ I P_{j}^{(L)} (L t) ⌉}{\sqrt{L}} |] = 0 .

(86)

Proof.

The first equality follows from definitions in (42) because the fractional value of $I P_{j}^{(L)} (L t)$ can be ignored when $L \to \infty$ . Theorem 4 shows that the second equality holds if

\lim_{L \to \infty} E [{\hat{G}}_{0}^{(L)}] ≜ \lim_{L \to \infty} \frac{{(W_{0}^{(L)} - T^{(L)} (t_{0}^{(L)}))}^{+}}{\sqrt{L}} = 0,

where by (85), for given j (

1 \leq j \leq n

{\hat{G}}_{0}^{(L)} = {(\frac{- A_{j} \cdot D^{(L)} (- L, - L_{k_{j}}^{(L)}) - ⌈ I P_{j}^{(L)} (- L_{k_{j}}^{(L)}) ⌉}{\sqrt{L}})}^{+} .

Applying (25) to the L^th system,

I P_{j}^{(L)} (- L_{k_{j}}^{(L)}) = Y_{j}^{(L)} (- L_{k_{j}}^{(L)}) - A_{j} \cdot D^{(L)} (- L, - L_{k_{j}}^{(L)}) .

Therefore, by Lemma 5 with $ν = A_{j} \cdot μ$ ,

\lim_{L \to \infty} {\hat{G}}_{0}^{(L)} = \lim_{L \to \infty} E [\frac{{(- Y_{j}^{(L)} (- L_{k_{j}}^{(L)}))}^{+}}{\sqrt{L}}] \leq \lim_{L \to \infty} E [{(| {\hat{Y}}_{j}^{(L)} (- {\hat{L}}_{k_{j}}^{(L)}) | - \sqrt{L} A_{j} \cdot μ)}^{+}] = 0 . □

6.2.2. Proof of Condition (51).

It is helpful to first review the processes that will be involved in the analysis. In the L^th system and for product i $(1 \leq i \leq m$ ), $B_{i}^{(L)} (t)$ $(t \geq 0)$ is the backlog target and $B_{i}^{* (L)} (t)$ $(t \geq 0)$ is the ideal backlog level for reaching our SP-based lower bound. Both are determined by the LP in (43), using actual inventory positions and inventory position targets as their respective inputs. The backlog level under our allocation policy is $B_{i}^{(L)} (t)$ $(t \geq 0)$ . To help with the analysis, we will define below an auxiliary process ${\underline{B}}_{i}^{(L)} (t)$ $(t \geq 0)$ , which mimics $B_{i}^{(L)} (t)$ $(t \geq 0)$ , except that it is not allowed to rise above the target $B_{i}^{(L)} (t)$ $(t \geq 0)$ .

For each product i $(1 \leq i \leq m)$ , we construct an instance of the stochastic tracking model by letting

\begin{array}{l} A = e_{i}, K = {1, \dots, K}, s_{l}^{(L)} = L_{l}^{(L)} (l \in K), t_{0}^{(L)} = 0, \\ T^{(L)} (t) = - B_{i}^{(L)} (t), t \geq 0, \\ W^{(L)} (t) = - {\underline{B}}_{i}^{(L)} (t), t \geq 0, and W_{0}^{(L)} = - D_{i}^{(L)} (- L, 0), \end{array}

(87)

where

{\underline{B}}_{i}^{(L)} (t)

(t \geq 0)

is defined by

\begin{array}{l} {\underline{B}}_{i}^{(L) -} (0) = - W_{0}^{(L)} = D_{i}^{(L)} (- L, 0), \\ {\underline{B}}_{i}^{(L)} (t) = {\underline{B}}_{i}^{(L) -} (t) \land B_{i}^{(L)} (t), t \geq 0, \\ and & {\underline{B}}_{i}^{(L) -} (t) = {\underline{B}}_{i}^{(L)} (t^{-}) + A d^{(L)} = {\underline{B}}_{i}^{(L)} (t^{-}) + d_{i}^{(L)} (t), t > 0 . \end{array}

Here, we will show that $- B_{i}^{(L)} (t)$ $(t \geq 0)$ is a target process, which, following the previous definition, also implies that $- {\underline{B}}_{i}^{(L)} (t)$ $(t \geq 0)$ is a tracking process.

When defining our allocation policy in Section 4.3.2, we have shown that $B_{i}^{(L)} (t)$ ( $t \geq 0$ ) is a pure jump process. To show that $- B_{i}^{(L)} (t)$ $(t \geq 0)$ is asymptotically Lipschitz continuous, let

E^{(L)} (t) = | B_{i}^{* (L)} (t) - B_{i}^{(L)} (t) |, t \geq 0 .

(88)

Then

| B_{i}^{(L)} (t_{1}) - B_{i}^{(L)} (t_{2}) | \leq | B_{i}^{* (L)} (t_{1}) - B_{i}^{* (L)} (t_{2}) | + E^{(L)} (t_{1}) + E^{(L)} (t_{2}),

which satisfies conditions (59)–(60) because

By applying (29) to (55), there exists some constant κ such that
$\begin{array}{l} | B_{i}^{* (L)} (t_{1}) - B_{i}^{* (L)} (t_{2}) | \leq κ \sum_{j = 1}^{n} (| A_{j} D^{(L)} (t_{2} - L, t_{2}) - A_{j} D^{(L)} (t_{1} - L, t_{1}) | \\ + | Y_{j}^{(L)} (t_{2} - L_{k_{j}}^{(L)}) - Y_{j}^{(L)} (t_{1} - L_{k_{j}}^{(L)}) |), \end{array}$
where under (53), there also exists some constant κ such that
$\begin{array}{l} | Y_{j}^{(L)} (t_{2} - L_{k_{j}}^{(L)}) - Y_{j}^{(L)} (t_{1} - L_{k_{j}}^{(L)}) | \\ \leq & κ \sum_{k > k_{j}} | | D^{(L)} (t_{2} - L_{k}^{(L)}, t_{2} - L_{k_{j}}^{(L)}) - D^{(L)} (t_{1} - L_{k}^{(L)}, t_{1} - L_{k_{j}}^{(L)}) | |_{1} \\ \leq & κ \sum_{k > k_{j}} (| | D^{(L)} (t_{2} - L_{k}^{(L)}, t_{2}) - D^{(L)} (t_{1} - L_{k}^{(L)}, t_{1}) | |_{1} + | | D^{(L)} (t_{2} - L_{k_{j}}^{(L)}, t_{2}) - D^{(L)} (t_{1} - L_{k_{j}}^{(L)}, t_{1}) | |_{1}) . \end{array}$
By (54), there exists some constant κ such that
$| E^{(L)} (t) | \leq κ \sum_{j = 1}^{n} | Q_{j}^{(L)} (t) - Q_{j}^{(L)} (t) | = κ \sum_{j = 1}^{n} | I P_{j}^{(L)} (t - L_{k_{j}}^{(L)}) - I P_{j}^{(L)} (t - L_{k_{j}}^{(L)}) |, t \geq 0 .$
Thus, (60) follows directly from Corollary 1.

Having shown that $(- B_{i}^{(L)} (t), - {\underline{B}}_{i}^{(L)} (t))$ $(t \geq 0)$ is an instance of the target and tracking processes, we now can prove (51) as the following corollary to Theorem 4.

Corollary 2.

For $i = 1, \dots, m$ , let

{\underline{\hat{B}}}_{i}^{(L)} (t) = \frac{{\underline{B}}_{i}^{(L)} (L t)}{\sqrt{L}} .

Then,

\lim_{L \to \infty} E [\sup_{t \geq 0} | {\hat{B}}_{i}^{(L)} (t) - {\underline{\hat{B}}}_{i}^{(L)} (t) |] = 0,

(89)

which implies Condition (51).

Proof.

By Theorem 4, (89) holds if

\lim_{L \to \infty} E [| {\hat{B}}_{i}^{(L)} (0) - {\underline{\hat{B}}}_{i}^{(L)} (0) |] = 0 .

(90)

To prove this initial condition, observe that

B_{i}^{(L)} (0) - {\underline{B}}_{i}^{(L)} (0) = B_{i}^{(L)} (0) - B_{i}^{(L)} (0) \land B_{i}^{(L) -} (0) = {(B_{i}^{(L)} (0) - B_{i}^{(L) -} (0))}^{+} = {(B_{i}^{(L)} (0) - D_{i}^{(L)} (- L, 0))}^{+} .

Because $B_{i}^{(L)} (0)$ minimizes (31) with $Q (t) = Q^{(L)} (0)$ , there exists some constant κ, which depends only on matrix A, such that

B_{i}^{(L)} (0) \leq κ \sum_{j = 1}^{n} {[Q_{j}^{(L)} (0)]}^{+} = κ \sum_{j = 1}^{n} {[A_{j} \cdot D^{(L)} (- L_{k_{j}}^{(L)}, 0) - I P_{j}^{(L)} (- L_{k_{j}}^{(L)})]}^{+} .

Because $I P_{j}^{(L)} (- L_{k_{j}}^{(L)}) \leq I P_{j}^{(L)} (- L_{k_{j}}^{(L)})$ $(1 \leq j \leq n)$ under our replenishment policy (see (27)),

\begin{array}{l} B_{i}^{(L)} (0) - D_{i}^{(L)} (- L, 0) & \leq κ \sum_{j = 1}^{n} {[A_{j} \cdot D^{(L)} (- L_{k_{j}}^{(L)}, 0) - I P_{j}^{(L)} (- L_{k_{j}}^{(L)})]}^{+} - D_{i}^{(L)} (- L, 0) \\ = κ \sum_{j = 1}^{n} {[A_{j} \cdot D^{(L)} (- L, 0) - Y_{j}^{(L)} (- L_{k_{j}}^{(L)})]}^{+} - D_{i}^{(L)} (- L, 0) . \end{array}

Let $ν = μ_{i} / (κ n (m \bar{a} + 1) + 1)$ . By its definition, ${\underline{B}}_{i}^{(L)} (0) = B_{i}^{(L)} (0) \land D_{i}^{(L)} (- L, 0)$ . Therefore,

\begin{array}{l} | {\hat{B}}_{i}^{(L)} (0) - {\underline{\hat{B}}}_{i}^{(L)} (0) | \\ \leq \frac{1}{\sqrt{L}} {(κ \sum_{j = 1}^{n} {[A_{j} \cdot D^{(L)} (- L, 0) - Y_{j}^{(L)} (- L_{k_{j}}^{(L)})]}^{+} - D_{i}^{(L)} (- L, 0))}^{+} \\ \leq {(κ \sum_{j = 1}^{n} A_{j} \cdot | {\hat{D}}^{(L)} (- 1, 0) | + κ \sum_{j = 1}^{n} | {\hat{Y}}_{j}^{(L)} (- {\hat{L}}_{k_{j}}) | - \sqrt{L} μ_{i} - {\hat{D}}_{i}^{(L)} (- 1, 0))}^{+} \leq n κ \bar{a} \sum_{i^{'} = 1}^{m} {(| {\hat{D}}_{i^{'}} (- 1, 0) | - \sqrt{L} ν)}^{+} + κ \sum_{j = 1}^{n} {(| {\hat{Y}}_{j}^{(L)} (- {\hat{L}}_{k_{j}}) | - \sqrt{L} ν)}^{+} + {({\hat{D}}_{i}^{(L)} (- 1, 0) - \sqrt{L} ν)}^{+} . \end{array}

Because $D^{(L)} (t)$ $(t \geq - L$ ) is stationary, Lemmas 3 and 5 imply (90), which proves (89).

To show that (89) implies (51),

| {\hat{B}}_{i}^{(L)} (t) - {\hat{B}}_{i}^{* (L)} (t) | \leq | {\hat{B}}_{i}^{(L)} (t) - {\hat{B}}_{i}^{(L)} (t) | + | {\hat{B}}_{i}^{(L)} (t) - {\hat{B}}_{i}^{* (L)} (t) | .

By (88), the last term is simply ${\hat{E}}^{(L)} (t)$ that satisfies (60); thus, we only need to prove

\lim_{L \to \infty} E [\sup_{t \geq 0} | {\hat{B}}_{i}^{(L)} (t) - {\hat{B}}_{i}^{(L)} (t) |] = 0 .

(91)

As is shown in (34), under our allocation policy,

B_{i}^{(L)} (t) \geq B_{i}^{(L) -} (t) \land B_{i}^{(L)} (t) = {\underline{B}}_{i}^{(L)} (t), t \geq 0,

and thus

{({\hat{B}}_{i}^{(L)} (t) - {\hat{B}}_{i}^{(L)} (t))}^{+} \leq {({\hat{B}}_{i}^{(L)} (t) - {\underline{\hat{B}}}_{i}^{(L)} (t))}^{+} = | {\hat{B}}_{i}^{(L)} (t) - {\underline{\hat{B}}}_{i}^{(L)} (t) |, t \geq 0 .

Moreover, because Property (37) applies when $B_{i}^{(L)} (t) - B_{i}^{(L)} (t) \geq 1$ , in all cases,

{\hat{B}}_{i}^{(L)} (t) - {\hat{B}}_{i}^{(L)} (t) \leq \frac{1}{\sqrt{L}} + \frac{\bar{a}}{\underline{a}} \sum_{i^{'} \neq i} {({\hat{B}}_{i^{'}}^{(L)} (t) - {\hat{B}}_{i^{'}}^{(L)} (t))}^{+} \leq \frac{1}{\sqrt{L}} + \frac{\bar{a}}{\underline{a}} \sum_{i^{'} \neq i} | {\hat{B}}_{i^{'}}^{(L)} (t) - {\underline{\hat{B}}}_{i^{'}}^{(L)} (t) |, t \geq 0 .

Combine the above two inequalities,

| {\hat{B}}_{i}^{(L)} (t) - {\hat{B}}_{i}^{(L)} (t) | \leq \frac{1}{\sqrt{L}} + \frac{\bar{a}}{\underline{a}} \sum_{i^{'} = 1}^{m} | {\hat{B}}_{i^{'}}^{(L)} (t) - {\underline{\hat{B}}}_{i^{'}}^{(L)} (t) |, t \geq 0,

so (91) follows immediately from (89). □

7. Stability of SP Optimal Solution

In this section, we finalize our analysis by showing that with proper treatments of the SP (13), Conditions (53), (54), and (55) can be satisfied. To this end, we need to address two issues: uniqueness of the optimal solutions and values of Lipschitz constants.

Recall that $B_{i} (t)$ and $B_{i}^{*} (t)$ $(t \geq 0$ ) are optimal solutions to the same LP (28) with different RHS coefficients in constraints. Hence, by Hoffman’s lemma, for each t ( $t \geq 0$ ), there exist $B_{i} (t)$ and $B_{i}^{*} (t)$ that satisfy (54), and for each t₁ and t₂ $(t_{2} > t_{1} \geq 0$ ), there exist $B_{i}^{*} (t_{1})$ and $B_{i}^{*} (t_{2})$ that satisfy (55). Nevertheless, the lemma does not exclude the possibility that if the optimal solution of (28) is not unique, then to satisfy (55) at the same t₁, we may need different $B_{i}^{*} (t_{1})$ for different t₂. In this case, a well-defined process $B_{i}^{*} (t)$ $(t \geq 0)$ that satisfies (55) for all t₁ and t₂ ( $0 \leq t_{1} < t_{2}$ ) is not guaranteed. To avoid this situation, we use perturbation to keep the optimal solution of (28) unique. As a result, both $B_{i} (t)$ and $B_{i}^{*} (t)$ $(t \geq 0)$ are uniquely defined processes that satisfy (54) and (55).

It takes significantly more effort to address (53). Because the supports of $D^{k}$ $(1 \leq k \leq K)$ are unbounded, $φ^{k} (y^{k + 1}, \dots, y^{K}, x)$ in (13) are infinite-dimensional problems, so their optimal solutions may not satisfy the continuity condition specified by (53). We develop finite-dimensional LPs to approximate the latter problem and prove that we can always keep the approximation error negligible by keeping the dimension of the LP sufficiently high. We also use perturbation to maintain uniqueness of the optimal solution. Both the approximation and perturbation (which also addresses uniqueness of the optimal solution to (28)) are developed in Section 7.1.

In the asymptotic analysis, when lead times increase, probabilities of having larger sample values of $D^{k} (t)$ $(1 \leq k \leq K)$ increase, so the dimension of the approximating LP needs to increase to keep the approximation sufficiently accurate. To sustain (53), we need to rule out the possibility that the Lipschitz constant κ has to grow unboundedly with the problem dimension. In Section 7.2, we show that κ can be kept at a finite value regardless how large the dimension of the LP becomes.

7.1. Approximation and Perturbation

We develop finite-dimensional approximations to SP (13) by taking following steps: let $M$ be a m-dimensional vector with all entries equal to an integer M > 0. Denote $D^{k} \land M$ by $D_{M}^{k}$ ( $1 \leq k \leq K$ ). Replace $D^{k}$ in (13) with $D_{M}^{k}$ in $φ^{k} (y^{k + 1}, \dots, y^{K}, x)$ ( $1 \leq k \leq K$ ) to define

\begin{array}{l} φ_{M}^{K} & = \min_{y^{K} \in R^{n_{K}}} {h^{K} \cdot y^{K} + E [φ_{M}^{K - 1} (y^{K}, D_{M}^{K})]}, \\ φ_{M}^{k} (y^{k + 1}, \dots, y^{K}, x) & = \min_{y^{k} \in R^{n_{k}}} {h^{k} \cdot y^{k} + E [φ_{M}^{k - 1} (y^{k}, \dots, y^{K}, x + D_{M}^{k})]}, 1 \leq k < K, \\ φ_{M}^{0} (y^{1}, \dots, y^{K}, x) & = φ^{0} (y^{1}, \dots, y^{K}, x) = - \max_{z \in R^{m}} {c \cdot z | z \leq x, A^{k} z \leq y^{k}, 1 \leq k \leq K} . \end{array}

(92)

Because $D^{k}$ are integers, for given M, $D_{M}^{k}$ have a finite support ( $1 \leq k \leq K$ ). Hence, for each k ( $1 \leq k \leq K$ ), $φ_{M}^{k} (y^{k + 1}, \dots, y^{K}, x)$ is a finite-dimension problem, which can be equivalently formulated as an LP. The LP formulation is thoroughly elaborated on in section 3.1.3 of Shapiro et al. (2009). Appendix C describes the detailed steps of specializing this standard process to (92), which leads to

φ_{M}^{k} (y^{k + 1}, \dots, y^{K}, x) = \min_{y^{k}, \dots, y^{1}, z} {h^{k} \cdot y^{k} + \sum_{k^{'} = 1}^{k - 1} \sum_{\bar{ω} \in Ω_{k}^{k^{'}}} P (\bar{ω}) h^{k^{'}} \cdot y^{k^{'}} (\bar{ω}) - \sum_{\bar{ω} \in Ω_{k}^{0}} P (\bar{ω}) c \cdot z (\bar{ω})},

(93)

subject to

z (\bar{ω}) \leq x + {\underline{D}}_{M}^{k} (\bar{ω}), \bar{ω} \in Ω_{k}^{0},

(94)

A^{k^{'}} z (\bar{ω}) - y^{k^{'}} ({\bar{ω}}^{'}) \leq 0, \bar{ω} \in Ω_{k}^{0}, {\bar{ω}}^{'} \in Ω_{k}^{k^{'}}, {\bar{ω}}^{'} ⊏ \bar{ω}, 1 \leq k^{'} < k,

(95)

A^{k} z (\bar{ω}) - y^{k} \leq 0, \bar{ω} \in Ω_{k}^{0},

(96)

A^{k^{'}} z (\bar{ω}) \leq y^{k^{'}}, \bar{ω} \in Ω_{k}^{0}, k < k^{'} \leq K .

(97)

Here $Ω_{k}^{k^{'}}$ is the set of strings that encode sample paths $(D_{M}^{k^{'} + 1}, \dots, D_{M}^{k}), 0 \leq k^{'} < k \leq K$ . For each sample path $\bar{ω} \in Ω_{k}^{0}$ ,

{\underline{D}}_{M}^{k} (\bar{ω}) = D_{M}^{k} (\bar{ω}) + \dots + D_{M}^{1} (\bar{ω}), 1 \leq k \leq K .

The probability attached to sample path $\bar{ω}$ is denoted by $P (\bar{ω})$ . For ${\bar{ω}}^{'} \in Ω_{k}^{k^{'}}$ and $\bar{ω} \in Ω_{k}^{0}$ ( $0 < k^{'} < k, 1 < k \leq K$ ), we write ${\bar{ω}}^{'} ⊏ \bar{ω}$ if the sample path $(D_{M}^{k^{'} + 1}, \dots, D_{M}^{k})$ encoded by ${\bar{ω}}^{'}$ is on the same sample path $(D_{M}^{1}, \dots, D_{M}^{k})$ encoded by $\bar{ω}$ .

We perturb coefficients in the objective function (93) to keep the optimal solution of the LP unique. The approach is standard, so we leave the details to Appendix C. As a general description, because A^k and $D_{M}^{k}$ $(1 \leq k \leq K)$ are integer-valued, the optimal solution must be rational numbers when k = K, and by induction, so are the optimal solutions when $k = K - 1, \dots, 1$ . By adding tiny different irrational values, we replace $c, h^{k}$ , and $P (\bar{ω})$ in (93) with their perturbed values, which are denoted, respectively, by $\tilde{c}, {\tilde{h}}^{k}$ ( $1 \leq k \leq K$ ), and $\tilde{P} (\bar{ω})$ ( $\bar{ω} \in Ω_{k}^{l}, 0 \leq l < k \leq K$ ). The use of the perturbed coefficients removes the possibility that two different rational solutions can yield the same objective value, and thus guarantees the uniqueness of the optimal solution.

In our following discussion, we will refer to $φ_{M}^{k} (\cdot)$ as ${\tilde{φ}}_{M}^{k} (\cdot)$ $(1 \leq k \leq K)$ if they correspond to the stage k problem of (92), or equivalently, (93)–(97), with perturbed coefficient values. At each stage, the perturbation does not change with inputs $y^{k + 1}, \dots, y^{K}$ $(1 \leq k \leq K$ ) and $x$ . In other words, the same perturbed coefficients apply to all instances, including cases when the problem is solved repeatedly over time with different inputs to execute our inventory policies.

For this truncated and perturbed SP to be an effective proxy of the SP in (13), its optimal objective value, ${\tilde{φ}}_{M}^{K}$ , must stay close to that of the original problem. To this end, we define Δ as the range of perturbation, that is,

| | \tilde{c} - c | |_{1} \leq Δ, | | {\tilde{h}}^{k} - h^{k} | |_{1} \leq Δ, and | \tilde{P} (\bar{ω}) - P (\bar{ω}) | < Δ, \bar{ω} \in Ω_{k}^{l}, 0 \leq l < k \leq K .

(98)

The following theorem shows the convergence of the two optimal objective values under proper choices of M and Δ.

Theorem 5

(See Appendix A for Proof). For any constant ϵ, there exists $M_{0} > 0$ such that

φ^{K} \leq φ_{M}^{K} < φ^{K} + ϵ for all M > M_{0} .

(99)

There also exists $Δ > 0$ , such that for all $\tilde{c}, {\tilde{h}}^{k}$ ( $1 \leq k \leq K$ ), and $\tilde{P} (\bar{ω})$ ( $\bar{ω} \in Ω_{k}^{l}, 0 \leq l < k \leq K$ ) in the range defined by (98),

| {\tilde{φ}}_{M}^{K} - φ_{M}^{K} | \leq ϵ .

(100)

In Appendix C, we show that for any $Δ > 0$ , it is easy to find perturbed coefficients that guarantee the uniqueness of the optimal solution while also fitting into the range defined in (98). Hence, the theorem indicates that the lower bound on the average inventory cost,

\underline{C} = φ^{K} + b \cdot E [\bar{D}],

can always be approximated by

{\tilde{φ}}_{M}^{K} + b \cdot E [\bar{D}]

with negligible error.

For inventory control, we incorporate the previous approximation into our inventory policy to specify a unique trajectory of targets on each sample path. In the replenishment policy defined in Algorithm 1, instead of (26), we set inventory position targets at Step 2(a) by solving the stage k problem of (92) (or equivalently, the LP in (93)–(97)),

{\tilde{φ}}_{M}^{k} (y^{k + 1}, \dots, y^{K}, x) = \min_{y^{k} \in R^{n_{k}}} {{\tilde{h}}^{k} \cdot y^{k} + E [{\tilde{φ}}_{M}^{k - 1} (y^{k}, y^{k + 1}, \dots, y^{K}, x + D_{M}^{k})]},

using as inputs

y^{k^{'}} = Y^{k^{'}} (t + L_{k} - L_{k^{'}}) (k < k^{'} \leq K) and x = D (t + L_{k} - L_{K}, t) .

(101)

Observe a subtle difference from (92): here only demands of future scenarios are truncated to $D_{M}^{k}$ , which keeps the dimension of the problem finite. Demands that have already arrived by time t, $D (t + L_{k} - L_{K}, t)$ , enter the SP as inputs without truncation, allowing their amounts to be fully accounted for in the determination of desirable inventory positions.

In the allocation policy implemented by Algorithm 2, (31) is equivalent to $φ_{M}^{0} (\cdot)$ in (92) for all M, and we solve the LP with $c$ substituted by its perturbed values of $\tilde{c}$ . Similarly, as the input we use

x = D (t - L_{K}, t),

that is, the LP is solved without truncating realized demands.

To evaluate the performance of the targets set by the above process, we consider how their values and resulting inventory costs can be represented by solutions and objective values of the approximating SPs. Let $y_{p}^{k}$ $(1 \leq k \leq K)$ denote the optimal solution to stage k $(1 \leq k \leq K$ ) and $z_{p}$ denote the optimal solution to the stage 0 problem of (92) with perturbed coefficient values. Inputs to the stage k problem are ( $k = 0, \dots, K$ ),

y^{k^{'}} = y_{p}^{k^{'}}, k^{'} = k + 1, \dots, K,

which are the optimal solutions of higher stages on the same sample path, and importantly,

x = D^{K} + \dots + D^{k + 1},

which corresponds to realized demand,

D (t - L_{K}, t + L_{k} - L_{K})

(1 \leq k \leq K)

, without truncation. Here the subscript p stands for policy, as these solutions correspond to target setting in our inventory policy. Referring to Section 4.1, when these approximating SPs are used in place of the original SP (13),

y_{p}^{k}

replaces

y^{k *}

(1 \leq k \leq K)

on the RHS of (19) and

\bar{D} - z_{p}

replaces

B^{*}

on the RHS of (20). Following the same analysis of (18), when inventory positions and backlog levels are kept at these replaced targets, the expected inventory cost rate at time t

(t \geq 0

) is

\begin{array}{l} \tilde{C_{p}} (t) = \sum_{k = 1}^{K} {\tilde{h}}^{k} \cdot E [y_{p}^{k}] - \tilde{c} \cdot E [D (t - L_{K}, t) - B (t)] + b \cdot E [D (t - L_{K}, t)] \\ = {\tilde{φ}}_{(p), M}^{K} + b \cdot E [\bar{D}], \end{array}

where

{\tilde{φ}}_{(p), M}^{K} = \sum_{k = 1}^{K} {\tilde{h}}^{k} \cdot E [y_{p}^{k}] - \tilde{c} \cdot E [z_{p}] .

It is important to observe that the values of ${\tilde{φ}}_{(p), M}^{K}$ and ${\tilde{φ}}_{M}^{K}$ may differ even though they both are based on the same problem formulation in (92). The latter is the optimal objective value when the problem is solved in its entirety, with perturbed coefficients and truncated demand input,

x = D^{K} \land M + \dots + D^{k + 1} \land M,

at every stage k

(1 \leq k \leq K)

. On the other hand,

{\tilde{φ}}_{(p), M}^{K}

is obtained when the subproblems of (92) are solved separately, with the aforementioned use of demand inputs without truncation. Therefore, although Theorem 5 shows that

{\tilde{φ}}_{M}^{K} + b \cdot E [\bar{D}]

is a close approximation of the lower bound on the inventory cost, the discrepancy prevents the direct application of the theorem’s conclusion to

{\tilde{φ}}_{(p), M}^{K} + b \cdot E [\bar{D}]

. The following corollary closes this gap.

Corollary 3

(See Appendix A for Proof). Under any truncation parameter M and perturbed coefficient values,

{\tilde{φ}}_{(p), M}^{K} \leq {\tilde{φ}}_{M}^{K} .

(102)

Hence, given the same ϵ, M₀, and Δ in Theorem 5, for all $M \geq M_{0}$ ,

{\tilde{φ}}_{(p), M}^{K} + b \cdot E [\bar{D}] \leq \underline{C} + 2 ϵ .

(103)

The corollary shows that we can use a perturbed finite-dimensional version of the SP (13) in our policy without compromising asymptotic optimality: fix a constant $ϵ > 0$ , and for each system L, choose a proper constant M and range Δ to keep

| φ^{K (L)} - {\tilde{φ}}_{M}^{K (L)} | \leq 2 ϵ .

This difference is asymptotically negligible on the diffusion scale.

The remaining question is whether the previous development can provide us with the targets we need. For backlogs, the answer is immediate. Because $φ_{M}^{0}$ in (92) is the same LP as (31) for all M, we only need to replace $c$ with $\tilde{c}$ and keep the latter value the same in all cases. Hoffman’s lemma immediately leads to the following.

Lemma 6.

Let $B (t)$ be the (unique) optimal solution of (31) and $B^{*} (t)$ be the (unique) optimal solution of the same LP with $Q^{k} (t)$ replaced by $Q^{k} (t)$ $(1 \leq k \leq K, t \geq 0$ ). Then $B (t)$ $(t \geq 0)$ and $B^{*} (t)$ $(t \geq 0)$ satisfy (54) and $B^{*} (t)$ $(t \geq 0)$ satisfies (55).

For inventory position targets, the analysis is much more involved and given in Section 7.2.

7.2. Values of Lipschitz Constants

The approximation and perturbation scheme in Section 7.1 is developed to set inventory position targets that satisfy (53) and backlog targets that satisfy (54)–(55). In the asymptotic analysis, the LP (31) stays the same when L changes, so (54)–(55) are satisfied as long as the optimal solution is kept unique by the aforementioned perturbation. On the other hand, as L increases, larger M is needed in (93)–(97) for an accurate approximation of the demand distribution. For (53) to hold, the value of κ needs to be independent of the change of M. Theorem 6 and its corollary show that such a value always exists.

Theorem 6.

For $k = 1, \dots, K$ , let $y_{M, a}^{k *}$ and $y_{M, b}^{k *}$ be (unique) optimal solutions to (93)–(97) under inputs $(y_{a}^{k + 1}, \dots, y_{a}^{K}, x_{a})$ and $(y_{b}^{k + 1}, \dots, y_{b}^{K}, x_{b})$ , respectively. Then

| | y_{M, a}^{k *} - y_{M, b}^{k *} | |_{\infty} \leq κ (| | y_{a}^{k + 1} - y_{b}^{k + 1} | |_{\infty} + \dots + | | y_{a}^{K} - y_{b}^{K} | |_{\infty} + | | x_{a} - x_{b} | |_{\infty}),

(104)

where κ depends only on

(m, n_{1}, \dots, n_{K})

and A^k

(1 \leq k \leq K)

Before proving the theorem, we first show its implication by deriving a corollary: as is specified in Section 7.1, we solve (93)–(97) to obtain values of $Y^{k} (t)$ ( $t \geq - L_{k}, 1 \leq k \leq K$ ) for implementing our replenishment policy. For this purpose, $Y^{k} (t)$ $(t \geq - L_{k})$ needs to satisfy (53), which is clearly satisfied when k = K, in which case these values are kept constant. Theorem 6 allows us to use induction to extend (53) to k < K, which is stated as the following corollary.

Corollary 4.

For $t \geq 0$ , let $Y^{k} (t)$ $(t \geq - L_{k}, 1 \leq k \leq K)$ be the (unique) optimal solution of $φ_{M}^{k} (y^{k + 1}, \dots, y^{K}, x)$ defined in (93)–(97), with inputs given by (101). Then (53) holds for a constant κ that depends only on $(m, n_{1}, \dots, n_{K})$ and A^k $(1 \leq k \leq K)$ .

Proof of Corollary 4.

For given k, suppose that (53) holds for all l > k (which is the case for $k = K - 1$ ). This means that there exists some κ, depending only on $(m, n_{1}, \dots, n_{K})$ and A^k $(1 \leq k \leq K)$ , such that for all $- L_{l} \leq t_{1} < t_{2}$ ,

\begin{array}{l} | | Y^{l} (t_{2} - (L_{l} - L_{k})) - Y^{l} (t_{1} - (L_{l} - L_{k})) | |_{1} \\ \leq κ \sum_{k^{'} = l + 1}^{K} | | D (t_{2} - (L_{k^{'}} - L_{k}), t_{2} - (L_{l} - L_{k})) - D (t_{1} - (L_{k^{'}} - L_{k}), t_{1} - (L_{l} - L_{k})) | |_{1}, \\ = κ \sum_{k^{'} = l + 1}^{K} | | {D (t_{2} - (L_{k^{'}} - L_{k}), t_{2}) - D (t_{2} - (L_{l} - L_{k}), t_{2})} \\ - {D (t_{1} - (L_{k^{'}} - L_{k}), t_{1}) - D (t_{1} - (L_{l} - L_{k}), t_{1})} | |_{1} . \end{array}

(105)

Because $Y^{k} (t)$ ( $t \geq - L_{k}$ ) is obtained by solving (93)–(97) using (101) for inputs $y^{l}$ ( $k < l \leq K$ ) and $x$ , by Theorem 6,

\begin{array}{l} | | Y^{k} (t_{2}) - Y^{k} (t_{1}) | |_{1} \leq κ \sum_{l = k + 1}^{K} | | Y^{l} (t_{2} - (L_{l} - L_{k})) - Y^{l} (t_{1} - (L_{l} - L_{k})) | |_{1} \\ + κ | | D (t_{2} - (L_{K} - L_{k}), t_{2}) - D (t_{1} - (L_{K} - L_{k}), t_{1}) | |_{1} \\ \leq κ (K - k) \sum_{l = k + 1}^{K} | | D (t_{2} - (L_{l} - L_{k}), t_{2}) - D (t_{1} - (L_{l} - L_{k}), t_{1}) | |_{1}, \end{array}

(106)

which shows that (53) holds for k. □

The remainder of this section is dedicated to proving Theorem 6. We start from the following theorem 2.4 in Mangasarian and Shiau (1987) (for convenience, we denote $u$ and $v$ in their theorem by $\bar{u}$ and $\bar{v}$ , respectively, here):

Let $β \geq 1$ and $β^{*} = 1 / (1 - 1 / β)$ (so that $| | x | |_{β^{*}}$ is the dual norm of $| | x | |_{β}$ ). Let the linear program

\max_{x} {p \cdot x s . t . A x \leq b, C x = d}

have nonempty solution sets

S^{1}

and

S^{2}

for RHSs

(b^{1}, d^{1})

and

(b^{2}, d^{2})

, respectively. For each

x^{1} \in S^{1}

, there exists

x^{2} \in S^{2}

such that

| | x^{1} - x^{2} | |_{\infty} \leq ν_{β} (A; C) {| | \begin{matrix} b^{1} - b^{2} \\ d^{1} - d^{2} \end{matrix} | |}_{β},

where

ν_{β} (A; C) ≔ \sup_{\bar{u}, \bar{v}} {{| | \begin{matrix} \bar{u} \\ \bar{v} \end{matrix} | |}_{β^{*}} | \begin{array}{l} | | \bar{u} A + \bar{v} C | |_{1} = 1 \\ Rows o f (\begin{matrix} A \\ C \end{matrix}) corresponding t o nonzero \\ elements o f (\begin{matrix} \bar{u} \\ \bar{v} \end{matrix}) are linearly independent \end{array}} .

To apply the theorem to $φ_{M}^{k} (y^{k + 1}, \dots, y^{K}, x)$ defined in (93)–(97), let $A^{k}$ be the LHS constraint matrix of the latter LP. Because there is no equality constraint, we can ignore $\bar{v}$ and C. Let $β = \infty$ , and thus $β^{*} = 1$ . Then $ν_{β} (A; C)$ and $\bar{u}$ in the theorem specialize to

ν_{\infty} (A^{k}) ≔ \sup_{\bar{u}} | | \bar{u} | |_{1},

where

| | \bar{u} A^{k} | |_{1} = 1,

and rows of

A^{k}

corresponding to nonzero elements of

\bar{u}

are linearly independent.

The following lemma builds a critical connection between the formulation of the previous quantities and the conclusion of Theorem 6.

Lemma 7

(See Appendix A for Proof). For $k = 1, \dots, K$ , let ${\hat{A}}^{k}$ be a maximal nonsingular submatrix of $A^{k}$ . Let $u$ be a vector that has the same number of components as the number of rows in ${\hat{A}}^{k}$ . Then there exists a constant κ, depending only on $(m, n_{1}, \dots, n_{K})$ and the values of the components of A^k $(1 \leq k \leq K)$ , such that

| | u | |_{1} \leq κ | | u {\hat{A}}^{k} | |_{1} .

(107)

Proving this lemma is quite involved because we need to exploit the particular structure of $A^{k}$ to show that the result, which does not apply to an arbitrary matrix, is true here. As may be seen from the proof of this lemma in Appendix A, it takes some effort to define this structure, especially for cases where k > 1. To help with the understanding of this result, we explain the intuition behind the lemma’s conclusion, after we first use the lemma to give the following proof of Theorem 6.

Proof of Theorem 6.

Let $\bar{u}$ be any vector such that $| | \bar{u} A^{k} | |_{1} = 1$ and rows of $A^{k}$ corresponding to nonzero elements of $\bar{u}$ are linearly independent. Let ${\hat{A}}^{k}$ be a maximal nonsingular submatrix of $A^{k}$ that contains all these independent rows. Let $u$ be a subvector of $\bar{u}$ that is composed of components corresponding to rows in ${\hat{A}}^{k}$ in $\bar{u} A^{k}$ , and thus includes all nonzero elements of $\bar{u}$ .

By Lemma 7, there exists a constant κ, depending only on $(m, n_{1}, \dots, n_{K})$ and component values of $A^{k^{'}}$ $(1 \leq k^{'} \leq K)$ , such that

| | \bar{u} | |_{1} = | | u | |_{1} \leq κ | | u {\hat{A}}^{k} | |_{1} = κ | | \bar{u} A^{k} | |_{1} = κ,

and thus

ν_{\infty} (A^{k}) = \sup_{\bar{u}} | | \bar{u} | |_{1} \leq κ .

By theorem 2.4 in Mangasarian and Shiau (1987), $ν_{\infty} (A^{k})$ satisfies (104), so does κ. □

To explain the insights on proving Lemma 7, we start from the identity

u = u {\hat{A}}^{k} {({\hat{A}}^{k})}^{- 1}, i . e ., u^{T} = {[{({\hat{A}}^{k})}^{- 1}]}^{T} {[u {\hat{A}}^{k}]}^{T},

(108)

where T denotes transpose. Following standard definitions (Meyer 2000), the L₁ norm of a matrix A is

| | A | |_{1} = \sup_{x \neq 0} \frac{| | A x | |_{1}}{| | x | |_{1}},

and its value is the maximum absolute column sum of A. This is also the value of

| | A^{T} | |_{\infty}

, which is the maximum absolute row sum of that matrix. Applying definitions of these norms to (108) with

{[{({\hat{A}}^{k})}^{- 1}]}^{T}

in place of A,

| | {({\hat{A}}^{k})}^{- 1} | |_{\infty} = | | {[{({\hat{A}}^{k})}^{- 1}]}^{T} | |_{1} = \sup_{x \neq 0} \frac{| | {[{({\hat{A}}^{k})}^{- 1}]}^{T} x | |_{1}}{| | x | |_{1}} \geq \frac{| | {[{({\hat{A}}^{k})}^{- 1}]}^{T} {[u {\hat{A}}^{k}]}^{T} | |_{1}}{| | {[u {\hat{A}}^{k}]}^{T} | |_{1}} = \frac{| | u | |_{1}}{| | u {\hat{A}}^{k} | |_{1}} .

Thus, we can let κ in (107) be the largest $| | {({\hat{A}}^{k})}^{- 1} | |_{\infty}$ over all ${\hat{A}}^{k}$ (nonsingular submatrices of $A^{k}$ ). What needs to be explained then is why this value of κ can stay bounded as M increases, which leads to more elements in $Ω_{k}^{0}$ and $Ω_{k}^{k^{'}}$ ( $k < k^{'} \leq K$ ) and thus a larger $A^{k}$ .

Our explanation focuses on k = 1, in which case (95) is irrelevant and

A^{1} = (\begin{matrix} H^{1} \\ ⋱ & E^{1} \\ H^{1} \end{matrix}),

where

H^{1}

is the coefficient matrix of

z (\bar{ω})

for a given

\bar{ω}

and

E^{1}

is the coefficient matrix of

y^{1}

. Submatrix

H^{1}

has m columns, corresponding to the number of components in

z (\bar{ω})

, and

m + n_{1} + \dots + n_{K}

rows, corresponding to the number of constraints in (94), (96), and (97). Entries of

H^{1}

are zero, one, or values of components in A^k

(1 \leq k \leq K)

Any nonsingular submatrix of $A^{1}$ can be written as

{\hat{A}}^{1} = (\begin{matrix} H_{1}^{'} \\ ⋱ & E^{'} \\ H_{N}^{'} \end{matrix}),

(109)

where

H_{i}^{'}

is a submatrix of

H^{1}

, N is the number of such submatrices contained in

{\hat{A}}^{1}

, and

E^{'}

is a submatrix of

E^{1}

. As M increases, the number of block matrices

H^{1}

A^{1}

increases, as does N and thus the dimension of many submatrices

{\hat{A}}^{1}

, especially the maximal ones of

A^{1}

As a thought experiment, suppose that for every aforementioned nonsingular matrix ${\hat{A}}^{1}$ , we can find a matrix $B$ to make the following linear transformation

B {\hat{A}}^{1} = {\hat{A}}_{d} .

Here ${\hat{A}}_{d}$ is a nonsingular matrix in the form of

(\begin{matrix} {\hat{H}}_{1} \\ ⋱ \\ {\hat{H}}_{N^{'}} \end{matrix}),

where

{\hat{H}}_{i}

(1 \leq i \leq N^{'})

are nonsingular submatrices with their dimensions bounded by an integer function of m and n₁ and their entries depending only on A^k (

1 \leq k \leq K

). Then,

{({\hat{A}}^{1})}^{- 1} = {\hat{A}}_{d}^{- 1} B = (\begin{matrix} {\hat{H}}_{1}^{- 1} \\ ⋱ \\ {\hat{H}}_{N^{'}}^{- 1} \end{matrix}) B .

Suppose further that for all ${\hat{A}}^{1}$ , (a) entries of the corresponding matrix $B$ are values in a finite set and (b) the number of nonzero entries on each row of $B$ is bounded by an integer function of m and n₁. Then the previous expression implies that regardless how large M is, the maximum absolute value of entries of ${({\hat{A}}^{1})}^{- 1}$ stays bounded, and the number of nonzero entries on each row of ${({\hat{A}}^{1})}^{- 1}$ remains finite. Thus $| | {({\hat{A}}^{1})}^{- 1} | |_{\infty}$ and thus the value of the aforementioned κ is bounded by some constant that is independent of M.

The actual proof of Lemma 7 is based on a similar idea to earlier, although we do not need to specify $B$ and carry out the aforementioned transformation explicitly. Roughly speaking, we observe that in (109), all $H^{'}$ s have finite dimensions and $E^{'}$ has a finite number of columns ( $\leq n_{1}$ ) with zero or one as their entries. This structure allows all nonzero entries of ${({\hat{A}}^{1})}^{- 1}$ , which recovers $| | u | |_{1}$ from $| | u {\hat{A}}^{1} | |_{1}$ (in the sense of (108)), to be obtained by inverting submatrices of ${\hat{A}}^{1}$ . These submatrices have finite dimensions and values of their entries do not depend on M. Consequently, in ${({\hat{A}}^{1})}^{- 1}$ , both the number of nonzero entries on each row and their values are finite, so $| | {(\hat{A^{1}})}^{- 1} | |_{\infty}$ can remain bounded as M increases.

Similar insights are used to prove Lemma 7 for cases with k > 1, but the procedure gets considerably more complicated. For instance, when k = 2, the LHS matrix of (94)–(97) is

A^{2} = (\begin{matrix} H^{2} \\ ⋱ & E^{2} \\ H^{2} \end{matrix}),

where

H^{2}

is the coefficient matrix of

z (\bar{ω})

and

y^{1} ({\bar{ω}}^{'})

, and

E^{2}

is the coefficient matrix of

y^{2}

. Each

H^{2}

corresponds to a particular

{\bar{ω}}^{'}

and contains all

\bar{ω}

such that

{\bar{ω}}^{'} ⊏ \bar{ω}

(see (94)–(95) with k = 2 and

k^{'} = 1

). Like

E^{1}, E^{2}

has a fixed number

(n_{2})

of columns, and like

A^{1}

, entries of

A^{2}

come from a finite set of values that do not depend on M. Different from

H^{1}

, the dimension of

H^{2}

is not fixed but grows with M. Because of this difference, we need to introduce additional constructs to define the matrix structure recursively and prove the lemma by induction.

8. Numerical Results

We evaluate the performance of our policy by simulating its application to the examples of ATO systems shown in Figure 1. Each system features two distinct lead times. We vary cost parameters and lead times to generate various cases. For all cases the demand for products consists of independent Poisson processes, with the arrival rate of demand for product i denoted by λ_i. In each case, $\underline{C}$ is the SP lower bound, C_s is the average inventory cost determined by the simulation, and the following optimality gap

Δ = \frac{C_{s} - \underline{C}}{\underline{C}}

serves as the performance metric of our policy.

For each case, we carry out 30 simulation runs. Depending on the lead times, the length of each run ranges from 150,000 to 600,000 time units, ensuring that it is at least 1,250 times of the longer lead time. The first one-tenth of the simulation time is for warm-up. Values presented are summaries of outputs from the remaining periods, averaged over the 30 simulation runs.

We first consider the W system, using 27 parameter sets given in Doğru et al. (2010, section 4.1). The inventory holding cost of the common component, h₀, is normalized to unity and demand arrival rates are kept at $λ_{1} = λ_{2} = 25$ . Values of h₁, h₂, b₁, and b₂, which are shown in the tables, are chosen to cover a wide range of cost relationships. Components 1 and 2 have the same lead time, which differs from that of component 0.

Table 5 shows optimality gaps when component 0 has the shorter lead time. Entries marked by * correspond to cases where the SP solution is within the 95% confidence interval (CI) of the average cost estimated by 30 simulation runs. In these cases, the difference between the inventory cost under our policy and its lower bound is not statistically significant. In all other cases, the optimality gap decreases as the lead times increase, consistent with the trend predicted by Theorem 3. The optimality gaps fall to a level that is close to or below 1% in many cases when $(L_{1}, L_{2}) = (20, 30)$ , and in a majority of cases when $(L_{1}, L_{2}) = (40, 60)$ . We stop running simulations for these cases with longer lead times (as indicated by “-” in the table). In a few cases, the optimality gap stays significantly above 1% in all simulations, but nevertheless, still follows a clear pattern of converging to 0. These cases (e.g., 15, 21, 27) are normally associated with a high $c_{1} / c_{2}$ ratio. Discussions in section 4.2 in Doğru et al. (2010) explain why the gap tends to be larger under these parameter values for systems with identical lead times. The same intuition applies here for systems with nonidentical lead times.

Table 3. Centered and Scaled Processes and Vectors $(1 \leq k \leq K)$

Table 3. Centered and Scaled Processes and Vectors $(1 \leq k \leq K)$

Process/vector	$D^{(L)} (t_{1}, t_{2})$	$D^{k (L)}$	$Y^{k (L)} (t)$	$I ℙ^{k (L)} (t)$	$I P^{k (L)} (t)$
Centering	$L (t_{2} - t_{1}) μ$	$(L_{k}^{(L)} - L_{k - 1}^{(L)}) μ$	$L A^{k} μ$	$L_{k}^{(L)} A^{k} μ$	$L_{k}^{(L)} A^{k} μ$
Centered and scaled	${\hat{D}}^{(L)} (t_{1}, t_{2})$	${\hat{D}}^{k (L)}$	${\hat{Y}}^{k (L)} (t)$	${\overset{⁁}{I ℙ}}^{k (L)} (t)$	${\overset{⁁}{I P}}^{k (L)} (t)$

In Table 6, we use the same parameter set but let the common component have the longer lead time and the other two components have the same shorter lead time. Like results in Table 5, it is evident that in each case, the optimality gap converges to zero as the lead times increase. Nevertheless, these gaps are generally larger here. For instance, when $(L_{1}, L_{2}) = (160, 240)$ , the gap is close to 2% in cases 21 and 27. We have additional runs for these two cases with $(L_{1}, L_{2}) = (320, 480)$ and found the gap drops to 1.03% and 1.26%, respectively.

Table 4. Scaled Processes and Variables $(1 \leq k \leq K)$

Table 4. Scaled Processes and Variables $(1 \leq k \leq K)$

Process/variable	$d^{(L)} (t)$	$Q^{k (L)} (t)$	$Q^{k (L)} (t)$	$B^{(L)} (t)$	$B^{(L)} (t)$	$B^{* (L)} (t)$	$C^{(L)} (t)$	$C^{(L)}$
Scaled	${\hat{d}}^{(L)} (t)$	${\hat{Q}}^{k (L)} (t)$	${\hat{Q}}^{k (L)} (t)$	${\hat{B}}^{(L)} (t)$	${\hat{B}}^{(L)} (t)$	${\hat{B}}^{* (L)} (t)$	${\hat{C}}^{(L)} (t)$	${\hat{C}}^{(L)}$

It is interesting to focus on the first four cases where c₁ = c₂. In Table 5, the SP lower bound is within the 95% CI except for case 4 when $(L_{1}, L_{2}) = (1, 1.5)$ . A further calculation shows that this bound is within the (wider) 99.9% CI. In Table 6, the SP lower bound is outside the 95% CI in cases 1, 3, and 4 when $(L_{1}, L_{2}) = (1, 1.5)$ . Further calculations show that in cases 3 and 4, the SP lower bound is also outside the 99.9% CI. Comparing the sample mean of the average cost obtained from the simulation with the lower bound, the t value is 4.69 for case 3 and 6.48 for case 4. Given that the sample size is 30, these values suggest that the inventory cost in both cases is significantly higher than the lower bound (with $α < 0.0005$ ).

The observation is consistent with theorem 4 in Reiman and Wang (2012), which concludes that in W systems with c₁ = c₂, the SP lower bound is reachable when the common component has the shorter lead time. However, the conclusion does not extend to W systems in which the common component has the longer lead time. Here, we use a special case of the W system, the N system shown in Figure 1, to explain this subtlety.

When c₁ = c₂, serving a unit of either product 1 or 2 removes the same amount of inventory cost from the system. Thus, the optimal allocation outcome prescribed by the SP solution is trivially attainable. However, when the common component has the longer lead time, it may not be possible to meet the SP-based inventory position target of the other component for all time. To see this point, consider a time t when a large batch of demand for product 1 has just arrived, exhausting all inventory of component 0, both on hand and in transit at the moment.

When component 0 has the longer lead time, any new order of it will not arrive until $t + L_{2}$ , which means that its net inventory level at $t + L_{1}$ is nonpositive. Correspondingly at time t, the inventory position target of component 1, constrained by the availability of component 0 at $t + L_{1}$ , will be set at a nonpositive level. The system may not be able to meet this target because the actual inventory position is affected by previous ordering decisions, and therefore can be positive at t even without ordering any new unit. Reducing it to a nonpositive level requires removing existing units, which is not feasible.

This situation does not happen when component 0 has the shorter lead time. In this case, the SP sets a constant inventory position target for component 1, which is always met (excluding the initial period) under a constant base stock policy. Any usage of component 1 is accompanied by the usage of component 0 by the same amount. Therefore, when the inventory position target of component 0 needs to be reduced because of the lack of component 1 within the next (shorter) lead time, its actual inventory position is also at a lowered level, which allows the target to be met without removing any unit from the system.

The impact of this difference on the optimality gap is shown by a few examples in Table 7. The first three columns give cost parameters. For each example, we compare the SP lower bound with the average cost from 100 simulation runs. Results in columns 4, 5, and 6 are from examples in which component 0 has the longer lead time. In each example, the SP lower bound is strictly below the lower end of the 99.9% CI of the average cost. Comparing the sample mean of the latter cost with the lower bound, the t values are 25.08, 16.81, and 33.30, respectively, in these three cases, so the optimality gap is strictly positive at exceedingly high significance levels. Results in columns 7, 8, 9 are from examples in which component 0 has the shorter lead time. In all examples, the SP lower bound is inside the 95% CI and thus certainly inside the (wider) 99.9% CI. The t values are 1.28, 0.97, and 1.34, respectively, none of them is significant at $α = 0.05$ .

Table 5. Optimality Gaps: W System, Component 0 Has the Shorter Lead Time, $h_{0} = 1, λ_{1} = λ_{2} = 25$

Table 5. Optimality Gaps: W System, Component 0 Has the Shorter Lead Time, $h_{0} = 1, λ_{1} = λ_{2} = 25$

Case no.	h₁	h₂	b₁	b₂	Optimality gap (L₁: component 0, L₂: components 1 and 2)
					$L_{1} = 1$	$L_{1} = 5$	$L_{1} = 10$	$L_{1} = 20$	$L_{1} = 40$	$L_{1} = 80$	$L_{1} = 160$
					$L_{2} = 1.5$	$L_{2} = 7.5$	$L_{2} = 15$	$L_{2} = 30$	$L_{2} = 60$	$L_{2} = 120$	$L_{2} = 240$
1	1	1	4	4	0.03% *	0.14% *	0.04% *	—	—	—	—
2	0.2	0.2	2.4	2.4	0.01% *	0.04% *	0.08% *	—	—	—	—
3	1	5	10	6	0.10% *	0.07% *	0.19% *	—	—	—	—
4	5	5	12	12	0.05%	0.01% *	0.00% *	—	—	—	—
5	0.2	1	6	4	0.56%	0.12% *	0.24% *	—	—	—	—
6	0.2	0.2	2.4	1.2	3.51%	1.92%	1.34%	0.83%	—	—	—
7	1	1	4	2	1.04%	0.68%	0.49%	—	—	—	—
8	5	5	12	6	0.13%	0.08%	0.04% *	—	—	—	—
9	1	0.2	4	2.4	2.20%	1.03%	0.79%	—	—	—	—
10	0.2	0.2	6	2	3.41%	1.63%	1.22%	0.87%	—	—	—
11	1	1	10	4	2.07%	1.24%	0.76%	—	—	—	—
12	5	5	30	12	0.40%	0.12% *	0.04% *	—	—	—	—
13	0.2	0.2	6	2.4	5.47%	2.89%	2.03%	1.56%	1.03%	—	—
14	1	0.2	4	1.2	5.85%	2.98%	2.07%	1.45%	0.98%	—	—
15	0.2	0.2	6	1.2	14.36%	7.17%	5.34%	3.75%	2.48%	1.92%	1.37%
16	1	1	10	2	5.20%	2.55%	2.12%	1.45%	0.86%	—	—
17	5	1	12	4	1.45%	1.09%	0.72%	—	—	—	—
18	5	5	30	6	0.84%	0.32%	0.23%	—	—	—	—
19	1	0.2	10	2.4	7.10%	3.73%	2.79%	2.04%	1.52%	1.01%	—
20	5	1	12	2	3.12%	1.63%	1.15%	0.78%	—	—	—
21	1	0.2	10	1.2	14.85%	7.98%	5.47%	4.08%	2.88%	2.17%	1.36%
22	5	0.2	12	2.4	4.65%	2.45%	1.83%	1.35%	0.87%	—	—
23	5	1	30	4	4.21%	2.21%	1.47%	1.30%	0.75%	—	—
24	5	0.2	12	1.2	9.03%	4.48%	3.53%	2.51%	1.78%	1.26%	—
25	5	1	30	2	7.79%	3.81%	3.08%	2.06%	1.59%	1.05%	—
26	5	0.2	30	2.4	9.65%	5.24%	3.91%	2.76%	1.89%	1.38%	—
27	5	0.2	30	1.2	17.01%	8.65%	6.47%	4.53%	2.92%	2.30%	1.55%

Note. See the text for an explanation of entries with * or —.

We also conducted simulations on the M system shown in Figure 1. We use the same cost parameters as these in Dogru et al. (2017). The inventory holding costs of both components, h₁ and h₂, are set to unity. Backlog costs, b₀, b₁, and b₂, are varied (see the second row in Tables 8 and 9) to generate four regions of different cost relationships: (1) $c_{1} + c_{2} < c_{0}$ (region A); (2) $c_{2} \leq c_{1} < c_{0} \leq c_{1} + c_{2}$ (region B); (3) $c_{2} < c_{0} \leq c_{1}$ (region C); and (4) $c_{0} \leq c_{2} \leq c_{1}$ (region D). As discussed in Doğru et al. (2017), our allocation policy specializes to allocation rules that are qualitatively different between the regions. Tables 8 and 9 show optimality gaps for cases when component 1 has the shorter and longer lead times, respectively. In both cases, the gaps are above zero when $(L_{1}, L_{2}) = (1, 1.5)$ and significantly so in regions A and D. Nevertheless, in each case, there is also a clear trend that the gap converges toward zero as the lead times increase, as is predicted by Theorem 3.

Table 6. Optimality Gaps: W System, Component 0 Has the Longer Lead Time, $h_{0} = 1, λ_{1} = λ_{2} = 25$

Table 6. Optimality Gaps: W System, Component 0 Has the Longer Lead Time, $h_{0} = 1, λ_{1} = λ_{2} = 25$

Case no.	h₁	h₂	b₁	b₂	Optimality gap (L₁: components 1 and 2, L₂: component 0)
					$L_{1} = 1$	$L_{1} = 5$	$L_{1} = 10$	$L_{1} = 20$	$L_{1} = 40$	$L_{1} = 80$	$L_{1} = 160$
					$L_{2} = 1.5$	$L_{2} = 7.5$	$L_{2} = 15$	$L_{2} = 30$	$L_{2} = 60$	$L_{2} = 120$	$L_{2} = 240$
1	1	1	4	4	0.16%	0.08% *	0.08% *	—	—	—	—
2	0.2	0.2	2.4	2.4	0.11% *	0.05% *	0.11% *	—	—	—	—
3	1	5	10	6	0.30%	0.10% *	0.15% *	—	—	—	—
4	5	5	12	12	0.34%	0.11% *	0.14%	—	—	—	—
5	0.2	1	6	4	0.78%	0.31%	0.20%	—	—	—	—
6	0.2	0.2	2.4	1.2	3.46%	2.04%	1.28%	0.94%	—	—	—
7	1	1	4	2	1.65%	0.72%	0.43%	—	—	—	—
8	5	5	12	6	0.94%	0.42%	0.13%	—	—	—	—
9	1	0.2	4	2.4	2.72%	1.25%	0.95%	—	—	—	—
10	0.2	0.2	6	2	3.97%	1.88%	1.39%	0.92%	—	—	—
11	1	1	10	4	2.63%	1.38%	0.97%	—	—	—	—
12	5	5	30	12	0.96%	0.37%	0.33%	—	—	—	—
13	0.2	0.2	6	2.4	5.76%	2.77%	1.97%	1.52%	1.00%	0.53%	—
14	1	0.2	4	1.2	7.74%	3.61%	2.74%	2.04%	1.33%	0.97%	—
15	0.2	0.2	6	1.2	14.47%	7.29%	5.07%	3.67%	2.67%	2.04%	1.50%
16	1	1	10	2	6.56%	3.19%	2.55%	1.81%	1.41%	0.95%	—
17	5	1	12	4	2.79%	1.55%	1.17%	0.86%	—	—	—
18	5	5	30	6	2.23%	1.11%	0.69%	—	—	—	—
19	1	0.2	10	2.4	8.78%	4.54%	3.33%	2.46%	1.60%	1.10%	0.77%
20	5	1	12	2	5.73%	3.19%	2.32%	1.58%	1.15%	0.83%	—
21	1	0.2	10	1.2	17.04%	8.88%	6.77%	4.87%	3.35%	2.79%	1.95%
22	5	0.2	12	2.4	7.43%	3.71%	2.72%	1.97%	1.33%	0.87%	—
23	5	1	30	4	6.32%	3.49%	2.44%	1.57%	1.19%	0.77%	—
24	5	0.2	12	1.2	13.02%	6.64%	4.99%	3.60%	2.62%	1.75%	1.31%
25	5	1	30	2	10.83%	5.87%	4.17%	2.98%	2.21%	1.64%	0.86%
26	5	0.2	30	2.4	13.29%	6.75%	5.02%	3.67%	2.59%	1.97%	1.03%
27	5	0.2	30	1.2	24.68%	12.13%	8.52%	6.00%	4.39%	3.09%	1.98%

Note. See the text for an explanation of entries with * or —.

Table 7. Optimality Gaps: N-System with Symmetric Costs ( $h_{0} = 1, L_{1} = 1, L_{2} = 1.5, λ_{1} = λ_{2} = 5$ )

Table 7. Optimality Gaps: N-System with Symmetric Costs ( $h_{0} = 1, L_{1} = 1, L_{2} = 1.5, λ_{1} = λ_{2} = 5$ )

h₁	b₁	b₂	L₁: component 1			L₁: component 0
			L₂: component 0			L₂: component 1
			Lower bound	Average	99.9% CI	Lower bound	Average	95% CI
0.1	0.4	0.5	21.38	21.56	[21.54,21.59]	18.95	18.95	[18.94,18.97]
0.1	0.7	0. 8	28.82	29.00	[28.96,29.04]	25.26	25.27	[25.25,25.29]
1	1	2	51.38	51.98	[51.91,52.04]	49.24	49.25	[49.23,49.28]

Table 8. Optimality Gaps: M System, Component 1 Has the Shorter Lead Time, $h_{1} = h_{2} = 1, λ_{0} = 25, λ_{1} = λ_{2} = 50$

Table 8. Optimality Gaps: M System, Component 1 Has the Shorter Lead Time, $h_{1} = h_{2} = 1, λ_{0} = 25, λ_{1} = λ_{2} = 50$

Lead time L₁: component 1 L₂: component 2		A	B	C	D
		$b_{0} = 8$	$b_{0} = 3$	$b_{0} = 2$	$b_{0} = 1$
		$b_{1} = 3.5$	$b_{1} = 2.5$	$b_{1} = 3.5$	$b_{1} = 8$
		$b_{2} = 1$	$b_{2} = 1$	$b_{2} = 1$	$b_{2} = 3$
$L_{1} = 1$	$L_{2} = 1.5$	10.03%	4.60%	5.72%	23.20%
$L_{1} = 5$	$L_{2} = 7.5$	4.82%	2.30%	2.87%	11.27%
$L_{1} = 10$	$L_{2} = 15$	3.29%	1.72%	2.07%	8.18%
$L_{1} = 20$	$L_{2} = 30$	2.32%	1.17%	1.44%	6.13%
$L_{1} = 40$	$L_{2} = 60$	1.75%	—	1.00%	4.45%
$L_{1} = 80$	$L_{2} = 120$	1.37%	—	—	3.20%

Table 9. Optimality Gaps: M System, Component 2 Has the Shorter Lead Time, $h_{1} = h_{2} = 1, λ_{0} = 25, λ_{1} = λ_{2} = 50$

Table 9. Optimality Gaps: M System, Component 2 Has the Shorter Lead Time, $h_{1} = h_{2} = 1, λ_{0} = 25, λ_{1} = λ_{2} = 50$

Lead time L₁: component 2 L₂: component 1		A	B	C	D
		$b_{0} = 8$	$b_{0} = 3$	$b_{0} = 2$	$b_{0} = 1$
		$b_{1} = 3.5$	$b_{1} = 2.5$	$b_{1} = 3.5$	$b_{1} = 8$
		$b_{2} = 1$	$b_{2} = 1$	$b_{2} = 1$	$b_{2} = 3$
$L_{1} = 1$	$L_{2} = 1.5$	9.07%	4.20%	5.20%	22.48%
$L_{1} = 5$	$L_{2} = 7.5$	4.42%	2.25%	2.67%	11.16%
$L_{1} = 10$	$L_{2} = 15$	3.17%	1.40%	1.87%	8.12%
$L_{1} = 20$	$L_{2} = 30$	2.25%	0.99%	1.29%	5.74%
$L_{1} = 40$	$L_{2} = 60$	1.46%	—	0.88%	3.99%
$L_{1} = 80$	$L_{2} = 120$	0.94%	—	—	2.69%

9. Conclusion

Optimal inventory control of ATO systems with multiple products is a long-standing problem in the literature. In principle, this paper has settled the issue of developing an asymptotically optimal control policy for systems with general BOMs and deterministic lead times. We have “collapsed” the design of a dynamic control policy for minimizing the average cost over an infinite time horizon to the solution of certain multistage stochastic programs. The approach leads to drastically simplified analyses and feasible inventory policies with provable optimality properties.

This paper is a continuation of a stream of past work on ATO inventory systems. The multistage SP in Section 3 generalizes the two-stage SP in (34), which applies only to systems with identical lead times. In comparison with the formulation in (33), the SP here is a better alternative as it directly sets the same lower bound on the average cost, eliminating the need to take the infimum of the objective values and yields optimal solutions that have finite values and hence can be used as parameters of inventory control policies. Although the allocation policy follows the same principle as that in (34), we break new ground in developing a replenishment policy. The conventional base stock policy is justified by asymptotic analysis for use in systems with identical or near identical lead times (Reiman and Wang 2012, Reiman et al. 2016), whereas a different policy has been shown to be optimal for one-product systems with nonidentical lead times (Rosling 1989). Subsuming both as special cases, our policy is supported by the proof of asymptotic optimality and numerical results for its use in systems with general BOMs and lead times.

As a result of the new policy development, the asymptotic analysis is more involved than for systems with identical lead times in (34). Instead of setting constant targets for inventory positions, the new policy updates them repeatedly over time based on changes of relevant system states, giving rise to the question about the stability of these targets. In general, it is not possible to keep actual inventory positions at their targets for all times, and in theory, the influence of the discrepancies can last into the indefinite future. We address these issues with new approaches, such as the formulation and analysis of the “stochastic tracking model” and the characterization and exploitation of the matrix structures of SP constraints. We develop these technical machineries for general settings, making it possible to apply them to models other than ATO systems, for example, to prove convergence of other stochastic processes or address Lipschitz continuity of certain types of infinite LPs.

This paper is, in a sense, the culmination of the stream of previously mentioned work. On the other hand, this is not the end of the story. In particular, making our policy implementable in practical ATO systems requires new approaches to overcome the computational complexity of multistage SPs. In this paper, we consider the use of finite-dimensional LPs to approximate the original problems to generate stable targets for inventory positions and backlog levels. For the approximation to be sufficiently accurate, the dimensions of these LPs may need to be exceedingly high, especially when the system has many different and very long lead times. Moreover, the SPs need to be reoptimized repeatedly over time to update policy parameters, which makes the computation even more expensive.

Recent studies on two-stage SPs show that exploring structural properties of ATO problems implied by their BOMs can facilitate solution procedures (Zipkin 2016, Doğru et al. 2017, DeValve et al. 2020). How to extend this strategy to multistage SPs is an issue yet to be explored. It has also been shown that when the percentage difference between the longest and shortest lead times becomes insignificant, it can be asymptotically optimal to follow a base stock policy (32), reducing the SP to a two-stage problem and eliminating the need to update inventory position targets. Similarly, it also seems possible to preserve asymptotic optimality while reducing computational efforts by applying simple replenishment rules without resorting to SP solutions on components whose lead times are insignificant fractions of the longest lead time. In general, there can be opportunities to systematically explore exploiting small lead time differences to reduce the number of stages of the SPs. Furthermore, one may also find ways to reduce the computational burden by resolving multistage SPs less frequently. Although interesting and important, these topics are certainly beyond the scope of one paper, and we leave them for future research.

As a broader message, our work highlights the power of asymptotic analysis to tackle difficult inventory models that defy exact analysis. There is no shortage of such problems in the inventory theory literature, giving rise to ample opportunities for innovative research. On this subject, we refer to a recent survey Goldberg et al. (2021) for detailed discussions.

Acknowledgments

The authors are grateful to the associate editor and anonymous referees for constructive comments.

Appendix A. Proof of Theorems

Proof of Theorem 1.

The proof will use the following primal-dual transformation of the last stage LP in (13):

\begin{array}{l} φ^{0} (y^{1}, \dots y^{K}, x) & = - \max_{z} {c \cdot z | z \leq x, A z \leq y} \\ = - \min_{ν, κ \geq 0} {x \cdot ν + y \cdot κ | ν + A^{'} κ = c} \\ = - c \cdot x - \min_{κ \geq 0} {(y - A x) \cdot κ | A^{'} κ \leq c} \\ = \max_{κ \geq 0} {(A x - y) \cdot κ | A^{'} κ \leq c} - c \cdot x . \end{array}

(A.1)

Given that all components of A are nonnegative, it is always optimal to set $κ_{j} = 0$ for any j such that $A_{j} \cdot x - y_{j} \leq 0$ $(1 \leq j \leq n)$ . Thus, without the loss of generality, we replace $(A x - y)$ with ${(A x - y)}^{+}$ in (A.1), in which case

\begin{array}{l} φ^{0} (y^{1}, \dots y^{K}, x) & = φ_{d}^{0} (y^{1}, \dots y^{K}, x) - c \cdot x, \\ where φ_{d}^{0} (y^{1}, \dots y^{K}, x) & ≔ \max_{κ \geq 0} {{(A x - y)}^{+} \cdot κ | A^{'} κ \leq c} . \end{array}

(A.2)

In (A.2), for any feasible solution of $φ_{d}^{0} (y^{1}, \dots y^{K}, x)$ ,

κ_{j} \leq \bar{κ} (1 \leq j \leq n), where \bar{κ} = \frac{\bar{c}}{\underline{a}},

where

\bar{c}

and

\underline{a}

are defined in Table 2. Let

y^{l *}

(1 \leq l \leq k)

be an optimal solution on the sample path

(D^{k}, \dots, D^{1}

). Then

φ^{k} (y^{k + 1}, \dots, y^{K}, x) = \sum_{l = 1}^{k} h^{l} \cdot E [y^{l *}] + E [φ_{d}^{0} (y^{1 *}, \dots, y^{k *}, y^{k + 1}, \dots, y^{K}, x + {\underline{D}}^{k})] - c \cdot E [x + {\underline{D}}^{k}] .

(A.3)

Let $φ^{k^{'}, j} (y^{k + 1}, \dots y^{K}, x)$ be a deviation from the optimal objective value $φ^{k} (y^{k + 1}, \dots, y^{K}, x)$ , obtained by changing $y_{j}^{*}$ to $y_{j}^{*} - 1$ while keeping all other values at the optimal levels. Then

φ^{k^{'}, j} (y^{k + 1}, \dots y^{K}, x) - φ^{k} (y^{k + 1}, \dots, y^{K}, x) \geq 0,

that is,

\begin{array}{l} - h_{j} + E [κ_{j} {(A_{j} \cdot (x + {\underline{D}}^{k}) - (y_{j}^{*} - 1))}^{+}] - E [κ_{j} {(A_{j} \cdot (x + {\underline{D}}^{k}) - y_{j}^{*})}^{+}] \\ = & - h_{j} + E [κ_{j} 1 {A_{j} \cdot (x + {\underline{D}}^{k}) \geq y_{j}^{*}}] \\ \geq & 0 . \end{array}

For the latter condition to hold, it is necessary that

\bar{κ} \Pr {\bar{a} (| | x | |_{1} + | | {\underline{D}}^{k} | |_{1}) \geq y_{j}^{*}} \geq \underline{h} .

When $y_{j}^{*} > 0$ , apply Markov’s Inequality to the above inequality,

\frac{E [\bar{a} (| | x | |_{1} + | | {\underline{D}}^{k} | |_{1})]}{y_{j}^{*}} \geq \underline{h} / \bar{κ},

which yields an upper bound

y_{j}^{*} \leq \bar{β} (| | x | |_{1} + E [| | {\underline{D}}^{k} | |_{1}]),

where

\bar{β}

can be any constant such that

\bar{β} \geq \frac{\bar{a}}{\underline{h}} \bar{κ} = \frac{\bar{a}}{\underline{a}} \frac{\bar{c}}{\underline{h}} .

This upper bound obviously applies when $y_{j}^{*} \leq 0$ .

To prove the lower bound, fix a component j of $y^{k}$ (i.e., k_j = k) and consider the following feasible solution to (A.2)

κ_{j^{'}} = {\begin{matrix} 0 & k_{j} > k, \\ h_{j} + b / \bar{a} & j^{'} = j, \\ h_{j^{'}} & otherwise, \end{matrix}

(A.4)

which yields a weakly lower objective value than the optimal one, that is,

\sum_{l = 1}^{k} h^{l} \cdot (A^{l} x - y^{l *}) + \underline{b} / \bar{a} (A_{j} \cdot x - y_{j}^{*}) \leq φ_{d}^{0} (y^{1 *}, \dots, y^{k *}, y^{k + 1}, \dots, y^{K}, x) .

Apply the inequality to (A.3) (notice that $x$ in $φ_{d}^{0} ()$ corresponds to $x + {\underline{D}}^{k}$ in $φ_{d}^{0} ()$ in (A.3)),

\begin{array}{l} φ^{k} (y^{k + 1}, \dots y^{K}, x) & \geq h^{k} \cdot y^{k *} + h^{k} \cdot (A^{k} (x + E [{\underline{D}}^{k}]) - y^{k *}) \\ + \underline{b} / \bar{a} [A_{j} \cdot (x + E [{\underline{D}}^{k}]) - y_{j}^{*}] + \sum_{l = 1}^{k - 1} h^{l} \cdot E [y^{l *}] \\ + \sum_{l = 1}^{k - 1} h^{l} \cdot (A^{l} (x + E [{\underline{D}}^{k}]) - E [y^{l *}]) \\ - c \cdot (x + E [{\underline{D}}^{k}]) \geq - (\underline{b} / \bar{a}) y_{j}^{*} - c^{'} \cdot (x + E [{\underline{D}}^{k}]), \end{array}

(A.5)

where

c^{'} = c - \sum_{l = 1}^{k} {(A^{k})}^{'} h^{k} - (\underline{b} / \bar{a}) A_{j}

Let $κ^{0}$ be an optimal solution of $φ_{d}^{0} (0, \dots, 0, y^{k + 1}, \dots y^{K}, x)$ . Then ${y^{l} = 0 (1 \leq l \leq k), κ^{0}}$ is a feasible solution of $φ^{k} (y^{k + 1}, \dots y^{K}, x)$ and yields the following objective value:

\sum_{j^{'} : k_{j^{'}} \leq k} E [κ_{j^{'}}^{0} A_{j^{'}} \cdot (x + {\underline{D}}^{k})] + \sum_{j^{'} : k_{j^{'}} > k} E [κ_{j^{'}}^{0} (A_{j^{'}} \cdot (x + {\underline{D}}^{k}) - y_{j^{'}})] - c \cdot (x + E [{\underline{D}}^{k}]) .

Therefore,

φ^{k} (y^{k + 1}, \dots y^{K}, x) \leq \sum_{j^{'} = 1}^{n} A_{j^{'}} \cdot E [κ_{j^{'}}^{0} (x + {\underline{D}}^{k})] + \sum_{j^{'} : k_{j^{'}} > k} | y_{j^{'}} | E [κ_{j^{'}}^{0}] - c \cdot (x + E [D^{k}]) .

(A.6)

Using (A.5) and (A.6), and applying that $κ_{j} \leq \bar{κ}$ :

(\underline{b} / \bar{a}) y_{j}^{*} \geq - \bar{κ} (\sum_{j^{'} = 1}^{n} A_{j^{'}} \cdot (x + E [D^{k}]) + \sum_{j^{'} : k_{j^{'}} > k} | y_{j^{'}} |) + (c - c^{'}) \cdot (x + E [D^{k}]) .

Therefore,

y_{j}^{*} \geq - β (| | x | |_{1} + E [| | D^{k} | |_{1}] + \sum_{l = k + 1}^{K} | | y^{l} | |_{1}), 1 \leq j \leq n,

where

\underline{β}

can be any constant such that

\underline{β} \geq \frac{\bar{a}}{\underline{b}} \bar{κ} (n \bar{a}) = n \frac{{(\bar{a})}^{2}}{\underline{a}} \frac{\bar{c}}{\underline{b}} . □

Proof of Theorem 2.

By substituting $y^{k}$ with $y^{k} - A^{k} α$ ( $1 \leq k \leq K$ ) and $z$ with $z - α$ in (11):

Φ^{K} (α) = Ψ_{α}^{K} - b \cdot α,

where

\begin{array}{l} Ψ_{α}^{K} & = \inf_{y^{K} \geq - A^{K} α} {h^{K} \cdot y^{K} + E [Ψ_{α}^{K - 1} (y^{K}, D^{K})]}, \\ Ψ_{α}^{k} (y^{k + 1}, \dots, y^{K}, x) & = \inf_{y^{k} \geq - A^{k} α} {h^{k} \cdot y^{k} + E [Ψ_{α}^{k - 1} (y^{k}, \dots, y^{K}, x + D^{k})]}, k = K - 1, \dots, 1, \\ Ψ_{α}^{0} (y^{1}, \dots, y^{K}, x) & = - \max_{z \geq - α} {c \cdot z | z \leq x, A^{k} z \leq y^{k}, 1 \leq k \leq K} . \end{array}

(A.7)

Using (10), we prove (15) by showing that

\inf_{α \geq 0} {Ψ_{α}^{K}} = φ^{K} .

(A.8)

Let M be a positive integer and $M$ be a m-dimensional vector with all components equal to M. Let $φ_{M}^{K}$ be the optimal objective value of the SP defined in (13) with inputs $D^{k}$ replaced by

D_{M}^{k} ≔ D^{k} \land M, 1 \leq k \leq K .

Likewise, let $Ψ_{M, α}^{K}$ be the optimal objective value of the SP defined in (A.7) with the replacement of $D^{k}$ by $D_{M}^{k}$ $(1 \leq k \leq K)$ . Following Theorem 5, as $M \to \infty, φ_{M}^{K}$ converges to $φ^{K}$ and $Ψ_{M, α}^{K}$ converges to $Ψ_{α}^{K}$ uniformly over all $α$ (see (A.20) and the later discussion in the proof of Theorem 5). Although the latter theorem and its proof appear later in the paper, they do not rely on the conclusion that we are proving here. Thus, to prove (A.8), we only need to show that for any given M, there exists some $α$ , such that

Ψ_{M, α}^{K} = φ_{M}^{K} .

(A.9)

Let $y^{k *}$ $(1 \leq k \leq K)$ be an optimal solution of $φ_{M}^{K}$ . By Theorem 1, $y_{M}^{K *}$ is finite. For $k = K - 1, \dots, 1$ , because every entry of $x$ in $φ_{M}^{k} (y_{M}^{k + 1 *}, \dots, y_{M}^{K *}, x)$ , which is

D_{M}^{K} + \dots D_{M}^{k + 1}

is bounded by

M (K - k)

, by a simple induction on (14),

y_{M}^{k *}

has a finite upper bound that applies to all sample paths. This means that when

α

is sufficiently large,

y_{M}^{k *}

(

1 \leq k \leq K

) are feasible solutions to

Ψ_{M, α}^{k} (y_{M}^{k + 1 *}, \dots, y_{M}^{K *}, x)

(

1 \leq k \leq K

). On the other hand, any optimal values of

y^{k}

for

Ψ_{M, α}^{k} (y^{k + 1}, \dots, y^{K}, x)

are obviously feasible for

φ_{M}^{k} (y^{k + 1}, \dots, y^{K}, x)

(1 \leq k \leq K)

. Thus, to prove (A.9), we only need to show that if

α

is sufficiently large,

Ψ_{M, α}^{0} (y^{1}, \dots, y^{K}, x) = φ_{M}^{0} (y^{1}, \dots, y^{K}, x)

(A.10)

for any

x \leq K M

and

y^{k}

(1 \leq k \leq K)

bounded by a constant that depends on M but not

α

. By definition, except for restrictions on inputs,

φ_{M}^{0} (y^{1}, \dots, y^{K}, x)

and

Ψ_{M, α}^{0} (y^{1}, \dots, y^{K}, x)

are the same LPs as

φ^{0} (y^{1}, \dots, y^{K}, x)

and

Ψ_{α}^{0} (y^{1}, \dots, y^{K}, x)

, respectively.

Obviously, for any $y^{k}$ $(1 \leq k \leq K)$ and $x$

Ψ_{M, α}^{0} (y^{1}, \dots, y^{K}, x) \geq φ_{M}^{0} (y^{1}, \dots, y^{K}, x),

(A.11)

because any feasible solution for the LP on the LHS is also feasible for the LP on the RHS. To prove the inequality holds in reverse, observe that if

z^{*}

optimizes

\begin{array}{l} φ_{M}^{0} (y^{1}, \dots, y^{K}, x) & = & - \max_{z} {c \cdot z | A^{k} z \leq y^{k} (1 \leq k \leq K), z \leq x} \\ = & - \max_{z} {\sum_{i = 1}^{m} c_{i} z_{i} | \sum_{i = 1}^{m} a_{j i} z_{i} \leq y_{j} (1 \leq j \leq n), z_{i} \leq x_{i} (1 \leq i \leq m)}, \end{array}

then

z_{i}^{*} \geq - \underline{α} where \underline{α} ≔ \max_{1 \leq j \leq n} {\frac{| y_{j} | + \sum_{i = 1}^{m} a_{j i} x_{i}}{\min_{a_{j i} > 0} {a_{j i}}}}, 1 \leq i \leq m .

Otherwise, one can improve the objective value by increasing z_i without violating any constraint. Because $y^{k}$ ( $1 \leq k \leq K$ ) and $x$ are bounded, $\underline{α}$ is also bounded by some constant ${\underline{α}}_{\max}$ . When $α \geq {\underline{α}}_{\max}$ , any optimal solution to $φ_{M}^{0} (y^{1}, \dots, y^{K}, x)$ is a feasible solution to $Ψ_{M, α}^{0} (y^{1}, \dots, y^{K}, x)$ , so (A.11) holds in reverse. This completes the proof of the theorem. □

Proof of Lemma 3.

Observe that

\begin{array}{l} E [\sup_{t \geq 0} {(| {\hat{D}}_{i}^{(L)} (0, t) | - \sqrt{L} t ν)}^{+}] & \leq & \sum_{τ = 1}^{\infty} E [\sup_{τ \leq t < τ + 1} {(| {\hat{D}}_{i}^{(L)} (0, t) | - \sqrt{L} t ν)}^{+}] \\ + E [\sup_{L^{- 1 / 4} \leq t < 1} {(| {\hat{D}}_{i}^{(L)} (0, t) | - \sqrt{L} t ν)}^{+}] \\ + E [\sup_{0 \leq t < L^{- 1 / 4}} {(| {\hat{D}}_{i}^{(L)} (0, t) | - \sqrt{L} t ν)}^{+}] . \end{array}

(A.12)

Notice that ${\tilde{D}}^{(L)} (t)$ $(t \geq - 1)$ is a stationary process and $| {\hat{D}}_{i}^{(L)} (0, t) |$ ( $t \geq - 1$ ) is a sub-martingale:

\begin{array}{l} \sum_{τ = 1}^{\infty} E [{(\sup_{τ \leq t \leq τ + 1} | {\hat{D}}_{i}^{(L)} (0, t) | - \sqrt{L} t ν)}^{+}] & \leq & \sum_{τ = 1}^{\infty} E [{(\sup_{τ \leq t \leq τ + 1} | {\hat{D}}_{i}^{(L)} (0, t) | - \sqrt{L} τ ν)}^{+}] \\ = & \sum_{τ = 1}^{\infty} E [{(\sup_{0 \leq t \leq 1} | {\hat{D}}_{i}^{(L)} (0, t + τ) | - \sqrt{L} τ ν)}^{+}] \\ = & \sum_{τ = 1}^{\infty} \int_{\sqrt{L} τ ν}^{\infty} \Pr {\sup_{0 \leq t \leq 1} | {\hat{D}}_{i}^{(L)} (0, t + τ) | \geq x} d x \\ \leq & \sum_{τ = 1}^{\infty} \int_{\sqrt{L} τ ν}^{\infty} \frac{E [| {\hat{D}}_{i}^{(L)} (0, τ + 1) |^{p}]}{x^{p}} d x \\ = & \frac{{(\sqrt{L} ν)}^{1 - p}}{p - 1} \sum_{τ = 1}^{\infty} \frac{E [| {\hat{D}}_{i}^{(L)} (0, τ + 1) |^{p}]}{τ^{p - 1}}, \end{array}

(A.13)

where the second inequality is Doob’s inequality and p > 1.

To bound the sum in the above, observe that

E [{({\hat{D}}_{i}^{(L)} (0, τ + 1))}^{6}] = E [{(\sum_{s = 1}^{τ + 1} {\hat{D}}_{i}^{(L)} (s - 1, s))}^{6}],

where

E [{\hat{D}}_{i}^{(L)} (s - 1, s)] = 0

. Because

{\hat{D}}_{i}^{(L)} (s - 1, s)

(

1 \leq s \leq τ + 1

) is an i.i.d. sequence,

E [{({\hat{D}}_{i}^{(L)} (s - 1, s))}^{k}] = E [{({\hat{D}}_{i}^{(L)} (0, 1))}^{k}], 1 \leq k \leq 6, s = 1, \dots, τ + 1 .

By eliminating terms that contain $E [{\hat{D}}_{i}^{(L)} (s - 1, s)] = 0$ and thus equal zero in the expansion,

\begin{array}{l} E [({\hat{D}}_{i}^{(L)} {(0, τ + 1)}^{6}] & = & C_{1}^{τ + 1} \times E [{({\hat{D}}_{i}^{(L)} (0, 1))}^{6}] \\ + C_{2}^{τ + 1} (2 C_{2}^{6} \times E [{({\hat{D}}_{i}^{(L)} (0, 1))}^{4}] \times E [{({\hat{D}}_{i}^{(L)} (0, 1))}^{2}] \\ + C_{3}^{6} \times {(E [{({\hat{D}}_{i}^{(L)} (0, 1))}^{3}])}^{2}) \\ + C_{3}^{τ + 1} \times C_{2}^{6} \times C_{2}^{4} \times {(E [{({\hat{D}}_{i}^{(L)} (0, 1))}^{2}])}^{3}, \end{array}

(A.14)

where

C_{r}^{q}

denotes q choose r (q, r integers and

q \geq r

). Let

υ = \sum_{τ = 1}^{\infty} \frac{E [{({\hat{D}}_{i}^{(L)} (0, τ + 1))}^{6}]}{τ^{5}},

which is a finite constant: Because the jump size of the Compound Poisson process is assumed to have a finite moment of order 6,

E [{({\hat{D}}_{i}^{(L)} (0, 1))}^{k}]

(

k = 2, 3, 4, 6

) are all finite. Moreover

C_{k}^{τ + 1}

(k = 1, 2, 3) are on the order of

τ^{3}

or smaller. By (A.13) with p = 6,

\sum_{τ = 1}^{\infty} E [{(\sup_{τ \leq t \leq τ + 1} | {\hat{D}}_{i}^{(L)} (0, t) | - \sqrt{L} t ν)}^{+}] \leq \frac{υ}{5 ν^{5}} L^{- 5 / 2} .

(A.15)

Applying the above and bounds in (64) and (65) of Lemma 2 to (A.13) proves (66). □

Proof of Lemma 4.

For each $i = 1, \dots, m$ ,

\begin{array}{l} E [\sum_{τ = 0}^{\infty} \sup_{0 \leq t < 1} {({\hat{d}}_{i}^{(L)} (t) - \sqrt{L} ν τ)}^{+}] & = & E [\sup_{0 \leq t < 1} {\hat{d}}_{i}^{(L)} (t)] + \sum_{τ = 1}^{\infty} E [\sup_{0 \leq t < 1} {({\hat{d}}_{i}^{(L)} (t) - \sqrt{L} ν τ)}^{+}] \\ \leq & 3 λ^{\frac{1}{2 + δ}} (1 + η_{i}) L^{- \frac{δ}{2 (2 + δ)}} + \sum_{τ = 1}^{\infty} E [\sup_{0 \leq t < 1} {({\hat{d}}_{i}^{(L)} (t) - \sqrt{L} ν τ)}^{+}], \end{array}

where the first equality follows from Tonelli’s theorem, and the next inequality is obtained by applying lemma 2 in Reiman and Wang (2015) to bound the first expected value (τ = 0). Observe that

\begin{array}{l} \sum_{τ = 1}^{\infty} E [\sup_{0 \leq t < 1} {({\hat{d}}_{i}^{(L)} (t) - \sqrt{L} ν τ)}^{+}] = \sum_{τ = 1}^{\infty} \int_{\sqrt{L} ν τ}^{\infty} \Pr {\sup_{0 \leq t < 1} {\hat{d}}_{i}^{(L)} (t) \geq x} d x \\ \leq \sum_{τ = 1}^{\infty} \int_{\sqrt{L} ν τ}^{\infty} \frac{E [{(\sup_{0 \leq t < 1} {\hat{d}}_{i}^{(L)} (t))}^{2 + δ}]}{x^{2 + δ}} d x \\ = \sum_{τ = 1}^{\infty} \frac{E [{(\sup_{0 \leq t < 1} {\hat{d}}_{i}^{(L)} (t))}^{2 + δ}]}{(1 + δ) {(\sqrt{L} ν)}^{1 + δ}} \frac{1}{τ^{1 + δ}} \\ \leq \sum_{τ = 1}^{\infty} \frac{L λ E [{(S_{i} / \sqrt{L})}^{2 + δ}]}{(1 + δ) {(\sqrt{L} ν)}^{1 + δ}} \frac{1}{τ^{1 + δ}} \\ = L^{- (1 / 2 + δ)} \frac{λ η_{i}}{(1 + δ) ν^{1 + δ}} \sum_{τ = 1}^{\infty} \frac{1}{τ^{1 + δ}}, \end{array}

where the last inequality follows from

{(\sup_{0 \leq t < 1} {\hat{d}}_{i}^{(L)} (t))}^{2 + δ} \leq \sum_{k = 0}^{Λ^{(L)} (1)} {(\frac{S_{i}}{\sqrt{L}})}^{2 + δ},

where

Λ^{(L)} (1)

is the number of arrivals in

[0, L]

and S_i is the order size, which are independent of each other. This analysis shows that

E [\sum_{τ = 0}^{\infty} \sup_{0 \leq t < 1} {({\hat{d}}_{i}^{(L)} (t) - \sqrt{L} ν τ)}^{+}] \leq ξ_{1} L^{- \frac{δ}{2 (2 + δ)}} + ξ_{2} L^{- (1 / 2 + δ)},

where

ξ_{1} = 3 λ^{1 / (2 + δ)} (1 + η_{i}) and ξ_{2} = \frac{λ η_{i}}{(1 + δ) ν^{1 + δ}} \sum_{τ = 1}^{\infty} \frac{1}{τ^{1 + δ}}

are both finite values that are independent of L. The lemma follows as a result. □

Proof of Lemma 5.

It is easy to verify from the expressions in (13) that because $Y^{k (L)} (t)$ $(t \geq - L_{k}, 1 \leq k \leq K)$ minimize (40)–(40′), ${\hat{Y}}^{k (L)} (t)$ $(t \geq - {\hat{L}}_{k}^{(L)}, 1 \leq k \leq K)$ minimize the same functions with the same linear transformation of inputs as follows: at each stage k $(1 \leq k \leq K)$ , let ${\hat{D}}^{k (L)}$ replace $D^{k (L)}, x$ take the value of ${\hat{D}}^{(L)} (t + {\hat{L}}_{k + 1}^{(L)} - 1, t)$ instead of $D^{(L)} (t + L_{k + 1}^{(L)} - L, t)$ , and ${\hat{Y}}^{k^{'} (L)} (t + {\hat{L}}_{k}^{(L)} - {\hat{L}}_{k^{'}}^{(L)})$ replace $Y^{k^{'} (L)} (t + L_{k}^{(L)} - L_{k^{'}}^{(L)})$ $(k < k^{'} \leq K)$ .

For each $j = 1, \dots, n$ , applying (14), there exists a constant $\tilde{β}$ such that

{\hat{Y}}_{j}^{(L)} (- {\hat{L}}_{k_{j}}^{(L)}) \leq \tilde{β} \sum_{i = 1}^{m} \sum_{k = k_{j}}^{K} (| {\hat{D}}_{i}^{(L)} (- 1, - {\hat{L}}_{k}^{(L)}) | + E [| {\hat{D}}_{i}^{(L)} (- {\hat{L}}_{k_{j}}^{(L)}, 0) |]) .

For any constant $ν > 0$ , let

\tilde{ν} = \frac{ν / \tilde{β}}{2 m ({\bar{n}}_{k} + \dots + {\bar{n}}_{K})} .

Because $D^{(L)} (t)$ ( $t \geq - L$ ) is a stationary process and $0 \leq {\hat{L}}_{k_{j}}^{(L)} \leq 1$ $(1 \leq j \leq n)$ ,

\begin{array}{l} E [{(| {\hat{Y}}_{j}^{(L)} (- {\hat{L}}_{k_{j}}^{(L)}) | - \sqrt{L} ν)}^{+}] \\ \leq \tilde{β} \sum_{i = 1}^{m} \sum_{k = k_{j}}^{K} E [{(| {\hat{D}}_{i}^{(L)} (- 1, - {\hat{L}}_{k}^{(L)}) | - \sqrt{L} \tilde{ν})}^{+} + {(E [| {\hat{D}}_{i}^{(L)} (- {\hat{L}}_{k_{j}}^{(L)}, 0) |] - \sqrt{L} \tilde{ν})}^{+}] \\ \leq 2 \tilde{β} (K - k_{j}) \sum_{i = 1}^{m} E [\sup_{t \geq 0} {(| {\hat{D}}_{i}^{(L)} (0, t) | - \sqrt{L} \tilde{ν})}^{+}] . \end{array}

Applying Lemma 3 to the last expression in the above inequality concludes the proof. □

Proof of Theorem 5.

To prove (99), we use induction to show that for $k = 0, \dots, K$ and any $y^{k + 1}, \dots, y^{K}$ and $x$ , there exists a η, depending only on $c$ and $A^{k^{'}}$ $(1 \leq k^{'} \leq K)$ , such that

0 \leq φ_{M}^{k} (y^{k + 1}, \dots, y^{K}, x) - φ^{k} (y^{k + 1}, \dots, y^{K}, x) \leq η \sum_{k^{'} = 1}^{k} E [| | D^{k^{'}} - D_{M}^{k^{'}} | |_{1}] .

(A.16)

Because $φ_{M}^{0} (y^{1}, \dots, y^{K}, x)$ in (13) and $φ^{0} (y^{1}, \dots, y^{K}, x)$ in (92) are the same LP, (A.16) holds when k = 0. Furthermore, for any $x^{a}$ and $x^{b}$ such that $x^{a} \leq x^{b}$ ,

0 \leq φ^{0} (y^{1}, \dots, y^{K}, x^{a}) - φ^{0} (y^{1}, \dots, y^{K}, x^{b}) \leq η | | x^{a} - x^{b} | |_{1},

(A.17)

where η is a constant that depends only on

c

and A^k

(1 \leq k \leq K)

. The left inequality holds because the feasible set of the LP under

x^{a}

is a subset of that under

x^{b}

. The right inequality and the specification of η are standard results of LP sensitivity analysis (section 10.4 of Schrijver 1986).

The induction starts from the assumption that when k = l $(0 \leq l < K)$ , there exists constant η, depending only on $c$ and A^k $(1 \leq k \leq K)$ , such that

0 \leq φ_{M}^{l} (y^{l + 1}, \dots, y^{K}, x) - φ^{l} (y^{l + 1}, \dots, y^{K}, x) \leq η \sum_{k^{'} = 1}^{l} E [| | D^{k^{'}} - D_{M}^{k^{'}} | |_{1}],

(A.18)

and for any

x^{a}

and

x^{b}

, where

x^{a} \leq x^{b}

0 \leq φ^{l} (y^{l + 1}, \dots, y^{K}, x^{a}) - φ^{l} (y^{l + 1}, \dots, y^{K}, x^{b}) \leq η | | x^{a} - x^{b} | |_{1} .

(A.19)

We have proved both conditions for the case with k = 0. We now prove that they are also true when $k = l + 1$ .

Let $y_{M}^{(l + 1) *}$ be an optimal solution of $φ_{M}^{l + 1} (y^{l + 2}, \dots, y^{K}, x)$ . Then

\begin{array}{l} φ^{l + 1} (y^{l + 2}, \dots, y^{K}, x) & = \min_{y^{l + 1}} {h^{l + 1} \cdot y^{l + 1} + E [φ^{l} (y^{l + 1}, y^{l + 2}, \dots, y^{K}, x + D^{l + 1})]} \\ \leq h^{l + 1} \cdot y_{M}^{(l + 1) *} + E [φ^{l} (y_{M}^{(l + 1) *}, y^{l + 2}, \dots, y^{K}, x + D^{l + 1})] \\ \leq h^{l + 1} \cdot y_{M}^{(l + 1) *} + E [φ^{l} (y_{M}^{(l + 1) *}, y^{l + 2}, \dots, y^{K}, x + D_{M}^{l + 1})] \\ \leq h^{l + 1} \cdot y_{M}^{(l + 1) *} + E [φ_{M}^{l} (y_{M}^{(l + 1) *}, y^{l + 2}, \dots, y^{K}, x + D_{M}^{l + 1})] \\ = φ_{M}^{l + 1} (y^{l + 2}, \dots, y^{K}, x), \end{array}

where the second inequality follows from induction assumption (A.19) with

x^{a} = D_{M}^{l + 1}

and

x^{b} = D^{l + 1}

and the third inequality follows from (A.18).

Let $y^{(l + 1) *}$ be an optimal solution to $φ^{l + 1} (y^{l + 2}, \dots, y^{K}, x)$ . Then

\begin{array}{l} φ_{M}^{l + 1} (y^{k + 2}, \dots, y^{K}, x) & = \min_{y^{l + 1}} {h^{l + 1} \cdot y^{l + 1} + E [φ_{M}^{l} (y^{l + 1}, y^{k + 2}, \dots, y^{K}, x + D_{M}^{l + 1})]} \\ \leq h^{l + 1} \cdot y^{(l + 1) *} + E [φ_{M}^{k} (y^{(l + 1) *}, y^{k + 2}, \dots, y^{K}, x + D_{M}^{l + 1})] \\ \leq h^{l + 1} \cdot y^{(l + 1) *} + E [φ^{k} (y^{(l + 1) *}, y^{k + 2}, \dots, y^{K}, x + D_{M}^{l + 1})] + η \sum_{k^{'} = 1}^{l} E [| | D^{k^{'}} - D_{M}^{k^{'}} | |_{1}] \\ \leq h^{l + 1} \cdot y^{(l + 1) *} + E [φ^{k} (y^{(l + 1) *}, y^{k + 2}, \dots, y^{K}, x + D^{l + 1})] + η \sum_{k^{'} = 1}^{l + 1} E [| | D^{k^{'}} - D_{M}^{k^{'}} | |_{1}] \\ = φ^{l + 1} (y^{l + 2}, \dots, y^{K}, x) + η \sum_{k^{'} = 1}^{l + 1} E [| | D^{k^{'}} - D_{M}^{k^{'}} | |_{1}], \end{array}

where the second inequality follows from (A.18) and third inequality follows from (A.19) with

x^{a} = D_{M}^{l + 1}

and

x^{b} = D^{l + 1}

. Hence, we have proved (A.18) holds when

k = l + 1

To prove (A.19) holds for $k = l + 1$ , for any $x^{a}$ and $x^{b}$ , where $x^{a} \leq x^{b}$ ,

\begin{array}{l} φ^{l + 1} (y^{l + 2}, \dots, y^{K}, x^{a}) & = \min_{y^{l + 1}} {h^{l + 1} \cdot y^{l + 1} + E [φ^{l} (y^{l + 1}, y^{l + 2}, \dots, y^{K}, x^{a} + D^{l + 1})]} \\ \leq \min_{y^{l + 1}} {h^{l + 1} \cdot y^{l + 1} + E [φ^{l} (y^{l + 1}, y^{k + 2}, \dots, y^{K}, x^{b} + D^{l + 1}) + η | | x^{a} - x^{b} | |_{1}]} \\ = φ^{l + 1} (y^{l + 2}, \dots, y^{K}, x^{b}) + η | | x^{a} - x^{b} | |_{1}, \end{array}

where the inequality follows from the right inequality of (A.19). Following the same step and use the left inequality of (A.19),

φ^{l + 1} (y^{l + 2}, \dots, y^{K}, x^{a}) \geq φ^{l + 1} (y^{l + 2}, \dots, y^{K}, x^{b}) .

We have thus proved (99). As a slight extension, in proving Theorem 2, we refer to this theorem to show that

\lim_{M \to \infty} Ψ_{M, α}^{K} = Ψ_{α},

(A.20)

where

Ψ_{α}

is defined in (A.7) and

Ψ_{M, α}^{K}

is defined by the same equation with

D^{k}

replaced by

D_{M}^{k}

(1 \leq k \leq K)

. It is straightforward to follow the same process as above, with

φ_{M}^{k} ()

and

φ^{k} ()

replaced by

Ψ_{M, α}^{k} ()

and

Ψ_{α}^{k} ()

, respectively

(1 \leq k \leq K)

, to prove (A.20). The only difference is that the determination of η in (A.17) needs to include the LHS of the constraint

z \geq - α

. Because the value of η is independent of

α

, which is on the RHS, the convergence is uniform over all

α

To prove (100), note that $φ_{M}^{K}$ and ${\tilde{φ}}_{M}^{K}$ are optimal objective values of (92) under the original and perturbed coefficient values, respectively. Because the perturbation only applies to coefficients of the objective function, the feasible set remains the same between the two problems (so one problem’s optimal solution is also a feasible solution of the other). Therefore, we only need to show that under a chosen M, the optimal solution in either problem is bounded by some quantities that are independent of perturbed coefficient values as long as they satisfy (98). Thus, by reducing Δ in (98), the difference of the two optimal objective values can be kept arbitrary small.

Theorem 1 shows that, with or without coefficient perturbation, $| | y^{k} | |_{1}$ in (92) satisfies $| | y^{k} | |_{1} \leq γ_{1}$ $(1 \leq k \leq K)$ for some finite constant γ₁. With $D^{k}$ truncated to $D_{M}^{k}$ ( $1 \leq k \leq K$ ), in (14), $| | x | |_{1} \leq γ_{2} M$ for some finite constant γ₂, and $E [| | {\underline{D}}^{k} | |_{1}]$ $(1 \leq k \leq K$ ) are finite even without truncation. Although the allowed ranges of $\underline{β}$ and $\bar{β}$ do depend on $b$ and $h^{k}$ $(1 \leq k \leq K)$ , the definitions of these two quantities also imply that there are non-empty ranges of perturbed coefficients to which some fixed values of $\underline{β}$ and $\bar{β}$ apply.

To show that $| | z | |_{1}$ is also bounded, first note that the constraint $z \leq x$ in $φ_{M}^{0} (\cdot)$ of (92), along with the upper bound on $x$ shown in the last paragraph imply that $z_{i} \leq γ_{2} M$ $(1 \leq i \leq m)$ . To bound the optimal value of z_i, when this value is negative, it must be in at least one of the constraints $A^{k} z \leq y^{k}, 1 \leq k \leq K$ at equality (otherwise, the objective value can be improved by increasing z_i). Combining the above bounds yields $z_{i} \geq - [γ_{1} + (m - 1) \bar{a} γ_{2} M], 1 \leq i \leq m$ . □

Proof of Corollary 3.

The formulation of the LP (93)–(97) implies that for any given $y^{k + 1}, \dots, y^{K}$ $(1 \leq k \leq K - 1)$ , if $x_{a} \geq x_{b}$ , then

{\tilde{φ}}_{M}^{k} (y^{k + 1}, \dots, y^{K}, x_{a} + D_{M}^{k}) \leq {\tilde{φ}}_{M}^{k} (y^{k + 1}, \dots, y^{K}, x_{b} + D_{M}^{k}) .

Let

x_{a}^{k} = D^{K} + \dots + D^{k + 1}, 0 \leq k < K,

and

x_{b}^{k} = D^{K} \land M + \dots + D^{k + 1} \land M, 0 \leq k < K .

Then, for all $k = K - 1, \dots, 1$ ,

\begin{array}{l} {\tilde{φ}}_{M}^{k} (y_{p}^{k + 1}, \dots y_{p}^{K}, x_{b}^{k}) & = & \min_{y^{k}} {{\tilde{h}}^{k} \cdot y^{k} + E [{\tilde{φ}}_{M}^{k - 1} (y^{k}, y_{p}^{k + 1}, \dots, y_{p}^{K}, x_{b}^{k} + D_{M}^{k})]} \\ \geq & \min_{y^{k}} {{\tilde{h}}^{k} \cdot y^{k} + E [{\tilde{φ}}_{M}^{k - 1} (y^{k}, y_{p}^{k + 1} \dots, y_{p}^{K}, x_{a}^{k} + D_{M}^{k})]} \\ = & {\tilde{h}}^{k} \cdot y_{p}^{k} + E [{\tilde{φ}}_{M}^{k - 1} (y_{p}^{k}, \dots, y_{p}^{K}, x_{a}^{k} + D_{M}^{k})] . \end{array}

Apply this inequality repeatedly from k = K to k = 1,

\begin{array}{l} {\tilde{φ}}_{M}^{K} & = {\tilde{h}}^{K} \cdot y_{p}^{K} + E [{\tilde{φ}}_{M}^{K - 1} (y_{p}^{K}, D_{M}^{K})] \\ \geq {\tilde{h}}^{K} \cdot y_{p}^{K} + E [{\tilde{h}}^{K - 1} \cdot y_{p}^{K - 1}] + E [{\tilde{φ}}_{M}^{K - 2} (y_{p}^{K - 1}, y_{p}^{K}, x_{a}^{K - 1} + D_{M}^{K - 1})] \\ \geq \dots \\ \geq {\tilde{h}}^{K} \cdot y_{p}^{K} + \dots + E [{\tilde{h}}^{1} \cdot y_{p}^{1}] + E [{\tilde{φ}}_{M}^{0} (y_{p}^{1}, \dots, y_{p}^{K}, x_{a}^{1} + D_{M}^{1})] \\ \geq \sum_{k = 1}^{K} {\tilde{h}}^{k} \cdot E [y_{p}^{k}] - \tilde{c} \cdot E [z_{p}] \\ = {\tilde{φ}}_{(p), M}^{K}, \end{array}

which proves (102). Therefore,

{\tilde{φ}}_{(p), M}^{K} + b \cdot E [\bar{D}] \leq {\tilde{φ}}_{M}^{K} + b \cdot E [\bar{D}] \leq (φ_{M}^{K} + ϵ) + b \cdot E [\bar{D}] \leq (φ^{K} + b \cdot E [\bar{D}]) + 2 ϵ = \underline{C} + 2 ϵ,

where the second and third inequalities follow from Theorem 5.

Proof of Lemma 7.

The proof requires a substantial amount of articulation. To facilitate understanding, we will first prove the special case with k = 1 before proving the general case in which k can vary from 1 to K.

Special Case ( $k = 1$ ). Consider $φ_{M}^{1} (y^{2}, \dots, y^{K}, x)$ , in which (94)–(97) specialize to

\begin{array}{l} z (\bar{ω}) & \leq x + D_{M}^{1} (\bar{ω}), \bar{ω} \in Ω_{1}^{0}, \\ A^{1} z (\bar{ω}) - y^{1} & \leq 0, \bar{ω} \in Ω_{1}^{0}, \\ A^{k} z (\bar{ω}) & \leq y^{k}, \bar{ω} \in Ω_{1}^{0}, 1 < k \leq K . \end{array}

(A.21)

The coefficient matrix on the LHS of constraints can be expressed as

A = (H, E) where H = (\begin{matrix} H_{1} \\ ⋱ \\ H_{N} \end{matrix}) .

(A.22)

All components of submatrix $H$ are zero except for those in block matrices on the diagonal (here and below we will deviate slightly from the standard definition by referring to a submatrix on the diagonal as a block even if it is not a square one). The number of blocks, N, is the number of elements in $Ω_{1}^{0}$ . Each block H_i $(1 \leq i \leq N)$ is composed of coefficients associated with $z (\bar{ω})$ of a particular $\bar{ω}$ , and thus has $m + n_{1} + \dots + n_{K}$ rows and m columns. In particular,

H_{1} = \dots = H_{N} = (\begin{matrix} I \\ A^{1} \\ ..... \\ A^{K} \end{matrix}) .

(A.23)

The matrix $E$ is composed of coefficients of $y^{1}$ . It has n₁ columns (the number of components in $y^{1}$ ) and its components are either 0 or –1. Because every H_i $(1 \leq i \leq N)$ has an identity submatrix, $E$ has a negative identity submatrix, and these submatrices are on different rows, $A$ in (A.22) has full column rank.

In general, we define a matrix $A$ as a finite coupling block matrix (FCBM) with characterization numbers $(m, n_{1})$ if:

With necessary permutations of rows and columns, $A$ can fit into the pattern defined by (A.22);
The matrix $A$ is of full column rank;
The submatrix $H$ has a finite number of blocks (denoted by N); all blocks, H_i $(1 \leq i \leq N)$ , have the same finite number of columns (denoted by m), but can have different number of rows;
The submatrix $E$ has a finite number of columns (denoted by n₁), and its components are either 0 or –1.

Let $\hat{A}$ be a maximal nonsingular submatrix of a FCBM $A$ . Because $A$ is of full column rank, $\hat{A}$ has the same number of columns as $A$ . Referring to (A.22), because each block H_i $(1 \leq i \leq N)$ in $A$ is of full column rank (or otherwise $A$ will not be so), $\hat{A}$ must contain at least m rows from each block. To be a square matrix, $\hat{A}$ must draw more rows than columns from some blocks, and the total number of these extra rows is exactly n₁ (the number of columns of $E$ ). This means that the number of blocks with more than m rows in $\hat{A}$ is somewhere between one and n₁. Correspondingly we can divide H_i $(1 \leq i \leq N)$ into two groups: group 1 are those with exactly m rows in $\hat{A}$ , each forms a m-dimensional nonsingular submatrix. Group 2 are those with more than m rows in $\hat{A}$ . Hence, by permuting rows and columns, we can write $\hat{A}$ as

\hat{A} = (\begin{array}{l} H_{1} & E_{1} & 0 \\ 0 & E_{2} & H_{2} \end{array}),

(A.24)

where

H_{1}

is a diagonal block matrix, where each block is a m-dimensional nonsingular matrix,

H_{2}

is a also a diagonal block matrix with no more than n₁ blocks, where each block has more rows than columns, and

(\begin{matrix} E_{1} \\ E_{2} \end{matrix})

is a submatrix of

E

with rearranged rows. Clearly

\hat{A}

is a FCBM itself.

Under these specifications, Lemma 7 holds when k = 1 if the following statement is true:

For any FCBM $A$ with characterization numbers $(m, n_{1})$ , where m and n₁ can be any positive integer, let $\hat{A}$ be a maximal nonsingular submatrix of $A$ . Let $V$ be the set of values of elements of $A$ . Let $u$ be any vector with the same number of components as the number of rows in $\hat{A}$ . Then there exists a constant κ, depending only on $V$ and $(m, n_{1})$ , such that

| | u | |_{1} \leq κ | | u \hat{A} | |_{1} .

(A.25)

To prove this statement, let $M_{1}$ be the set of indexes of nonsingular blocks in $H_{1}$ . We partition $u$ into $u^{1}$ and $u^{2}$ such that when multiplying $u$ with $\hat{A}$ , components of $u^{1}$ are multiplied with $(H_{1} E_{1})$ and components of $u^{2}$ are multiplied with $(H_{2} E_{2})$ . Similarly, we partition $u^{1}$ into $u_{i}^{1}$ ( $i \in M_{1}$ ) where components of $u_{i}^{1}$ are multiplied with block H_i in $H_{1}$ . For each $i \in M_{1}$ , H_i is nonsingular, thus

| | u^{1} | |_{1} = \sum_{i \in M_{1}} | | u_{i}^{1} | |_{1} = \sum_{i \in M_{1}} | | u_{i}^{1} H_{i} H_{i}^{- 1} | |_{1} \leq κ_{1} \sum_{i \in M_{1}} | | u_{i}^{1} H_{i} | |_{1} .

(A.26)

Here H_i $(i \in M_{1})$ are m-dimensional nonsingular matrices, with their components taking values from the finite set $V$ , and κ₁ is the maximum component value of all possible $H_{i}^{- 1}$ $(i \in M_{1}$ ).

Define $G_{2} ≐ (E_{2} H_{2})$ . Because both $\hat{A}$ and $H_{1}$ are nonsingular matrices, $G_{2}$ is also nonsingular. Each block in $H_{2}$ has m columns. For $G_{2}$ to be a square matrix, $H_{2}$ has n₁ more rows than columns, so the dimension of $G_{2}$ is between $n_{1} + m$ if $H_{2}$ has one block with $n_{1} + m$ rows and $(m + 1) n_{1}$ if $H_{2}$ has n₁ blocks, each with m + 1 rows. Let κ₂ be the maximum component of all possible formation of $G_{2}^{- 1}$ , where $G_{2}$ is a nonsingular matrix with dimension between $n_{1} + m$ and $(m + 1) n_{1}$ and component values drawn from the finite set $V$ . Then

| | u^{2} | |_{1} = | | u^{2} G_{2} G_{2}^{- 1} | |_{1} \leq κ_{2} | | u^{2} G_{2} | |_{1} = κ_{2} (| | u^{2} H_{2} | |_{1} + | | u^{2} E_{2} | |_{1}) .

(A.27)

It follows that

\begin{array}{l} | | u \hat{A} | |_{1} & = \sum_{i \in M_{1}} | | u_{i}^{1} H_{i} | |_{1} + | | u^{2} H_{2} | |_{1} + | | u^{1} E_{1} + u^{2} E_{2} | |_{1} \\ \geq \sum_{i \in M_{1}} | | u_{i}^{1} H_{i} | |_{1} + | | u^{2} H_{2} | |_{1} + | | u^{2} E_{2} | |_{1} - | | u^{1} E_{1} | |_{1} \\ \geq \sum_{i \in M_{1}} | | u_{i}^{1} H_{i} | |_{1} + | | u^{2} H_{2} | |_{1} + | | u^{2} E_{2} | |_{1} - n_{1} | | u^{1} | |_{1} \\ \geq \frac{| | u^{1} | |_{1}}{κ_{1}} + \frac{| | u^{2} | |_{1}}{κ_{2}} - n_{1} | | u^{1} | |_{1}, \end{array}

(A.28)

where the second inequality holds because components of

E_{1}

are either 0 or –1, and the last inequality follows from (A.26) and (A.27). Also by (A.26),

| | u^{1} | |_{1} \leq κ_{1} \sum_{i \in M_{1}} | | u_{i}^{1} H_{i} | |_{1} \leq κ_{1} | | u \hat{A} | |_{1},

where the second inequality follows from the structure of

\hat{A}

shown in (A.24). Thus, (A.28) implies that

| | u | |_{1} = | | u^{1} | |_{1} + | | u^{2} | |_{1} \leq (κ_{1} \lor κ_{2}) (| | u \hat{A} | |_{1} + n_{1} | | u^{1} | |_{1}) \leq (κ_{1} \lor κ_{2}) (1 + n_{1} κ_{1}) | | u \hat{A} | |_{1},

(A.29)

and (A.25) is satisfied with

κ = (κ_{1} \lor κ_{2}) (1 + n_{1} κ_{1}) .

General Case ( $1 \leq k \leq K$ ). We can extend the proof of (107) to cases with $k = 1, \dots, K$ by induction. To do that, we first generalize the definition of FCBM with the following recursive characterizations:

A matrix $A^{0}$ is stage 0 FCBM with characterization number m if it is a finite-dimensional matrix with m columns and full column rank;
A matrix $A^{k}$ is a stage k FCBM $(k \geq 1)$ with characterization numbers $(m, n_{1}, \dots, n_{k})$ if it is of full column rank, and with necessary permutations of rows and columns, can be written as
$A^{k} = (H^{k}, E^{k}),$ (A.30)
where
- The matrix $H^{k}$ has nonzero entries only in diagonal blocks, the number of which, denoted by N_k, can take any finite value. Specifically,
  $H^{k} = (\begin{matrix} A_{1}^{k - 1} \\ ⋱ \\ A_{N_{k}}^{k - 1} \end{matrix});$ (A.31)
  where each block $A_{i}^{k - 1}$ $(1 \leq i \leq N_{k}$ ) is a stage- $(k - 1)$ FCBM with characterization numbers $(m, n_{1}, \dots, n_{k - 1})$ ; blocks might not be identical, but they all have the same number of columns;
- The matrix $E^{k}$ has a finite number of columns (denoted by n_k) and its components are either 0 or −1.

For $k = 1, \dots, K$ , the LHS constraint matrix of $φ_{M}^{k} (y^{k + 1}, \dots, y^{K}, x)$ is a FCBM. To show this, when k = 2, (94)–(97) specialize to

z (\bar{ω}) \leq x + {\underline{D}}_{M}^{2} (\bar{ω}), \bar{ω} \in Ω_{2}^{0},

(A.32)

A^{1} z (\bar{ω}) - y^{1} ({\bar{ω}}^{'}) \leq 0, {\bar{ω}}^{'} \in Ω_{2}^{1}, \bar{ω} \in Ω_{2}^{0}, {\bar{ω}}^{'} ⊏ \bar{ω},

(A.33)

A^{2} z (\bar{ω}) - y^{2} \leq 0, \bar{ω} \in Ω_{2}^{0},

(A.34)

A^{k} z (\bar{ω}) \leq y^{k}, \bar{ω} \in Ω_{2}^{0}, 2 < k \leq K .

(A.35)

With necessary permutations of rows and columns, its LHS constraint matrix can be written as

A^{2} = (H^{2}, E^{2}) = (\begin{matrix} A_{1}^{1} \\ ⋱ & E^{2} \\ A_{N_{2}}^{1} \end{matrix}) .

Here $E^{2}$ is composed of coefficients of $y^{2}$ and N₂ is the number of elements in $Ω_{2}^{1}$ . Each $A_{i}^{1}$ $(1 \leq i \leq N_{2})$ corresponds to a particular ${\bar{ω}}^{'} \in Ω_{2}^{1}$ , and

A_{i}^{1} = (\begin{matrix} A_{1}^{0} \\ ⋱ & E^{1} \\ A_{N_{1}}^{0} \end{matrix}),

where

E^{1}

is composed of coefficients of

y^{1} ({\bar{ω}}^{'})

, N₁ is the number of elements

\bar{ω}

Ω_{2}^{0}

such that

{\bar{ω}}^{'} ⊏ \bar{ω}

. Corresponding to a particular

\bar{ω}, A_{i}^{0}

(

1 \leq i \leq N_{1}

) are the same as H_is in (A.23), and composed of coefficients associated with

z (\bar{ω})

. Thus, by definition,

A_{i}^{1}

(1 \leq i \leq n_{1})

is a stage 1 FCBM with characterization numbers

(m, n_{1})

. Moreover, because

A_{i}^{0}

(1 \leq i \leq N_{1}

) contains an identity matrix (Constraints (A.32)), each

E^{1}

(1 \leq i \leq N_{2})

contains a negative identity matrix (Constraints (A.33)),

E^{2}

contains a negative identity matrix (Constraints (A.34)), and these submatrices have no overlapping rows,

A^{2}

has full column rank. Hence,

A^{2}

is a FCBM with characterization numbers

(m, n_{1}, n_{2})

For general k $(1 \leq k \leq K)$ , the same can be shown by induction. Referring to (94)–(97). Let N_k be the number of elements in $Ω_{k}^{k - 1}$ . Then we can make the induction assumption that coefficients of $z (\bar{ω})$ $(\bar{ω} \in Ω_{k}^{0})$ and $y^{k^{'}} ({\bar{ω}}^{'})$ $({\bar{ω}}^{'} ⊏ \bar{ω}, {\bar{ω}}^{'} \in Ω_{k}^{k^{'}}, \bar{ω} \in Ω_{k}^{0}, 1 \leq k^{'} < k)$ are given by a number of N_k stage- $(k - 1)$ FCBMs with characterization numbers $(m, n_{1}, \dots, n_{k - 1})$ , which we denote by $A_{i}^{k - 1}$ $(1 \leq i \leq N_{k})$ . Coefficients of $y^{k}$ , which are either −1 or 0, are given by matrix $E^{k}$ , which has n_k columns. Thus, with necessary permutations of rows and columns, the LHS constraint matrix of $φ_{M}^{k} (y^{k + 1}, \dots, y^{K}, x)$ can be written as

A^{k} = (\begin{matrix} A_{1}^{k - 1} \\ ⋱ & E^{k} \\ A_{N_{k}}^{k - 1} \end{matrix}) .

Because every column in Constraints (94), (95), and (96) are covered by an identity or negative identity matrix and rows of these matrices do not overlap, by definition $A^{k}$ is a FCBM with characterization numbers $(m, n_{1}, \dots, n_{k})$ .

Let ${\hat{A}}^{k}$ be a maximal nonsingular submatrix of $A^{k}$ $(1 \leq k \leq K)$ . Then following the same reasoning that leads to (A.24), ${\hat{A}}^{k}$ and $A^{k}$ must have the same number of columns because $A^{k}$ has full column rank. By (A.30) and (A.31), for $i = 1, \dots, N_{k}, {\hat{A}}^{k}$ contains either a maximal nonsingular submatrix of $A_{i}^{k - 1}$ (the rank of the submatrix equals the column rank of $A_{i}^{k - 1}$ ) or a submatrix that includes every column of $A_{i}^{k - 1}$ and has more rows than columns. The total number of extra rows is n_k for ${\hat{A}}^{k}$ to be nonsingular. Hence ${\hat{A}}^{k}$ can be written as

{\hat{A}}^{k} = (\begin{array}{l} H_{1}^{k} & E_{1}^{k} & 0 \\ 0 & E_{2}^{k} & H_{2}^{k} \end{array}),

(A.36)

where

H_{1}^{k}

is a diagonal block matrix, with each block being a nonsingular submatrix of

A_{i}^{k - 1}

for some i

(1 \leq i \leq N_{k}

H_{2}^{k}

is a non-square diagonal block matrix, where each block is a submatrix of

A_{i}^{k - 1}

for some other i

(1 \leq i \leq N_{k})

, and has more rows than columns, and

(\begin{matrix} E_{1}^{k} \\ E_{2}^{k} \end{matrix})

is composed of a subset of rows in

E^{k}

. The number of blocks

H_{2}^{k}

ranges from one to n_k.

Under the previous specifications, the lemma holds for all k $(1 \leq k \leq K)$ if the following statement is true:

For $k = 1, \dots, K$ , let $A^{k}$ be a stage-k FCBM with characterization numbers $(m, n_{1}, \dots, n_{k})$ , where $m, n_{1}, \dots, n_{k}$ can be any positive integers. Let ${\hat{A}}^{k}$ be a maximal nonsingular submatrix of $A^{k}$ . Let $V$ be the set of component values in $A^{k}$ . Let $u$ be any vector with the same number of components as the number of rows in ${\hat{A}}^{k}$ . Then there exists a constant κ, depending only on $V$ and $(m, n_{1}, \dots, n_{k})$ , such that

| | u | |_{1} \leq κ | | u {\hat{A}}^{k} | |_{1} .

(A.37)

We have already shown in (A.29) that the previous statement is true for k = 1, and thus can use induction to prove the same for k > 1:

Assume that (A.37) holds for any maximal nonsingular submatrix of a stage- $(k - 1)$ FCBM (k > 1) with any finite and positive characterization numbers. Let $A^{k}$ be a stage-k FCBM with characterization numbers $(m, n_{1}, \dots, n_{k - 1}, n_{k})$ . Recall that by permuting rows and columns, a maximal nonsingular matrix of $A^{k}$ can be written as

{\hat{A}}^{k} = (\begin{array}{l} H_{1}^{k} & E_{1}^{k} & 0 \\ 0 & E_{2}^{k} & H_{2}^{k} \end{array}),

(A.38)

where

H_{1}^{k}

is a diagonal block matrix, where each block is a nonsingular stage-

(k - 1)

FCBM with characterization numbers

(m, n_{1}, \dots, n_{k - 1}), H_{2}^{k}

is a diagonal matrix, where each block is also a stage-

(k - 1)

FCBM with characterization numbers

(m, n_{1}, \dots, n_{k - 1})

but has more rows than columns, and

(\begin{matrix} E_{1}^{k} \\ E_{2}^{k} \end{matrix})

is a matrix with n_k columns and the values of its components are either −1 or 0. Let blocks in

H_{1}^{k}

be indexed and let

M_{1}^{k}

be the index set. Denote these blocks by

{\hat{A}}^{k - 1, i}

(

i \in M_{1}^{k}

), where the first superscript shows the stage and the second one identifies a particular block.

We partition the vector $u$ into $u^{1}$ and $u^{2}$ such that in the product of $u$ and ${\hat{A}}^{k}$ , components of $u^{1}$ and $u^{2}$ are multiplied with rows in $(H_{1}^{k} E_{1}^{k})$ and $(E_{2}^{k} H_{2}^{k})$ , respectively. We also partition $u^{1}$ into $u_{i}^{1}$ $(i \in M_{1}^{k})$ such that components of $u_{i}^{1}$ are multiplied with rows in ${\hat{A}}^{k - 1, i}$ ( $i \in M_{1}^{k}$ ).

Because ${\hat{A}}^{k - 1, i}$ ( $i \in M_{1}^{k}$ ) is a nonsingular stage- $(k - 1)$ FCBM, its maximal nonsingular submatrix is the matrix itself. Therefore, by the induction assumption, there exists a constant κ₁, depending only on $(m, n_{1}, \dots, n_{k - 1})$ and $V$ , such that

| | u^{1} | |_{1} = \sum_{i \in M_{1}^{k}} | | u_{i}^{1} | |_{1} \leq κ_{1} \sum_{i \in M_{1}^{k}} | | u_{i}^{1} {\hat{A}}^{k - 1, i} | |_{1} .

(A.39)

To bound $u^{2}$ , recall that $H_{2}^{k}$ is a diagonal block matrix that contains no more than n_k (non-square) blocks. Suppose that the actual number is $N^{'}$ $(1 \leq N^{'} \leq n_{k})$ . As noted previously, each block is a stage- $(k - 1)$ FCBM with full column rank. Let $M_{2}^{k}$ be the index set of these blocks and denote them by $A^{k - 1, i}$ ( $i \in M_{2}^{k}$ ). Following the definition in (A.30) and (A.31),

A^{k - 1, i} = (H^{k - 1, i} E^{k - 1, i}), i \in M_{2}^{k} .

For the ease of presentation, we let indexes in $M_{2}^{k}$ take integer values $i = 1, \dots, N^{'}$ . Thus, the lower right submatrix in (A.38) can be written as

\begin{array}{l} (E_{2}^{k} H_{2}^{k}) & = (\begin{matrix} A^{k - 1, 1} \\ ........ \\ E_{2}^{k} & A^{k - 1, i} \\ ........... \\ A^{k - 1, N^{'}} \end{matrix}) \\ = (\begin{matrix} H^{k - 1, 1} & E^{k - 1, 1} \\ ....... \\ E_{2}^{k} & H^{k - 1, i} & E^{k - 1, i} \\ ........ \\ H^{k - 1, N^{'}} & E^{k - 1, N^{'}} \end{matrix}) . \end{array}

Applying (A.30) and (A.31) recursively, every $H^{k - 1, i}$ is a diagonal block matrix, where each block is a stage- $(k - 2)$ FCBM with characterization numbers $(m, n_{1}, \dots, n_{k - 2})$ . Let M(i) be the number of blocks in $H^{k - 1, i}$ $(i \in M_{2}^{k})$ . Denote a block in $H^{k - 1, i}$ by $A_{j}^{k - 2, i}$ $(1 \leq j \leq M (i), i \in M_{2}^{k})$ . Then with necessary permutations of rows and columns, $(E_{2}^{k} H_{2}^{k})$ can be written in a more expanded form as

\begin{array}{l} (\begin{array}{l} A_{1}^{k - 2, 1} \\ .... \\ A_{M (1)}^{k - 2, 1} \\ .... \\ A_{1}^{k - 2, i} \\ .... \\ A_{M (i)}^{k - 2, i} \\ ..... \\ A_{1}^{k - 2, N^{'}} \\ .... \\ A_{M (N^{'})}^{k - 2, N^{'}} \end{array} | \\ \begin{array}{l} E^{k - 1, 1} \\ .... \\ E^{k - 1, i} & E_{2}^{k} \\ \dots \\ E^{k - 1, N^{'}} \end{array}) . \end{array}

The submatrix to the left of $|$ is a diagonal block matrix, where each block is a stage $(k - 2)$ FCBM. The submatrix on the right has either 0 and –1 as its component value. Since $E_{2}^{k}$ has n_k columns, each $E^{k - 1, i}$ $(i \in M_{2}^{k})$ has $n_{k - 1}$ columns, and the number of components of $M_{2}^{k}$ is $N^{'} \leq n_{k}$ , the latter submatrix has a finite number of $N^{'} n_{k - 1} + n_{k}$ columns. Also by (A.38), $(E_{2}^{k} H_{2}^{k})$ has full rank. Hence, $(E_{2}^{k} H_{2}^{k})$ fits the definition as a nonsingular stage $(k - 1)$ FCBM with characterizations numbers of $(m, n_{1}, \dots, n_{k - 2}, N^{'} n_{k - 1} + n_{k})$ .

By the induction assumption, there exists a constant κ₂, which depends only on $V$ and $(m, n_{1}, \dots, n_{k - 2}, N^{'} n_{k - 1} + n_{k})$ (and thus $(m, n_{1}, \dots, n_{k - 1}, n_{k}$ ) because $1 \leq N^{'} \leq n_{k}$ ), such that

| | u^{2} | |_{1} \leq κ_{2} | | u^{2} (E_{2}^{k} H_{2}^{k}) | |_{1} = κ_{2} (| | u^{2} E_{2}^{k} | |_{1} + | | u^{2} H_{2}^{k} | |_{1}) .

(A.40)

The proof can then be completed by observing that

\begin{array}{l} | | u {\hat{A}}^{k} | |_{1} & = \sum_{i \in M_{1}^{k}} | | u_{i}^{1} {\hat{A}}^{k - 1, i} | |_{1} + | | u^{1} E_{1}^{k} + u^{2} E_{2}^{k} | |_{1} + | | u^{2} H_{2}^{k} | |_{1} \\ \geq \sum_{i \in M_{1}^{k}} | | u_{i}^{1} {\hat{A}}^{k - 1, i} | |_{1} + | | u^{2} E_{2}^{k} | |_{1} - | | u^{1} E_{1}^{k} | |_{1} + | | u^{2} H_{2}^{k} | |_{1} \\ \geq \sum_{i \in M_{1}^{k}} | | u_{i}^{1} {\hat{A}}^{k - 1, i} | |_{1} + | | u^{2} E_{2}^{k} | |_{1} - n_{k} | | u^{1} | |_{1} + | | u^{2} H_{2}^{k} | |_{1} \\ \geq \sum_{i \in M_{1}^{k}} | | u_{i}^{1} {\hat{A}}^{k - 1, i} | |_{1} + | | u^{2} E_{2}^{k} | |_{1} - n_{k} κ_{1} \sum_{i \in M_{1}^{k}} | | u_{i}^{1} {\hat{A}}^{k - 1, i} | |_{1} + | | u^{2} H_{2}^{k} | |_{1} \\ \geq \frac{| | u^{1} | |_{1}}{κ_{1}} + \frac{| | u^{2} | |_{1}}{κ_{2}} - n_{k} κ_{1} | | u {\hat{A}}^{k} | |_{1}, \end{array}

where the second inequality holds because

E_{1}^{k}

has n_k columns, the third inequality comes from (A.39), and the last inequality is implied by (A.39), (A.40), and the first equality in the previous expression. Thus,

| | u | |_{1} = | | u^{1} | |_{1} + | | u^{2} | |_{2} \leq κ_{1} \lor κ_{2} (1 + n_{k} κ_{1}) | | u {\hat{A}}^{k} | |_{1} . □

Appendix B. Illustration of the Inventory Policy

We use a very simple example, the N system, to illustrate our inventory policy developed in Section 4. Figure 1 in Section 2 shows that the system has two products and two components. One unit of component 1 is used by both products, and one unit of component 2 is used by product 2 only. Components have different lead times and $L_{1} < L_{2}$ . Thus, K = 2. We also assume that $c_{1} > c_{2}$ .

The formulation of the SP (13) for this system specializes to

\begin{array}{l} φ^{2} & = \min_{y_{2}} {h_{2} y_{2} + E [φ^{1} (y_{2}, D^{2})]} \\ φ^{1} (y_{2}, x) & = \min_{y_{1}} {h_{1} y_{1} + E [φ^{0} (y_{1}, y_{2}, x + D^{1})]} \\ φ^{0} (y_{1}, y_{2}, x) & = - \max_{z_{1}, z_{2}} {c_{1} z_{1} + c_{2} z_{2} | z_{1} \leq x_{1}, z_{2} \leq x_{2}, z_{1} + z_{2} \leq y_{1}, z_{2} \leq y_{2}} . \end{array}

To maximize $φ_{0} (y_{1}, y_{2}, x)$ ,

z_{1}^{*} = x_{1} and z_{2}^{*} = x_{2} \land (y_{1} - x_{1}) \land y_{2} .

Correspondingly, with y₂ and $x$ given, $y_{1}^{*}$ is chosen to minimize

h_{1} y_{1} - c_{1} E [x_{1} + D_{1}^{1}] - c_{2} E [(x_{2} + D_{2}^{1}) \land (y_{1} - x_{1} - D_{1}^{1}) \land y_{2}],

(B.1)

where the expectation is taken with respected to

D^{1} = (D_{1}^{1}, D_{2}^{1})

. Substituting

x

with the realization of

D^{2}

in the resulting optimal objective value

φ^{1} (y_{2}, x)

and taking the expectation over

D^{2}, y_{2}^{*}

is chosen to minimize

h_{2} y_{2} + E [φ^{1} (y_{2}, D^{2})] .

(B.2)

It follows that the replenishment policy prescribed in Algorithm 1 specializes to the following actions:

Let $y_{2}^{*}$ be the (constant) inventory position target for component 2 and follow a base stock policy to keep the actual inventory position at that level.
At time t, where t is either $- L_{1}$ or any time afterward when there is a demand arrival:
1. choose $y_{1}^{*}$ to minimize (B.1) with $x = D (t - L_{2} + L_{1}, t)$ , set component 1’s inventory position target to
  $I P_{1} (t) = y_{1}^{*} - [D_{1} (t - L_{2} + L_{1}, t) + D_{2} (t - L_{2} + L_{1}, t)],$
  and order ${(⌈ I P_{1} (t) ⌉ - I P_{1}^{-} (t))}^{+}$ units of that component;
2. schedule a future update of the inventory position target and a possible new replenishment at time $t^{'} = t - L_{1} + L_{2}$ . Repeat Step 2(a) when the system reaches $t = t^{'}$ .

For the allocation policy, the procedure prescribed in Algorithm 2 specializes to the following actions:

At each time t, where t is either zero or any time afterward when there is a demand arrival or some previously ordered component is received:

Set backlog targets at
$B_{1} (t) = 0 and B_{2} (t) = {(D_{1} (t - L_{1}, t) + D_{2} (t - L_{1}, t) - I P_{1} (t - L_{1}))}^{+} \lor {(D_{2} (t - L_{2}, t) - y_{2}^{*})}^{+},$
which correspond to the optimal solution to
$\min_{B \geq 0} {c_{1} B_{1} + c_{2} B_{2} | B_{1} + B_{2} \geq Q_{1} (t), B_{2} \geq Q_{2} (t)},$
using
$Q_{1} (t) = D_{1} (t - L_{1}, t) + D_{2} (t - L_{1}, t) - I P_{1} (t - L_{1}) and Q_{2} (t) = D_{2} (t - L_{2}, t) - y_{2}^{*} .$
Use component 1 to serve demand 1 until either the component runs out or $B_{1} (t) = 0$ . In the latter case, use the remaining amount of component 1 and the available amount of component 2 to serve as much demand 2 as possible.

In this process, the backlog target $B_{1} (t)$ does not depend on system state and $B_{2} (t)$ is the higher shortage level of the two components. Thus, the previous procedure can be simplified to a priority policy, that is, use available components to serve as much demand as possible, and when there is not sufficient component 1 to serve both demands, use the component for product 1 first.

Appendix C. Formulation and Perturbation of LP (93)–(97)

The SP in (92) has K + 1 stages. Correspondingly, we develop a scenario tree with K + 1 levels to encode information available at each stage. The top level (k = K) has a single node, which is the root of the tree. A node at lower levels $k = K - 1, \dots, 1$ is the root node of a subtree that starts from that level. On that subtree, the path from the root node to a descendant node at level $k^{'}$ ( $0 \leq k^{'} < k$ ) is encoded by a string

\bar{ω} = ω^{k} \dots ω^{k^{'} + 1},

and

D_{M}^{l} (ω^{l})

is a realization of demand

D_{M}^{l}

(

k^{'} < l \leq k

). Because

D_{M}^{l}

are independent, this specification applies to all subtrees that start from level k. Let

Ω_{k}^{k^{'}}

be the collection of these strings (

0 \leq k^{'} < k \leq K

). Note that

Ω_{k}^{k^{'}}

depends on M, but we suppress this to ease the notational burden. For

k^{'} < k

, the probability associated with a path encoded by string

\bar{ω} \in Ω_{k}^{k^{'}}

P (\bar{ω}) = P_{M}^{k} (ω^{k}) \times \dots \times P_{M}^{k^{'} + 1} (ω^{k^{'} + 1}),

(C.1)

where

P_{M}^{l} (ω^{l})

is the probability that

D_{M}^{l} = D_{M}^{l} (ω^{l})

(

k^{'} < l \leq k

). For convenience, we also allow

k^{'} = k

, where

Ω_{k}^{k}

contains a single element,

\bar{ω}

, corresponding to an empty string, and

P (\bar{ω}) = 1 .

The total demand realized on the path

\bar{ω} = ω^{k} \dots ω^{1}

is then denoted by

{\underline{D}}_{M}^{k} (\bar{ω}) = D_{M}^{k} (ω^{k}) + \dots + D_{M}^{1} (ω^{1}) .

For any two strings ω₁ and ω₂, write $ω_{1} ⊏ ω_{2}$ if ω₁ is a prefix substring of string ω₂. On any subtree that starts from level k ( $1 \leq k \leq K$ ), let ${\bar{ω}}^{'}$ ( ${\bar{ω}}^{'} \in Ω_{k}^{k^{'}}$ ) encode a path between its root node and a descendant node at level $k^{'}$ ( $0 < k^{'} \leq k$ ). Let ${\bar{ω}}^{'}^{'}$ ( ${\bar{ω}}^{'}^{'} \in Ω_{k}^{k^{'}^{'}}$ ) encode a path between the root node and a descendant node at a lower level $k^{'}^{'}$ ( $0 \leq k^{'}^{'} < k$ ). Then the former path is a segment of the latter one if and only if ${\bar{ω}}^{'} ⊏ {\bar{ω}}^{'}^{'}$ .

A tree starting from level k ( $0 \leq k \leq K$ ), as specified previously, and associated with data $(y^{k + 1}, \dots, y^{K}, x)$ ( $x \geq 0$ ), allows us to formulate $φ_{M}^{k} (y^{k + 1}, \dots, y^{K}, x)$ as the following LP (recall that $P (\bar{ω}) = 1$ for $\bar{ω} \in Ω_{k}^{k}$ as the set that contains a single element):

φ_{M}^{k} (y^{k + 1}, \dots, y^{K}, x) = \min_{y^{k}, \dots, y^{1}, z} {\sum_{k^{'} = 1}^{k} \sum_{\bar{ω} \in Ω_{k}^{k^{'}}} P (\bar{ω}) h^{k^{'}} \cdot y^{k^{'}} (\bar{ω}) - \sum_{\bar{ω} \in Ω_{k}^{0}} P (\bar{ω}) c \cdot z (\bar{ω})},

(C.2)

subject to

z (\bar{ω}) \leq x + {\underline{D}}_{M}^{k} (\bar{ω}), \bar{ω} \in Ω_{k}^{0},

(C.3)

A^{k^{'}} z (\bar{ω}) - y^{k^{'}} ({\bar{ω}}^{'}) \leq 0, \bar{ω} \in Ω_{k}^{0}, {\bar{ω}}^{'} \in Ω_{k}^{k^{'}}, {\bar{ω}}^{'} ⊏ \bar{ω}, 1 \leq k^{'} < k,

(C.4)

A^{k} z (\bar{ω}) - y^{k} \leq 0, \bar{ω} \in Ω_{k}^{0},

(C.5)

A^{k^{'}} z (\bar{ω}) \leq y^{k^{'}}, \bar{ω} \in Ω_{k}^{0}, k < k^{'} \leq K .

(C.6)

This is the same formulation as (93)–(97) given in Section 7.1. When k = K, (C.2)–(C.6) is the same problem defined in (92). When k < K, it defines subproblems of (92) at stage k ( $0 \leq k < K$ ), with $(y^{k + 1}, \dots, y^{K})$ given by decisions already taken at stages $K, \dots, k + 1$ ( $0 \leq k < K$ ), and $x$ representing the amounts of existing demands.

We perturb coefficients in (C.2) to keep the optimal solution of the LP unique. Let ${\tilde{h}}^{k}$ be perturbed values of $h^{k}$ $(1 \leq k \leq k), \tilde{b}$ be perturbed value of $b$ , and referring to (C.1), let ${\tilde{P}}_{M}^{k} (ω^{k})$ be the perturbed values of $P_{M}^{k} (ω^{k})$ $(1 \leq k \leq K)$ . Consequently,

\tilde{c} = \tilde{b} + \sum_{k = 1}^{K} {(A_{k})}^{'} {\tilde{h}}^{k} and \tilde{P} (\bar{ω}) = {\tilde{P}}_{M}^{k} (ω^{k}) \times \dots \times {\tilde{P}}_{M}^{k^{'} + 1} (ω^{k^{'} + 1})

are perturbed values of

c

and

P (\bar{ω})

(

\bar{ω} \in Ω_{k}^{k^{'}}, 0 \leq k^{'} < k \leq K

), respectively.

In each system, we assign same values to ${\tilde{h}}^{k}$ ( $1 \leq k \leq K$ ), $\tilde{b}$ , and $P_{M}^{k} (\bar{ω})$ $(\bar{ω} \in Ω_{k}^{k^{'}}, 1 \leq k \leq K)$ . Hence, the same perturbed coefficients are used in $φ_{M}^{k} (y^{k} + 1, \dots, y^{K}, x)$ , independent of data $y^{k + 1}, \dots, y^{K}, x$ ( $0 \leq k \leq K$ ). These coefficients are chosen to satisfy the following conditions.

First, there are no two nonzero rational vectors $r^{l} (\bar{ω})$ $(ω \in Ω_{k}^{l}, 1 \leq l \leq k)$ and $r^{0} (\bar{ω})$ $(\bar{ω} \in Ω_{k}^{0})$ such that

\sum_{l = 1}^{k} \sum_{\bar{ω} \in Ω_{k}^{l}} \tilde{P} (\bar{ω}) {\tilde{h}}^{l} \cdot r^{l} (\bar{ω}) + \sum_{\bar{ω} \in Ω_{k}^{0}} \tilde{P} (\bar{ω}) \tilde{c} \cdot r^{0} (\bar{ω}) = 0 .

(C.7)

To meet this condition, we can, for instance, add distinct irrational values to $h^{k}$ $(1 \leq k \leq K), b$ , and $P_{M}^{k} (\bar{ω})$ ( $\bar{ω} \in Ω_{k}^{k^{'}}, 0 \leq k^{'} < k \leq K$ ). These irrational values are chosen to keep the ratio of any two perturbed coefficients irrational, so that their weighted sum cannot be zero if weights are rational numbers. The condition guarantees that the optimal solution of (C.2)–(C.6) is unique, because if not, then the LP would have two extreme-point solutions, which must be rational-valued because A^k and ${\underline{D}}_{M}^{k} (\bar{ω})$ ( $1 \leq k \leq K$ ) in (C.3)–(C.6) are integers. Thus, by induction from k = K downward, $y^{k^{'}}$ $(k < k^{'} \leq K)$ on the RHS of (C.6) are rational vectors. Because the two solutions yield the same objective value, (C.7) is satisfied if we let $r^{l} (\bar{ω})$ be the difference in $y^{l} (\bar{ω})$ $(\bar{ω} \in Ω_{k}^{l}, 1 \leq l \leq k)$ and $r^{0} (\bar{ω})$ be the difference in $z (\bar{ω})$ $(\bar{ω} \in Ω_{k}^{0})$ , which is a contradiction.

Second, these values can be kept sufficiently small; that is, they can satisfy (98) for any choice of $Δ > 0$ . This is easily achievable by dividing the aforementioned irrational values by a sufficiently large integer before adding them to the original coefficients.

References

Agrawal N, Cohen MA (2001) Optimal material control in an assembly system with component commonality. Naval Res. Logist. 48:409–429.Google Scholar
Akçay Y, Xu SH (2004) Joint inventory replenishment and component allocation optimization in an assemble-to-order system. Management Sci. 50:99–116.Link, Google Scholar
Atan Z, Ahmadi T, Stegehuis C, De Kok T, Adan I (2017) Assemble-to-order systems: A review. Eur. J. Oper. Res. 261:866–879.Google Scholar
Atar R, Keslassy I, Mendelson G (2019) Replicate to the shortest queues. Queueing Systems 92(1):1–23.Google Scholar
Bell SL, Williams RJ (2001) Dynamic scheduling of a system with two parallel servers in heavy traffic with resource pooling: Asymptotic optimality of a threshold policy. Ann. Appl. Probability 11:608–649.Google Scholar
Bramson M (1998) State space collapse with application to heavy traffic limits for multiclass queueing networks. Queueing Systems 30:89–140.Google Scholar
Bu J, Gong X, Yao D (2020) Constant-order policies for lost-sales inventory models with random supply functions: Asymptotics and heuristic. Oper. Res. 68:1063–1073.Link, Google Scholar
DeValve L, Pekeč S, Wei Y (2020) A primal-dual approach to analyzing ATO systems. Management Sci. 66:5389–5407.Link, Google Scholar
Doğru MK, Reiman MI, Wang Q (2010) A stochastic programming based inventory policy for assemble-to-order systems with application to the W model. Oper. Res. 58:849–864.Link, Google Scholar
Doğru MK, Reiman MI, Wang Q (2017) Assemble-to-order inventory management via stochastic programming: Chained BOMs and the M-system. Production Oper. Management 26:446–468.Google Scholar
Goldberg DA, Reiman MI, Wang Q (2021) A survey of recent progress in the asymptotic analysis of inventory systems. Prod. Oper. Manag. 30:1718–1750.Google Scholar
Goldberg DA, Katz-Rogozhnikov DA, Lu Y, Sharma M, Squillante MS (2016) Asymptotic optimality of constant-order policies for lost sales inventory models with large lead times. Math. Oper. Res. 41:898–913.Link, Google Scholar
Harrison J (1988) Brownian models of queueing networks with heterogeneous customer populations. Inst. Math. Its Appl. 10:147.Google Scholar
Harrison J (1996) The bigstep approach to flow management in stochastic processing networks. Stochastic Networks Theory Appl. 4:147–186.Google Scholar
Harrison J, López M (1999) Heavy traffic resource pooling in parallel-server systems. Queueing Systems 33:339–368.Google Scholar
Harrison J, Wein L (1990) Scheduling networks of queues: Heavy traffic analysis of a two-station closed network. Oper. Res. 38:1052–1064.Link, Google Scholar
Harrison JM, van Mieghem JA (1997) Dynamic control of Brownian networks: State space collapse and equivalent workload formulations. Ann. Appl. Probability 7:747–771.Google Scholar
Hausman W, Lee H, Zhang A (1998) Order response time reliability in a multi-item inventory system. Eur. J. Oper. Res. 109:646–659.Google Scholar
Huang K, de Kok T (2015) Optimal FCFS allocation rules for periodic-review assemble-to-order systems. Naval Res. Logist. 62:158–169.Google Scholar
Huh WT, Rusmevichientong P (2009) A nonparametric asymptotic analysis of inventory planning with censored demand. Math. Oper. Res. 34:103–123.Link, Google Scholar
Huh WT, Janakiraman G, Muckstadt JA, Rusmevichientong P (2009) Asymptotic optimality of order-up-to policies in lost sales inventory systems. Management Sci. 55:404–420.Link, Google Scholar
Karlin S, Scarf H (1958) Inventory models of the Arrow-Harris-Marschak type with time lag. Stud. Math. Theory Inventory Production 1:155.Google Scholar
Lu Y, Song J-S (2005) Order-based cost optimization in assemble-to-order systems. Oper. Res. 53:151–169.Link, Google Scholar
Lu L, Song JS, Zhang H (2015) Optimal and asymptotically optimal policies for assemble-to-order N- and W-systems. Naval Res. Logist. 62:617–645.Google Scholar
Lu Y, Song J-S, Zhao Y (2010) No-holdback allocation rules for continuous-time assemble-to-order systems. Oper. Res. 58:691–705.Link, Google Scholar
Mangasarian OL, Shiau T-H (1987) Lipschitz continuity of solutions of linear inequalities, programs, and complementarity problems. SIAM J. Control Optim. 25:583–595.Google Scholar
Meyer CD (2000) Matrix Analysis and Applied Linear Algebra, vol. 71 (SIAM, Philadelphia).Google Scholar
Nadar E, Akan M, Scheller-Wolf A (2014) Optimal structural results for assemble-to-order generalized M-systems. Oper. Res. 62:571–579.Link, Google Scholar
Plambeck EL, Ward AR (2006) Optimal control of a high-volume assemble-to-order system. Math. Oper. Res. 31:453–477.Link, Google Scholar
Reiman MI (1984) Some diffusion approximations with state space collapse. Modelling and Performance Evaluation Methodology (Springer, Berlin), 207–240.Google Scholar
Reiman MI (2004) A new and simple policy for a continuous review lost-sales inventory model. Unpublished manuscript.Google Scholar
Reiman MI, Wang Q (2012) A stochastic program based lower bound for assemble-to-order inventory systems. Oper. Res. Lett. 40:89–95.Google Scholar
Reiman MI, Wang Q (2015) Asymptotically optimal inventory control for assemble-to-order systems with identical lead times. Oper. Res. 63:716–732.Link, Google Scholar
Reiman MI, Wan H, Wang Q (2016) On the use of independent base-stock policies in assemble-to-order inventory systems with nonidentical lead times. Oper. Res. Lett. 44:436–442.Google Scholar
Rosling K (1989) Optimal inventory policies for assembly systems under random demands. Oper. Res. 37:565–579.Link, Google Scholar
Schrijver A (1986) Theory of Linear and Integer Programming (John Wiley & Sons, Hoboken, NJ).Google Scholar
Shapiro A, Dentcheva D, Ruszczyński A (2009) Lectures on Stochastic Programming: Modeling and Theory (SIAM, Philadelphia).Google Scholar
Song J-S, Zipkin P (2003) Supply chain operations: Assemble-to-order systems. Handbook Oper. Res. Management Sci. 11:561–596.Google Scholar
Stolyar AL, Wang Q (2022) Exploiting random lead times for significant inventory cost savings. Oper Res. 70(4):2496–2516.Google Scholar
van Jaarsveld W, Scheller-Wolf A (2015) Optimization of industrial-scale assemble-to-order systems. INFORMS J. Comput. 27:544–560.Link, Google Scholar
Wei L, Jasin S, Xin L (2021) On a deterministic approximation of inventory systems with sequential service-level constraints. Oper. Res. 69:1057–1076.Link, Google Scholar
Xin L, Goldberg DA (2016) Optimality gap of constant-order policies decays exponentially in the lead time for lost sales models. Oper. Res. 64:1556–1565.Link, Google Scholar
Zhang AX (1997) Demand fulfillment rates in an assembleto-order system with multiple products and dependent demands. Production Oper. Management 6:309–324.Google Scholar
Zipkin PH (2000) Foundations of Inventory Management (McGraw-Hill, New York).Google Scholar
Zipkin P (2016) Some specially structured assemble-to-order systems. Oper. Res. Lett. 44:136–142.Google Scholar

Volume 13, Issue 1

March 2023

Pages 1-180

Article Information

Metrics

Information

Received:September 21, 2018
Accepted:July 26, 2022
Published Online:October 26, 2022

Cite as

Martin I. Reiman, Haohua Wan, Qiong Wang (2022) Asymptotically Optimal Inventory Control for Assemble-to-Order Systems. Stochastic Systems 13(1):128-180.

https://doi.org/10.1287/stsy.2022.0099

Keywords

Acknowledgments

The authors are grateful to the associate editor and anonymous referees for constructive comments.

PDF download

Available Issues

Available Issues

Asymptotically Optimal Inventory Control for Assemble-to-Order Systems

Abstract

1. Introduction

2. Problem Formulation

2.1. System

2.2. Inventory Control Problem

2.3. Additional Variables, Processes, and Relationships

2.3.1. Variables and Processes.

2.3.2. Relationships.

2.3.3. Other Parameters.

3. Stochastic Program

3.1. Previous Result

3.2. New Development

4. Inventory Policy

4.1. General Idea

4.2. Replenishment Policy

4.2.1 Overview.

4.2.2. Specific Policy Procedure.

4.3. Allocation Policy

4.3.1. Overview.

4.3.2. Specific Policy Procedures.

4.4. Further Comments

5. Asymptotic Optimality

5.1. Large Lead Time Asymptotic Regime

5.1.1. Demand Process.

5.1.2. Replenishment and Inventory.

5.1.3. Allocation and Backlog.

5.1.4. Inventory Cost.

5.1.5. Summary of Definitions.

5.2. Asymptotic Optimality Criterion

5.3. Sufficient Conditions for Asymptotic Optimality

6. Proof of Sufficient Conditions

6.1. Stochastic Tracking Model

6.2. Convergence to Targets

6.2.1. Proof of Condition (50).

6.2.2. Proof of Condition (51).

7. Stability of SP Optimal Solution

7.1. Approximation and Perturbation

7.2. Values of Lipschitz Constants

8. Numerical Results

9. Conclusion

Appendix A. Proof of Theorems

Appendix B. Illustration of the Inventory Policy

Appendix C. Formulation and Perturbation of LP (93)–(97)

References

Volume 13, Issue 1

Article Information

Metrics

Information

Cite as

Keywords