Free Access

Compressed Smooth Sparse Decomposition

Shancong Mou
Shancong Mou
[email protected]
H. Milton Stewart School of Industrial and Systems Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332
Search for more papers by this author
,
Jianjun Shi
Corresponding Author
Jianjun Shi
[email protected]
https://orcid.org/0000-0002-3774-9176
H. Milton Stewart School of Industrial and Systems Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332
Search for more papers by this author

Shancong Mou

[email protected]

H. Milton Stewart School of Industrial and Systems Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332

Search for more papers by this author

Jianjun Shi

Corresponding Author

Jianjun Shi

[email protected]

https://orcid.org/0000-0002-3774-9176

H. Milton Stewart School of Industrial and Systems Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332

Search for more papers by this author

Published Online:7 Nov 2022https://doi.org/10.1287/ijds.2022.0023

Abstract

Image-based anomaly detection systems are of vital importance in various manufacturing applications. The resolution and acquisition rate of such systems are increasing significant in recent years under the fast development of image sensing technology. This enables the detection of tiny anomalies in real time. However, such a high resolution and a high acquisition rate of image data not only slow down the speed of image processing algorithms but also, increase data storage and transmission cost. To tackle this problem, we propose a fast and data-efficient method with theoretical performance guarantee that is suitable for sparse anomaly detection in images with a smooth background (smooth plus sparse signal). The proposed method, named compressed smooth sparse decomposition (CSSD), is a one-step method that unifies the compressive image acquisition- and decomposition-based image processing techniques. To further enhance its performance in a high-dimensional scenario, a Kronecker compressed smooth sparse decomposition (KronCSSD) method is proposed. Compared with traditional smooth and sparse decomposition algorithms, significant transmission cost reduction and computational speed boost can be achieved with negligible performance loss. Simulation examples and several case studies in various applications illustrate the effectiveness of the proposed framework.

History: Kwok-Leung Tsui served as the senior editor for this article.

Funding: This work is partially support by the National Science Foundation Division of Engineering Education and Centers [Grant 2052714].

Data Ethics & Reproducibility Note: The code capsule is available on Code Ocean at https://doi.org/10.24433/CO.6352310.v2 and in the e-Companion to this article (available at https://doi.org/10.1287/ijds.2022.0023).

1. Introduction

High-quality image sensing systems are widely used in manufacturing processes for product quality monitoring and fault diagnosis. The resolution and acquisition rate of such systems increase significantly benefiting from the rapid development of image sensing technology. For example, in a hot rolling process, an in situ image-based sensor can detect a micrometer-sized seam on a rolling bar at a speed of up to 225 miles per hour (Yan et al. 2018). For another example, to monitor solar activity, satellites can capture high-resolution solar images with a high acquisition rate, producing terabytes of data per day (Wang et al. 2018). To achieve real-time inspection, a large volume of high-resolution images needs to be transmitted and processed in real time. Such a large volume of high-resolution image data poses a big challenge not only on the speed of image processing algorithms but also, for the storage and transmission of the data itself.

Matrix decomposition-based image processing techniques are widely used in image-based process monitoring and anomaly detection. They achieve the goal by integrating the prior for background and anomaly components into the optimization problem. In terms of utilizing the low-rank and sparse property, robust principal component analysis was first proposed by Candès et al. (2011) to decompose a data matrix into low-rank and element-wise sparse components. One of its famous applications is dynamic foreground and static background separation (Bouwmans and Zahzah 2014). Following this approach, numerous algorithm variants have been proposed, including outlier pursuit (Xu et al. 2012), which aims to decompose the data matrix into a low-rank component and a column-wise sparse component, and low-rank plus compressed sparse decomposition (Mardani et al. 2013), which aims to decompose the data matrix into low-rank and compressed sparse components and so on. For utilizing smooth and sparse properties, smooth and sparse decomposition (SSD) methods (Minaee et al. 2015, Yan et al. 2017) are proposed for anomaly detection in images with smooth backgrounds. Following this approach, several explorations have been conducted, including spatiotemporal smooth sparse decomposition (ST-SSD) (Yan et al. 2018), additive tensor decomposition (Mou et al. 2021), and so on. By adopting the matrix decomposition approach, both the background and anomaly can be captured without detection time delay. However, because of the requirement of storage, transmission, and processing of the whole image signal, it cannot be applied in the scenario with low-transmission bandwidth but high processing speed requirements: for example, the solar flare detection application (Augusto et al. 2011).

To mitigate the data storage and transmission burden and improve sensing efficiency, compressive sensing (CS) (Candes et al. 2006) has been proposed, in which the data are directly collected in a compressed form and then, reconstructed accurately with high probability. More specifically, suppose that the original signal is a sparse vector $y \in R^{n}$ , and the main idea is to store and transmit a small set of compressive measurements $y' = Ay \in R^{p}$ , where $A \in R^{p \times n}$ is an underdetermined sensing matrix ( $p ≪ n$ ) satisfying specific properties. Then, the original signal can be reconstructed from its compressed form $y'$ , on which assorted image processing algorithms can be applied for defect detection and so on. For a comprehensive review of CS, please refer to Marques et al. (2018) and Rani et al. (2018). Even though promising, the naïve approach that tries to first reconstruct the image from the compressed measurement and then, apply matrix decomposition algorithms for anomaly detection has two issues.

For smooth plus sparse signals, the existence of such a sensing matrix $A$ satisfying specific properties is unknown.
The reconstruction process is usually computationally intensive (Marques et al. 2018), which on the other hand, slows down the overall computational speed of the defect detection algorithms.

Recently, to integrate the CS with matrix decomposition algorithms, Waters et al. (2011) proposed an SpaRCS method to recover low-rank and sparse matrices directly from compressive measurements, and Tanner and Vary (2020) gave a rigorous performance discussion. However, those methods do not consider the smooth plus sparse decomposition problems and are not efficient in dealing with high-order data.

In this paper, we discuss the possibility of adopting compressive data acquisition systems for image-based quality monitoring and fault detection in applications where the background is smooth, and anomalies are sparse. To achieve so, we propose a compressed smooth sparse decomposition (CSSD) framework. In this framework, the signal processing algorithms are directly applied to the compressed data, and no reconstruction step is needed. By doing so, a significant cost reduction in sensing, storage, and transmission as well as a boost in the speed of image processing algorithms can be achieved with negligible performance loss. We also established the theoretical foundation of adopting such a compressive data acquisition system for smooth plus sparse signals as well as the performance guarantee of the proposed algorithm. To further improve its performance in high-order scenarios, a Kronecker compressed smooth sparse sensing (KronCSSD) is proposed.

The remainder of this paper is organized as follows. In Section 2, we present the CSSD framework. In Section 3, we use simulation studies to validate the proposed framework. In Section 4, we demonstrate the proposed framework using several case studies. Finally, Section 5 concludes the paper.

2. Compressed Smooth Sparse Decomposition Framework

In this section, we will present the proposed CSSD framework. As mentioned in Section 1, we aim to design a fast and data-efficient method for sparse anomaly (sparse signal component) detection in signals with a smooth background (smooth signal component). For simplicity, we first discuss the methodology for one-dimensional (1D) signals and then, generalize it to n-dimensional images. Figure 1 provides an overview of the proposed methodology. There are three stages in the proposed methodology. (i) The signal is acquired in its compressed form through compressive measurement. (ii) Then, the compressed data are transmitted to the server. (iii) Finally, a decomposition algorithm will be applied directly to the compressed signal to decompose it into its corresponding smooth and sparse signal components.

**Figure 1. (Color online) An Overview of the CSSD/KronCSSD Framework**

Mathematically, let $y \in R^{n}$ be a smooth plus sparse signal (which will be defined formally in Section 2.1). We aim to store and transmit a small set of compressive measurements $y'$ (i.e., $y' = Ay$ ) and then, reconstruct the smooth and sparse signal components from $y'$ . To achieve so, there are three questions to be addressed.

What is a smooth plus sparse signal?
How do we compress such a signal?
How do we reconstruct such a signal from compressive measurements?

The remainder of this section is organized as follows to answer those three questions. We start with a formal definition of 1D smooth plus sparse signals in Section 2.1. Based on that, we introduce the compressive measurement method and discuss its theoretical properties for such signals in Section 2.2. In Section 2.3, we present the proposed CSSD framework that can directly reconstruct the smooth and sparse signal components from the compressive measurement by solving the following optimization problem:

\begin{matrix} \min_{θ, θ_{a}} ∥ θ_{a} ∥_{1} \\ s . t . ∥ A (B θ + B_{a} θ_{a}) - y' ∥_{2} \leq ϵ_{1}, \end{matrix}

(1)

where

B

and

B_{a}

are bases,

θ

and

θ_{a}

are corresponding coefficients, and

ϵ_{1}

is the bound for measurement error. The reconstruction accuracy is also characterized theoretically. Then, we generalize the CSSD algorithm to n dimensions using Kronecker compressive sensing (KCS) (Duarte and Baraniuk 2011) and propose a KronCSSD formulation in Section 2.4. Finally, in Section 2.5, we present the advantage of the proposed framework and give the strategy of selecting the compressive ratio, tuning parameters, and bases in practice.

2.1. The Set of Smooth Plus Sparse Signals

In this section, we define the set of smooth plus sparse signals mathematically. The smooth signals originate from the spline smoothing (De Boor and De Boor 1978, Eilers and Marx 1996), where the raw signal is approximated by a linear combination of a set of spline basis functions for smooth interpolation and denoising, and a spline regression technique is usually utilized. To improve the regression robustness with respect to outliers, outliers are explicitly accounted for in the regression model as a sparse component of the raw signal (Giannakis et al. 2011, Mateos and Giannakis 2011). This idea was further generalized to incorporate the special structure of outliers (Yan et al. 2017, 2018).

As a summary, a 1D signal $y \in R^{n}$ is defined as a smooth plus sparse signal if it can be decomposed into two signal components: (i) a smooth signal $m \in R^{n}$ in a low-dimensional subspace spanned by a set of smooth bases (i.e., $m = B θ$ , where $B \in R^{n \times r}$ is a basis matrix with $r ≪ n$ ) and (ii) a sparse signal $a \in R^{n}$ in a relatively high-dimensional subspace spanned by a set of predefined bases, of which the coefficients admit sparse property (i.e., $a = B_{a} θ_{a}$ , where $B_{a} \in R^{n \times q}$ is a basis matrix with $q \leq n$ and $θ_{a} \in R^{q}$ is an $s$ -sparse vector; i.e., $∥ θ_{a} ∥_{0} \leq s$ ). Given a smooth plus sparse signal $y$ , we define the aforementioned decomposition as SSD (i.e., $y = m + a$ ).

To ensure the uniqueness of SSD in a nontrivial case when $n \leq r + q$ , the following definition is introduced.

Definition 2.1.

The local support property of $B_{a}$ , $B_{a} \in R^{n \times q}$ . $B_{a}$ only has local support such that each column of $B_{a}$ only has nonzero values inside a specific interval. The length of this interval is defined as $l (B_{a})$ .

Notice that $l (B_{a}) \in {1, \dots, n}$ . The local support property with a small $l$ ensures the sparsity of $a$ . For example, the B-spline basis has a local support property (Unser 1999).

Definition 2.2.

The incoherence condition of $B, B \in R^{n \times r}$ . Let $B = UΣ V^{T}$ be the reduced singular value decomposition (SVD) of $B$ , where $U \in R^{n \times r}, Σ \in R^{r \times r}$ , and $V \in R^{r \times r}$ . Its incoherence condition parameter $μ (B)$ is defined as the smallest value such that

\max_{i \in {1, .., r}} ∥ U^{T} e_{i} ∥_{2} \leq \sqrt{\frac{μ (B) r}{n}},

where

e_{i}

is the

i

th standard basis vector in

R^{n}

Notice that $μ (B) \in [1, \sqrt{n / r}]$ . The incoherent condition with a small $μ$ ensures that the $m$ is not sparse (Candès et al. 2011).

The following theorem ensures the uniqueness of the SSD decomposition.

Theorem 2.1.

If $μ (B) < n {(2 r s l)}^{- 1}$ , then the SSD decomposition is unique with respect to $m$ and $a$ .

Theorem 2.1 gives the condition that the smooth plus sparse signal can be uniquely decomposed into a smooth part and a sparse part. The proof of Theorem 2.1 is in Appendix A.

Formally, we define the set of smooth plus sparse signals as follows.

Definition 2.3.

The set of smooth and sparse signals is defined as $M S_{r, s, μ, l} :$

\begin{array}{l} M S_{r, s, μ, l} = {y \in R^{n} | y = B θ + B_{a} θ_{a}, B_{a} \in R^{n \times q}, l (B_{a}) = l, θ_{a} \in R^{q}, ∥ θ_{a} ∥_{0} \leq s, B \in R^{n \times r}, \\ μ (B) = μ < n {(2 r s l)}^{- 1}, θ \in R^{r}} . \end{array}

For such a smooth plus sparse signal, how to compress it while ensuring the reconstruction performance will be discussed in the next section.

2.2. Compressive Sensing for Smooth Plus Sparse Signals

As mentioned in Section 1, to ensure the reconstruction performance, the sensing matrix $A$ has to satisfy the so-called restricted isometry property (RIP) (Candes 2008). It has been proven that a random matrix can satisfy the RIP property for the sparse signal (Candes 2008), the low-rank signal (Recht et al. 2010), and the rank plus sparse signal (Tanner and Vary 2020). However, the existence of such a matrix for the smooth plus sparse signal, which is the foundation of adopting compressed data acquisition techniques in applications with smooth background and sparse anomalies, is still unknown. In this section, we will discuss the existence of such a sensing matrix. Before stating the result, we will first present the relevant definitions that are necessary to derive the main result.

Definition 2.4.

RIP for $M S_{r, s, μ, l}$ . Let $A \in R^{p \times n}$ be a linear measurement matrix. For every quadruple $(r, s, μ, l)$ , define the restricted isometry constant (RIC) $δ_{r, s, μ, l}$ to be the smallest positive constant such that

(1 - δ_{r, s, μ, l}) ∥ y ∥_{2} \leq ∥ Ay ∥_{2} \leq (1 + δ_{r, s, μ, l}) ∥ y ∥_{2}, \forall y \in M S_{r, s, μ, l} .

If such a $δ_{r, s, μ, l} \in (0, 1)$ exists, we say that $A$ satisfies the RIP.

Theorem 2.2.

Suppose that $δ_{r, 2 s, μ, l} < 1$ for some integer $r, s, l \geq 1$ and positive numbers $μ < n {(2 r s l)}^{- 1}$ ; then, there is a $y_{0}$ in the set $M S_{r, s, μ, l}$ , which is the only solution for $A y_{0} = b .$

Theorem 2.2 guarantees the uniqueness of the smooth plus sparse signal that satisfies the sensing equation when $A$ satisfies the RIP. The proof of Theorem 2.2 is in Appendix B.

Next, we prove that for the set of smooth plus sparse signals, $M S_{r, s, μ, l}$ , there exists such a matrix $A$ satisfying the RIP property with RIC = $δ_{r, s, μ, l}$ with high probability.

Notice that the RIP for a matrix is difficult to verify. A suitable set of random matrices that obey the RIP for the set of sparse vectors with high probability (Recht et al. 2010, Tanner and Vary 2020) is defined as follows.

Definition 2.5.

Nearly isometric matrices (Baraniuk et al. 2008). Let $A \in R^{p \times n}$ be a random variable that takes values in linear maps from $R^{n}$ to $R^{p}$ ; then, for any $y \in R^{n}$ , $A$ is nearly isometrically distributed if

$E [∥ Ay ∥_{2}^{2}] = ∥ y ∥_{2}^{2}$ and
$P r (| ∥ Ay ∥_{2}^{2} - ∥ y ∥_{2}^{2} | \geq ϵ ∥ y ∥_{2}^{2}) \leq 2 e^{- p c_{0} (ϵ)}, 0 < ϵ < 1,$
where $c_{0} (ϵ)$ is a constant that only depends on $ϵ$ .

The $p \times n$ matrix with independent, identically distributed (i.i.d.) Gaussian entries satisfies those two properties (Baraniuk et al. 2008) $($ i.e., $A_{i j} \sim N (0, \frac{1}{p}),$ with $c_{0} (ϵ) = ϵ^{2} / 4 - ϵ^{3} / 6$ $)$ . There are also other distributions satisfying the nearly isometric property, such as the $p \times n$ matrix with i.i.d. Bernoulli entries and their related distribution (Baraniuk et al. 2008).

Then, the following theorem states that the nearly isometric matrices can also serve as the sensing matrix for smooth plus sparse signals and gives the magnitude of the number of linear measurements.

Theorem 2.3.

Let $A \in R^{p \times n}$ be a matrix from the families described in Definition 2.5. Furthermore, assume that $μ < n {(2 r s l)}^{- 1}$ and the basis matrix $B_{a}$ for the sparse signal component satisfies the RIP with RIC $δ_{B_{a}, s} \in (0, 1)$ : that is, $δ_{B_{a}, s}$ to be the smallest positive constant such that

(1 - δ_{B_{a}, s}) ∥ θ_{a} ∥_{2} \leq ∥ B_{a} θ_{a} ∥_{2} \leq (1 + δ_{B_{a}, s}) ∥ θ_{a} ∥_{2}, \forall θ_{a} \in {θ_{a} \in R^{q}, ∥ θ_{a} ∥_{0} \leq s} .

For a given $δ \in (0, 1)$ , there exists constants $c_{1}, c_{2} > 0$ depending only on $δ$ , such that the RIC for $M S_{r, s, μ, l}$ is upper bounded by $δ,$ with the probability of at least $1 - \exp (- c_{1} p)$ , whenever

p \geq c_{2} (\ln 2 + r \ln \frac{24}{δ} τ_{1} + s (1 + \ln \frac{24}{δ} τ_{0} + \ln \frac{n}{s})),

where

η = \sqrt{\frac{μ rsl}{n}}

τ_{0} = \frac{1}{\sqrt{(1 - δ_{B_{a}, s}) (1 - η^{2})}}

τ_{1} = ∥ B^{†} ∥_{2} (1 + \frac{1}{\sqrt{1 - η^{2}}})

, and

B^{†} = {(B^{T} B)}^{- 1} B^{T}

Theorem 2.3 states that if the nearly isometric matrix is selected as the sensing matrix, the RIC for $M S_{r, s, μ, l}$ is upper bounded with high probability. In practice, $B$ and $B_{a}$ are prespecified based on the understanding/engineering knowledge of the process (please refer to Section 2.5.4 for more detail). Theorem 2.3 also provides a guidance on the magnitude of linear measurements $p$ , which determines the compressive measurement matrix $A$ .

The proof of Theorem 2.3 is in Appendix C.

In this section, we answered two fundamental questions. (i) How do we compress the smooth plus sparse signal (design the compressive measurement matrix $A$ )? (ii) How many linear measurements are needed to preserve the information in smooth plus sparse signals with high probability?

In the next section, we will discuss the problem formulation to recover the smooth and sparse signal components simultaneously from the compressed signal using the CSSD framework.

2.3. Compressed Smooth Sparse Decomposition

As mentioned in Section 1, one way to recover the smooth component and sparse component is first reconstructing the compressed image and then, using the SSD algorithm. However, this will slow down the speed of the defect detection algorithm. Instead, we propose to solve the one-step convex relaxation Problem (1). Natural questions are if we can recover the smooth and sparse signals from the compressed measurement $y^{'}$ by solving Problem (1) and what the accuracy is. The following theorem guarantees the recovery performance of the proposed convex relaxation Problem (1).

Theorem 2.4.

Let $A \in R^{p \times n}$ be a matrix from the families described inDefinition 2.5. Let the signal $y =$ $m_{0} + a_{0} \in M S_{r, s, μ, l}$ , where $m_{0} = B θ_{0}$ and $a_{0} = B_{a} θ_{a 0}$ . Assume that Problem (1) is feasible, and let the optimal solution be $m^{*} = B θ^{*}$ and $a^{*} = B_{a} θ_{a}^{*}$ . Assume that the basis matrix $B_{a}$ for sparse signal component satisfies the RIP with RIC $δ_{B_{a}, 2 s} \in (0, 1)$ . Let $a = (1 + α_{1} + α_{2}) γ^{2} + 2 α_{2} + 2$ and c $= 1 - γ^{2} α_{1} α_{2} - α_{2}^{2}$ , where $α_{1} = \frac{η}{1 - η^{2}}$ , $α_{2} = \frac{\sqrt{2} η}{1 - 2 η^{2}}$ , $γ = \sqrt{\frac{1 + δ_{B_{a}}, 2 s}{1 - δ_{B_{a}, 2 s}}}$ , and $η = \sqrt{\frac{μ rsl}{n}}$ . Suppose that $r, s, l \in N$ and $μ < n {(2 r s l)}^{- 1}$ , such that $c > 0$ and $δ_{r, 3 s, μ, l} \in (0, c / a)$ ; then,

{∥ a_{0} - a^{*} ∥}_{2} = {∥ B_{a} θ_{a 0} - B_{a} θ_{a}^{*} ∥}_{2} \leq C_{a} ϵ_{1}

and

{∥ m_{0} - m^{*} ∥}_{2} = {∥ B θ_{0} - B θ^{*} ∥}_{2} \leq C_{m} ϵ_{1},

where

C_{a} = \frac{(1 + γ^{2}) (1 + α_{2}) \sqrt{1 + δ_{r, 3 s, μ, l}}}{c - a δ_{r, 3 s, μ, l}}

and

\begin{matrix} C_{m} = \frac{\sqrt{1 + δ_{r, 3 s, μ, l}} + (δ_{r, 3 s, μ, l} + \frac{γ^{2}}{1 + γ^{2}} α_{1} + \frac{1}{1 + γ^{2}} α_{2}) C_{a}}{(1 - δ_{r, 3 s, μ, l})} . \end{matrix}

Theorem 2.4 gives the conditions that the proposed convex relaxation Problem (1) can recover the true smooth and sparse signal up to a constant times the noise bound.

The proof of Theorem 2.4 is in Appendix D.

Notice that one advantage of the proposed CSSD framework is that it is compatible with existing decomposition algorithms. For example, for the SSD algorithm proposed by Yan et al. (2017), the problem formulation becomes

\begin{matrix} \min_{θ, θ_{a}} ∥ y' - A (B θ + B_{a} θ_{a}) ∥_{2}^{2} + λ ∥ θ_{a} ∥_{1}, \end{matrix}

(2)

which can be solved efficiently by using the algorithm proposed by Yan et al. (2017).

2.4. Kronecker Compressed Smooth Sparse Decomposition

In the previous section, we discussed the CSSD framework for the 1D signal. In this section, we will generalize the proposed CSSD to KronCSSD framework for high-order tensor data.

Let $Y \in R^{n_{1} \times \dots \times n_{d}}$ be the original signal and $y \in R^{N}$ be its corresponding vectorized signal (i.e., $y = vec (Y)$ and $N = \prod_{i = 1}^{d} n_{i}$ ). Let $A \in R^{p \times N}$ be a measurement matrix satisfying the RIP. Let $y' \in R^{p}$ be the compressed data, such that $y' = Ay$ . $B = \otimes_{i = 1}^{d} B_{i}$ and $B_{a} = \otimes_{i = 1}^{d} B_{a i}$ are the known bases for smooth and sparse components, respectively. Notice that problem formulation (1) can still be used for recovering the high-order smooth and sparse signal components from the compressive measurement. However, there are two issues. (i) In practice, the global CS measurements matrix $A$ is hard to realize using the CS device (Duarte and Baraniuk 2011). (ii) The resulting bases $B$ and $B_{a}$ can be extremely large as the dimension of the data increases, which will not only cause a big challenge in the storage of such a large matrix but also, result in the computational issue in handling such large matrices.

In reality, the high-dimensional tensor data usually have low-rank properties along each mode, which have been extensively exploited in tensor low-rank modeling techniques, such as CANDECOMP/PARAFAC (CP)/Tucker decompositions (Kolda and Bader 2009). This makes it possible for designing a sensing matrix for each mode, which is called Kronecker CS (Duarte and Baraniuk 2011). Inspired by the KCS method, we propose the KronCSSD framework formulation as follows.

Let $A_{i} \in R^{p_{i} \times n_{i}}$ be a measurement matrix with RIP for each mode of the tensor; then, we have the following formulation,

\begin{matrix} \min_{Θ, Θ_{a}} ∥ vec (Θ_{a}) ∥_{1} \\ s . t . ∥ vec (Y_{1} - Y^{'}) ∥_{2} \leq ϵ_{1}, \\ Y_{1} = Θ \times_{1} (A_{1} B_{1}) \times_{2} \dots \times_{d} (A_{d} B_{d}) + Θ_{a} \times_{1} (A_{1} B_{a 1}) \times_{2} \dots \times_{d} (A_{d} B_{a d}), \end{matrix}

(3)

where

Y^{'} = Y \times_{1} A_{1} \times_{2} \dots \times_{d} A_{d}

is the compressive measurement.

Θ \in R^{r_{1} \times \dots \times r_{d}}

and

Θ_{a} \in R^{q_{1} \times \dots \times q_{d}}

are the basis coefficients for the smooth and sparse signal components, respectively.

The proposed KronCSSD framework is a nontrivial generalization of the CSSD framework for 1D signals. Its performance will be shown empirically in simulation and case studies. The theoretical discussion of the KronSSD framework is left for future work.

For example, for a two-dimensional (2D) image $Y \in R^{n_{1} \times n_{2}}$ , when adopting the SSD algorithm (Yan et al. 2017), the problem formulation becomes

\begin{matrix} \min_{Θ, Θ_{a}} ∥ Y^{'} - A_{1} B_{1} Θ B_{2}^{T} A_{2}^{T} - A_{1} B_{a 1} Θ_{a} B_{a 2}^{T} A_{2}^{T} ∥_{2}^{2} + λ ∥ vec (Θ_{a}) ∥_{1}, \end{matrix}

(4)

where

Y'

is the compressed image (i.e.,

Y^{'} = A_{1} Y A_{2}^{T}

). It can be solved efficiently by using the algorithm proposed by Yan et al. (2017).

2.5. Discussion

2.5.1. Advantages of the Proposed CSSD/KronCSSD Methods.

In this section, we will give a brief discussion about the advantages of the proposed CSSD/KronCSSD methods. We define the compressive ratio as $c = \prod_{i = 1}^{d} p_{i} / n_{i}$ . The smaller the compressive ratio, the fewer data will be transmitted. The proposed CSSD/KronCSSD methods have the following characteristics.

We propose to directly acquire the compressed image, which not only reduces the sensing cost but also, reduces the data transmission and storage cost by $\prod_{i = 1}^{d} p_{i} / n_{i}$ times.
The smooth and sparse signal components can be recovered by solving a smaller-scale convex optimization problem with input data $\prod_{i = 1}^{d} p_{i} / n_{i}$ times smaller than that of the original problem, which significantly boosts the computation.

2.5.2. Compressive Ratio Selection in Practice.

Theorems 2.3 and 2.4 show that the 1D smooth plus sparse signal can be recovered with high probability from a compressive measurement if the compressive ratio is above a specific threshold. However, there are several parameters (such as $c_{1}, c_{2}$ ) that are difficult to obtain when calculating the threshold in practice. We propose a practical procedure of selecting the compressive ratio utilizing the historical data. Suppose some training signals with their real background and anomalies are available; the compressive ratio can be selected with the following guidelines.

If there is a requirement for reconstruction accuracy for smooth and sparse signal components, then $\hat{p}$ is chosen as the smallest value that satisfies such a requirement.
If there is no such requirement, we recommend choosing the compressive ratio corresponding to the sharp change point of the slope of the loss function-compressive ratio curve. This point exists because of the existence of such a threshold, after which the reconstruction with high probability is guaranteed by Theorems 2.3 and 2.4. We will demonstrate this in the simulation study in Section 3.1.

For high-order tensor data, the selection of the $p_{i}$ for each mode can be challenging. We provide some empirical guidelines as follows.

If the smoothness of the background is similar along different modes, a unified compressive ratio is recommended.
If the smoothness of the background is different, more sensing budget should be allocated to the mode along which the background is less smooth.
We recommend fixing the ratio among $p_{i}$ along different modes and adopting steps (i) and (ii) for 1D signal to determine the compressive ratio.

2.5.3. Tuning Parameter Selection in Practice.

Notice that in problem formulations (1) and (3), there is a hyperparameter $ϵ_{1}$ , indicating the bound for measurement noise. If the measurement error bound is known from the accuracy of the measurement device, it can be directly used here. Otherwise, a crossvalidation step using historical data is recommended. Similarly, there is a tuning parameter $λ$ in their corresponding Lagrangian form Equations (2) and (4), which controls the sparsity of the decomposed anomaly. Crossvalidation can be used to determine this parameter. For more detail, please refer to Yan et al. (2017).

2.5.4. Bases Selection in Practice.

The bases $B$ and $B_{a}$ are prespecified based on the understanding/engineering knowledge of the process. The selection of such bases is discussed in detail in section 3.4 of Yan et al. (2017). In general, any smooth basis, such as splines or kernels, can be used for the background. For sparse anomalies, such as small regions scattered over the background or in the form of thin lines, an identity basis is recommended. Linear (quadratic) B splines are recommended for anomalous regions with sharp corners (curved boundaries). However, to ensure the uniqueness of the smooth sparse decomposition, we do require the bases $B$ and $B_{a}$ to satisfy specific properties, such as those mentioned in Definitions 2.1 and 2.2 and Theorem 2.1, which should be checked for selected bases. We will demonstrate this in the simulation study in Section 3.1.

3. Simulation Study

In this section, we will demonstrate the proposed CSSD and KronCSSD framework with simulation studies. First, the CSSD method is applied on 1D signals in Section 3.1, and then, we apply the KronCSSD method on 2D images in Section 3.2.

3.1. CSSD on a 1D Signal

A 1D signal $(y \in R^{n})$ is assumed to be a superposition of the smooth signal component ( $m = B θ$ ), the sparse signal component $(a = B_{a} θ_{a})$ , and noise $e$ (i.e., $y = m + a + e$ ). To demonstrate the proposed CSSD framework, we will first conduct compressive data acquisition on the raw signal (i.e., $y' = Ay$ ). Then, the data reconstruction and decomposition are achieved in one step. The performance is evaluated by the relative error between the true signal $a$ and the reconstructed one $\hat{a}$ (i.e., ${∥ a - \hat{a} ∥_{2} / ∥ a ∥}_{2}$ and $∥ m - \hat{m} ∥_{2} / ∥ m ∥_{2}$ for the smooth background).

In the simulation study, we generate a 1D signal from $M S_{r, s, μ, l}$ and let $n = 1,000$ . The smooth background is generated from a random linear combination of B-spline bases with three knots $(r = 10)$ (i.e., $m = B θ$ , where $B \in R^{n \times r}$ and $θ \in R^{r}$ is a random vector such that $θ_{i} \sim N (0, 1)$ , $i \in {1, \dots, 4}$ ). The incoherence condition parameter $μ = μ (B) = 0.82$ . The sparse signal component is generated by a sparse random linear combination of degree 2 B-spline bases with 500 knots $(q = 500, l = 4)$ : that is, $a = B_{a} θ_{a}$ , where $B_{a} \in R^{n \times q}$ and $θ_{a} \in R^{q}$ is a four-sparse random vector ( $s = 4$ ) such that its nonzero elements follow i.i.d. standard normal distribution. Notice that the RIC, $δ_{B_{a}, s}$ for matrix $B_{a}$ , is hard to calculate in general. However, there is a loose upper bound that can be used, which is $δ_{B_{a}, s} \leq δ_{B_{a}, p} = \max {λ_{\max} - 1, 1 - λ_{\min}} = 0.60$ , where $λ_{\max}$ and $λ_{\min}$ are the maximum and minimum singular values of $B_{a}$ , respectively. The noise signal generated as a random vector $e \in R^{n}$ is a random vector such that $e_{i} \sim N (0, {0.001}^{2})$ , $i \in {1, \dots, n}$ . Figure 2 shows a sample raw signal and its smooth, sparse components.

Figure 2. (Color online) A Sample Raw Signal and Its Smooth and Sparse Components
*Notes*. (a) Raw signal. (b) Smooth signal component $m$ . (c) Sparse signal component $a$ .

Before we state the result, we first check the assumption of Theorems 2.3 and 2.4. For Theorem 2.3, it is easy to check that $0.82 = μ \leq \frac{n}{2 r s l} = \frac{1, 000}{2 \times 10 \times 4 \times 4} = 6.25$ and also, that $δ_{B_{a}, s} \leq 0.60 \in (0, 1)$ . For Theorem 2.4, $δ_{B_{a}, 2 s} \leq 0.60 \in (0, 1)$ , $c = 0.37 > 0$ , and $δ_{r, 3 s, μ, l} \in (0, 0.04)$ . Notice that the range for $δ_{r, 3 s, μ, l}$ is small in this case because of the loose upper bound for $δ_{B_{a}, 2 s}$ , which can be further improved. This indicates that the smooth and sparse signal can be recovered by the proposed algorithm with high probability provided that the compressive ratio is above a specific threshold, which is demonstrated by the following observation.

The simulation is repeated 100 times, and the average log relative error for the background and sparse signal components with respect to compressive ratio are shown in Figure 3. We can observe a large error at the beginning, and it drops very fast with an increase of the compressive ratio. Then, above a threshold (0.1 approximately) of compressive ratio, the error becomes small (below 3%), and the decrease of error becomes less significant, which demonstrates the effectiveness of the reconstruction algorithm. The threshold of 0.1 can be chosen as the compressive ratio mentioned in Section 2.5.2.

Figure 3. (Color online) Average Log Relative Reconstruction Error
*Notes*. (a) Log relative error for the smooth component. (b) Log relative error for the sparse component.

The reconstructed signal components in 1 of the 100 simulations are shown in Figure 4. We can observe that when adopting the compressive ratio of 0.1, both the smooth and sparse signal components can be reconstructed with high accuracy.

Figure 4. (Color online) Reconstructed Smooth and Sparse Signal Components
*Notes*. (a) Smooth signal component of compressive ratios 0.02 (left panel), 0.05 (center panel), and 0.1 (right panel). (b) Sparse signal component of compressive ratios 0.02 (left panel), 0.05 (center panel), and 0.1 (right panel).

We also vary the magnitude of sparse signals to examine the reconstruction performance of the proposed method. The result and analysis are provided in Appendix H.

3.2. CSSD on a 2D Image

In this simulation study, we aim to decompose an image into the smooth background, sparse anomalies, and noise. A $350 \times 350$ image with smooth background and the sparse anomaly is generated similar to Yan et al. (2017) (i.e., $Y = M + A + E$ , where $M$ is the smooth background, $A$ is the sparse anomalies, and $E$ is i.i.d. Gaussian noise such that $E_{i} \sim NID (0, σ^{2})$ ). The smooth background is generated from a linear combination of B-spline bases with $3 \times 3$ knots, and the anomalies are generated from a sparse linear combination of B-spline bases with $88 \times 88$ knots. The background and anomalies are shown in Figure 5.

**Figure 5. (Color online) True Background and Anomalies**

We can see that the anomaly size covers a large range. The mean absolute value of the background plus anomaly is $μ = 0.21$ . In this simulation, we first study the reconstruction performance under a noise-free scenario. Then, we increase the noise level to test the robustness of the algorithm.

3.2.1. Noise-Free Case.

In this section, we vary the compressive ratio from 4% to 100%. For each compressive ratio, we simulate 100 times and the average false-negative rate (FNR) and false-positive rate (FPR) are reported in Figure 6. The FPR is defined as the portion of normal pixels predicted as anomaly,

FPR = \frac{\sum_{i, j} (1 - I_{A \neq 0} {A (i, j)}) I_{\hat{A} \neq 0} {\hat{A} (i, j)}}{\sum_{i, j} (1 - I_{A \neq 0} {A (i, j)})},

and the FNR is defined as the portion of anomalous pixels predicted as normal background,

FNR = \frac{\sum_{i, j} I_{A \neq 0} {A (i, j)} (1 - I_{\hat{A} \neq 0} {\hat{A} (i, j)})}{\sum_{i, j} I_{A \neq 0} {A (i, j)}},

where

A

is the true anomaly,

\hat{A}

is the predicted anomaly, and

I_{Ω} (\cdot)

is the indicator function: that is,

I_{Ω} (x) = {\begin{matrix} 1, if x \in Ω \\ 0, otherwise . \end{matrix}

**Figure 6. (Color online) Average FNR and FPR**

From Figure 6, we can see that both the FPR and FNR ratios decrease as the compressive ratio increases. The FPR drops so fast that when the compressive ratio achieves 8%, there is no false alarm, which is desired for the anomaly detection algorithm. The FNR drops slower, and less than 5% of the anomaly pixels are ignored when the compressive ratio achieves 8%. However, it does not miss any cluster of the anomalies, even though some of them are small.

The recovered sparse signal components are shown in Figure 7 for compressive ratios 4% (Figure 7(a)), 8% (Figure 7(b)), 33% (Figure 7(c)), and 73% (Figure 7(d)). For comparison, we apply the SSD algorithm (Yan et al. 2017), of which the FNR and the FPR are zero and computation time is 0.16 seconds.

Figure 7. (Color online) The Recovered Sparse Signal Components
*Notes*. (a) Compressive ratio 4%. (b) Compressive ratio 8%. (c) Compressive ratio 33%. (d) Compressive ratio 73%.

We record the computation time in Figure 8. A significant boosting of the computation is observed when the compressive ratio is 8%, which speeds up the SSD algorithm by 4.3 times.

**Figure 8. (Color online) The Computation Time**

3.2.2. Noisy Case.

The signal-to-noise ratio $μ / ϵ \in [4, 40]$ is studied to evaluate the robustness of the algorithm. The three-dimensional plot of FPR, the signal-to-noise ratio, and the compressive ratio are shown in Figure 9.

**Figure 9. (Color online) The Contour Plot of FPR, Signal-to-Noise Ratio, and Compressive Ratio**

The purple line indicates the equipotential line of FPR = 0.01. We can see that with the increase of the signal-to-noise ratio, we are allowed to use a less compressive ratio to achieve satisfactory decomposition results. The majority of the area lies below the equipotential line of FPR = 0.01, which means that the proposed algorithm is robust.

4. Case Study

In this section, we use three real cases to demonstrate the effectiveness of the proposed CSSD/KronCSSD framework. For comparison, we also apply the SSD method in each case study. The compressive ratio ( $c$ ) and the average computational time ( $t$ ) for a single image are reported in Table 1. A significant transmission bandwidth reduction and computation boost can be observed with negligible performance degradation (Figures 10–12), compared with the vanilla SSD algorithm.

Table 1. Comparison Between KronCSSD and SSD

Table 1. Comparison Between KronCSSD and SSD

	Surface defect		Solar flare		Indentation
	$c$	$t$ /s	$c$	$t$ /s	$c$	$t$ /s
KronCSSD	54%	0.073	22%	0.034	48%	0.053
SSD	1	0.135	1	0.086	1	0.094

Figure 10. (Color online) Steel Rolling Images and Detected Anomalies
*Notes*. (a) Raw image. (b) KronCSSD result. (c) SSD result.

Figure 11. (Color online) Solar Activity Images and Detected Solar Flare
*Notes*. (a) Raw image. (b) KronCSSD result. (c) SSD result.

Figure 12. (Color online) Silicon Stress Map and Detected Indentation
*Notes*. (a) Raw image. (b) KronCSSD result. (c) SSD result.

4.1. Surface Defect Detection in a Steel Rolling Process

As mentioned in Section 1, vision sensors collect high-resolution images of the product surface with a high data acquisition rate in the rolling processes. This poses a challenge in data storage, transmission, and processing. One sample image of size $128 \times 512$ with typical anomalies is shown in Figure 10(a). The black scratches shown in the red block are surface anomalies. For a detailed description, the readers are encouraged to refer to Yan et al. (2018). The detected anomalies are shown in Figure 10, (b) and (c) by using the proposed KronCSSD algorithm and the SSD algorithm, respectively. The data set has 100 images, and more example results can be found in Appendix I.

The KronCSSD method achieves a similar anomaly detection performance with 54% bandwidth and is 1.8 times faster. By adopting the KronCSSD method, (i) we can achieve a faster anomaly detection and thus, reduce the loss through a timely intervention of the manufacturing process, and (ii) we can keep the manufacturing inspection information for a longer time period with the same storage capability, which is important for root cause analysis.

4.2. Solar Flare Detection

Another important application is solar flare detection from satellite images. A solar flare is defined as a sudden, transient, and intense variation in brightness over the sun’s surface. It has a significant influence on radio communication on Earth. Each second, thousands of high-resolution images are captured by a satellite, which poses a big challenge to real-time data transmission and processing (Yan et al. 2018). The data set has 300 images, and more example images can be found in Appendix I.

One sample image of size $232 \times 292$ with a typical solar flare is shown in Figure 11(a), where the yellowish bright region is the solar flare. The detected anomalies are shown in Figure 11, (b) and (c) by using the proposed KronCSSD and SSD algorithms, respectively. The KronCSSD method achieves a similar solar flare detection performance as the SSD method but with 22% bandwidth, and it is 2.5 times faster. Notice that the decomposed images can also be used for downstream tasks, such as control charts and so on, which are beyond the scope of this paper.

By adopting the KronCSSD method, we can improve the transmission rate under the same transmission bandwidth and thus, achieve almost five times faster solar flare detection, which is of vital importance for protecting radio communications, power grids, and navigation systems.

4.3. Silicon Surface Indentation Detection

The stress map of size $90 \times 550$ of a silicon surface laminate with surface indentation is shown in Figure 12, where clusters of high-stress areas indicate the surface indentation (Yan et al. 2017). We aim to detect those high-stress areas. The detected anomalies are shown in Figure 12, (b) and (c). We can see that the KronCSSD method achieves a similar detection performance with the SSD method but with 40% bandwidth, and it is 1.8 times faster, which significantly reduces the storage and transmission cost and improves the processing speed.

5. Conclusion

In this paper, we proposed a CSSD framework for efficient data acquisition, transmission, and processing for sparse anomaly detection in smooth backgrounds. To further enhance its computational efficiency, a KronCSSD framework is proposed for tensor data.

The contributions of this work are twofold. (i) Theoretically, we showed the feasibility of combining compressive sensing and smooth sparse decomposition. This enables the adoption of a compressive data acquisition approach. (ii) Practically, the proposed framework is compatible with many existing decomposition-based anomaly detection algorithms, such as SSD, ST-SSD, and so on, which achieve both a significant cost reduction in sensing, storage, and transmission and a boost in speed but with negligible loss in their performance.

In this article, we use a simulation study to demonstrate the effectiveness and robustness of the proposed CSSD/KronCSSD framework. Three case studies across different applications demonstrate the versatility of the proposed framework. The authors believe that the CSSD/KronCSSD framework can be applied in a wider range of applications toward more efficient data acquisition, transmission, and processing.

Further studies on the theoretical properties of the proposed KronCSSD can be a future direction.

Appendix A

Proof of Theorem 2.1.

We prove Theorem 2.1 by contradiction. Assume there exists two different decompositions for the same smooth plus sparse signal $y$ (i.e., $y = m_{1} + a_{1} = m_{2} + a_{2}$ , where $m_{1} \neq m_{2}$ and $a_{1} \neq a_{2}$ ). Then,

m_{1} - m_{2} = - a_{1} + a_{2} .

(A.1)

Because $m_{1} \neq m_{2}$ , we can normalize both side by $∥ m_{1} - m_{2} ∥_{2}$ and denote $\tilde{m} = (m_{1} - m_{2}) / {‖ m_{1} - m_{2} ‖}_{2}$ and $\tilde{a} = (- a_{1} + a_{2}) / {‖ m_{1} - m_{2} ‖}_{2}$ . Notice that $\tilde{m}$ is in the column space of $B$ , which is spanned by columns of the $U$ (recall that $B = U Σ V^{T}$ ; i.e., $\tilde{m} = Ux$ , where $x$ is the coefficient vector, $x \in R^{r}$ ). Because ${‖ \tilde{m} ‖}_{2} = 1$ , we have ${‖ x ‖}_{2} = 1$ . We can bound each element in $\tilde{m}$ as follows:

| {\tilde{m}}_{i} | = e_{i}^{T} Ux \leq ∥ e_{i}^{T} U ∥_{2} ∥ x ∥_{2} \leq \max_{i \in {1, .., r}} ∥ U^{T} e_{i} ∥_{2} \leq \sqrt{\frac{μ (B) r}{n}} < \sqrt{\frac{1}{2 l s}}, \forall i \in {1, \dots, n} .

According to Definition 2.1, $∥ a_{1} ∥_{0} \leq l s$ and $∥ a_{2} ∥_{0} \leq l s$ . Therefore, $∥ \tilde{a} ∥_{0} \leq 2 l s$ . Moreover, according to Equation (A.1), we conclude that $∥ \tilde{m} ∥_{0} \leq 2 l s$ . Denote the support of $\tilde{m}$ as $T_{\tilde{m}}$ ; we have $∥ \tilde{m} ∥_{2} = \sqrt{\sum_{_{i \in T_{\tilde{m}}}} {| {\tilde{m}}_{i} |}^{2}} < 1$ , which is a contradiction. □

Appendix B

Proof of Theorem 2.2.

We prove Theorem 2.2 with contradiction. Assume there exists another vector $y = B θ + B_{a} θ_{a} \in M S_{r, s, μ, l}$ such that $Ay = b$ and $y \neq y_{0}$ . Then, $z = y - y_{0} = B (θ - θ_{0}) + B_{a} (θ_{a} - θ_{a 0})$ is a nonzero vector. Because $∥ θ_{a} - θ_{a 0} ∥_{1} \leq ∥ θ_{a} ∥_{1} + ∥ θ_{a 0} ∥_{1} \leq 2 s$ , by Definition 2.3, we have that $z \in M S_{r, 2 s, μ, l}$ . Therefore, $0 = ∥ Az ∥_{2} \geq (1 - δ_{r, 2 s, μ, l}) ∥ z ∥_{2} > 0$ , which is a contradiction. Notice that the proof is inspired by the proof of lemma 3.1 in Candes and Tao (2005).

Appendix C

Proof of Theorem 2.3.

This proof is inspired by Baraniuk et al. (2008) and Tanner and Vary (2020).

We will first derive the RIC for a fixed subspace $M S_{r, T, μ, l}$ of $M S_{r, s, μ, l}$ when $θ_{a}$ is restricted in a fixed subspace $T$ with the fixed support such that the number of nonzero elements is $s$ :

\begin{array}{l} M S_{r, T, μ, l} = {y \in R^{n} | y = B θ + B_{a} θ_{a}, B_{a} \in R^{n \times q}, l (B_{a}) = l, θ_{a} \in T, \\ B \in R^{n \times r}, μ (B) = μ < n {(2 r s l)}^{- 1}, θ \in R^{r}} . \end{array}

Then, we use a covering argument that counts over all possible sparse subspaces $T$ with support less than or equal to $s$ . Finally, we can derive the RIC for $M S_{r, s, μ, l}$ .

The following lemma describes the RIC for a fixed subspace $M S_{r, T, μ, l}$ and is proved in Appendix E.

Lemma C.1.

RIC for a fixed subspace $M S_{r, T, μ, l}$ . Let $A \in R^{p \times n}$ be a matrix from the families described inDefinition 2.5. Furthermore, assume that $μ < n {(2 r s l)}^{- 1}$ and that the basis matrix $B_{a}$ for the sparse signal component satisfies the RIP with RIC $δ_{B_{a}, s} \in (0, 1)$ : that is, $δ_{B_{a}, s}$ is the smallest positive constant such that

(1 - δ_{B_{a}, s}) ∥ θ_{a} ∥_{2} \leq ∥ B_{a} θ_{a} ∥_{2} \leq (1 + δ_{B_{a}, s}) ∥ θ_{a} ∥_{2}, \forall θ_{a} \in {θ_{a} \in R^{q} | ∥ θ_{a} ∥_{0} \leq s} .

For a given $δ \in (0, 1)$ , there exists a constant $c_{0} > 0$ depending only on $δ$ , such that the RIC for $M S_{r, T, μ, l}$ is upper bounded by $δ$ with the probability of at least $1 - 2 {(\frac{24}{δ} τ_{1})}^{r} {(\frac{24}{δ} τ_{0})}^{s} e^{- p c_{0} (δ / 2)}$ , where $η = \sqrt{\frac{μ rsl}{n}}$ , $τ_{0} = \frac{1}{\sqrt{(1 - δ_{B_{a}, s}) (1 - η^{2})}}$ , $τ_{1} = ∥ B^{†} ∥_{2} (1 + \frac{1}{\sqrt{1 - η^{2}}})$ , and $B^{†} = {(B^{T} B)}^{- 1} B^{T}$ .

Notice that for a fixed subspace $M S_{r, T, μ, l}$ , the RIP will fail with probability less than or equal to $2 {(\frac{24}{δ} τ_{1})}^{r} {(\frac{24}{δ} τ_{0})}^{s} e^{- p c_{0} (δ / 2)} .$ Because there are $(\begin{matrix} n \\ s \end{matrix}) \leq {(\frac{e n}{s})}^{s}$ such subspaces, the probability to fail for $M S_{r, s, μ, l}$ , which is a combination of those $(\begin{matrix} n \\ s \end{matrix})$ subspaces, will be less than or equal to

(\begin{matrix} n \\ s \end{matrix}) 2 {(\frac{24}{δ} τ_{1})}^{r} {(\frac{24}{δ} τ_{0})}^{s} e^{- p c_{0} (\frac{δ}{2})} \leq \exp (- c_{0} (\frac{δ}{2}) p + \ln 2 + r \ln \frac{24}{δ} τ_{1} + s (1 + \ln \frac{24}{δ} τ_{0} + \ln \frac{n}{s})) .

Then, for any give $δ$ , there exist $c_{1}, c_{2} > 0$ , such that the probability to fail for $M S_{r, s, μ, l}$ is less than or equal to $\exp (- c_{1} p),$ provided that $p \geq c_{2} (\ln 2 + r \ln \frac{24}{δ} τ_{1} + s (1 + \ln \frac{24}{δ} τ_{0} + \ln \frac{n}{s})),$ where $c_{2} = {[c_{0} (\frac{δ}{2}) - c_{1}]}^{- 1} .$ This finishes the proof.

Appendix D

Proof of Theorem 2.4.

The proof of Theorem 2.4 is inspired by Candes et al. (2006) and Tanner and Vary (2020). Assume that in Problem (1), $ϵ_{1}$ is properly chosen such that Problem (1) is feasible. In the following discussion, we will use ${(\cdot)}^{*}$ to denote the optimal solution of Problem (1) and ${(\cdot)}_{0}$ to denote the signal we wish to recover. Let $R = X^{*} - X_{0} = R^{m} + R^{a}$ , where $R^{m} = m - m_{0} = B θ^{*} - B θ_{0}$ and $R^{a} = B_{a} θ_{a}^{*} - B_{a} θ_{a 0}$ are the residuals of the smooth and sparse signal components, respectively.

Let $h = θ_{a}^{*} - θ_{a 0} = h_{T_{0}} + h_{T_{0}^{c}}$ , where $T_{0}$ is the support of $θ_{a 0}$ , $h_{T_{0}}$ denotes the projection of $h$ onto $T_{0}$ such that

h_{T_{0}} (t) = {\begin{matrix} t, & if t \in T_{0} \\ 0, & otherwise, \end{matrix}

and

T_{0}^{c}

denotes the complementary set of

T_{0}

Because $θ_{a 0}$ is feasible and $θ_{a}^{*}$ is the optimal solution of Problem (1), we must have $∥ θ_{a}^{*} ∥_{1} \leq ∥ θ_{a 0} ∥_{1},$ which is equivalent to $∥ θ_{a 0} + h_{T_{0}} + h_{T_{0}^{c}} ∥_{1} \leq ∥ θ_{a 0} ∥_{1} .$ Because $T_{0}$ and $T_{0}^{c}$ are complementary to each other, we have $∥ θ_{a 0} + h_{T_{0}} ∥_{1} + ∥ h_{T_{0}^{c}} ∥_{1} \leq ∥ θ_{a 0} ∥_{1} .$ Because $∥ θ_{a 0} + h_{T_{0}} ∥_{1} \geq ∥ θ_{a 0} ∥_{1} - ∥ h_{T_{0}} ∥_{1}$ , we have $∥ θ_{a 0} ∥_{1} - ∥ h_{T_{0}} ∥_{1} + ∥ h_{T_{0}^{c}} ∥_{1} \leq ∥ θ_{a 0} ∥_{1} .$ Hence, $∥ h_{T_{0}^{c}} ∥_{1} \leq ∥ h_{T_{0}} ∥_{1} .$ Because $∥ h_{T_{0}} ∥_{1} \leq \sqrt{s} ∥ h_{T_{0}} ∥_{2}$ , we have

∥ h_{T_{0}^{c}} ∥_{1} \leq \sqrt{s} ∥ h_{T_{0}} ∥_{2} .

(D.1)

Similar to Candes et al. (2006), we order the elements of $T_{0}^{c}$ in decreasing order of their magnitude and enumerate $T_{0}^{c}$ as $v_{1}, \dots, v_{n - | T_{0} |}$ . Then, $T_{0}^{c}$ is divided into subsets $T_{i}^{c}$ of size $M$ , where

T_{i}^{c} = {v_{j} : (i - 1) M \leq j \leq i M} .

Let $h_{T_{i}^{c}}$ be the projection of $h$ onto $T_{i}^{c}$ ; we have

\begin{array}{l} ∥ h_{T_{i}^{c}} ∥_{0} \leq M, \forall i \geq 1 \\ T_{i}^{c} \cap T_{j}^{c} = Ø, \forall i \neq j \\ ∥ h_{T_{i + 1}^{c}} ∥_{2} \leq \frac{1}{\sqrt{M}} ∥ h_{T_{i}^{c}} ∥_{1}, \forall i \geq 1, \end{array}

(D.2)

where the last inequality comes from the fact that

T_{0}^{c}

is in decreasing order, such that

{| h_{T_{i + 1}^{c}} |}_{(v)} \leq \frac{1}{M} \sum_{j \in T_{i}^{c}} {| h_{T_{i}^{c}} |}_{(j)} . \forall v \in T_{i + 1}^{c} .

Define $R_{T_{i}^{c}}^{a} = B_{a} h_{T_{i}^{c}}$ and $R_{T_{0}}^{a} = B_{a} h_{T_{0}}$ , and combine Equations (D.1) and (D.2). Then, we have

\begin{array}{l} \sum_{j \geq 2} ∥ R_{T_{j}^{c}}^{a} ∥_{2} \leq \sum_{j \geq 2} \sqrt{1 + δ_{B_{a}, M}} ∥ h_{T_{j}^{c}} ∥_{2} \\ \leq_{(a)} \sum_{j \geq 1} \frac{\sqrt{1 + δ_{B_{a}, M}} ∥ h_{T_{j}^{c}} ∥_{1}}{\sqrt{M}} = \frac{\sqrt{1 + δ_{B_{a}, M}} ∥ h_{T_{0}^{c}} ∥_{1}}{\sqrt{M}} \\ \leq_{(b)} \frac{\sqrt{s} \sqrt{1 + δ_{B_{a}, M}} ∥ h_{T_{0}} ∥_{2}}{\sqrt{M}} \leq \frac{\sqrt{s} \sqrt{1 + δ_{B_{a}, M}} ∥ R_{T_{0}}^{a} ∥_{2}}{\sqrt{M} \sqrt{1 - δ_{B_{a}, M}}}, \end{array}

where (a) follows Equation (D.2), (b) follows Equation (D.1), and the RIP property of

B_{a}

is used because

∥ h_{T_{j}^{c}} ∥_{0} \leq M

Denote $γ = \sqrt{\frac{1 + δ_{B_{a}, M + s}}{1 - δ_{B_{a}, M + s}}}$ . (A tighter bound can be achieved by using $\sqrt{\frac{1 + δ_{B_{a}, M}}{1 - δ_{B_{a}, M}}}$ . However, we adopt $δ_{B_{a}, M + s}$ instead of $δ_{B_{a}, M}$ for simplicity in the following proof.) Then, we have

\underset{j \geq 2}{\sum^{​}} ∥ R_{T_{j}^{c}}^{a} ∥_{2} \leq \sqrt{\frac{s}{M}} γ ∥ R_{T_{0}}^{a} ∥_{2} .

(D.3)

Next, we derive the bounds for $R^{m}$ and $R^{a}$ .

Bound for $R^{m} :$

\begin{array}{l} ∥ A R^{m} ∥_{2}^{2} = | 〈 A R^{m}, A (R - R^{a}) 〉 | \\ = | 〈 A R^{m}, A (R - R^{a}) 〉 | \\ = | 〈 A R^{m}, AR 〉 + 〈 A R^{m}, - A R^{a} 〉 | \\ \leq | 〈 A R^{m}, AR 〉 | + | 〈 A R^{m}, - A R^{a} 〉 | \\ = | 〈 A R^{m}, AR 〉 | + | 〈 A R^{m}, - A (R_{T_{0}}^{a} + R_{T_{1}^{c}}^{a} + \sum_{j \geq 2} R_{T_{j}^{c}}^{a}) 〉 | \\ \leq | 〈 A R^{m}, AR 〉 | + | 〈 A R^{m}, A (R_{T_{0}}^{a} + R_{T_{1}^{c}}^{a}) 〉 | + \sum_{j \geq 2} | 〈 A R^{m}, A R_{T_{j}^{c}}^{a} 〉 | . \end{array}

(D.4)

In the following discussion, we will bound those terms. According to the Cauchy–Schwarz inequality, the first term can be bounded as

| 〈 A R^{m}, AR 〉 | \leq ∥ A R^{m} ∥_{2} ∥ AR ∥_{2} \leq \sqrt{1 + δ_{r, s, μ, l}} ϵ_{1} ∥ R^{m} ∥_{2},

(D.5)

where the last inequality comes from the RIP property and the first constraint in Problem (1).

The third term can be bounded as follows. Denoting $z_{1} = R^{m} / ∥ R^{m} ∥_{2}$ and $z_{2} = R_{T_{j}^{c}}^{a} / ∥ R_{T_{j}^{c}}^{a} ∥_{2}$ , we have

\begin{array}{l} \frac{| 〈 A R^{m}, A R_{T_{j}^{c}}^{a} 〉 |}{∥ R^{m} ∥_{2} ∥ R_{T_{j}^{c}}^{a} ∥_{2}} = | 〈 A z_{1}, A z_{2} 〉 | = \frac{1}{4} | ∥ A (z_{1} + z_{2}) ∥_{2}^{2} - ∥ A (z_{1} - z_{2}) ∥_{2}^{2} | \\ \leq_{(a)} \frac{1}{4} \max {\begin{matrix} | (1 + δ_{r, M, μ, l}) ‖ z_{1} + z_{2} ‖_{2}^{2} - (1 - δ_{r, M, μ, l}) ‖ z_{1} - z_{2} ‖_{2}^{2} |, \\ | (1 + δ_{r, M, μ, l}) ‖ z_{1} - z_{2} ‖_{2}^{2} - (1 - δ_{r, M, μ, l}) ‖ z_{1} + z_{2} ‖_{2}^{2} | \end{matrix}} \\ = | δ_{r, M, μ, l} + 〈 z_{1}, z_{2} 〉 | \leq_{(b)} δ_{r, M, μ, l} + \frac{η_{1}}{1 - η_{1}^{2}}, \end{array}

where

η_{1} = \sqrt{\frac{μ rMl}{n}};

inequality (a) follows the RIP property because

z_{1} + z_{2} \in M S_{r, M, μ, l}

and

z_{1} - z_{2} \in M S_{r, M, μ, l}

; and inequality (b) follows Equation (F.1) in the proof of Lemma E.1 and

∥ z_{1} ∥_{2} = ∥ z_{2} ∥_{2} = 1,

〈 z_{1}, z_{2} 〉 \leq \frac{η_{1}}{1 - η_{1}^{2}} ‖ z_{1} ‖_{2} ‖ z_{2} ‖_{2} = \frac{η_{1}}{1 - η_{1}^{2}}

, provided that

M \leq 2 s

Therefore,

| 〈 A R^{m}, A R_{T_{j}^{c}}^{a} 〉 | \leq (δ_{r, M, μ, l} + \frac{η_{1}}{1 - η_{1}^{2}}) ‖ R^{m} ‖_{2} ‖ R_{T_{j}^{c}}^{a} ‖_{2} .

(D.6)

Similarly, the second term can be bounded as

| 〈 A R^{m}, A (R_{T_{0}}^{a} + R_{T_{1}^{c}}^{a}) 〉 | \leq (δ_{r, M + s, μ, l} + \frac{η_{2}}{1 - η_{2}^{2}}) ‖ R^{m} ‖_{2} ‖ R_{T_{0}}^{a} + R_{T_{1}^{c}}^{a} ‖_{2},

(D.7)

where

η_{2} = \sqrt{\frac{μ r (M + s) l}{n}}

, provided that

M \leq s

Plugging Equations (D.5)–(D.7) into Equation (D.4), we have

\begin{array}{l} ‖ A R^{m} ‖_{2}^{2} \leq ‖ R^{m} ‖_{2} (\sqrt{1 + δ_{r, s, μ, l}} ϵ_{1} + (δ_{r, M + s, μ, l} + \frac{η_{2}}{1 - η_{2}^{2}}) ‖ R_{T_{0}}^{a} + R_{T_{1}^{c}}^{a} ‖_{2} \\ + (δ_{r, M, μ, l} + \frac{η_{1}}{1 - η_{1}^{2}}) ‖ R_{T_{j}^{c}}^{a} ‖_{2}) \end{array}

\begin{array}{l} \leq_{(a)} ‖ R^{m} ‖_{2} (\sqrt{1 + δ_{r, s, μ, l}} ϵ_{1} + (δ_{r, M + s, μ, l} + \frac{η_{2}}{1 - η_{2}^{2}}) ‖ R_{T_{0}}^{a} + R_{T_{1}^{c}}^{a} ‖_{2} \\ + (δ_{r, M, μ, l} + \frac{η_{1}}{1 - η_{1}^{2}}) \sqrt{\frac{s}{M}} γ ‖ R_{T_{0}}^{a} ‖_{2}), \end{array}

where inequality (a) follows Equation (D.3).

According to the RIP property, we have

\begin{array}{l} (1 - δ_{r, s, μ, l}) ‖ R^{m} ‖_{2}^{2} \\ \leq ‖ R^{m} ‖_{2} (\sqrt{1 + δ_{r, s, μ, l}} ϵ_{1} + (δ_{r, M + s, μ, l} + \frac{η_{2}}{1 - η_{2}^{2}}) ‖ R_{T_{0}}^{a} + R_{T_{1}^{c}}^{a} ‖_{2} \\ + (δ_{r, M, μ, l} + \frac{η_{1}}{1 - η_{1}^{2}}) \sqrt{\frac{s}{M}} γ ‖ R_{T_{0}}^{a} ‖_{2}) . \end{array}

Consequently, we have

\begin{matrix} ‖ R^{m} ‖_{2} \leq \frac{\begin{array}{l} \sqrt{1 + δ_{r, s, μ, l}} ϵ_{1} + ((δ_{r, M, μ, l} + \frac{η_{1}}{1 - η_{1}^{2}}) \sqrt{\frac{s}{M}} γ^{2} \\ + (δ_{r, M + s, μ, l} + \frac{η_{2}}{1 - η_{2}^{2}})) ‖ R_{T_{0}}^{a} + R_{T_{1}^{c}}^{a} ‖_{2} \end{array}}{(1 - δ_{r, s, μ, l})}, \end{matrix}

(D.8)

where the inequality follows from

\begin{array}{l} ‖ R_{T_{0}}^{a} ‖_{2} = ‖ B_{a} h_{T_{0}} ‖_{2} \leq_{(a)} \sqrt{1 + δ_{B_{a}, s}} ‖ h_{T_{0}} ‖_{2} \leq_{(b)} \sqrt{1 + δ_{B_{a}, s}} ‖ h_{T_{0}} + h_{T_{1}^{c}} ‖_{2} \\ \leq_{(c)} \sqrt{\frac{1 + δ_{B_{a}, s}}{1 - δ_{B_{a}, M + s}}} ‖ B_{a} (h_{T_{0}} + h_{T_{1}^{c}}) ‖_{2} \leq γ ‖ R_{T_{0}}^{a} + R_{T_{1}^{c}}^{a} ‖_{2}, \end{array}

(D.9)

inequalities (a) and (c) follow the RIP property of

B_{a}

, and inequality (b) follows that

T_{0} \cap^{​} T_{1}^{c} = Ø .

Bound for $R^{a} :$

\begin{array}{l} ‖ A (R_{T_{0}}^{a} + R_{T_{1}^{c}}^{a}) ‖_{2}^{2} = | 〈 A (R_{T_{0}}^{a} + R_{T_{1}^{c}}^{a}), A (R_{T_{0}}^{a} + R_{T_{1}^{c}}^{a} - R + R) 〉 | \\ = | 〈 A (R_{T_{0}}^{a} + R_{T_{1}^{c}}^{a}), AR 〉 | + | 〈 A (R_{T_{0}}^{a} + R_{T_{1}^{c}}^{a}), - A (R^{m} + \underset{j \geq 2}{\sum^{​}} R_{T_{j}^{c}}^{a}) 〉 | \\ \leq | 〈 A (R_{T_{0}}^{a} + R_{T_{1}^{c}}^{a}), AR 〉 | + | 〈 A R^{m}, A (R_{T_{0}}^{a} + R_{T_{1}^{c}}^{a}) 〉 | + \underset{j \geq 2}{\sum^{​}} | 〈 A (R_{T_{0}}^{a} + R_{T_{1}^{c}}^{a}), A R_{T_{j}^{c}}^{a} 〉 | . \end{array}

(D.10)

In the following discussion, we will bound those terms. According to the Cauchy–Schwarz inequality, the first term can be bounded as

| 〈 A (R_{T_{0}}^{a} + R_{T_{1}^{c}}^{a}), AR 〉 | \leq ‖ A (R_{T_{0}}^{a} + R_{T_{1}^{c}}^{a}) ‖_{2} ‖ AR ‖_{2} \leq \sqrt{1 + δ_{r, M + s, μ, l}} ϵ_{1} ‖ R_{T_{0}}^{a} + R_{T_{1}^{c}}^{a} ‖_{2},

(D.11)

where the last inequality comes from the RIP property and the first constraint in Problem (1).

The third term can be bounded as follows. Denoting $z_{2} = R_{T_{j}^{c}}^{a} / ∥ R_{T_{j}^{c}}^{a} ∥_{2}, j \geq 2$ and $z_{3} = (R_{(T_{0})}^{a} + R_{(T_{1}^{c})}^{a}) / ‖ R_{(T_{0})}^{a} + R_{(T_{1}^{c})}^{a} ‖_{2},$ we have

\begin{array}{l} \frac{| 〈 A (R_{T_{0}}^{a} + R_{T_{1}^{c}}^{a}), A R_{T_{j}^{c}}^{a} 〉 |}{‖ R_{T_{0}}^{a} + R_{T_{1}^{c}}^{a} ‖_{2} ∥ R_{T_{j}^{c}}^{a} ∥_{2}} = | 〈 A z_{3}, A z_{2} 〉 | \\ = \frac{1}{4} | ‖ A (z_{3} + z_{2}) ‖_{2}^{2} - ‖ A (z_{3} - z_{2}) ‖_{2}^{2} | \\ \leq_{(a)} \frac{1}{4} \max {\begin{matrix} | (1 + δ_{r, 2 M + s, μ, l}) ‖ z_{3} + z_{2} ‖_{2}^{2} - (1 - δ_{r, 2 M + s, μ, l}) ‖ z_{3} - z_{2} ‖_{2}^{2} |, \\ | (1 + δ_{r, M + 2 s, μ, l}) ‖ z_{3} - z_{2} ‖_{2}^{2} - (1 - δ_{r, 2 M + s, μ, l}) ‖ z_{3} + z_{2} ‖_{2}^{2} | \end{matrix}} \\ = | δ_{r, 2 M + s, μ, l} + 〈 z_{3}, z_{2} 〉 | \\ \leq_{(b)} δ_{r, 2 M + s, μ, l}, \end{array}

where inequality (a) follows the RIP property because

z_{3} + z_{2} \in M S_{r, 2 M + s, μ, l}

and

z_{3} - z_{2} \in M S_{r, 2 M + s, μ, l}

. Inequality (b) comes from that

T_{i}^{c} \cap^{​} T_{j}^{c} = Ø, \forall i \neq j

and

T_{0} \cap T_{i}^{c} = Ø, \forall i

Therefore,

| 〈 A R^{m}, A R_{T_{j}^{c}}^{a} 〉 | \leq δ_{r, 2 M + s, μ, l} {‖ R_{T_{0}}^{a} + R_{T_{1}^{c}}^{a} ‖}_{2} ∥ R_{T_{j}^{c}}^{a} ∥_{2} .

(D.12)

Plugging Equations (D.7), (D.11), and (D.12) into Equation (D.10), we have

\begin{array}{l} {‖ A (R_{T_{0}}^{a} + R_{T_{1}^{c}}^{a}) ‖}_{2}^{2} \\ \leq {‖ R_{T_{0}}^{a} + R_{T_{1}^{c}}^{a} ‖}_{2} (\sqrt{1 + δ_{r, M + s, μ, l}} ϵ_{1} + (δ_{r, M + s, μ, l} + \frac{η_{2}}{1 - η_{2}^{2}}) ∥ R^{m} ∥_{2} + δ_{r, 2 M + s, μ, l} \sum_{j \geq 2} ∥ R_{T_{j}^{c}}^{a} ∥_{2}) \\ \leq_{(a)} ‖ R_{T_{0}}^{a} + R_{T_{1}^{c}}^{a} ‖_{2} (\sqrt{1 + δ_{r, M + s, μ, l}} ϵ_{1} + (δ_{r, M + s, μ, l} + \frac{η_{2}}{1 - η_{2}^{2}}) ‖ R^{m} ‖_{2} + δ_{r, 2 M + s, μ, l} \sqrt{\frac{s}{M}} γ ‖ R_{T_{0}}^{a} ‖_{2}) \\ \leq_{(b)} ‖ R_{T_{0}}^{a} + R_{T_{1}^{c}}^{a} ‖_{2} (\sqrt{1 + δ_{r, M + s, μ, l}} ϵ_{1} + (δ_{r, M + s, μ, l} + \frac{η_{2}}{1 - η_{2}^{2}}) ‖ R^{m} ‖_{2} \\ + δ_{r, 2 M + s, μ, l} \sqrt{\frac{s}{M}} γ^{2} ‖ R_{T_{0}}^{a} ‖ + ‖ R_{T_{1}^{c}}^{a} ‖), \end{array}

where inequalities (a) and (b) follows the same argument as deriving Equation (D.8).

According to the RIP property, we have

\begin{array}{l} (1 - δ_{r, M + s, μ, l}) ‖ R_{T_{0}}^{a} + R_{T_{1}^{c}}^{a} ‖_{2}^{2} \\ \leq ‖ R_{T_{0}}^{a} + R_{T_{1}^{c}}^{a} ‖_{2} (\sqrt{1 + δ_{r, M + s, μ, l}} ϵ_{1} + (δ_{r, M + s, μ, l} + \frac{η_{2}}{1 - η_{2}^{2}}) ‖ R^{m} ‖_{2} + δ_{r, 2 M + s, μ, l} \sqrt{\frac{s}{M}} γ^{2} ‖ R_{T_{0}}^{a} ‖_{2}) . \end{array}

Consequently, we have

\begin{array}{l} ‖ R_{T_{0}}^{a} ‖ + ‖ R_{T_{1}^{c}}^{a} ‖_{2} \\ \leq \frac{\begin{array}{l} (\sqrt{1 + δ_{r, M + s, μ, l}} ϵ_{1} + (δ_{r, M + s, μ, l} + \frac{η_{2}}{1 - η_{2}^{2}}) ‖ R^{m} ‖_{2} \\ + δ_{r, 2 M + s, μ, l} \sqrt{\frac{s}{M}} γ^{2} ‖ R_{T_{0}}^{a} ‖ + ‖ R_{T_{1}^{c}}^{a} ‖) \end{array}}{(1 - δ_{r, M + s, μ, l})} . \end{array}

(D.13)

Notice that Equations (D.8) and (D.13) still hold if we relax $δ_{r, M + s, μ, l}, δ_{r, s, μ, l}$ to $δ_{r, 2 M + s, μ, l}$ . For simplicity, here we replace $δ_{r, M + s, μ, l}, δ_{r, s, μ, l}$ with $δ_{r, 2 M + s, μ, l}$ in the following derivation.

Plugging Equation (D.8) into Equation (D.13) and letting $x \equiv ‖ R_{T_{0}}^{a} + R_{T_{1}^{c}}^{a} ‖_{2}, y \equiv ‖ R^{m} ‖_{2}$ , we have

(D_{1} - \frac{B_{1} B_{2}}{D_{2}} - C_{1}) x \leq A_{1} ϵ_{1} + \frac{B_{1}}{D_{2}} A_{2} ϵ_{1},

(D.14)

where

A_{1} = \sqrt{1 + δ_{r, 2 M + s, μ, l}}, B_{1} = (δ_{r, 2 M + s, μ, l} + \frac{η_{2}}{1 - η_{2}^{2}}), C_{1} = δ_{r, 2 M + s, μ, l} \sqrt{\frac{s}{M}} γ^{2}, D_{1} = (1 - δ_{r, 2 M + s, μ, l}),

and

A_{2} = \sqrt{1 + δ_{r, 2 M + s, μ, l}}, B_{2} = (δ_{r, 2 M + s, μ, l} + \frac{η_{1}}{1 - η_{1}^{2}}) \sqrt{\frac{s}{M}} γ^{2} + (δ_{r, 2 M + s, μ, l} + \frac{η_{2}}{1 - η_{2}^{2}}), D_{2} = (1 - δ_{r, 2 M + s, μ, l}) .

Here, we require $D_{1} - \frac{B_{1} B_{2}}{D_{2}} - C_{1} > 0$ in Equation (D.14) and let $M = s$ , which is

1 - γ^{2} α_{1} α_{2} - α_{2}^{2} - ((1 + α_{1} + α_{2}) γ^{2} + 2 α_{2} + 2) δ_{r, 3 s, μ, l} > 0 .

(D.15)

Let $a = (1 + α_{1} + α_{2}) γ^{2} + 2 α_{2} + 2$ and c $= 1 - γ^{2} α_{1} α_{2} - α_{2}^{2}$ , where $α_{1} = \frac{η}{1 - η^{2}}$ , $α_{2} = \frac{\sqrt{2} η}{1 - 2 η^{2}}$ , $γ = \sqrt{\frac{1 + δ_{B_{a}, 2 s}}{1 - δ_{B_{a}, 2 s}}}$ , and $η = \sqrt{\frac{μ rsl}{n}}$ . If $c > 0,$ then there exist a $δ_{r, 3 s, μ, l} > 0$ such that Equation (D.15) is valid. Then, the denominator $- a δ_{r, 3 s, μ, l} + c > 0 \forall δ_{r, 3 s, μ, l} \in (0, c / a) .$ Consequently,

‖ R_{T_{0}}^{a} ‖ + ‖ R_{T_{1}^{c}}^{a} ‖_{2} \leq \frac{(1 + α_{2}) \sqrt{1 + δ_{r, 3 s, μ, l}}}{c - a δ_{r, 3 s, μ, l}} ϵ_{1} .

Notice that from Equations (D.1) and (D.9), we have

\sum_{j \geq 2} ‖ R_{T_{j}^{c}}^{a} ‖_{2} \leq γ ‖ R_{T_{0}}^{a} ‖_{2} \leq γ^{2} ‖ R_{T_{0}}^{a} ‖ + ‖ R_{T_{1}^{c}}^{a} ‖_{2} .

Therefore, $‖ R^{a} ‖_{2} \leq ‖ R_{T_{0}}^{a} ‖ + ‖ R_{T_{1}^{c}}^{a} ‖_{2} + \sum_{j \geq 2} ‖ R_{T_{j}^{c}}^{a} ‖_{2} \leq C_{a} ϵ_{1},$ where

C_{a} = \frac{(1 + γ^{2}) (1 + α_{2}) \sqrt{1 + δ_{r, 3 s, μ, l}}}{c - a δ_{r, 3 s, μ, l}} .

Similarly, we can bound $‖ R^{m} ‖_{2}$ as $‖ R^{m} ‖_{2} \leq C_{m} ϵ_{1},$ where

C_{m} = \frac{\sqrt{1 + δ_{r, 3 s, μ, l}} + (δ_{r, 3 s, μ, l} + \frac{γ^{2}}{1 + γ^{2}} α_{1} + \frac{1}{1 + γ^{2}} α_{2}) C_{a}}{(1 - δ_{r, 3 s, μ, l})} . □

Appendix E

Proof of Lemma C.1.

In this section, we will provide the proof for Lemma C.1. By linearity of the measurement matrix $A$ , without loss of generality, it is enough to prove this lemma when $‖ y_{2} ‖ = 1$ . The proof mainly has two steps. First, the bounds for $θ_{a}$ and $θ$ are derived, and a finite set of points to approximate the set $M S_{r, T, μ, l}$ to any accuracy in norm 2 sense can be found. Then, the concentration inequality can be applied through a union bound. This is a common approach in compressive sensing literature (Baraniuk et al. 2008, Tanner and Vary 2020).

To derive the upper bounds for $θ_{a}$ and $θ$ , we first derive the upper bounds for the signals $m = B θ$ and $a = B_{a} θ_{a}$ , which are given in the following lemma.

Lemma E.1.

The smooth signal component $m$ and sparse signal component $a$ of the signal $y$ in $M S_{r, T, μ, l}$ with $μ < \frac{n}{2 r s l}$ can be bounded as follows:

‖ m ‖_{2} = ‖ B θ ‖_{2} \leq \frac{‖ y ‖_{2}}{\sqrt{1 - η^{2}}},

(E.1)

‖ a ‖_{2} = ‖ B_{a} θ_{a} ‖_{2} \leq \frac{‖ y ‖_{2}}{\sqrt{1 - η^{2}}},

(E.2)

where

η = \sqrt{\frac{μ rsl}{n}}

The proof is presented in Appendix F.

According to the RIP for $B_{a}$ , we have

\sqrt{(1 - δ_{B_{a}, s})} ‖ θ_{a} ‖_{2} \leq ‖ B_{a} θ_{a} ‖_{2} \leq \sqrt{(1 + δ_{B_{a}, s})} ‖ θ_{a} ‖_{2} .

(E.3)

Combining Equations (E.2) and (E.3), we have

‖ θ_{a} ‖_{2} \leq \frac{1}{\sqrt{(1 - δ_{B_{a}, s})}} ‖ B_{a} θ_{a} ‖_{2} \leq \frac{‖ y ‖_{2}}{\sqrt{(1 - δ_{B_{a}, s}) (1 - η^{2})}} = \frac{1}{\sqrt{(1 - δ_{B_{a}, s}) (1 - η^{2})}} .

Denoting $τ_{0} = \frac{1}{\sqrt{(1 - δ_{B_{a}, s}) (1 - η^{2})}}$ , we have $‖ θ_{a} ‖_{2} \leq τ_{0} .$ Recall that $y = B θ + B_{a} θ_{a}$ ; we have $θ = B^{†} (y - B_{a} θ_{a}),$ where $B^{†} = {(B^{T} B)}^{- 1} B^{T}$ . Therefore, according to the triangle inequality and the Cauchy–Schwarz inequality, we have

‖ θ ‖_{2} \leq ‖ B^{†} ‖_{2} (‖ y ‖_{2} + ‖ B_{a} θ_{a} ‖_{2}) \leq ‖ B^{†} ‖_{2} (1 + \frac{1}{\sqrt{1 - η^{2}}}) .

Denoting $τ_{1} = ‖ B^{†} ‖_{2} (1 + \frac{1}{\sqrt{1 - η^{2}}})$ , we have $‖ θ ‖_{2} \leq τ_{1} .$

Because we have derived the bounds for $θ_{a}$ and $θ$ , the covering number of $M S_{r, T, μ, l}$ is given by the following lemma whose proof is in Appendix G.

Lemma E.2.

There exists a set $Q \in M S_{r, T, μ, l}$ , such that for all $y \in M S_{r, T, μ, l}$ , with $‖ y ‖_{2} = 1$ we have $\min_{q \in Q} ‖ q - y ‖_{2} \leq \frac{δ}{4}$ and $| Q | \leq {(\frac{24}{δ} τ_{1})}^{r} {(\frac{24}{δ} τ_{0})}^{s}$ , where $| Q |$ is its cardinality.

Next, we will prove the main result by applying the concentration inequality (Definition 2.5(ii)) with union bound. Let $ϵ = δ / 2$ ,

(1 - \frac{δ}{2}) ‖ q ‖_{2}^{2} \leq ‖ A q ‖_{2}^{2} \leq (1 + \frac{δ}{2}) ‖ q ‖_{2}^{2} \forall q \in Q,

(E.4)

with probability greater than

1 - 2 | Q | e^{- p c_{0} (δ / 2)}

Because $δ \in (0, 1),$ we have $1 - \frac{δ}{2} \leq \sqrt{1 - \frac{δ}{2}}$ and $\sqrt{1 + \frac{δ}{2}} \leq 1 + \frac{δ}{2}$ . Then, Equation (E.4) can be written as

(1 - \frac{δ}{2}) ‖ q ‖_{2} \leq ‖ A q ‖_{2} \leq (1 + \frac{δ}{2}) ‖ q ‖_{2} \forall q \in Q,

(E.5)

with probability greater than

1 - 2 | Q | e^{- p c_{0} (δ / 2)}

By the triangle inequality, we have

‖ A y ‖_{2} \leq ‖ A {(y - q) ‖}_{2} + ‖ A q ‖_{2} .

(E.6)

Define

U = \max_{y \in M S_{r, T, μ, l}, ‖ y ‖_{2} = 1} ‖ A y ‖_{2},

(E.7)

which is attainable because

M S_{r, T, μ, l}

is closed.

Combining Equations (E.5) and (E.6), we have $\forall y \in M S_{r, T, μ, l}, with ‖ y ‖_{2} = 1$ ; there exists a $q \in M S_{r, T, μ, l}$ , such that $‖ A y ‖_{2} \leq ‖ A {(q - y) ‖}_{2} + (1 + \frac{δ}{2}) ‖ q ‖_{2},$ with probability greater than $1 - 2 | Q | e^{- p c_{0} (δ / 2)}$ .

Because $q - y \in M S_{r, T, μ, l}$ , if $q - y = 0$ , we have $‖ A y ‖_{2} \leq (1 + \frac{δ}{2}) ‖ q ‖_{2} = 1 + \frac{δ}{2} .$

If $q - y \neq 0$ , we have $‖ A y ‖_{2} \leq ‖ A {\frac{(q - y)}{‖ q - y ‖_{2}} ‖}_{2} ‖ q - y ‖_{2} + (1 + \frac{δ}{2}) ‖ q ‖_{2} \leq \frac{δ}{4} U + 1 + \frac{δ}{2} .$

Notice that the second inequality comes from Equation (E.7) combined with $Q$ being a $\frac{δ}{4}$ covering of $M S_{r, T, μ, l}$ . In summary, we have $‖ A y ‖_{2} \leq \frac{δ}{4} U + 1 + \frac{δ}{2} .$

Because $U$ is attainable, according to Equation (E.7), we have $U \leq \frac{δ}{4} U + 1 + \frac{δ}{2} .$ Consequently, we have $U \leq 1 + \frac{3}{4 - δ} δ \leq 1 + δ$ because $δ < 1$ . Therefore, $‖ A y ‖_{2} \leq 1 + δ,$ with probability greater than $1 - 2 | Q | e^{- p c_{0} (δ / 2)}$ .

Similarly, we can prove that $‖ A y ‖_{2} \geq 1 - δ$ with probability greater than $1 - 2 | Q | e^{- p c_{0} (δ / 2)}$ .

Finally, according to Lemma E.2, we have that

1 - 2 | Q | e^{- p c_{0} (\frac{δ}{2})} \geq 1 - 2 {(\frac{24}{δ} τ_{1})}^{r} {(\frac{24}{δ} τ_{0})}^{s} e^{- p c_{0} (\frac{δ}{2})} .

This finishes the proof.

Appendix F

Proof of Lemma E.1.

To prove the result, we first derive a nontrivial upper bound for the inner produce between $m$ and $a$ . Let $B = U Σ V^{T}$ be the reduced SVD of $B$ ; then,

\begin{array}{l} | m^{T} a | = | θ^{T} B^{T} a | = | θ^{T} V Σ U^{T} a | = | θ^{T} V Σ U^{T} \sum_{i}^{n} a_{i} e_{i} | = | θ^{T} V Σ \sum_{i}^{n} a_{i} U^{T} e_{i} | \\ \leq ‖ θ^{T} V Σ ‖_{2} ‖ \sum_{i}^{n} a_{i} U^{T} e_{i}_{2} \\ \leq ‖ θ^{T} V Σ ‖_{2} \sum_{i}^{n} | a_{i} | ‖ U^{T} e_{i} ‖_{2} \\ \leq ‖ θ^{T} V Σ ‖_{2} ‖ \sum_{i}^{n} | a_{i} | \max_{j \in {1, \dots r}} ‖ U^{T} e_{j} ‖_{2} \\ \leq_{(a)} ‖ θ^{T} V Σ U^{T} ‖_{2} ‖ a ‖_{1} \sqrt{\frac{μ r}{n}} \\ \leq_{(b)} \sqrt{\frac{μ rsl}{n}} ‖ m ‖_{2} ‖ a ‖_{2}, \end{array}

(F.1)

where inequality (a) follows

‖ θ^{T} V Σ ‖_{2} = ‖ θ^{T} V Σ U^{T} ‖_{2}

because

U^{T} U = I

and Definition 2.2. Inequality (b) follows from

‖ a ‖_{1} \leq \sqrt{l s} ‖ a ‖_{2}

Let $η = \sqrt{\frac{μ rsl}{n}}$ because $μ < \frac{n}{rsl}$ ; we have $η < 1$ and

| m^{T} a | = \frac{| ‖ y ‖_{2}^{2} - ‖ m ‖_{2}^{2} - ‖ a ‖_{2}^{2} |}{2} \leq η ‖ m ‖_{2} a_{2} .

Therefore, we have

‖ m ‖_{2}^{2} + ‖ a ‖_{2}^{2} - ‖ y ‖_{2}^{2} \leq 2 η ‖ m ‖_{2} ‖ a ‖_{2} .

By completing the square, we have

{(‖ m ‖_{2} + η ‖ a ‖_{2})}^{2} + (1 - η^{2}) ‖ a ‖_{2}^{2} - ‖ y ‖_{2}^{2} \leq 0 .

Because ${(‖ m ‖_{2} + η ‖ a ‖_{2})}^{2} \geq 0$ , we have

‖ a ‖_{2} \leq \frac{1}{\sqrt{(1 - η^{2})}} ‖ y ‖_{2} .

Similarly, we can derive that

‖ m ‖_{2} \leq \frac{1}{\sqrt{(1 - η^{2})}} ‖ y ‖_{2} .

This finishes the proof.

Appendix G

Proof of Lemma E.2.

We first state results for the covering number of a set (Vershynin 2018). The covering number of a smallest $ϵ$ net for a unit $l_{2}$ norm ball in $d$ -dimensional space is ${(3 / ϵ)}^{d}$ .

Let $M = {m \in R^{n} | m = B θ, θ \in R^{r}, ‖ θ ‖_{2} \leq τ_{1}, μ (B) = μ}$ and $S = {a \in R^{n} | a = B_{a} θ_{a}, θ_{a} \in T, ‖ θ_{a} ‖_{2} \leq τ_{0}, l (B_{a}) = l}$ . There exist two finite $\frac{δ}{8}$ covering sets of $M$ and $S$ , which are $Q_{M} \subseteq M$ and $Q_{S} \subseteq S$ .

For all $q_{M} \in Q_{M}$ and for all $m \in M$ , we have

\min_{q_{M} \in Q_{M}} ‖ m - q_{M} ‖_{2} \leq \frac{δ}{8};

for all

q_{S} \in Q_{s}

and for all

a \in S

, we have

\min_{q_{S} \in Q_{s}} ‖ a - q_{S} ‖_{2} \leq \frac{δ}{8} .

Therefore, we have $| Q_{M} | \leq {(\frac{24}{δ} τ_{1})}^{r}$ and $| Q_{s} | \leq {(\frac{24}{δ} τ_{0})}^{s} .$

Define $Q_{M S} = {q_{M} + q_{S} | q_{M} \in Q_{M}, q_{S} \in Q_{S}} \subseteq M S_{r, T, μ, l}$ . Then, $\forall y \in M S_{r, T, μ, l}$ , there exists a pair $q_{M S} = q_{M} + q_{S} \in M S_{r, T, μ, l}$ , such that

‖ q_{M S} - y ‖_{2} = ‖ q_{M} - m + q_{S} - a ‖_{2} \leq ‖ q_{M} - m ‖_{2} + ‖ q_{S} - a ‖_{2} \leq \frac{δ}{4} .

Therefore, $Q_{M S}$ is a $δ / 4$ covering of $M S_{r, T, μ, l}$ and $| Q_{M S} | \leq {(\frac{24}{δ} τ_{1})}^{r} {(\frac{24}{δ} τ_{0})}^{s} .$ This finishes the proof.

Appendix H. Simulation Study by Varying the Magnitude of Sparse Signals

We adopt the same simulation data generation procedure as in Section 3.1 while changing the distribution of elements in $θ_{a} \in R^{q}$ such that the nonzero elements follow i.i.d. normal distribution with mean zero and standard deviation $σ_{s}$ in the range of ${0.065, 0.125, 0.25, 0.5}$ . The example signals and reconstruction performance are shown in Figure H.1.

Figure H.1. (Color online) Simulation Study by Varying $σ_{s}$
*Notes*. (a)–(d) Example signals corresponding to different $σ_{s}$ , (e) log relative error for the smooth component, and (f) log relative error for the sparse component. (a) $σ_{s} = 0.065$ . (b) $σ_{s} = 0.125$ . (c) $σ_{s} = 0.25$ . (d) $σ_{s} = 0.5$ . (e) Log relative error for the smooth component. (f) Log relative error for the sparse component

There are several observations.

The log relative error of the smooth component does not change with different magnitudes of $σ_{s}$ . This agrees with the theoretical result in Theorem 2.4, where the reconstruction error is bounded by a constant times the noise bound because the noise level is kept the same in all simulations.
The log relative error of the sparse component decreases as $σ_{s}$ increases. This also agrees with the theoretical result in Theorem 2.4. Because the log relative error of the sparse component is defined as $\log ‖ a - \hat{a} ‖_{2} / ‖ a ‖_{2}$ , according to Theorem 2.4, the reconstruction error term can be approximated by $C_{a} ϵ_{1}$ (i.e., $‖ a - \hat{a} ‖_{2} \sim C_{a} ϵ_{1}$ , where $C_{a}$ is independent of $θ_{a}$ ). $‖ a ‖_{2} = ‖ B_{a} θ_{a} ‖_{2}$ increases as the magnitude of elements in $θ_{a}$ increases. Therefore, $\log ‖ a - \hat{a} ‖_{2} / ‖ a ‖_{2}$ will decrease as $σ_{s}$ increases.
The 0.1 threshold (approximately) of the compressive ratio still holds with different magnitudes of sparse signal component.

Appendix I. Sample Case Study Images

Appendix I.1. Sample Images of Case Study 4.1

See Figure I.1.

Figure I.1. Sample Images of Surface Defect Detection in the Steel Rolling Process
*Notes*. The top row shows raw images. The middle row shows corresponding SSD results. The bottom row shows KronCSSD results.

Appendix I.2. Sample Images of Case Study 4.2

See Figure I.2.

Figure I.2. (Color online) Sample Images of Solar Flare Detection
*Notes*. The top row shows raw images. The middle row shows corresponding SSD results. The bottom row shows KronCSSD results.

References

Augusto CRA, Fauth AC, Navia CE, Shigeouka H, Tsui KH (2011) Connection among spacecrafts and ground level observations of small solar transient events. Experiment. Astronomy 31(2):177–197.Google Scholar
Baraniuk R, Davenport M, DeVore R, Wakin M (2008) A simple proof of the restricted isometry property for random matrices. Constructive Approximation 28(3):253–263.Google Scholar
Bouwmans T, Zahzah EH (2014) Robust PCA via principal component pursuit: A review for a comparative evaluation in video surveillance. Comput. Vision Image Understanding 122:22–34.Google Scholar
Candes EJ (2008) The restricted isometry property and its implications for compressed sensing. Competus Rendus Math. 346(9–10):589–592.Google Scholar
Candes EJ, Tao T (2005) Decoding by linear programming. IEEE Trans. Inform. Theory 51(12):4203–4215.Google Scholar
Candes EJ, Romberg JK, Tao T (2006) Stable signal recovery from incomplete and inaccurate measurements. Comm. Pure Appl. Math. 59(8):1207–1223.Google Scholar
Candès EJ, Li X, Ma Y, Wright J (2011) Robust principal component analysis? J. ACM 58(3):1–37.Google Scholar
De Boor C, De Boor C (1978) A Practical Guide to Splines, vol. 27 (Springer-Verlag, New York).Google Scholar
Duarte MF, Baraniuk RG (2011) Kronecker compressive sensing. IEEE Trans. Image Processing 21(2):494–504.Google Scholar
Eilers PH, Marx BD (1996) Flexible smoothing with B-splines and penalties. Statist. Sci. 11(2):89–121.Google Scholar
Giannakis GB, Mateos G, Farahmand S, Kekatos V, Zhu H (2011) USPACOR: Universal sparsity-controlling outlier rejection. Tichavsky P, Cernocky H, Prochazka A, eds. 2011 IEEE Internat. Conf. Acoustics Speech Signal Processing (IEEE, New York), 1952–1955.Google Scholar
Kolda TG, Bader BW (2009) Tensor decompositions and applications. SIAM Rev. 51(3):455–500.Google Scholar
Mardani M, Mateos G, Giannakis GB (2013) Recovery of low-rank plus compressed sparse matrices with application to unveiling traffic anomalies. IEEE Trans. Inform. Theory 59(8):5186–5205.Google Scholar
Marques EC, Maciel N, Naviner L, Cai H, Yang J (2018) A review of sparse recovery algorithms. IEEE Access 7:1300–1322.Google Scholar
Mateos G, Giannakis GB (2011) Robust nonparametric regression by controlling sparsity. 2011 IEEE Internat. Conf. Acoustics Speech Signal Processing (ICASSP).Google Scholar
Minaee S, Abdolrashidi A, Wang Y (2015) Screen content image segmentation using sparse-smooth decomposition. 2015 49th Asilomar Conf. Signals Systems Comput.Google Scholar
Mou S, Wang A, Zhang C, Shi J (2021) Additive tensor decomposition considering structural data information. IEEE Trans. Automation Sci. Engrg. 19(4):2904–2917.Google Scholar
Rani M, Dhok SB, Deshmukh RB (2018) A systematic review of compressive sensing: Concepts, implementations and applications. IEEE Access 6:4875–4894.Google Scholar
Recht B, Fazel M, Parrilo PA (2010) Guaranteed minimum-rank solutions of linear matrix equations via nuclear norm minimization. SIAM Rev. 52(3):471–501.Google Scholar
Tanner J, Vary S (2020) Compressed sensing of low-rank plus sparse matrices. Preprint, submitted July 18, https://arxiv.org/abs/2007.09457v1.Google Scholar
Unser M (1999) Splines: A perfect fit for signal and image processing. IEEE Signal Processing Magazine 16(6):22–38.Google Scholar
Vershynin R (2018) High-Dimensional Probability: An Introduction with Applications in Data Science, vol. 47 (Cambridge University Press, Cambridge, United Kingdom).Google Scholar
Wang A, Xian X, Tsung F, Liu K (2018) A spatial-adaptive sampling procedure for online monitoring of big data streams. J. Quality Tech. 50(4):329–343.Google Scholar
Waters AE, Sankaranarayanan AC, Baraniuk RG (2011) SpaRCS: Recovering low-rank and sparse matrices from compressive measurements. Shawe-Taylor J, Zemel RS, Bartlett PL, Pereira F, Weinberger KQ, eds. Conf. Neural Inform. Processing Systems (Curran Associates Inc., Red Hook, NY), 1089–1097.Google Scholar
Xu H, Caramanis C, Sanghavi S (2012) Robust PCA via outlier pursuit. IEEE Trans. Inform. Theory 58(5):3047–3064.Google Scholar
Yan H, Paynabar K, Shi J (2017) Anomaly detection in images with smooth background via smooth-sparse decomposition. Technometrics 59(1):102–114.Google Scholar
Yan H, Paynabar K, Shi J (2018) Real-time monitoring of high-dimensional functional data streams via spatio-temporal smooth sparse decomposition. Technometrics 60(2):181–197.Google Scholar

cover image INFORMS Journal on Data Science

Volume 2, Issue 1

April-June 2023

Pages 1-98, C2

Article Information

Supplemental Material

Metrics

Information

Received:January 11, 2022
Accepted:September 02, 2022
Published Online:November 07, 2022

Cite as

Shancong Mou, Jianjun Shi (2022) Compressed Smooth Sparse Decomposition. INFORMS Journal on Data Science 2(1):60-80.

https://doi.org/10.1287/ijds.2022.0023

Keywords

PDF download

Available Issues

Available Issues

Compressed Smooth Sparse Decomposition

Abstract

1. Introduction

2. Compressed Smooth Sparse Decomposition Framework

2.1. The Set of Smooth Plus Sparse Signals

2.2. Compressive Sensing for Smooth Plus Sparse Signals

2.3. Compressed Smooth Sparse Decomposition

2.4. Kronecker Compressed Smooth Sparse Decomposition

2.5. Discussion

2.5.1. Advantages of the Proposed CSSD/KronCSSD Methods.

2.5.2. Compressive Ratio Selection in Practice.

2.5.3. Tuning Parameter Selection in Practice.

2.5.4. Bases Selection in Practice.

3. Simulation Study

3.1. CSSD on a 1D Signal

3.2. CSSD on a 2D Image

3.2.1. Noise-Free Case.

3.2.2. Noisy Case.

4. Case Study

4.1. Surface Defect Detection in a Steel Rolling Process

4.2. Solar Flare Detection

4.3. Silicon Surface Indentation Detection

5. Conclusion

Appendix A

Appendix B

Appendix C

Appendix D

Appendix E

Appendix F

Appendix G

Appendix H. Simulation Study by Varying the Magnitude of Sparse Signals

Appendix I. Sample Case Study Images

Appendix I.1. Sample Images of Case Study 4.1

Appendix I.2. Sample Images of Case Study 4.2

References

Volume 2, Issue 1

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords