Open Access

On Solving MAX-SAT Using Sum of Squares

Lennart Sinjorgo
Corresponding Author
Lennart Sinjorgo
[email protected]
https://orcid.org/0000-0003-0516-6360
CentER, Department of Econometrics and Operations Research, Tilburg University, 5037 AB Tilburg, Netherlands
Search for more papers by this author
,
Renata Sotirov
Renata Sotirov
[email protected]
https://orcid.org/0000-0002-3298-7255
CentER, Department of Econometrics and Operations Research, Tilburg University, 5037 AB Tilburg, Netherlands
Search for more papers by this author

Lennart Sinjorgo

Corresponding Author

Lennart Sinjorgo

[email protected]

https://orcid.org/0000-0003-0516-6360

CentER, Department of Econometrics and Operations Research, Tilburg University, 5037 AB Tilburg, Netherlands

Search for more papers by this author

Renata Sotirov

[email protected]

https://orcid.org/0000-0002-3298-7255

CentER, Department of Econometrics and Operations Research, Tilburg University, 5037 AB Tilburg, Netherlands

Search for more papers by this author

Published Online:7 Nov 2023https://doi.org/10.1287/ijoc.2023.0036

Abstract

We consider semidefinite programming (SDP) approaches for solving the maximum satisfiability (MAX-SAT) problem and weighted partial MAX-SAT. It is widely known that SDP is well-suited to approximate (MAX-)2-SAT. Our work shows the potential of SDP also for other satisfiability problems by being competitive with some of the best solvers in the yearly MAX-SAT competition. Our solver combines sum of squares (SOS)–based SDP bounds and an efficient parser within a branch-and-bound scheme. On the theoretical side, we propose a family of semidefinite feasibility problems and show that a member of this family provides the rank-two guarantee. We also provide a parametric family of semidefinite relaxations for MAX-SAT and derive several properties of monomial bases used in the SOS approach. We connect two well-known SDP approaches for (MAX)-SAT in an elegant way. Moreover, we relate our SOS-SDP relaxations for partial MAX-SAT to the known SAT relaxations.

History: Accepted by Andrea Lodi, Area Editor for Design & Analysis of Algorithms – Discrete.

Supplemental Material: The online appendix is available at https://doi.org/10.1287/ijoc.2023.0036.

1. Introduction

In this paper, we investigate semidefinite programming (SDP) approaches for the satisfiability (SAT) problem, maximum satisfiability (MAX-SAT) problem, and their variants. Given a logical proposition built from a conjunction of clauses, SAT asks whether there exists a truth assignment to the variables such that all clauses are satisfied. The optimization variant of SAT, known as MAX-SAT, is to determine a truth assignment that satisfies the largest number of clauses.

SAT is a central problem in mathematical logic and computer science and finds various applications, including software or hardware verification (Marques-Silva and Sakallah 2000) and planning in artificial intelligence (Kautz and Selman 1999). Cook (1971) proves that SAT is $N P$ -complete. SAT is the first problem proven to be $N P$ -complete, which implies that any problem contained in the complexity class $N P$ can be efficiently recast as a SAT instance. Thus, algorithms for SAT can also solve a wide variety of other problems, such as timetabling (Asín Achá and Nieuwenhuis 2014, Gattermann et al. 2016) and product line engineering (Mendonca et al. 2009).

SDP approaches to SAT are proposed first by de Klerk et al. (2000) and later extended by Anjos (2004a, b; 2005; 2006; 2007). Goemans and Williamson (1995) are first to apply SDP to MAX-SAT. They show that, for a specific class of MAX-SAT instances known as MAX-2-SAT (in MAX-k-SAT, each clause is a disjunction of at most k variables), MAX-SAT is equivalent to optimizing a multivariate quadratic polynomial, which is naturally well-suited for semidefinite relaxations. In the same paper, Goemans and Williamson (1995) propose a 0.878-approximation algorithm for MAX-2-SAT based on SDP. This result is later improved to 0.940 in Lewin et al. (2002). Further, Karloff and Zwick (1997) obtain an optimal 7/8 approximation algorithm for MAX-3-SAT, and Halperin and Zwick (2001) obtain a nearly optimal approximation algorithm for MAX-4-SAT. In van Maaren et al. (2008), the authors exploit sum of squares (SOS) optimization to compute bounds for MAX-SAT.

Despite the great success in designing approximation algorithms using SDP, most modern MAX-SAT solvers do not exploit SDP. A possible reason for this is the fact that medium- to large-size SDP problems are computationally challenging to solve. Interior point methods, the conventional approach for solving SDPs, struggle from large memory requirements and prohibitive computation time per iteration already for medium-size SDPs. Recently, first order methods, such as the alternating direction method of multipliers (Gabay and Mercier 1976, Boyd et al. 2011) and the Peaceman–Rachford splitting method (PRSM) (Peaceman and Rachford 1955) show a great success in solving SDPs (see, e.g., Oliveira et al. 2018, de Meijer and Sotirov 2021, Graham et al. 2022). Motivated by these results, we design a MAX-SAT solver that incorporates SDP bounds and the PRSM within a branch-and-bound (B&B) scheme.

In particular, we further exploit the SOS approach from van Maaren et al. (2008) to derive SOS-based SDP relaxations that provide strong upper bounds to the optimal MAX-SAT solution. The derived SDP relaxations are strengthened SDP duals of the Goemans and Williamson (1995) MAX-SAT relaxation. The strength of the upper bounds and the required time to compute the relaxations depend on the chosen monomial basis. We experiment with different monomial bases and propose a class of bases that provide good trade-offs between these effects. Moreover, we derive several properties of monomial bases that are exploited in the design of our solver. We extend the SOS approach to weighted partial MAX-SAT, a variant of MAX-SAT in which clauses are divided into soft and hard clauses. Here, the goal is to maximize the weighted sum of soft clauses, satisfying all the hard clauses. We strengthen SDP bounds for weighted partial MAX-SAT using the SAT resolution rule. To the best of our knowledge, we are the first to exploit SDP for solving weighted partial MAX-SAT.

We show that the Peaceman–Rachford splitting method is well-suited for exploiting the structure of the SOS-based SDP relaxations. Therefore, we implement the PRSM to (approximately) solve large-scale SDP relaxations and obtain upper bounds for the (weighted partial) MAX-SAT. The resulting algorithm is very efficient; for example, it can compute upper bounds with matrix variables of order 1,800 in less than two minutes and for matrices of order 2,400 in less than four minutes. Our numerical results show that the upper bounds are strong, in particular, when larger monomial bases are used. We also exploit the output of the PRSM to efficiently compute lower bounds for the MAX-SAT problem.

We design an SOS-SDP–based MAX-SAT solver (named SOS-MS) that exploits SOS-based SDP relaxations and the PRSM. SOS-MS is one of the first SDP-based MAX-SAT solvers. The only alternative SDP-based solver is the MIXSAT algorithm (Wang and Zico Kolter 2019) that is designed to solve MAX-2-SAT instances. SOS-MS is able to solve (weighted partial) MAX-k-SAT instances for $k \leq 3$ . To solve a MAX-SAT instance, SOS-MS has to approximately solve multiple SDP subproblems. A crucial component of SOS-MS is, therefore, its ability to quickly construct the program parameters of the required SDPs, that is, the process of parsing. We design an efficient parsing method, which is also applicable to other problems and publicly available. Another efficient feature of our solver is warm starts. Namely, our solver uses the approximate PRSM solution at a node as warm starts for the corresponding children’s node. We are able to solve a variety of MAX-SAT instances in a reasonable time, solving some instances faster than the best solvers in the 11th Evaluation of Max-SAT Solvers (MSE-2016). Moreover, we solve three previously unsolved MAX-3-SAT instances from the MSE-2016. To our knowledge, we are first to use SDP for solving weighted partial MAX-SAT. Our results provide new perspectives on solving MAX-SAT and all its variants by using SDPs.

Our paper provides various theoretical results. We propose a family of semidefinite feasibility problems and show that one member of this family provides the rank-two guarantee. That is, whenever the semidefinite relaxation admits a feasible matrix of rank two or less, the underlying SAT instance is satisfiable. This result relates to a similar rank-two guarantee result by Anjos (2004a). The rank value can be seen as a measure of the strength of the relaxation. We also provide a parametric family of semidefinite relaxations for the (weighted partial) MAX-SAT. This parameter can be finely tuned to determine the strength of the relaxation, and any such relaxation can easily be incorporated within SOS-MS. This allows the solver to be adapted per (class of) problem instances.

Further, we show how the SOS approach to MAX-SAT of van Maaren et al. (2008) and the, here generalized, moment relaxations of SAT from Anjos (2004a, b; 2005; 2006; 2007) are related. This is done by exploiting the duality theory of the moment and SOS approaches. Our result generalizes a result by van Maaren et al. (2008), who show the connection between the two approaches only for restricted cases. By exploiting duality theory, we also relate the SOS relaxations for the partial MAX-SAT problem to SAT relaxations from Anjos (2004a).

Finally, we investigate MAX-SAT resolution, a powerful technique used by many MAX-SAT solvers (Abramé and Habet 2014) in relation to the SDP approach to MAX-SAT. Standard MAX-SAT solvers use resolution to determine upper bounds on the MAX-SAT solution, whereas SOS-MS determines upper bounds through SDP. We show how resolution is related to the monomial basis. We also show how SAT resolution can be exploited for weighted partial MAX-SAT.

This paper is organized as follows. We provide notation and preliminaries in Section 1.1 and assumptions in Section 1.2. Section 2 provides an overview of the Goemans and Williamson (1995) approach to MAX-SAT. Section 3 first outlines previous SDP approaches to SAT and then generalizes them. Section 4 provides the details of the SOS theory applied to MAX-SAT. We also derive various properties of monomial bases in that section. Section 5 concerns the combination of MAX-SAT resolution and SOS. In Section 6, we show how two SDP approaches to (MAX-)SAT (i.e., Anjos 2004a, van Maaren et al. 2008) are connected. Section 7 introduces the PRSM for SOS.¹ We extend the SOS approach to weighted partial MAX-SAT and connect the resulting program to the relaxations in Anjos (2004a) in Section 8. Online Appendix E provides an overview and pseudocode of our solver SOS-MS. Online Appendix F presents numerical results that include SOS-SDP bounds and performance of SOS-MS. Concluding remarks are given in Section 9.

1.1. Preliminaries and Notation

For any $n \in N$ , we write $[n] = {1, \dots, n}$ . We denote by $ϕ$ a propositional formula, in variables x₁ up to x_n and assume that $ϕ$ is in conjunctive normal form (CNF). That is, $ϕ$ is given by a conjunction of m clauses,

ϕ = ⋀_{j = 1}^{m} C_{j} .

(1)

We mostly use n to refer to the number of variables and m to refer to the number of clauses. Each clause C_j is a disjunction of (possibly negated) variables. We define each clause C_j as a subset of $[n]$ , indicating the variables appearing in C_j. Moreover, we define $I_{j}^{+} \subseteq C_{j}$ as the set of unnegated variables appearing in C_j. Similarly, $I_{j}^{-} \subseteq C_{j}$ is defined as the set of negated variables appearing in C_j. Thus, the clause associated with C_j reads

\underset{i \in I_{j}^{+}}{⋁} x_{i} \lor \underset{i \in I_{j}^{-}}{⋁} \neg x_{i} .

(2)

We refer to both x_i and $\neg x_{i}$ as literals. For example, the literal $\neg x_{i}$ is true if x_i is false. We denote the length of a clause by $ℓ_{j}$ ; thus, $ℓ_{j} ≔ | C_{j} |$ . We say that $ϕ$ constitutes a (MAX-)k-SAT instance if ${max}_{j \in [m]} ℓ_{j} = k$ .

The SAT problem is to decide, given $ϕ$ , whether a satisfying truth assignment to the variables x_i, $i \in [n]$ exists. The MAX-SAT problem is to find an assignment that satisfies the largest number of clauses. We associate to each clause a vector $a_{j} \in {0, \pm 1}^{n}$ , having entries $a_{j, i}$ according to

a_{j, i} = {\begin{array}{l} - 1, & if i \in I_{j}^{-}, \\ 0, & if i \notin I_{j}^{+} \cup I_{j}^{-}, \\ 1, & if i \in I_{j}^{+} . \end{array}

(3)

We write $S^{n}$ for the set of symmetric n × n matrices and $S_{+}^{n}$ for the cone of symmetric positive semidefinite matrices of size n × n. If the context is clear, the superscript n is omitted. We denote by $1_{n} \in R^{n}$ and $0_{n} \in R^{n}$ vectors of all ones and zeroes, respectively (subscripts are omitted when the context is clear). The identity matrix of order n is written as I_n. Matrix $0_{m \times n}$ denotes the zero matrix of m rows and n columns. For $X, Y \in S$ , we define the trace inner product as $〈 X, Y 〉 ≔ Tr (X Y)$ . We write $X ⪰ Y$ if and only if $X - Y \in S_{+}$ . The Frobenius norm of a symmetric matrix X is given by ${‖ X ‖}_{F} = \sqrt{〈 X, X 〉}$ . For $X \in S$ and $M \subseteq S$ , we set

P_{M} (X) ≔ \underset{_{Z \in M}}{arg min} {‖ Z - X ‖}_{F},

(4)

as the projection of X onto

M

. For

X \in R^{n \times n}

, diag

(X) \in R^{n}

denotes the vector equal to the diagonal of X. For

x \in R^{n}

, we write

‖ x ‖ ≔ \sqrt{x^{⊤} x}

and

x^{α} ≔ \prod_{i \in α} x_{i},

(5)

for some

α \subseteq [n]

. We evaluate the empty product as one. Note that (5) differs from the more common notation in SOS literature, in which the α are considered as vectors in

N^{n}

; see, for example, Lasserre (2007).

1.2. Assumptions

In the rest of this work, we assume that all logical propositions $ϕ$ on n variables and m clauses satisfy the following three properties:

For all $j \in [m], | C_{j} | \geq 2$ .
For all $j \in [m], I_{j}^{+} \cap I_{j}^{-} = \emptyset$ .
Each variable is contained in at least two clauses along with $ϕ$ being in CNF.

Property 1 can be assumed for SAT instances. If a SAT instance contains a clause C_j with $| C_{j} | = 1$ (such a clause is known as a unit clause), the literal in C_j must be satisfied. The variable corresponding to this literal can, thus, be given an appropriate truth value, and $ϕ$ can be reduced (such a reduction of $ϕ$ is referred to as unit resolution). For MAX-SAT instances, it is possible that an optimal truth assignment might leave unit clauses unsatisfied. We note, however, that the MAX-SAT benchmark instances we consider satisfy all these assumptions.

In Online Appendix B, we show that properties 2 and 3 can be assumed without loss of generality (w.l.o.g.).

2. MAX-SAT Formulation and Relaxation

We outline the approach of Goemans and Williamson (1995) for formulating MAX-SAT as a polynomial optimization problem. We also present their SDP relaxation for MAX-2-SAT.

Let $x_{1}, x_{2}, \dots, x_{n}$ be the variables of the MAX-SAT instance given by a logical proposition $ϕ$ . We, thus, assume that $ϕ$ is given in conjunctive normal form, see (1), and contains m clauses. As customary in the SDP SAT literature, we associate +1 with true and –1 with false. An assignment of the x_i values in ${\pm 1}$ is referred to as a truth assignment. As proposed by Goemans and Williamson (1995), we define a truth function $v : {\pm 1}^{n} \to {0, 1}$ such that, given a logical proposition $ϕ'$ evaluated for some truth assignment, $v (ϕ') = 1$ if and only if $ϕ'$ is satisfied and zero otherwise. This property uniquely determines v; that is, $v (x_{i}) = \frac{1 + x_{i}}{2} and v (\neg x_{i}) = \frac{1 - x_{i}}{2} .$ And, in general, for a clause $C_{j} \subseteq [n]$ of length $ℓ_{j}$ , we have

v (C_{j}) ≔ 1 - \prod_{i \in I_{j}^{+}} v (\neg x_{i}) \prod_{i \in I_{j}^{-}} v (x_{j}) = 1 - 2^{- ℓ_{j}} (\sum_{γ \subseteq C_{j}} {(- 1)}^{| γ |} a_{j}^{γ} x^{γ}),

(6)

for a_j as in (3) and both

a_{j}^{γ}

and

x^{γ}

as in (5). The last equality in (6) follows from the product expansion of

v (C_{j})

as shown in proposition 1 in Anjos (2006). In Goemans and Williamson (1995), an extra variable

x_{0} \in {\pm 1}

is defined with the purpose of deciding the truth value; that is,

ϕ'

is true if and only if

v (ϕ') = x_{0}

. We set

x_{0} = 1

without loss of generality for sake of clarity. The MAX-SAT problem given by

ϕ

is to maximize the following polynomial:

v_{ϕ} ≔ \sum_{j \in [m]} v (C_{j}) = \sum_{α \subseteq [n]} v_{ϕ}^{α} x^{α},

(7)

subject to

x_{i} \in {\pm 1}

for all i and for appropriate

v_{ϕ}^{α} \in R

and

x^{α}

as in (5). Observe that

v_{ϕ}

is a kth degree polynomial if

ϕ

represents a MAX-k-SAT instance. A MAX-2-SAT instance, thus, corresponds to a quadratic polynomial and is, therefore, well-suited for approximation by SDP. We return to

v_{ϕ}

in Section 6.

Assuming now that $ϕ$ represents a MAX-2-SAT instance on n variables, the corresponding MAX-2-SAT can be formulated as

\begin{array}{l} max 〈 W, X 〉 s . t . diag (X) = 1, X ⪰ 0, X \in {\pm 1}^{(n + 1) \times (n + 1)}, \end{array}

(8)

where

〈 W, X 〉

describes the quadratic polynomial

v_{ϕ}

. Observe that the constraints of (8) enforce X to satisfy

X = x x^{⊤}

for some

x \in {\pm 1}^{n + 1}

. The size of this vector x is one more than the number of variables n to account for the additional truth value variable x₀.

A semidefinite relaxation of (8) is obtained by omitting the integrality constraint or, equivalently, nonconvex rank-one constraint. This constitutes the well-known Goemans and Williamson (1995) SDP relaxation of MAX-2-SAT. That is,

\begin{array}{l} max 〈 W, X 〉 s . t . diag (X) = 1, X \in S_{+}^{n + 1} . \end{array}

(9)

Goemans and Williamson (1995) show that the optimal matrix for (9) can be used to obtain a 0.878-approximation algorithm to MAX-2-SAT. Assuming $P \neq N P$ , for any $ε > 0$ there exists no $(\frac{21}{22} + ε)$ approximation algorithm to MAX-2-SAT (Håstad 2001). Karloff and Zwick (1997) introduce a canonical way of obtaining SDP relaxations for any MAX-SAT that is exploited to obtain approximation algorithms to MAX-3-SAT and MAX-4-SAT in Karloff and Zwick (1997) and Halperin and Zwick (2001), respectively. To solve MAX-2-SAT instances, rather than approximate, Wang and Zico Kolter (2019) propose the MIXSAT algorithm, which combines (9) with a B&B scheme.

3. SAT as a Semidefinite Feasibility Problem

In this section, we first present a brief overview of the work done by de Klerk et al. (2000) and Anjos (2004a, b; 2005; 2006; 2007). Their relaxations of SAT involve semidefinite feasibility problems: infeasibility of the semidefinite program implies unsatisfiability of the corresponding SAT instance. The difference between relaxations is the size of the SDP variable and the method of encoding the structure of the SAT instance in the SDP relaxation. Then, we propose a family of semidefinite feasibility problems that contains relaxations from Anjos (2004a, b; 2005; 2006; 2007) and de Klerk et al. (2000) as special cases and show that a particular member of the family provides a rank-two guarantee; see Theorem 1.

We reconsider first Program (9), which attempts to satisfy the maximum number of clauses through its objective function. For SAT specifically, one might move the clause satisfaction part from the objective to the feasible set of a semidefinite program. This idea is first proposed by de Klerk et al. (2000) and later extended by Anjos (2004a). To be precise: de Klerk et al. (2000) propose the so-called GAP relaxation, or GAP for short, which is a semidefinite feasibility problem, given by

\begin{array}{l} find & Y \in S_{+}^{n}, y \in R^{n} \\ s . t . & a_{j}^{⊤} Y a_{j} - 2 a_{j}^{⊤} y \leq ℓ_{j} (ℓ_{j} - 2), \forall j \in [m], diag (Y) = 1, Y ⪰ y y^{⊤}, \end{array}

(GAP)

for a_j as in (3). It is noted in de Klerk et al. (2000) that, for

ℓ_{j} \leq 2

, the corresponding inequalities in GAP relaxation may be changed to equalities. The GAP relaxation is suited for instances that contain a clause of length two. If

ℓ_{j} \geq 3, \forall j \in [m]

, then

(Y, y) = (I, 0)

is always feasible for GAP whether the underlying SAT instance is satisfiable or not.

We now state the SDP relaxations of SAT by Anjos (2004a, b; 2005; 2006; 2007) that are not restricted to the lengths of the clauses in instances. Let $ϕ$ be a proposition on n variables and m clauses and $x \in {\pm 1}$ the truth assignment to the variables. Consider a family of subsets $F = {α_{1}, \dots, α_{s}}$ , let $x = {(x^{α_{1}}, \dots, x^{α_{s}})}^{⊤}$ , and define $Y ≔ x x^{⊤}$ . It is clear that $rank (Y) = 1$ , diag $(Y) = 1$ , and $Y ⪰ 0$ . Later, to obtain a semidefinite relaxation of SAT, we omit the rank-one constraint.

We index the matrix Y with the elements of $F$ and define for all subsets γ contained in some clause of $ϕ$ the expression

Y (γ) ≔ Y_{α, β}, for some α, β \in F jointly contained in a single clause s . t . α △ β = γ,

(10)

where

△

is the symmetric difference operator, which is induced by the fact that, for

x \in {\pm 1}^{n}

, we have

Y_{α, β} = x^{α} x^{β} = x^{α △ β} = x^{γ}

; see (5). In general,

Y (\emptyset)

refers to a diagonal entry of Y; hence,

Y (\emptyset) = 1

. We may have

Y (γ) = Y_{\emptyset, γ}

, and we assume that, for all γ contained in a clause of

ϕ

, we can always find α and β as in (10).

The expression $Y (γ)$ can refer to multiple entries of Y. By construction of Y, these entries are equal. Stated formally, we have $Y \in \cap_{j \in [m]} Δ_{j}$ , where

Δ_{j} ≔ {Y \in S | Y_{α_{1}, β_{1}} = Y_{α_{2}, β_{2}} \forall (α_{1}, α_{2}, β_{1}, β_{2}) \in F s . t . α_{1} △ β_{1} = α_{2} △ β_{2} \subseteq C_{j}} .

(11)

Observe that the sets Δ_j do not capture all equalities present in Y because of the restriction $α_{1} △ β_{1} = α_{2} △ β_{2} \subseteq C_{j}$ . In this section, we choose to include only the equalities captured by Δ_j. This keeps our work in line with previous relaxations by Anjos (2004a), and these equalities suffice to prove the main theorem in this section; see Theorem 1. In Section 6, we consider an SDP relaxation of SAT that considers all equalities present in Y.

If x is a satisfying assignment to $ϕ$ , then $v (C_{j}) = 1$ (see (6)) for all $j \in [m]$ . We can rewrite this constraint in terms of $Y (γ)$ ; see (10). We now omit the rank-one constraint on Y to obtain the following semidefinite feasibility program, denoted by $R_{F} (ϕ)$ :

\begin{array}{l} find & Y \in S_{+}^{| F |} \\ s . t . & \sum_{γ \subseteq C_{j}} {(- 1)}^{| γ |} a_{j}^{γ} Y (γ) = 0, \forall j \in [m], diag (Y) = 1, Y \in \underset{j \in [m]}{\cap} Δ_{j} . \end{array}

R_{F} (ϕ)

The program $R_{F} (ϕ)$ contains both the GAP relaxation and the relaxations proposed by Anjos (2004b) as special cases. Specifically, one obtains the GAP relaxation from $R_{F} (ϕ)$ by taking $F = {α \subseteq [n] | | α | \leq 1}$ . de Klerk et al. (2000) prove that, whenever GAP is feasible for a 2-SAT instance, the 2-SAT instance is satisfiable. Note that 2-SAT is decidable in linear time (Aspvall et al. 1979) unlike the NP-complete k-SAT with $k \geq 3$ .

The GAP relaxation can be considered as a semidefinite program in the first level of the well-known Lasserre (2001) hierarchy. Anjos (2004b) proposes semidefinite relaxations of SAT in approximately levels two and three of the Lasserre (2001) hierarchy by only adding a subset of products of variables to the moment relaxation. For example, Anjos (2004a) proposes the R₂ relaxation, which can be obtained from $R_{F} (ϕ)$ by taking

F = {α | α \subseteq C_{j} for some j, | α | odd, or α = \emptyset} .

(12)

Anjos (2004a) proves that the R₂ relaxation attains a rank-two guarantee on 3-SAT instances: whenever the SDP admits a feasible matrix of rank two or lower, the corresponding SAT instance is satisfiable. We now prove that, for a different $F$ than (12), the resulting relaxation $R_{F} (ϕ)$ provides the same rank-two guarantee.

Theorem 1.

Let $ϕ$ be a 3-SAT instance and $F = {α \subseteq [n] | α \subseteq C_{j} for some j, | α | \neq 1}$ . Let Y be the matrix variable of $R_{F} (ϕ)$ indexed by elements of $F$ . If $R_{F} (ϕ)$ admits a feasible rank-two matrix, then $ϕ$ is satisfiable.

Proof.

See Online Appendix A. □

4. Sum of Squares and MAX-SAT

In Section 4.1, we first provide an overview of the approach of van Maaren et al. (2008) for deriving relaxations for MAX-SAT. Their approach exploits SOS optimization, which has received much attention in the literature (see, e.g., Lasserre 2007, Laurent 2009, Scheiderer 2009, Parrilo and Thomas 2020). Relaxations depend on a basis that is used to compute them. We introduce a parametric family of monomial bases with increasing complexity. In Section 4.2, we derive several properties of monomial bases that are later used in our computations.

4.1. General Overview

For a given logical proposition $ϕ$ on n variables and m clauses, the value

F_{ϕ} (x) ≔ \sum_{j = 1}^{m} \frac{1}{2^{ℓ_{j}}} \prod_{i \in C_{j}} (1 - a_{j, i} x_{i}),

(13)

for

a_{j, i}

as in (3) equals the number of unsatisfied clauses by truth assignment

x \in {\pm 1}^{n}

. Hence, we are interested in minimizing

F_{ϕ}

over

{\pm 1}^{n}

, on which

F_{ϕ}

is nonnegative. Let

R [x]

be the set of real polynomials in x. We define

V ≔ {f | f \equiv \sum_{j = 1}^{k} f_{j}^{2} \mod I, f_{j} \in R [x] \forall j \in [k], k \in N}

(14)

as the set of SOS polynomials modulo

I

, where

I

is the vanishing ideal of

{\pm 1}^{n}

. That is,

I = 〈 1 - x_{1}^{2}, 1 - x_{2}^{2}, \dots, 1 - x_{n}^{2} 〉 .

(15)

By Putinars Positivstellensatz (Putinar 1993), $V$ is the set of nonnegative polynomials on ${\pm 1}^{n}$ . Generally, optimization over $V$ is intractable because of its size, which is why we consider

V_{x} ≔ {f | f \equiv x^{⊤} M x \mod I, M ⪰ 0},

(16)

where x is some monomial basis. Because

M ⪰ 0

, it follows that all polynomials of

V_{x}

are nonnegative on

{\pm 1}^{n}

. Therefore,

V_{x} \subseteq V

, and we may approximate the minimum of

F_{ϕ}

min_{x \in {\pm 1}^{n}} F_{ϕ} = \sup {λ | F_{ϕ} - λ \in V} \geq \sup {λ | F_{ϕ} - λ \in V_{x}} .

(17)

The description of $V_{x}$ shows that computing this lower bound can be done via SDP.

It is important to note that, in the quotient ring of $R [x]$ modulo $I$ , all terms $x_{i}^{2} \equiv 1$ , and thus, it suffices to consider only monomials in x for which the highest power is at most one. Thus, we can write

F_{ϕ} (x) = \sum_{α \subseteq [n]} p_{ϕ}^{α} x^{α},

(18)

where

p_{ϕ}^{α} \in R

for all

α \subseteq [n]

and

x^{α}

as in (5). For the constant term of

F_{ϕ} (x)

, we have

p_{ϕ}^{\emptyset} = \sum_{j = 1}^{m} \frac{1}{2^{ℓ_{j}}} .

(19)

We say that monomial basis x represents a logical proposition $ϕ$ if matrix $X \equiv x x^{⊤} \mod I$ contains all monomials $x^{α}$ for which $p_{ϕ}^{α} \neq 0$ . We index this matrix X and the matrix M from (16) with subsets $α \subseteq [n]$ for which $x^{α} \in x$ . Note that for such $α, β \subseteq [n]$ , we have

X_{α, β} \equiv x^{α △ β} \mod I .

(20)

For $α \subseteq [n]$ , we write $x^{α} \in X$ if X has an entry equal to $x^{α}$ (modulo $I$ ). van Maaren et al. (2008) propose multiple monomial bases x, among them basis SOS_p, given by

x = 1 \cup {x_{i} | i \in [n]} \cup {x_{i} x_{j} | i and j appear together in a clause} .

(21)

It is stated in van Maaren et al. (2008) that SOS_p represents 2-SAT and 3-SAT instances. Whereas this is true, this basis also represents 4-SAT instances (see Lemma 2). We additionally define for $Q \in N$ , as an extension to SOS_p, the basis

S O S_{p}^{Q} ≔ {SOS}_{p} \cup {x_{i} x_{j} | i and j are both in the top Q appearing variables} .

(22)

Basis $S O S_{p}^{Q}$ takes basis SOS_p and adds all the $(\begin{matrix} Q \\ 2 \end{matrix})$ quadratic terms of the Q variables appearing in the highest number of clauses of $ϕ$ . Any basis x is considered to have duplicate monomials removed, and so, for small values of Q, bases $S O S_{p}^{Q}$ and SOS_p might coincide.

We also define the basis $S O S_{s}^{θ}$ for $θ \in [0, 1]$ , which is suited for (MAX-)2-SAT instances. This basis consists of all the monomials of degrees one and zero plus a percentage θ of all quadratic monomials appearing in SOS_p. The included quadratic monomials are those that appear in SOS_p and attain the highest monomial weight w, which is defined as $w (x^{α}) ≔ \sum_{i \in α} w (i)$ , where $w (i) ≔ | {C \in ϕ | i \in C} |$ for $i \in [n]$ . This results in the following chain of inclusions:

{x^{α} | | α | \leq 1} = S O S_{s}^{0} \subseteq S O S_{s}^{θ} \subseteq S O S_{s}^{1} = {SOS}_{p} = S O S_{p}^{0} \subseteq S O S_{p}^{Q} \subseteq S O S_{p}^{n} = {x^{α} | | α | \leq 2} .

(23)

We now define, for all $γ \subseteq [n]$ such that $x^{γ} \in X$ , a set of ordered pairs as follows:

x^{γ} ≔ {(α, β) \subseteq {[n]}^{2} | α △ β = γ, x^{α} \in x, x^{β} \in x} .

(24)

Set $x^{γ}$ contains the index pairs $(α, β)$ such that $X_{α, β} \equiv x^{γ}$ . Therefore, $F_{ϕ} \equiv x^{⊤} M x$ if and only if

\sum_{(α, β) \in x^{γ}} M_{α, β} = p_{ϕ}^{γ}, \forall γ such that x^{γ} \in X .

(25)

Constraints of this form are sometimes referred to as coefficient matching conditions in SOS literature (Zheng et al. 2017). We define

M_{ϕ} ≔ {M \in S^{| x |} | \sum_{(α, β) \in x^{γ}} M_{α, β} = p_{ϕ}^{γ}, \forall γ \neq \emptyset such that x^{γ} \in X},

(26)

as the set of matrices satisfying the coefficient matching conditions for all monomials except

x^{\emptyset}

Note that M is constrained to be symmetric, which is reflected in the definition of $x^{γ}$ , because $(α, β) \in x^{γ}$ if and only if $(β, α) \in x^{γ}$ . Moreover, $x^{\emptyset}$ contains the index pairs of the diagonal entries of M, which correspond to zero-degree monomials in X. Hence, if $F_{ϕ} - λ \equiv x^{⊤} M x$ , then $p_{ϕ}^{\emptyset} - λ = 〈 I, M 〉$ ; see (17) and (19). To maximize the lower bound on $F_{ϕ}$ (see (17)), we maximize λ, which is, thus, equivalent to minimizing $〈 I, M 〉$ . We can, therefore, compute this lower bound by solving the following SDP:

\begin{array}{l} min 〈 I, M 〉 s . t . M \in M_{ϕ} \cap S_{+} . \end{array}

(P_ϕ)

We note that, for the purpose of solving $P_{ϕ}$ through interior point methods, program $P_{ϕ}$ is strictly feasible: for any feasible matrix M, matrix M + I is strictly feasible. The existence of any such feasible matrix M follows from the nonnegativity of $F_{ϕ}$ on ${\pm 1}^{n}$ . We postpone the derivation of the dual of $P_{ϕ}$ to Section 6, in which we also show its strict feasibility in Theorem 2.

4.2. Properties of $S O S_{p}^{Q}$

We provide several properties of monomial bases that are exploited within the PRSM; see Section 7.

Denote by $| x^{γ} |$ the cardinality of the set $x^{γ}$ ; see (24). Because of the symmetry of X (see (20)), $| x^{γ} |$ is an even number and greater than or equal to two. In particular, when $| x^{γ} | = 2$ , say $x^{γ} = {(α, β), (β, α)}$ , we have

M_{α, β} + M_{β, α} = p_{ϕ}^{γ} and M_{α, β} = M_{β, α} \Rightarrow M_{α, β} = M_{β, α} = p_{ϕ}^{γ} / 2 .

(27)

Thus, whenever $| x^{γ} | = 2$ , the constraint involving $x^{γ}$ in $M_{ϕ}$ (see (26)), simply fixes two entries of M. van Maaren et al. (2008) refer to these constraints arising from $| x^{γ} | = 2$ as unit constraints. In section 7 of van Maaren et al. (2008), the authors empirically show that a high percentage of the constraints of $M_{ϕ}$ are unit constraints. van Maaren et al. (2008) propose as future work the development of an SDP solver that is able to exploit the large number of unit constraints. We propose an algorithm for approximately solving $P_{ϕ}$ in Section 7 that is able to do so.

The following lemma describes the subsets γ that induce unit constraints.

Lemma 1.

Let $ϕ$ be a (MAX-)SAT instance on n variables and m clauses and x a monomial basis that contains at least all of the monomials induced by the SOS_p basis; see (21). Then, for all $γ \subseteq [n]$ , we have $| x^{γ} | = 2 \Rightarrow p_{ϕ}^{γ} = 0$ , where $p_{ϕ}^{γ}$ is a coefficient of $F_{ϕ} (x)$ ; see (18).

Proof.

See Online Appendix A. □

Lemma 1 implies that, in an implementation that uses the SOS_p basis, it is not required to store the coefficients corresponding to unit constraints (because these all equal zero), but only the indices restricted by the unit constraints. The converse of Lemma 1 is generally not true. That is, there can exist many subsets $γ \subseteq [n]$ for which $p_{ϕ}^{γ} = 0$ , but $| x^{γ} | > 2$ .

Lemma 2.

Let $ϕ$ be an SAT instance on n variables and x its monomial basis according to $S O S_{p}^{Q}$ for some $Q \in N$ . Let $γ \subseteq [n]$ . Then,

$| γ | \in {1, 2} \Rightarrow | x^{γ} | \leq 2 n$ .
$| γ | \in {3, 4} \Rightarrow | x^{γ} | \leq 6$ .
$| γ | > 4 \Rightarrow | x^{γ} | = 0$ .

Proof.

See Online Appendix A. □

Part 3 of Lemma 2 shows that the $S O S_{p}^{Q}$ bases are only suited for (MAX-)k-SAT when $k \leq 4$ .

5. Resolution and Monomial Bases

In this section, we consider resolution in combination with the SOS approach to MAX-SAT. Resolution is a technique from mathematical logic and widely employed by MAX-SAT solvers (Prasad et al. 2005). Resolution takes as inputs two clauses of a proposition $ϕ$ and returns a set of new clauses named the resolvent clauses. The resolvent clauses transform $ϕ$ into $ϕ'$ by either replacing the original clauses or adding the resolvent clauses to $ϕ$ (depending on which resolution rule is used). We show in this section that the MAX-SAT resolution rule might not be beneficial for the SOS approach applied to MAX-SAT and can even decrease its effectiveness. However, in Section 8.2, we show how to benefit from the SAT resolution rule in partial MAX-SAT.

We show this using an example. For $k \geq 3$ , we define the following proposition on k variables:

ϕ_{k} ≔ {\begin{array}{l} \neg x_{1} \land (x_{1} \lor \neg x_{2}) \land (x_{2} \lor x_{3}) \land \neg x_{3} if k = 3, \\ \neg x_{1} \land (x_{1} \lor \neg x_{2}) \land (x_{2} \lor x_{3}) \land [⋀_{j = 3}^{k - 1} (\neg x_{j} \lor x_{j + 1})] \land \neg x_{k} else . \end{array}

(28)

It is clear that $ϕ_{k}$ is unsatisfiable. If one satisfies the initial two unit clauses and performs unit resolution, more unit clauses appear. Repeating this process leads to an all-false truth assignment, leaving clause $x_{2} \lor x_{3}$ unsatisfied. Therefore, any truth assignment leaves at least one clause unsatisfied, and hence, ${min}_{x \in {\pm 1}^{k}} F_{ϕ_{k}} = F_{ϕ_{k}} (- 1_{k}) = 1$ for $F_{ϕ_{k}}$ as in (13). In the following lemma, we show that the SOS_p basis (see (21)), suffices for proving optimality of this assignment.

Lemma 3.

For all $k \geq 3$ , we have $max {λ | F_{ϕ_{k}} - λ \in V_{x}} = 1$ , where $x = {SOS}_{p} (ϕ_{k})$ , and $V_{x}$ as in (16).

Proof.

See Online Appendix A. □

Let us now present the MAX-SAT resolution rule (see, e.g., Abramé and Habet 2014). For clauses C₁ and C₂ of some proposition $ϕ$ , on literals x, z_i, $i \in [s]$ and y_i, $i \in [t]$ construct the clauses below the horizontal line:

\begin{array}{c} \frac{C_{1} = [x \lor z_{1} \lor \dots \lor z_{s}], C_{2} = [\neg x \lor y_{1} \lor \dots \lor y_{t}]}{z_{1} \lor \dots \lor z_{s} \lor y_{1} \lor \dots \lor y_{t},} \\ [C_{1} \lor \neg y_{1} \lor y_{2} \lor \dots \lor y_{t}], [C_{1} \lor \neg y_{2} \lor y_{3} \lor \dots \lor y_{t}], \dots, [C_{1} \lor \neg y_{t}], \\ [C_{2} \lor \neg z_{1} \lor z_{2} \lor \dots \lor z_{s}], [C_{2} \lor \neg z_{2} \lor z_{3} \lor \dots \lor z_{s}], \dots, [C_{2} \lor \neg z_{s}] . \end{array}

(29)

The MAX-SAT resolution rule states that one may replace clauses C₁ and C₂ in $ϕ$ with a subset of the $1 + s + t$ resolvent clauses below the horizontal line. Namely, clauses that are trivially satisfied, such as $x \lor \neg x$ , are not part of this subset. We refer to the resulting new proposition, obtained after resolution, as $ϕ'$ . In Bonet et al. (2007), theorem 4, it is proven that $F_{ϕ} = F_{ϕ'}$ ; see (13). Note that, in Bonet et al. (2007), definition 6, function $F_{ϕ}$ is stated in terms of ${0, 1}$ variables.

For standard MAX-SAT solvers, one of the goals of resolution is to create new unit clauses, which are used to compute upper bounds on the MAX-SAT solution (Abramé and Habet 2014). For our SDP approach, assuming a fixed monomial basis, the sets $M_{ϕ}$ and $M_{ϕ'}$ (see (26)), depend only on the coefficients of $F_{ϕ}$ and $F_{ϕ'}$ . Because $F_{ϕ} = F_{ϕ'}$ , these coefficients are equal, and thus, $M_{ϕ} = M_{ϕ'}$ . Note that the feasible set of $P_{ϕ}$ is defined in terms of $M_{ϕ}$ . Hence, it follows that, if given the same basis, program $P_{ϕ}$ equals program $P_{ϕ'}$ . This equivalence of programs suggests that MAX-SAT resolution does not change our approach; however, we find that, in general, ${SOS}_{p} (ϕ) \neq {SOS}_{p} (ϕ')$ ; see (21). We investigate the effect of this difference.

Returning to the example of $ϕ_{k}$ in (28), let us define $C_{q} = \neg x_{q} \lor x_{q + 1}$ . Observe that, for $3 \leq q \leq k - 1, C_{q} \in ϕ_{k}$ (assuming k > 3). Let us fix some q, $3 \leq q \leq k - 3$ , and consider the clauses $C_{q}, C_{q + 1}, C_{q + 2} \in ϕ_{k}$ . We perform resolution as

\frac{C_{q} = [\neg x_{q} \lor x_{q + 1}], C_{q + 1} = [\neg x_{q + 1} \lor x_{q + 2}]}{[\neg x_{q} \lor x_{q + 2}], [\neg x_{q} \lor x_{q + 1} \lor \neg x_{q + 2}], [x_{q} \lor \neg x_{q + 1} \lor x_{q + 2}] .}

(30)

We perform resolution again on the third new clause obtained in (30) and $C_{q + 2}$ to obtain

\begin{array}{c} \frac{[x_{q} \lor \neg x_{q + 1} \lor x_{q + 2}], C_{q + 2} = [\neg x_{q + 2} \lor x_{q + 3}]}{[x_{q} \lor \neg x_{q + 1} \lor x_{q + 3}], [x_{q} \lor \neg x_{q + 1} \lor x_{q + 2} \lor \neg x_{q + 3}],} \\ [\neg x_{q} \lor \neg x_{q + 1} \lor \neg x_{q + 2} \lor x_{q + 3}], [x_{q + 1} \lor \neg x_{q + 2} \lor x_{q + 3}], \end{array}

(31)

and, here, the resolution rule states that one may replace the original clauses C₁, C₂, and C₃ with the six new resolvent clauses obtained from (30) and (31) (the third resolvent from (30) is not counted because it is replaced in the resolution in (31)).

Observe that the SOS_p basis generates six quadratic monomials for the new resolvent clauses, whereas originally, only three quadratic monomials are generated for C_q, $C_{q + 1}$ , and $C_{q + 2}$ . We now define $ϕ_{k}^{'}$ for $k \geq 6$ as the logical proposition, obtained by taking $ϕ_{k}$ and performing resolution as in (30) and (31) for each triple of clauses ${C_{q}, C_{q + 1}, C_{q + 2}}$ for each $q \in {3, 6, 9, \dots, k - 3}$ (let us assume here that k is a multiple of three). Note that proposition $ϕ_{k}^{'}$ constitutes a MAX-4-SAT instance, and therefore, basis SOS_p is applicable. Let us compare the sizes of the resulting SOS_p bases, denoted as $| {SOS}_{p} |$ . We have $| {SOS}_{p} (ϕ_{k}) | = 2 k < 3 k - 3 = | {SOS}_{p} (ϕ_{k}^{'}) | .$

Thus, compared with ${SOS}_{p} (ϕ_{k})$ , basis ${SOS}_{p} (ϕ_{k}^{'})$ adds approximately k monomials. None of these monomials strengthens the bound because ${SOS}_{p} (ϕ_{k})$ is already sufficient for proving optimality by Lemma 3. It is clear that having a larger basis without offering a stronger bound is inefficient because solving $P_{ϕ}$ requires more time for larger matrices.

The example of $ϕ_{k}^{'}$ and $ϕ_{k}$ shows that not all monomials are (equally) useful in determining bounds. It also shows that resolution can decrease the effectiveness of the SOS approach to MAX-SAT by providing “bad” monomial bases or it can occur that the SOS_p basis misses “good” monomials. Our proposed basis $S O S_{p}^{Q}$ (see (22)) attempts to solve this issue.

6. Relating Sum of Squares and Method of Moments

In this section, we show how the SOS-SDP relaxation of van Maaren and van Norden (2005) and van Maaren et al. (2008) and moment relaxations of Anjos (2004a, b; 2005; 2006; 2007) are related. The relaxations of Anjos as described in Section 3 are first introduced in Anjos (2004a) and can be considered as extensions of the GAP relaxation via the well-known Lasserre (2001) hierarchy. van Maaren and van Norden (2005) propose the SOS approach to (MAX-)SAT. Subsequently, van Maaren and van Norden (2005) show that the SOS relaxation, using monomial basis SOS_pt that is larger than SOS_p (see (21)) outperforms the R₃ relaxation of Anjos (2005) in deciding on the satisfiability of 3-SAT instances. The R₃ relaxation is known to dominate the R₂ relaxation; see (12). Anjos (2007) strengthened his R₃ relaxation further and left it as future work to determine which SDP relaxation was the strongest.

We complete that work here by showing a simple relation between the two approaches. In particular, Anjos’ relaxations can be considered as method of moments in the Lasserre (2001) hierarchy. It is well-known that method of moments is dual to SOS optimization (see Lasserre 2001), and we work out the details here. Let us first derive the dual of the SOS program $P_{ϕ}$ (see Theorem 2) and then relate it to the here proposed strengthened version of Anjos’ relaxations.

To this end, we require the following intermediate result on $v_{ϕ}$ ; see (7).

Lemma 4.

Let $ϕ = ⋀_{j = 1}^{m} C_{j}$ be a logical proposition, $v_{ϕ} = \sum_{α \subseteq [n]} v_{ϕ}^{α} x^{α}$ (see (7)), and $F_{ϕ} = \sum_{α \subseteq [n]} p_{ϕ}^{α} x^{α}$ ; see (18). Then, $v_{ϕ} = m - F_{ϕ} .$ and $v_{ϕ}^{α} = - p_{ϕ}^{α}$ for all nonempty $α \subseteq [n]$ .

Proof.

See Online Appendix A. □

Let x be a given monomial basis, $S \in S^{| x |}$ . Matrix S is indexed by all $α \subseteq [n]$ for which $x^{α} \in x$ . To simplify the comparison between the SOS approach and the relaxations of Anjos (2004a), we define the set

X_{ϕ} ≔ {S \in S | diag (S) = 1, S_{α, β} = S_{α', β'} \forall (α, β, α', β') \subseteq [n] s . t . α △ β = α' △ β'},

(32)

for a proposition

ϕ

on n variables and m clauses. Note that

X_{ϕ} \subseteq \cap_{j \in [m]} Δ_{j}

(see (11)) because Δ_j only restricts entries

S_{α, β}

whenever α and β are jointly contained in a single clause. We use

X_{ϕ}

in the following theorem.

Theorem 2.

Let $ϕ$ be a logical proposition and x a monomial basis. The SOS program $P_{ϕ}$ defined by $ϕ$ and x is equivalent to

\begin{array}{l} max 〈 C, S 〉 s . t . S \in X_{ϕ} \cap S_{+}, \end{array}

(33)

where

X_{ϕ}

is given by (32) and

C \in S^{| x |}

, indexed by the subsets

α \subseteq [n]

for which

x^{α} \in x

, is any matrix that satisfies

\sum_{(α, β) \in x^{γ}} C_{α, β} = v_{ϕ}^{γ}, \forall γ \neq \emptyset, x^{γ} \neq \emptyset

for

v_{ϕ}^{γ}

as in (7). Moreover, (33) is strictly feasible.

Proof.

We rewrite program $P_{ϕ}$ by splitting the matrix variable M as follows:

v ≔ min 〈 I, M 〉 s . t . M = Z, M \in M_{ϕ}, Z \in S_{+},

(34)

where

M_{ϕ}

as in (26). We dualize the constraint M = Z, and set

g (S) ≔ min_{M \in M_{ϕ}, Z ⪰ 0} 〈 I, M 〉 + 〈 S, M - Z 〉,

for some

S \in S

. Clearly,

g (S) \leq v

for all S, and we, thus, look to maximize g(S); that is,

\begin{array}{l} max_{S} g (S) = max_{S} [min_{M \in M_{ϕ}} 〈 I + S, M 〉 + min_{Z ⪰ 0} 〈 S, - Z 〉] = max_{S ⪯ 0} min_{M \in M_{ϕ}} 〈 I + S, M 〉 \\ = max_{S ⪰ 0} min_{M \in M_{ϕ}} 〈 I - S, M 〉 . \end{array}

(35)

We now determine the set $X_{ϕ}$ such that, whenever $S \in X_{ϕ}$ , the minimization over $M \in M_{ϕ}$ in (35) is bounded. Observe that $M_{ϕ}$ places no restrictions on the diagonal. To guarantee a bounded minimum, set $X_{ϕ}$ should restrict $diag (I - S) = 0$ . Each off-diagonal element of a matrix in $M_{ϕ}$ is restricted by a single constraint of the form (25). Therefore, solving (35) for M can be done by considering separately the elements of M restricted by a single constraint. That is,

min_{M \in S} \sum_{γ \in X} \sum_{(α, β) \in x^{γ}} - S_{α, β} M_{α, β} s . t . \sum_{(α, β) \in x^{γ}} M_{α, β} = p_{ϕ}^{γ},

(36)

where

x^{γ}

and X are defined in (24) and (20), respectively. This minimization problem is bounded if and only if

S_{α, β} = S_{α', β'}, \forall (α, β), (α', β') \in x^{γ} \forall x^{γ} \in X,

(37)

or equivalently,

S_{α, β} = S_{α', β'}

for all possible index pairs

(α, β)

and

(α', β')

that satisfy

α △ β = α' △ β'

. It follows that

X_{ϕ}

is given by (32). Now, for fixed

S \in X_{ϕ} \cap S_{+}

, any matrix

M \in M_{ϕ}

obtains the same value in (35). Note also that, w.l.o.g., we may fix

M = P_{M_{ϕ}} (0)

, that is, the projection of the zero matrix onto

M_{ϕ}

(see Lemma 5), which has zero diagonal. This yields the equivalent program of the form (33) for

C = - M = - P_{M_{ϕ}} (0)

. Written explicitly,

C_{α, β} = - p_{ϕ}^{γ} / | x^{γ} |

for all

α, β \subseteq [n]

such that

α △ β = γ

(i.e.,

(α, β) \in x^{γ}

). This, combined with Lemma 4, proves the claim on matrix C. Finally, observe that the identity matrix of appropriate size is strictly feasible for (33). □

We define, for $S \in X_{ϕ}$ and each clause C_j, the function $v^{SDP} (S, C_{j})$ , which is obtained by taking (6) and replacing each $x^{γ}$ by $S_{α, β}$ for some $(α, β) \in x^{γ}$ . By (37), we are allowed to pick any such $(α, β)$ . By Lemma 4, for any nonempty $γ \subseteq [n], S \in X_{ϕ}$ and C as in Theorem 2, we have

\sum_{(α, β) \in x^{γ}} C_{α, β} S_{α, β} = \sum_{(α, β) \in x^{γ}} \frac{- p_{ϕ}^{γ}}{| x^{γ} |} S_{α, β} = - p_{ϕ}^{γ} S_{α, β} = v_{ϕ}^{γ} S_{α, β} .

(38)

Hence, maximizing $〈 C, S 〉$ is equivalent to maximizing the semidefinite relaxation of $v_{ϕ}$ (see (7)), which equals $\sum_{j \in [m]} v^{SDP} (S, C_{j})$ .

Moreover, in the relaxations of Anjos (2004a) outlined in Section 3, the matrix variable is restricted to satisfy $v^{SDP} (S, C_{j}) = 1$ . Now, we can easily observe the difference between the SOS-SDP relaxations and those proposed by Anjos. We present the equivalent dual formulation of the SOS approach on the left-hand side and the latter (in slightly adapted form) on the right.

\begin{array}{l} v^{*} = & max \sum_{j \in [m]} v^{SDP} (S, C_{j}) \\ s . t . S \in X_{ϕ} \cap S_{+} . \end{array}

(39)

\begin{array}{l} find & S \in X_{ϕ} \cap S_{+} \\ s . t . & v^{SDP} (S, C_{j}) = 1, \forall C_{j} . \end{array}

(40)

Note again the difference between (40) and the relaxations described in Section 3, resulting from using set $X_{ϕ}$ instead of the intersection of the Δ_j; see (11). Thus, we compare the SOS approach with a strengthened variant of the relaxation proposed by Anjos. In Section 8.3, we determine the dual of (40).

Program (39) proves the unsatisfiability of $ϕ$ if $v^{*} < m$ (with some margin of error because of numerical precision), whereas (40) does so whenever the program is infeasible. These programs are not equivalent in this sense: we empirically find instances $ϕ$ for which $v^{*} \geq m$ , whereas (40) is infeasible. Neither program can directly prove satisfiability. However, solutions to both programs can be used to guide the search toward satisfying assignments (should they exist); see Online Appendix C.

If (40) admits a feasible matrix $S^{*}$ , then matrix $S^{*}$ is clearly also feasible for (39) and attains an objective value of m. Consequently, in this case, we have $v^{*} \geq m$ . Thus, if (40) does not prove unsatisfiability of $ϕ$ , then neither does (39). In Online Appendix F, we show that (39) can be computed efficiently by applying the PRSM to its dual.² It is currently unclear whether a good algorithm for solving (40) exists, and if so, how efficient it would be. Previous numerical experiments on (40) use general purpose SDP solvers. An immediate improvement might be to use an SDP feasibility problem solver (see Henrion and Malick 2011, Drusvyatskiy et al. 2017).

Finally, the objective value of (39) is more useful for MAX-SAT: if the underlying instance is infeasible, $v^{*}$ provides an upper bound to the number of satisfiable clauses, which is useful in a B&B scheme. Program (40) might also show unsatisfiability of the same instance, but its infeasibility offers no additional value as to how unsatisfiable the instance is.

7. The Peaceman–Rachford Splitting Method for MAX-SAT

In this section, we introduce the Peaceman–Rachford splitting method (Peaceman and Rachford 1955) for solving SOS-SDP problems and apply it to the MAX-SAT SOS program $P_{ϕ}$ . Conventionally, interior point methods are used to solve SDP problems. However, for medium- and large-size instances, interior point methods suffer from a large computation time and memory demand, which has recently motivated researchers to consider first order methods, such as the PRSM. For recent applications of PRSM to SDP, see, for example, de Meijer and Sotirov (2021) and Graham et al. (2022).

Section 7.2 and Online Appendix C provide details on obtaining valid upper and lower bounds from the output of the PRSM algorithm.

7.1. The PRSM for SOS Relaxations of MAX-SAT

We start from the reformulation of $P_{ϕ}$ given in (34). The augmented Lagrangian function of (34) with respect to (w.r.t.) the constraint M = Z for a penalty parameter $β > 0$ is

L_{β} (Z, M, S) = 〈 I, M 〉 + 〈 S, M - Z 〉 + \frac{β}{2} {‖ M - Z ‖}_{F}^{2} .

(41)

Here, $S \in S^{n}$ is the Lagrange multiplier and ${‖ \cdot ‖}_{F}$ denotes the Frobenius matrix norm; see Section 1.1.

The PRSM now entails iteratively optimizing over the variables Z and M separately and updating S twice per cycle. We write superscript k to denote the the value of the variable at iteration k:

{\begin{array}{l} Z^{k + 1} = \underset{Z ⪰ 0}{arg min} L_{β} (Z, M^{k}, S^{k}) = P_{S_{+}} (M^{k} + \frac{1}{β} S^{k}), \\ S^{k + 1 / 2} = S^{k} + γ_{1} β (M^{k} - Z^{k + 1}), \\ M^{k + 1} = \underset{M \in M_{ϕ}}{arg min} L_{β} (Z^{k + 1}, M, S^{k + 1 / 2}) = P_{M_{ϕ}} (Z^{k + 1} - \frac{1}{β} [I + S^{k + 1 / 2}]), \\ S^{k + 1} = S^{k + 1 / 2} + γ_{2} β (M^{k + 1} - Z^{k + 1}) . \end{array}

(42)

Here, $M_{ϕ}$ is as in (26), and $P$ is the projection operator as in (4). We use that

\begin{array}{l} \underset{Z ⪰ 0}{arg min} L_{β} (Z, M, S) & = \underset{Z ⪰ 0}{arg min} 〈 I, M 〉 - \frac{1}{2 β} {‖ S ‖}_{F}^{2} + \frac{β}{2} {‖ Z - (M + \frac{1}{β} S) ‖}_{F}^{2} \\ = \underset{Z ⪰ 0}{arg min} \frac{β}{2} {‖ Z - (M + \frac{1}{β} S) ‖}_{F}^{2}, \end{array}

(43)

see, for example, Oliveira et al. (2018). In an implementation of (42), one should not store matrix S^k directly but, rather, the matrix

\frac{1}{β} S^{k}

; see Online Appendix G. When

X \in S

has eigenvalues λ_i and corresponding eigenvectors v_i, it is well-known that the projection onto the positive semidefinite cone is given by

P_{S_{+}} (X) = \sum_{{i | λ_{i} > 0}} λ_{i} v_{i} v_{i}^{⊤} = X - \sum_{{i | λ_{i} < 0}} λ_{i} v_{i} v_{i}^{⊤} .

(44)

Depending on the number of positive eigenvalues of X, one of these expressions is cheaper to compute. The next lemma shows how to compute a projection onto $M_{ϕ}$ .

Lemma 5.

Let matrices $M, \hat{M} \in S$ , indexed by subsets of $[n]$ such that $\hat{M} = P_{M_{ϕ}} (M)$ , where the projection operator $P_{M_{ϕ}} (\cdot)$ is given by (4). Then, $diag (\hat{M}) = diag (M)$ and

{\hat{M}}_{δ, μ} = M_{δ, μ} - \frac{1}{| x^{γ} |} (\sum_{(α, β) \in x^{γ}} M_{α, β} - p_{ϕ}^{γ}),

(45)

for

(δ, μ) \in x^{γ}, γ \neq \emptyset

. In particular, when

| x^{γ} | = 2

, (45) reduces to

{\hat{M}}_{δ, μ} = {\hat{M}}_{μ, δ} = p_{ϕ}^{γ} / 2

Proof.

See Online Appendix A. □

Because of the presence of many unit constraints (see (27)), these projections are computationally cheap to compute, and hence, the PRSM is well-suited to exploit this. Finally, it is proven (He et al. 2016) that (42) converges for $(γ_{1}, γ_{2}) \in D$ , where

D = {(γ_{1}, γ_{2}) | γ_{1} + γ_{2} > 0, | γ_{1} | < 1, 0 < γ_{2} < \frac{1 + \sqrt{5}}{2}, | γ_{1} | < 1 + γ_{2} - γ_{2}^{2}} .

(46)

The values that we choose for $(γ_{1}, γ_{2})$ and other parameters are given in Online Appendix F.

7.2. Upper Bounds, Lower Bounds, and Early Stopping

After each PRSM iterate k, we obtain a triple $(Z^{k}, M^{k}, S^{k})$ and the resulting $〈 I, M^{k} 〉 .$ Although this value converges to the optimal solution of the SDP, the convergence is (typically) not monotonic, and therefore, this value does not necessarily provide a valid upper bound for the problem. In this section, we describe how to obtain a valid upper bound from the output of the PRSM.

Observe that the feasible set of $P_{ϕ}$ depends on the chosen monomial x through $V_{x}$ ; see (16). Hence, by (17), we have

p_{ϕ}^{\emptyset} - {min_{M \in M_{ϕ} \cap S_{+}}}_{} 〈 I, M 〉 = \sup_{λ \in R} {λ | F_{ϕ} - λ \in V_{x}} \leq min_{_{x \in {\pm 1}^{n}}} F_{ϕ},

(47)

for

p_{ϕ}^{\emptyset}

as in (19). From this, it follows that the maximum number of satisfiable clauses of

ϕ

is bounded from above by

m - p_{ϕ}^{\emptyset} + min_{_{M \in M_{ϕ} \cap S_{+}}} 〈 I, M 〉,

(48)

for m equal to the number of clauses in

ϕ

. Because the number of satisfied clauses is an integer, the bound (48) can be improved by rounding down the result.

Ideally, the PRSM algorithm (42) computes the upper bound (48) by finding the optimal M in the set $M_{ϕ} \cap S_{+}$ (up to some given numerical precision). However, in practice, one terminates the PRSM algorithm before this optimal M is found. Let matrix M^k then be defined as in (42) and let $λ_{min} (M^{k})$ be its smallest eigenvalue. Note that

{\tilde{M}}^{k} = M^{k} - λ_{min} (M^{k}) I \in M_{ϕ} \cap S_{+},

(49)

and so

{\tilde{M}}^{k}

is feasible for

P_{ϕ}

. Thus, a valid upper bound at iteration k is obtained as follows:

⌊ m - p_{ϕ}^{\emptyset} + 〈 I, {\tilde{M}}^{k} 〉 ⌋ .

(50)

In Online Appendix C, we outline a method for determining lower bounds when using the PRSM.

8. Weighted Partial MAX-SAT

In this section, we extend the SOS approach from MAX-SAT to weighted partial MAX-SAT. We also show that the dual formulation of the SOS program for certain partial MAX-SATs equals the relaxations by Anjos (2005).

In weighted MAX-SAT, each clause is given a weight, and the objective is to maximize the sum of the weights of the satisfied clauses. In partial MAX-SAT, clauses are divided into soft and hard clauses. The aim is to maximize the number of satisfied soft clauses, satisfying all the hard clauses. The combination of weighted and partial MAX-SAT is clear and referred to as weighted partial MAX-SAT (Li and Manya 2021).

Consider again a logical proposition $ϕ$ . Let $w_{j} \in R$ be the weight associated to clause C_j. The generalization of (13) for (unweighted) MAX-SAT to weighted MAX-SAT follows by setting

F_{ϕ}^{W} (x) = \sum_{j = 1}^{m} \frac{w_{j}}{2^{ℓ_{j}}} \prod_{i \in C_{j}} (1 - a_{j, i} x_{i}),

(51)

and then minimizing

F_{ϕ}^{W}

for

x \in {\pm 1}^{n}

. This minimization can be approximated by SOS optimization, using directly the semidefinite program

P_{ϕ}

For weighted partial MAX-SAT, consider a logical proposition $ϕ$ on n variables, m soft clauses C_j, and q hard clauses $C_{p}^{H}$ . To each hard clause $C_{p}^{H}, p \in [q]$ , we associate the polynomial $f_{p} = \prod_{i \in [n]} (1 - a_{p, i} x_{i})$ , similar to (51). Note that f_p vanishes for all truth assignments that satisfy clause $C_{p}^{H}$ . Similar to (14) and (16), we define the sets

\begin{array}{l} H_{x} ≔ {\sum_{p \in [q]} c_{p} f_{p} \mod I | c_{p} \in R \forall p \in [q]} \subseteq H ≔ {\sum_{p \in [q]} g_{p} f_{p} \mod I | g_{p} \in R [x] \forall p \in [q]} . \end{array}

(52)

Let $S A T \subseteq {\pm 1}^{n}$ be the set of all truth assignments satisfying the hard clauses (which we assume to be nonempty). From Putinar (1993), it follows that

min_{x \in S A T} F_{ϕ}^{W} (x) = \sup {λ | F_{ϕ}^{W} - λ \in V + H} \geq \sup {λ | F_{ϕ}^{W} - λ \in V_{x} + H_{x}},

(53)

where “+” denotes the Minkowski sum of sets. We proceed by writing the lower bound in (53) as an explicit SDP for which we introduce the following sets:

H^{γ} ≔ {p \in [q] | γ \in C_{p}^{H}}

for

γ \subseteq [n]

. Set

H^{γ}

contains all p for which f_p, when expanded, contains the term

\pm x^{γ}

. The sign here is determined by the parity of

| γ \cap I_{p}^{+} |

; see (2). Additionally, we define as analogue to

M_{ϕ}

(see (26)), the set

M_{ϕ}^{H}

. This set contains all matrices M and vectors c such that

F_{ϕ}^{W} - λ \equiv x^{⊤} M x + \sum_{p \in [q]} c_{p} f_{p} \mod I

. It is, therefore, defined as

M_{ϕ}^{H} ≔ {(M, c) \in S \times R^{q} | \sum_{(α, β) \in x^{γ}} M_{α, β} + \sum_{p \in H^{γ}} {(- 1)}^{| γ \cap I_{p}^{+} |} c_{p} = p_{ϕ}^{γ}, \forall γ \neq \emptyset s . t . x^{γ} \in X} .

(54)

This allows us to adapt $P_{ϕ}$ to weighted partial MAX-SAT as follows:

\begin{array}{l} min & 〈 I, M 〉 + \sum_{p \in [q]} c_{p} s . t . (M, c) \in M_{ϕ}^{H}, M \in S_{+} . \end{array}

(55)

We approximately solve (55) by the PRSM; see Section 8.1. Let us elaborate on how to adapt the monomial bases to (weighted) partial MAX-SAT. We make no distinction between soft and hard clauses for the SOS_p basis; see (21). For basis $S O S_{s}^{θ}$ (see (23)), we determine the variable weights as $w (i) ≔ \sum_{{j | i \in C_{j}}} w_{j} + \sum_{{p | i \in C_{p}^{H}}} \bar{w}$ for $\bar{w}$ the mean of all soft clause weights w_j. For basis $S O S_{p}^{Q}$ , we add all $(\begin{matrix} Q \\ 2 \end{matrix})$ quadratic terms of the Q variables that attain the highest value of w(i). For unweighted partial MAX-SAT instances, we consider all w_j to equal one.

8.1. The PRSM for SOS Relaxations of Weighted Partial MAX-SAT

We show here how to solve (55) by the PRSM. We first rewrite (55) by introducing the matrix variable Z (see also (34)):

\begin{array}{l} min & 〈 I, M 〉 + \sum_{p \in [q]} c_{p} s . t . M = Z, (M, c) \in M_{ϕ}^{H}, Z \in S_{+} . \end{array}

(56)

Then, the augmented Lagrangian function of (56) w.r.t. Z = M and for a penalty parameter $β > 0$ is $L_{β} (Z, M, S, c) ≔ L_{β} (Z, M, S) + 1^{⊤} c$ ; see (41). The PRSM is iteratively and separately optimizing over $(M, c) \in M_{ϕ}^{H}$ and $Z \in S_{+}$ and updating S twice per cycle, similarly to (42). However, in this case, the M-subproblem from (42) is replaced by the (M, c)-subproblem.

We now show that minimization over $(M, c) \in M_{ϕ}^{H}$ can be performed efficiently. By derivations similar to (43), we have

\underset{(M, c) \in M_{ϕ}^{H}}{arg min} L_{β} (Z, M, S, c) = \underset{(M, c) \in M_{ϕ}^{H}}{arg min} {‖ M - X ‖}_{F}^{2} + \frac{2}{β} 1^{⊤} c,

(57)

where

X ≔ Z - (S + I) / β

. This is a convex quadratic program (QP) that we solve in two steps. First, consider the matrix entries

M_{α, β}

with

(α, β) \in x^{γ}

(see (24)) and

H^{γ} = \emptyset

. Because

H^{γ} = \emptyset

, these entries are unaffected by the c_p variables. This implies that these

M_{α, β}

variables are not coupled with the other entries of M, and one can minimize separately over such

M_{α, β}

. This separate minimization problem can be solved by applying Lemma 5. Second, the remaining QP

{min}_{} \sum_{{γ | H^{γ} \neq \emptyset}} \sum_{(α, β) \in x^{γ}} {(M_{α, β} - X_{α, β})}^{2} + \frac{2}{β} 1^{⊤} c,

(58)

can be simplified by the following observation. If

M^{*}

is an optimal solution to (57), then

(α, β), (α', β') \in x^{γ} \Rightarrow M_{α, β}^{*} - X_{α, β} = M_{α', β'}^{*} - X_{α', β'}

. Hence, (58) can be simplified by substituting each term

\sum_{(α, β) \in x^{γ}} {(M_{α, β} - X_{α, β})}^{2}

with a single squared variable. We solve the resulting QP either by solving the Karush–Kuhn–Tucker conditions using the lower-upper decomposition or via MOSEK ApS (2023). The solving method depends on the underlying QP.

8.2. Strengthening the Bounds

We demonstrate a simple technique for improving the upper bounds given by Program (55). This technique is based on the SAT resolution rule, which is given as follows. For two hard clauses of some proposition $ϕ$ , on literals x, z_i, $i \in [s]$ and y_i, $i \in [t]$ , construct the clause below the horizontal line:

\frac{[x \lor z_{1} \lor \dots \lor z_{s}], [\neg x \lor y_{1} \lor \dots \lor y_{t}]}{z_{1} \lor \dots \lor z_{s} \lor y_{1} \lor \dots \lor y_{t} .}

(59)

In contrast to the MAX-SAT resolution rule (29), the SAT resolution rule states that one may add the clause below the horizontal line to $ϕ$ without changing its (un)satisfiability (we say that the new clause is implied by the original two clauses). We may apply this SAT resolution rule to the hard clauses of a partial MAX-SAT instance to generate more hard clauses. As each new clause induces a new variable c_p, the bound of Program (55) can only improve. One may also regard SAT resolution as extending the set $H_{x}$ (see (52)), by including terms of the form $c_{p} x^{α} f_{p}$ for some $α \subseteq [n]$ , where $c_{p} \in R$ .

Additionally, SAT resolution can generate hard unit clauses. This is advantageous because hard unit clauses reduce the number of variables in MAX-SAT; see Section 1.2.

8.3. Duality in Partial MAX-SAT

Now, we consider a partial MAX-SAT with only hard clauses. Solving such instances is, thus, equivalent to determining the satisfiability of the given hard clauses. We show that, by taking the dual of the resulting SOS program, one obtains (a stronger version of) the relaxations of Anjos (2004a) given by (40).

We define, for $A \in S^{n}, vec (A) \in R^{n^{2}}$ the vector whose entries are the columns of A stacked together. We start from Program (55) and perform variable splitting on M, similar to (34). We take the dual g(S) of this formulation, similar to (35), and consider the problem

max_{S} g (S) = max_{S \in X_{ϕ} \cap S_{+}} min_{(M, c) \in M_{ϕ}^{H}} 〈 S, - M 〉 + 1^{⊤} c,

(60)

for

X_{ϕ}

as in (32). The steps that show

S \in X_{ϕ} \cap S_{+}

is necessary for the expression to be finite are provided in the proof of Theorem 2. We rewrite the inner minimization problem in (60) as

min_{_{(M, c) \in M_{ϕ}^{H}}} {[\begin{matrix} vec (S) \\ 1 \end{matrix}]}^{⊤} [\begin{matrix} vec (- M) \\ c \end{matrix}],

(61)

and proceed to show under which conditions this value is bounded. Observe that the coefficients

p_{ϕ}^{γ} = 0

(see (54)) because there are no soft clauses. Moreover, the set

M_{ϕ}^{H}

places only linear constraints on the entries of M and c. Therefore, there exists a matrix D that satisfies

(M, c) \in M_{ϕ}^{H} \Leftrightarrow D [\begin{matrix} vec (- M) \\ c \end{matrix}] = 0 .

(62)

Hence, (61) is bounded if and only if $[\begin{matrix} vec {(S)}^{⊤} 1^{⊤} \end{matrix}]$ is contained in the row space of D. This is precisely the requirement that $v^{SDP} (S, C_{p}^{H}) = 1, \forall p \in [q]$ as in (40). We provide one example of this claim in Online Appendix D.

Thus, (61) is bounded if $[\begin{matrix} vec {(S)}^{⊤} 1^{⊤} \end{matrix}] \in row (D)$ , in which case, the value equals zero. Hence, Program (60) is equivalent to (40).

9. Conclusions and Future Work

In this paper, we consider SOS optimization for solving MAX-SAT and weighted partial MAX-SAT. We design an SOS-SDP–based exact MAX-SAT solver called SOS-MS. Our solver is competitive with the best known solvers on solving various (weighted partial) MAX-SAT instances. We are also first to compute SDP bounds for weighted partial MAX-SAT.

In Section 3, we propose a family of semidefinite feasibility problems $R_{F} (ϕ)$ and show that one member of this family provides the rank-two guarantee; see Theorem 1. That is, the existence of a feasible rank-two matrix implies satisfiability of the corresponding SAT instance. In Section 4, we outline the SOS approach to MAX-SAT from van Maaren et al. (2008) and propose new bases. We introduce the $S O S_{s}^{θ}$ and $S O S_{p}^{Q}$ bases (see (23)) and provide several theoretical results related to these bases; see Lemmas 1 and 2. Clearly, the strength of the SOS-SDP–based relaxations and the required time to compute them depend on the chosen monomial basis. The SOS-SDP relaxation for MAX-SAT is denoted by $P_{ϕ}$ . We consider MAX-SAT resolution in Section 5 and show that resolution might not be beneficial for the SOS approach applied to MAX-SAT.

In Section 6, we elegantly show a connection between the SOS approach to MAX-SAT and the family of semidefinite feasibility problems $R_{F} (ϕ)$ . This is done by deriving the dual problem to $P_{ϕ}$ ; see Theorem 2. In Section 7, we propose the PRSM for solving $P_{ϕ}$ . We show that PRSM is well-suited for exploiting the structure of $P_{ϕ}$ , in particular, the unit constraints; see (27). We, thus, provide an affirmative answer to the key question posed by van Maaren et al. (2008) regarding whether SDP software can be developed dealing with unit constraints efficiently.

We extend the SOS approach for MAX-SAT to weighted partial MAX-SAT in Section 8. Here, the variables are restricted to satisfy a set of hard clauses. We show that such hard clauses can be incorporated in the SOS program $P_{ϕ}$ by adding scalar variables. We show in Section 8.1 that the resulting Program (55) is also well-suited for the PRSM.

In Online Appendix E, we provide implementation details of our SOS-SDP–based MAX-SAT solver, whose pseudocode is given in Online Algorithm 1. SOS-MS is a B&B algorithm and has two crucial components. The first one is the use of warm starts to program $P_{ϕ}$ , in order to quickly obtain strong bounds. The second one is its ability to quickly parse $P_{ϕ}$ as outlined in Online Appendix E.1. Our algorithm parses a basis that contains 4,439,449 monomials in (approximately) one second (!).

In Online Appendix F, we provide extensive numerical results that verify efficiency of our exact solver SOS-MS and quality of SOS upper bounds. We show that SOS-MS can solve a variety of MAX-SAT instances in reasonable time, solving some instances faster than the best solvers in the MSE-2016. We show that the $S O S_{p}^{Q}$ bases (22) are able to prove optimality of some MAX-SAT instances and the parameter Q provides the option to adjust the trade-off between quality of the bounds and computation time. We also test our B&B algorithm for (weighted) partial MAX-SAT instances in Online Appendices F.2 and F.4. Our solver is able to solve many (weighted) partial MAX-SAT instances in a reasonable time.

This paper demonstrates the strong performance of SOS-MS on (weighted partial) MAX-SAT instances from the MSE random track. In the future, we hope to also solve instances with SOS-MS from the so-called industrial and crafted tracks. These tracks currently impose two challenges on SOS-MS. First, these instances induce prohibitively large SOS_p bases, which hinders the computation of strong bounds. To solve this, we require a more sophisticated method for choosing a smaller, manageable, basis, such as $S O S_{s}^{θ}$ . Second, these instances can possess clauses of length k, where $k \geq 4$ . This is problematic in the current settings because $F_{ϕ}$ (see (13)) is a kth degree polynomial, which requires a large basis to be represented. One possible way to overcome these challenges is through exploiting the structure present in these instances. For example, function $F_{ϕ}$ might have few nonzero coefficients, which allows for finding SOS decompositions with small monomial bases, using methods proposed in Wang et al. (2021); see also Ahmadi et al. (2017).

Endnotes

¹ Our PRSM implementation is available at https://github.com/LMSinjorgo/SOS-SDP_MAXSAT.

² Program (39) can also be directly solved with the PRSM as projecting onto $X_{ϕ}$ is computationally cheap.

References

Abramé A, Habet D (2014) Ahmaxsat: Description and evaluation of a branch and bound Max-SAT solver. J. Satisfiability Boolean Model. Comput. 9(1):89–128.Crossref, Google Scholar
Ahmadi AA, Hall G, Papachristodoulou A, Saunderson J, Zheng Y (2017) Improving efficiency and scalability of sum of squares optimization: Recent advances and limitations. Astolfi A, chairman. 2017 IEEE 56th Annual Conf. Decision Control (IEEE, Piscataway, NJ), 453–462.Google Scholar
Anjos MF (2004a) On semidefinite programming relaxations for the satisfiability problem. Math. Methods Oper. Res. 60(3):349–367.Crossref, Google Scholar
Anjos MF (2004b) Proofs of unsatisfiability via semidefinite programming. Ahr D, Fahrion R, Oswald M, Reinelt G, eds. Operations Research Proceedings 2003 (Springer-Verlag, Berlin).Google Scholar
Anjos MF (2005) An improved semidefinite programming relaxation for the satisfiability problem. Math. Programming 102(3):589–608.Crossref, Google Scholar
Anjos MF (2006) Semidefinite optimization approaches for satisfiability and maximum-satisfiability problems. J. Satisfiability Boolean Model. Comput. 1(1):1–47.Google Scholar
Anjos MF (2007) An extended semidefinite relaxation for satisfiability. J. Satisfiability Boolean Model. Comput. 4(1):15–31.Crossref, Google Scholar
Asín Achá R, Nieuwenhuis R (2014) Curriculum-based course timetabling with SAT and MaxSAT. Ann. Oper. Res. 218(1):71–91.Crossref, Google Scholar
Aspvall B, Plass MF, Tarjan RE (1979) A linear-time algorithm for testing the truth of certain quantified Boolean formulas. Inform. Processing Lett. 8(3):121–123.Crossref, Google Scholar
Bonet ML, Levy J, Manya F (2007) Resolution for Max-SAT. Artificial Intelligence 171(8–9):606–618.Crossref, Google Scholar
Boyd S, Parikh N, Chu E, Peleato B, Eckstein J (2011) Distributed optimization and statistical learning via the alternating direction method of multipliers. Foundations Trends Machine Learning 3(1):1–122.Google Scholar
Cook SA (1971) The complexity of theorem-proving procedures. Lewis PM, chairman. Proc. Third Annual ACM Sympos. Theory Comput. (Association for Computing Machinery, New York), 151–158.Google Scholar
de Klerk E, van Maaren H, Warners J (2000) Relaxations of the satisfiability problem using semidefinite programming. J. Automatic Reasoning 24(1):37–65.Crossref, Google Scholar
de Meijer F, Sotirov R (2021) SDP-based bounds for the quadratic cycle cover problem via cutting-plane augmented Lagrangian methods and reinforcement learning: Informs journal on computing meritorious paper awardee. INFORMS J. Comput. 33(4):1262–1276.Abstract, Google Scholar
Drusvyatskiy D, Li G, Wolkowicz H (2017) A note on alternating projections for ill-posed semidefinite feasibility problems. Math. Programming 162(1):537–548.Crossref, Google Scholar
Gabay D, Mercier B (1976) A dual algorithm for the solution of nonlinear variational problems via finite element approximations. Comput. Math. Appl. 2(7):17–40.Crossref, Google Scholar
Gattermann P, Großmann P, Nachtigall K, Schöbel A (2016) Integrating passengers’ routes in periodic timetabling: A SAT approach. Goerigk M, Werneck RF, eds. 16th Workshop Algorithmic Approaches Transportation Model. Optim. Systems (Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik, Wadern, Germany), 3:1–3:15.Google Scholar
Goemans MX, Williamson DP (1995) Improved approximation algorithms for maximum cut and satisfiability problems using semidefinite programming. J. ACM 42(6):1115–1145.Crossref, Google Scholar
Graham N, Hu H, Im J, Li X, Wolkowicz H (2022) A restricted dual Peaceman-Rachford splitting method for a strengthened DNN relaxation for QAP. INFORMS J. Comput. 34(4):2125–2143.Link, Google Scholar
Halperin E, Zwick U (2001) Approximation algorithms for MAX 4-SAT and rounding procedures for semidefinite programs. J. Algorithms 40(2):184–211.Crossref, Google Scholar
Håstad J (2001) Some optimal inapproximability results. J. ACM 48(4):798–859.Crossref, Google Scholar
He B, Ma F, Yuan X (2016) Convergence study on the symmetric version of ADMM with larger step sizes. SIAM J. Imaging Sci. 9(3):1467–1501.Crossref, Google Scholar
Henrion D, Malick J (2011) Projection methods for conic feasibility problems: Applications to polynomial sum-of-squares decompositions. Optim. Methods Software 26(1):23–46.Crossref, Google Scholar
Karloff H, Zwick U (1997) A 7/8-approximation algorithm for MAX 3SAT? Torwick I, ed. Proc. 38th Annual Sympos. Foundations Comput. Sci. (IEEE, Piscataway, NJ), 406–415.Google Scholar
Kautz H, Selman B (1999) Unifying SAT-based and graph-based planning. Dean T, ed. Proc. Sixteenth Internat. Joint Conf. Artificial Intelligence—Volume 1 (Morgan Kaufmann Publishers Inc., San Francisco), 318–325.Google Scholar
Lasserre JB (2001) Global optimization with polynomials and the problem of moments. SIAM J. Optim. 11(3):796–817.Crossref, Google Scholar
Lasserre JB (2007) A sum of squares approximation of nonnegative polynomials. SIAM Rev. 49(4):651–669.Crossref, Google Scholar
Laurent M (2009) Sums of squares, moment matrices and optimization over polynomials. Putinar M, Sullivant S, eds. Emerging Applications of Algebraic Geometry (Springer, New York), 157–270.Crossref, Google Scholar
Lewin M, Livnat D, Zwick U (2002) Improved rounding techniques for the MAX 2-SAT and MAX DI-CUT problems. Cook WJ, Schulz AS, eds. Internat. Conf. Integer Programming Combin. Optim. (Springer), 67–82.Google Scholar
Li CM, Manya F (2021) MaxSAT, hard and soft constraints. Biere A, Heule M, van Maaren H, Walsh T, eds. Handbook of Satisfiability, Frontiers in Artificial Intelligence and Applications, vol. 336 (IOS Press, Amsterdam), 903–927.Crossref, Google Scholar
Marques-Silva JP, Sakallah KA (2000) Boolean satisfiability in electronic design automation. De Micheli G, chairman. Proc. 37th Annual Design Automation Conf. (Association for Computing Machinery, New York), 675–680.Google Scholar
Mendonca M, Wa¸sowski A, Czarnecki K (2009) SAT-based analysis of feature models is easy. McGregor JD, Muthig D, eds. Proc. 13th Internat. Software Product Line Conf. (Carnegie Mellon University, Pittsburgh), 231–240.Google Scholar
MOSEK ApS (2023) The MOSEK optimization toolbox for MATLAB manual, Version 10.0.20. Accessed January 2023, https://docs.mosek.com/10.0/toolbox/index.html.Google Scholar
Oliveira DE, Wolkowicz H, Xu Y (2018) ADMM for the SDP relaxation of the QAP. Math. Programming Comput. 10(4):631–658.Crossref, Google Scholar
Parrilo PA, Thomas RR, eds. (2020) Sum of Squares: Theory and Applications. Proc. Sympos. Appl. Math., vol. 77 (American Mathematical Society, Providence, RI).Crossref, Google Scholar
Peaceman DW, Rachford HH Jr (1955) The numerical solution of parabolic and elliptic differential equations. J. Soc. Indust. Appl. Math. 3(1):28–41.Crossref, Google Scholar
Prasad MR, Biere A, Gupta A (2005) A survey of recent advances in SAT-based formal verification. Internat. J. Software Tools Tech. Transfer 7(2):156–173.Crossref, Google Scholar
Putinar M (1993) Positive polynomials on compact semi-algebraic sets. Indiana Univ. Math. J. 42(3):969–984.Crossref, Google Scholar
Scheiderer C (2009) Positivity and sums of squares: A guide to recent results. Putinar M, Sullivant S, eds. Emerging Applications of Algebraic Geometry (Springer, New York), 271–324.Crossref, Google Scholar
van Maaren H, van Norden L (2005) Sums of squares, satisfiability and maximum satisfiability. Bacchus F, Walsh T, eds. Internat. Conf. Theory Appl. Satisfiability Testing (Springer, New York), 294–308.Google Scholar
van Maaren H, van Norden L, Heule MJ (2008) Sums of squares based approximation algorithms for MAX-SAT. Discrete Appl. Math. 156(10):1754–1779.Crossref, Google Scholar
Wang J, Magron V, Lasserre JB (2021) Chordal-TSSOS: A moment-SOS hierarchy that exploits term sparsity with chordal extension. SIAM J. Optim. 31(1):114–141.Crossref, Google Scholar
Wang PW, Zico Kolter J (2019) Low-rank semidefinite programming for the MAX2SAT problem. Van Hentenryck P, Zhou Z-H, program chairs. Proc. Conf. AAAI Artificial Intelligence, vol. 33 (AAAI Press, Palo Alto, CA), 1641–1649.Google Scholar
Zheng Y, Fantuzzi G, Papachristodoulou A (2017) Exploiting sparsity in the coefficient matching conditions in sum-of-squares programming using ADMM. IEEE Control Systems Lett. 1(1):80–85.Crossref, Google Scholar

cover image INFORMS Journal on Computing

Volume 36, Issue 2

March-April 2024

Pages 305-704, C2

Article Information

Supplemental Material

Metrics

Information

Received:February 13, 2023
Accepted:August 07, 2023
Published Online:November 07, 2023

Cite as

Lennart Sinjorgo, Renata Sotirov (2023) On Solving MAX-SAT Using Sum of Squares. INFORMS Journal on Computing 36(2):417-433.

https://doi.org/10.1287/ijoc.2023.0036

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

On Solving MAX-SAT Using Sum of Squares

Abstract

1. Introduction

1.1. Preliminaries and Notation

1.2. Assumptions

2. MAX-SAT Formulation and Relaxation

3. SAT as a Semidefinite Feasibility Problem

4. Sum of Squares and MAX-SAT

4.1. General Overview

4.2. Properties of $S O S_{p}^{Q}$

5. Resolution and Monomial Bases

6. Relating Sum of Squares and Method of Moments

7. The Peaceman–Rachford Splitting Method for MAX-SAT

7.1. The PRSM for SOS Relaxations of MAX-SAT

7.2. Upper Bounds, Lower Bounds, and Early Stopping

8. Weighted Partial MAX-SAT

8.1. The PRSM for SOS Relaxations of Weighted Partial MAX-SAT

8.2. Strengthening the Bounds

8.3. Duality in Partial MAX-SAT

9. Conclusions and Future Work

References

Volume 36, Issue 2

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News

Available Issues

Available Issues

On Solving MAX-SAT Using Sum of Squares

Abstract

1. Introduction

1.1. Preliminaries and Notation

1.2. Assumptions

2. MAX-SAT Formulation and Relaxation

3. SAT as a Semidefinite Feasibility Problem

4. Sum of Squares and MAX-SAT

4.1. General Overview

4.2. Properties of SOSpQ

5. Resolution and Monomial Bases

6. Relating Sum of Squares and Method of Moments

7. The Peaceman–Rachford Splitting Method for MAX-SAT

7.1. The PRSM for SOS Relaxations of MAX-SAT

7.2. Upper Bounds, Lower Bounds, and Early Stopping

8. Weighted Partial MAX-SAT

8.1. The PRSM for SOS Relaxations of Weighted Partial MAX-SAT

8.2. Strengthening the Bounds

8.3. Duality in Partial MAX-SAT

9. Conclusions and Future Work

References

Volume 36, Issue 2

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

4.2. Properties of $S O S_{p}^{Q}$