Succinct representations of Boolean functions and the Circuit-SAT problem
by
c Shadab Romani
A thesis submitted to the School of Graduate Studies
in partial fulfilment of the requirements for the degree of
Master of Science
Department of Computer Science Memorial University of Newfoundland
April 2016
St. John’s Newfoundland
Abstract
We study the question whether there is a computational advantage in deciding properties
of Boolean functions given a succinct description of the function (such as a Boolean circuit)
as opposed to black-box access to the function. We argue that a significant computational
advantage for a large class of properties implies a non-trivial algorithm for the Circuit Sat-
isfiability (Circuit-SAT) problem. In particular, we show that if there is a property with
strong black-box lower bounds yet decidable in BPP, which also has a highly sensitive in-
stance computable by a small circuit, then there is a non-uniform sub-exponential algorithm
for the Circuit-SAT problem. Additionally, we analyze variants of this question for other
computational models.
Acknowledgements
I am greatly indebted to my supervisor Antonina Kolokolova for giving me the oppor- tunity to study complexity theory. I have learned a lot from her. Antonina is unbelievably kind, generous and patient with students. Remembering all my struggles at the beginning of grad school, without her help, I couldn’t finish grad school!
Working on this thesis wouldn’t have been possible without the guidance of Russell Impagliazzo, Valentine Kabanets, and Pierre Mckenzie. In particular, this thesis is based on a beautiful conjecture by Russell Impagliazzo. I was really privileged to collaborate with them.
And finally, I dedicate this thesis to my parents for their unconditional Love and support.
Contents
Abstract ii
Acknowledgements iii
1 Introduction 1
2 Preliminaries 6
2.1 Boolean Functions and Types of Access . . . . 6 2.2 Boolean Function Lower-Bounds in Black-Box Model . . . . 9 2.3 The Original Rice’s Theorem: A Brief Look . . . . 11
3 Previous Work 13
3.1 Approach Based on Counting Problems . . . . 13 3.2 Approach Based on Obfuscation . . . . 15
4 Black-Box Hypothesis and Its Variants 18
4.1 Black-box Hypothesis in Other Models . . . . 19
4.1.1 Read-once Models . . . . 19
4.2 Black-Box hypothesis in Other Complexity Classes . . . . 22
5 Connections with Circuit-SAT 26
5.1 Properties of Easy Functions . . . . 28
5.2 Properties with Easy Sensitive Instances . . . . 30
5.3 Sensitivity of Symmetric Properties . . . . 33
5.4 Eliminating Randomness for High Sensitivity . . . . 35
5.5 Sensitivity and Black-Box Lower Bounds . . . . 37
6 Conclusions 41
Bibliography 44
Chapter 1 Introduction
Solving a problem aside from the implications of the problem itself can reveal several facts about the solver. The complexity of a task shows the ability to perform tasks of lesser or equal complexity. The celebrated Rice’s Theorem, proved by Henry Gordon Rice[Ric53], gives such a picture about Turing machines (TM) as problem solvers.
Rice’s Theorem states that any non-trivial semantic property of TMs is undecidable.
Semantic properties indicate something about the functionality of a TM and are independent of machine’s syntactic description. Undecidability of semantic properties implies that the only useful thing that one can do with the description of a TM is simply running it.
Rice’s Theorem elegantly reduces the Halting Problem to any non-trivial semantic prop- erty of TMs. Therefore, before deciding anything interesting about the functionality of programs, we must face the Halting Problem as if it is the simplest undecidable problem!
Rice’s theorem cannot be straightforwardly extended to models other than TMs. For
instance, a polynomial time TM is guaranteed to use a finite amount of time in its compu-
tation. Therefore, halting is not a problem for such machines and the idea of reduction from
the Halting Problem is not valid for them. Understanding the functionality of these models becomes easier, in the sense that it is decidable. But, we face the next barrier: intractability!
An analog of Rice’s Theorem can be particularly useful for generalizing and abstracting proofs of hardness in models of finite computation. Proofs of hardness for circuit analysis problems use a great variety of techniques from all areas of mathematics. For the TM model, Rice’s Theorem eliminates the need for different proofs of undecidability by presenting a general clean template.
Boolean circuits are a powerful model of finite computation. If a property about the underlying Boolean functions is hard, then no efficient way is known to benefit from the circuit description. Essentially brute-force is the best algorithm we can design to decide such properties. Hard circuit problems show a similarity with undecidable problems in TM Model. In the sense that, for none of these problems the syntax of a program helps in understanding its semantics.
Additionally in the circuit model, the status of satisfiability among other hard problems resembles that of the Halting Problem among other undecidable problems. As if it is the simplest hard problem! Obviously, NP-completeness of the satisfiability problem means that if any other problem NP-complete has an efficient algorithm so does satisfiability. But by
”simplest hard problem”, we mean a more fine-grained perspective of the complexity of the satisfiability problem and its relationship with other NP-hard problems.
TMs are succinct representations of languages and Rice’s Theorem rules out the possibil-
ity of any advantage provided by a succinct representation for deciding semantic properties
of TMs. Circuits are succinct representations of Boolean functions. We study whether there
is an advantage in looking into the circuit description compared to accessing it in a black-box
manner for deciding semantic properties of circuits. Informally, black box is a type of access
in which we know the size of the circuit, and we can ask input-output queries.
This thesis is based on the line of research initiated in the 90’s ([BS96, BGI
+01b]) study- ing the existence of Rice’s Theorem analogues for Boolean circuits, the first of which was formulated by Borchert and Stephan[BS96]. They focused on counting properties, that is, properties that only depend on the number of solutions, and have shown the following;
“Any non-empty counting property is UP-hard with respect to polynomial time Turing reductions. A property is a counting property if it only depends on the number of solutions. ”
In the early 2000’s, Barak, Goldreich, Impagliazzo, Rudich, Sahai, Vadhan, and Yang [BGI
+01a] suggested a different analogue of Rice’s Theorem for Boolean circuits. Moti- vated by cryptographic applications, they differentiated between algorithms analysing cir- cuit descriptions and algorithms accessing circuits as a black-box. They made the following conjecture which they called ”Scaled-down Rice’s Theorem”;
“Every property of Boolean functions F that can be computed in BPP given a circuit for function f, can be computed in comparable probabilistic polynomial time and with comparable success probability by a black-box algorithm with an oracle access to f and an upper bound on its circuit complexity. ”
If Scaled-down Rice’s Theorem holds, then any property decidable in BPP, has at most polynomial randomized query complexity in black-box. Contrapositively, assuming a super- polynomial randomized query complexity in black-box, a BPP algorithm doesn’t exist. Here, we will work with the following equivalent statement which we call ”black-box hypothesis”
:
1“Black-box Hypothesis: Let F be a property of Boolean functions such that even a randomized algorithm has to examine a large fraction of bits of the truth table of a function f to decide if f ∈ F . Then the complexity of deciding F is not significantly less when a circuit computing f is given as an input. ”
Barak et al. disproved a variant of this conjecture for promise-BPP under the assumption that one-way functions exist. However, the main statement remains open. The Scaled-down
1
We are using the word ”hypothesis” at Russell Impagliazzo’s suggestion, since both proving and dis-
proving it would have interesting consequences.
Rice’s Theorem is more general than the approach based on counting properties and can explain the hardness of those properties as well.
Our Results
We study the implications of the black-box hypothesis for the hardness of Circuit-SAT. Our result supports the intuition of satisfiability being the simplest NP-hard problem.
“Main Theorem: We show that Circuit-SAT has a non-trivial algorithm if for some circuit analysis problem (satisfying a very general condition) there is a super-polynomial gap in complexity between the algorithms accessing the circuit description and the algorithms treating circuits as black-box. ”
Additionally, we show that in restricted models such as read-once branching programs, algorithm that accesses a description may gain a super-polynomial advantage over any algo- rithm that access the read-once branching programs as black-box.
More specifically, our main result is as follows. Let F be a property of Boolean functions.
Let succinct-F be a language L
F= {hCi | [C] ∈ F }, where [C] is a function computed
by a circuit C. If F is hard for black-box algorithms, but L
F∈ BP P , then Circuit-SAT
is solvable by a sub-exponential non-uniform family of circuits, provided that F either 1)
contains only easy functions (i.e. functions computable by circuits of ≈ Size(2
o(n))), or 2) F
is sensitive on some easy inputs. In other words, there exists an easy Boolean function f such
that there is a significant number boolean functions f
0s disagreeing with f on a single point
such that F (f ) 6= F (f
0). In particular, non-empty counting properties satisfy the second
condition, with an easy sensitive string of the form tt(f ) = 0 . . . 01 . . . 1 such that adding or
removing a 1 flips the value of F (f). If that decision tree complexity is polynomially related
to sensitivity, the second condition can be replaced by a lower bound on the decision tree
complexity for F (on easy inputs), however it is not clear how to do that. In particular, it
seems to involve giving a positive answer to a (generalization of) the well-studied Sensitivity Conjecture.
Organization
• Chapter 2 contains the necessary definitions and notational conventions for introducing the complexity theoretic analogues of Rice’s Theorem.
• In Chapter 3 we overview the two previous approaches for translating Rice’s Theorem to circuit complexity theory.
• In Chapter 4, we discuss several ways to formulate analogues of Rice’s Theorem with respect to different computational models and complexity classes. We will also present the black-box hypothesis which can be viewed as an analogue of Rice’s Theorem for general Boolean circuits and BPP time.
• Chapter 5 discusses the implications of the black-box hypothesis for hardness of Circuit- SAT.
• In the last chapter, we present our conclusions, intuitions, and future work directions.
Chapter 2
Preliminaries
In the world of circuits, we can make a distinction between circuit as syntax and the computed Boolean function as semantics. White-box access to a program informally means accessing the syntax of a program. The syntax of a circuit is a directed acyclic graph encoded as a binary string. On the other hand, black-box access informally means using only the semantics of a program by asking oracle queries. Our goal is correlating the hardness of circuit decision problems in black-box and white-box. To make a meaningful comparison between these two settings, we must define the circuit representation and the type of oracle access carefully. So first, we need several basic definitions from circuit complexity theory about encoding and representation in white-box and black-box.
2.1 Boolean Functions and Types of Access
An n-ary Boolean function is a function that maps the domain {0, 1}
nto the co-domain {0, 1}. Notation: tt(f ) denotes the truth table of a Boolean function f.
Definition 2.1. A property of Boolean functions F is a subset of Boolean functions, where
each slice F
n⊆ {0, 1}
2nis a subset of (truth tables of) Boolean functions of n variables. F is non-trivial if for every (large enough) n, both F
nand its complement F
nare non-empty.
Definition 2.2 (Boolean Circuit). A Boolean circuit is a directed acyclic graph with m nodes g
1, g
2, . . . , g
m(in some fixed topological order) called gates, such that the first n nodes g
1, g
2, . . . , g
ntake their value from the input g
1= x
1, g
2= x
2, . . . , g
n= x
n. Each of the subsequent gates g
icomputes a Boolean function from a set of Boolean functions B called basis gates which take their argument from the set of previously evaluated gates i.e.,
g
i= φ(g
i1, g
i2, . . . , g
id<i),
where φ is a function in B and d is the number of its arguments. The last gate g
mis distinguished as the output gate.
A circuit C computes a Boolean function f : {0, 1}
n→ {0, 1} if for every input string x
1. . . x
n∈ {0, 1}
n, f (x
1. . . x
n) is equal to the value of gate g
min C(x
1. . . x
n). [C] denotes the Boolean function computed by circuit C. We equate a circuit (or an algorithm) outputting 1 with accepting and outputting 0 with rejecting. A circuit is satisfiable if there is at least one input on which it outputs 1. We refer to the language of all satisfiable circuits as Circuit-SAT.
|C| refers to the size of the circuit. In general, there are several related ways to define circuit size: size of the string describing the circuit, number of edges, number of gates or the sum of the number of edges and gates. Note that all these measures are polynomially related. The number of edges is at most square of the number of gates, and description length is:
O((#edges + #gates) log(#edges + #gates))
Here we take |C| to be the length of the description of the circuit using the direct connection language of [Ruz81] denoted desc(C). The size of a circuit on n inputs is at least O(n log n).
The circuit size of a Boolean function f , denoted by CircuitSize(f ), is the size of the smallest circuit computing f .
In this thesis, we mainly discuss circuits defined over the following basis, which we refer to as general Boolean circuits: all the possible Boolean functions defined over two bits, the constant 0, the constant 1 and the NOT gate. Since we are working with general circuits, we assume constant fan-in without loss of generality.
Definition 2.3 (Semantic Property of Circuits). A semantic property F of circuits is a property of Boolean functions that circuits compute. A language of circuits that satisfy a semantic property F is called a metalanguage i.e. a metalanguage L
Ffor F is:
L
F= {desc(C) | [C] ∈ F }
Based on the distinction between a circuit and the function it computes, two access types are defined:
Definition 2.4. An algorithm A decides property F in white-box if A decides the corre- sponding metalanguage L
F. That is, given as input a string desc(C) it accepts iff [C] ∈ F .
An algorithm in black-box model accesses the circuit using an oracle and the knowledge
of the size of the circuit description. An oracle O
Lfor a language L is an external device that
is capable of reporting whether any string w is a member of L. An oracle Turing machine
M
OLis a modified TM that has the additional capability of querying an oracle. Whenever
M
OLwrites a string on a special oracle tape, it is informed whether that string is a member
of A in a single computational step. An oracle for a Boolean function f is the language
O
f: {x | f (x) = 1}. To simplify the notation, we often write M
L, M
fto mean M
OL, M
Of.
Definition 2.5. An algorithm A decides F in black-box if A
f(1
n, 1
m) accepts iff f ∈ F , where f : {0, 1}
n→ {0, 1}, m is an upper bound on circuit size of f and O
fis an oracle for the Boolean function f. Here, 1
nand 1
mrepresent n and m in unary.
We will be comparing the complexity of algorithms in white-box vs. black-box. As the inputs to the black-box algorithm are given in unary, the size of the input to black-box and white-box algorithms are essentially the same. Note that given desc(C), it is easy to compute number of inputs n. In particular, we compare metalanguages in BPP with their black-box counterparts.
Definition 2.6. A language L is in BPP if there exists a randomized algorithm A running in probabilistic polynomial-time and such that for every x ∈ {0, 1}
n,
x ∈ L ⇒ P r[A(x) accepts ] ≥ 2/3 x / ∈ L ⇒ P r[A(x) rejects] ≥ 2/3
We say that a property F is decidable in BPP in the white-box setting iff L
F∈ BPP.
A property F is in black-box-BPP if there exists a randomized polynomial time algorithm A, such that for every n, f : {0, 1}
n→ {0, 1} and m ≥ CircuitSize(f),
f ∈ F ⇒ P r[A
f(1
n, 1
m) accepts] ≥ 2/3 f / ∈ F ⇒ P r[A
f(1
n, 1
m) rejects] ≥ 2/3.
2.2 Boolean Function Lower-Bounds in Black-Box Model
A Boolean function is a point function if it outputs 1 on exactly one input. We refer to the
Boolean function that outputs zero on every input as constant Zero function.
Proposition 2.7. Checking satisfiability of n-ary Boolean functions needs 2
n−1oracle queries by a randomized black-box algorithm.
Proof. This can be proved by a simple adversary argument. Alice thinks of a point function f and Bob asks for values of f on different inputs. In the worst case, Alice answers 0 to all the 2
n− 1 first queries and 1 to the last query. To distinguish a point function from the constant zero function with probability p > 1/2, at least half of the inputs must be queried.
Circuit complexity of point functions and all zero function is linear in n. Even though the black-box algorithm is given the upper-bound cn log n for constant c on the size of the circuit, it still needs to make exponentially many queries.
For a thorough discussion on lower bounds using different types of oracle queries see [Ang88].
The query complexity of randomized and deterministic algorithms are polynomially re- lated. So the black-box lower bound of Proposition 2.7 also implies a lower bound for deterministic query complexity. Decision trees are a standard model for proving oracle lower bounds. Presentation below follows [BDW02].
Definition 2.8 (Decision tree). A decision tree is a rooted ordered binary tree T . Each internal node of T is labeled with a variable x
iand each leaf is labeled with a value 0 or 1.
Given an input x ∈ {0, 1}
n, the tree is evaluated as follows. Start at the root. If this is a leaf then stop. Otherwise, query the variable x
ithat labels the root. If x
i= 0, then recursively evaluate the left sub-tree, if x
i= 1 then recursively evaluate the right sub-tree. The output of the tree is the value of the leaf that is reached eventually.
Definition 2.9 (Randomized decision tree). A randomized decision tree is defined like a
decision tree but each internal node is associated with a bias probability p ∈ [0, 1]. On input
x
1x
2. . . x
n, tree evaluation is similar to a regular decision tree, but when an internal node is reached, we follow the edge whose label agrees with x
iwith probability p.
Definition 2.10. Deterministic query complexity D(f ) is the minimal depth of a decision tree computing f . Randomized query complexity R
2(f) is the minimal depth of a randomized decision tree for f , where the subscript 2 indicates 2-sided error.
R
2(f) and D(f) are polynomially related.
Theorem 2.11. [Nis91] D(f) < 27R
2(f)
3Therefore, randomized algorithms cannot do significantly better than deterministic algo- rithms in the black-box setting.
Another important algorithm is Circuit Evaluation. In black-box, we only need one oracle query. In white-box, there exists a linear time algorithm.
Proposition 2.12. Let F be the property of outputting 1 on the all zero input (00 . . . 0).
Then the complexity of white-box and black-box is O(m), where n is the number of inputs and m is the size of the given circuit abd m ≥ n.
Proof. The white-box algorithm evaluates the given circuit on 00 . . . 0. Circuit evaluation takes linear time with respect to the size of the circuit. The black-box algorithm also takes O(m) (to write the query) time because one oracle query in enough.
2.3 The Original Rice’s Theorem: A Brief Look
Rice’s Theorem [Ric53] gives a general test for determining the undecidability of certain
properties of TMs. Here, we only outline the proof to make an analogy with the black-box
hypothesis. For the sake of simplicity we use a different terminology from the original proof.
To see a detailed proof [HU79] is a good source.
A property of languages is defined as a set of languages. Semantic properties of TMs are properties of languages they accept. A semantic property is non-trivial if there is a TM that satisfies the property and one that violates the property. Informally, Rice’s Theorem shows that every property about the language recognized by a TM is either trivial or undecidable.
Theorem 2.13 (Rice’s Theorem). Let P be any non-trivial semantic property of TMs. Then, the following language is undecidable;
L
P= {hM i : L(M) ∈ P}
The idea is a classic proof by contradiction using a clever reduction from the Halting Problem. Suppose that P is decidable. Since P is non-trivial there must be some machine M
Lthat has this property. Without loss of generality, assume that P does not include the empty language: otherwise, consider complement of P . Given machine M and input x, we want to decide if M halts on x. We construct another machine R using M
L:
R description: On input w do the following steps:
1. Run M on x
2. Run M
Lon input w
Now R satisfies P if and only if M halts on x. Because only in this case R behaves like M
Land if M doesn’t halt then R only accepts the empty language which by assumption does
not satisfy P .
Chapter 3
Previous Work
The naive way to formulate an analogue of Rice theorem is by simply replacing the words
”Turing machine” with ”circuit” and the word ”undecidable” with ”intractable”;
“Every nontrivial semantic property of Boolean circuits is intractable. ”
This formulation is wrong. Some nontrivial semantic properties are easy, such as the property of outputting zero on the all zero input (see Proposition 2.12). Therefore, any ana- logue must present a more sophisticated characterization of hard semantic properties. There are two lines of work for constructing a complexity theoretic analogue of Rice’s Theorem for the circuit model: one based on hardness of Counting problems and one based on the notion of obfuscation. In the next two sections, we overview these approaches.
3.1 Approach Based on Counting Problems
The quest for an analogue of Rice’s Theorem for circuits began in the 90s by Borchert
and Stephan[BS96] and was further developed by Hemaspaandra, Rothe and Thakur[HT01,
HR00]. In this line of research, Counting Properties were the main focus to prove some analogue of Rice’s Theorem. These properties only depend on the number of satisfying assignments of the circuit–not their particular distribution. Generally, three specific types of counting properties are defined:
Definition 3.1. Counting Properties: The notation #
1(and #
0) denotes the total num- ber of satisfying (respectively falsifying) assignments of a circuit.
• Absolute Counting: Let S be a subset of N
Absolute − Counting(S) = {C | #
1(C) = n, n ∈ S}
• Gap Counting: Let S be a subset of Z
Gap − Counting(S) = {C | #
1(C) − #
0(C) = n, n ∈ S}
• Relative Counting: Let S be a subset of D (dyadic numbers):
RelativeCounting(S) = {C |
# #1(C)1(C)+#0(C)