On Computing Multilinear Polynomials Using Multi-r -ic Depth Four Circuits

(1)

HAL Id: hal-02366791

https://hal.archives-ouvertes.fr/hal-02366791

Submitted on 25 Dec 2019

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Distributed under a Creative CommonsAttribution - ShareAlike| 4.0 International

On Computing Multilinear Polynomials Using Multi-r -ic Depth Four Circuits

Suryajith Chillara

To cite this version:

Suryajith Chillara. On Computing Multilinear Polynomials Using Multi-r -ic Depth Four Circuits.

37th International Symposium on Theoretical Aspects of Computer Science (STACS 2020), LIRMM, CNRS, Mar 2020, Montpellier, France. �hal-02366791�

(2)

On Computing Multilinear Polynomials Using Multi- r -ic Depth Four Circuits

Suryajith Chillara^∗ University of Haifa November 16, 2019

Abstract

In this paper, we are interested in understanding the complexity of computing multilinear polynomials using depth four circuits in which polynomial computed at every node has a bound on the individual degree ofr (referred to as multi-r-ic circuits). The goal of this study is to make progress towards proving superpolynomial lower bounds for general depth four circuits computing multilinear polynomials, by proving better and better bounds as the value ofr increases.

Recently, Kayal, Saha and Tavenas (Theory of Computing, 2018) showed that any depth four arithmetic circuit of bounded individual degreer computing a multilinear polynomial onn^O(1) variables and degreed=o(n), must have size at least _rⁿ1.1

Ωq

d r

whenr iso(d)and is striclty less thann^1.1. This bound however deteriorates with increasingr. It is a natural question to ask if we can prove a bound that does not deteriorate with increasingror a bound that holds for alarger regime ofr.

We here prove a lower bound which does not deteriorate withr, however for a specific instance ofd =d(n)but for awiderrange ofr. Formally, we show that there exists an explicit polynomial onn^O(1) variables and degreeΘ(log²n)such that any depth four circuit of bounded individual degreer<n^0.2must have size at least exp Ω log²n

. Thisimprovementis obtained by suitably adapting the complexity measure of Kayal et al. (Theory of Computing, 2018). This adaptation of the measure is inspired by the complexity measure used by Kayal et al. (SIAM J. Computing, 2017).

1 Introduction

One of the major focal points in the area of algebraic complexity theory is to show that certain polynomials are hard to compute syntactically. Here, the hardness of computation is quantified by the number of arithmetic operations that are needed to compute the target polynomial. Instead of the standard turing machine model, we consider arithmetic circuits and formulas as models of computation.

Arithmetic circuits are directed acyclic graphs such that the leaf nodes are labeled by variables or constants from the underlying field, and every non-leaf node is labeled either by a+or×. Every node computes a polynomial by operating on its inputs with the operand given by its label. The flow of computation flows from the leaf to the output node. We refer the readers to the standard resources[SY10,Sap19]for more information on arithmetic formulas and arithmetic circuits.

Valiant conjectured that the Permanent does not have polynomial sized arithmetic circuits[Val79].

Working towards that conjecture, we aim to prove superpolynomial circuit size lower bounds. However,

∗suryajith@cmi.ac.in. This work was done while the author was a post doctoral fellow at IIT Bombay.

(3)

the best known circuit size lower bound isΩ(nlogn), for a power symmetric polynomial, due to Baur and Strassen[Str73, BS83], and, the best known formula size lower bound is Ω(n²), due to Kalorkoti[Kal85]. Due to the slow progress towards proving general circuit/formula lower bounds, it is natural to study some restricted class of arithmetic circuits and formulas.

Since most of the polynomials of interest such as Determinant, Permanent, etc., are multilinear polynomials, it is natural to consider the restriction where every intermediate computation is in fact multilinear. Due to the phenomenal work in the last two decades[NW97,Raz06,Raz04,RY08,RY09, HY11,RSY08,AKV18,CLS19,CELS18,CLS18], the complexity of multilinear formulas and circuits is better understood than that of general formulas and circuits.

Backed with this progress it is natural to try to extend these results to a circuit model where the individual degree of every variable in the polynomial computed at every node in the circuit isr. We refer to these circuits as multi-r-ic circuits. Whenr =1, the circuit model is multilinear.

Kayal and Saha[KS17]first studied multi-r-ic circuits of depth three and proved exponential lower bounds. Kayal, Saha and Tavenas[KST18]have extended this and proved exponential lower bounds at depth three and depth four. These circuits that were considered were syntactically multi-r-ic . That is, at any product node, any variable appears in the support of at most r many operands, and the total of the individual degrees is also at most r. Henceforth, all the multi-r-ic depth four circuits that we talk about shall be syntactically multi-r-ic .

Recently, Kumar, Oliviera and Saptharishi[KdOS19]showed that there is a chasm¹for multi-r-ic circuits too. Formally, they showed that any polynomial sized (sayn^c for a fixed constantc) multi-r-ic circuit of arbitrary depth computing a polynomial onnvariables can be depth reduced to syntactical multi-r-ic depth four circuits of size exp(O(p

r nlogn)). This provides us a motivation to study multi-r-ic depth four circuits and prove strong lower bounds against them.

Kayal, Saha and Tavenas[KST18]proved an exponential size lower bound against multi-r-ic depth four circuits computing a variant of the iterated matrix multiplication polynomial. They achieved this bound using a measure that is inspired by the methodShifted Partial Derivatives[Kay12,GKKS14]and the method ofSkew Partial Derivatives[KNS16]. They referred to this new technique as the method ofShifted Skew Partial Derivatives. Hegde and Saha[HS17]improved upon[KST18]and showed a near-optimalsize lower bound. However, the "best known" lower bounds are for polynomials that are multi-r-ic.

Motivation for this work

Raz and Yehudayoff[RY09]showed a lower bound of exp(Ωp

dlogd

)against multilinear depth four circuits which compute a multilinear polynomial overnvariables and degreed n(cf.[KST18, Footnote 9]). Kayal, Saha and Tavenas [KST18] have shown a lower bound of _rⁿ1.1

ΩÇ

d r

for a multilinear polynomial overn^O⁽¹⁾variables and degreed that is computed by a multi-r-ic depth four circuit. This lower bound degrades with the increasing value ofr and is superpolynomial only when r iso(d)and is strictly less thann^1.1. This raises a natural question if the dependence onr could be improved upon.

In this work, we show that for a certain regime ofd, we can prove a lower bound that does not deteriorate with increasing values ofr.

1Agrawal and Vinay[AV08], Koiran, and Tavenas[Tav15]showed that any general circuit can be depth reduced to a depth four circuit of non-trivial size.

(4)

Theorem 1. Let n and r be integers such that r<n^0.2. There exists an explicit polynomial Q_nof degree Θ(log²n), over n^O(1)variables such that any syntactically multi-r -ic depth four circuit computing it must have sizeexp Ω(log²n)

.

The explicit polynomial that we consider is a close variant of the polynomial considered by Kayal et al.[KST18]. By substituting these parameters into the bound from[KST18], their bound evaluates ton^Ω

logn pr

. Note that this bound is superpolynomial only when r =o(log²n). Our lower bound is quantitatively better in this regime of parameters.

If we can show superpolynomial size lower bounds against multi-r-ic depth four circuits forr =n^c for any constantc, then we indeed have superpolynomial circuit size lower bounds against depth four circuits. We believe that by building on the work of[KST18,HS17],Theorem 1is a step towards that direction.

In a subsequent work to this, we observed that by careful analysis and by building upon this work and work of Kumar and Saraf[KS14], we can in fact prove exponential lower bounds against depth four multi-r-ic circuits computing the Iterated Matrix Multiplication polynomial.

Proof overview:

Analogous to the work of Fournier et al.[FLMS15], and Kumar and Saraf[KS14], we first consider multi-r-ic depth four circuits of low bottom support²and prove lower bounds against them.

LetT₁,T₂, . . . ,T_sbe the terms corresponding to the product gates feeding into the output sum gate.

The output polynomial is obtained by adding the termsT₁,T₂, . . . ,T_s. Note that each of theseT_i’s is a product of low support polynomialsQ_i,j, that is, every monomial in theseQ_i,_j’s is supported on a small set of variables (sayµmany). One major observation at this point is to see that there can be at mostN·r many factors in any of theT_i’s.

Kayal et al.[KST18]observed that the measure of shifted partial derivates[KSS14,FLMS15]does not yield any non-trivial lower bound if the number of factors is much larger than the number of variables itself. They worked around this obstacle by defining ahybridcomplexity measure (refered to asShifted Skew Partial Derivatives) where they first split the variables into two disjoint and unequal sets Y andZ such that|Y| |Z|, then considered theloworder partial derivatives with respect to only the Y variables and subsequently set all theY variables to zero in the partial derivatives obtained. This effectively reduces the number of factors in a partial derivative to at most|Z| ·r. They then shift these polynomials by monomials inZ variables and look at the dimension of the polynomials thus obtained.

This measure gave them a size lower bound of _rⁿ1.1

ΩÇ

d r

against multi-r-ic depth four circuits computing an explicit polynomial onn^O⁽¹⁾variables and degreed=o(n)whenr =o(d). To improve the dependence on r in the lower bound, we consider a variant ofShifted Skew Partial Derivativesthat we callProjected Shifted Skew Partial Derivatives. Here, we project down the space of Shifted Skew Partials and only look at the multilinear terms. Since the polynomial of interest is multilinear, it makes sense to only look at the multilinear terms obtained after the shifts of the skew partial derivatives. This is analogous to the method employed by Kayal et al.[KLSS17]to prove exponential lower bounds for Homogeneous depth four circuits, through the measure ofProjected Shifted Partial Derivatives.

We first show that the dimension of Projected Shifted Skew Partial derivatives is not too large for small multi-r-ic depth four circuits of low bottom support. We then show that there exists an explicit

2That is, all the product gates at the bottom are supported on small set of variables.

(5)

polynomial whose dimension of Projected Shifted Skew Partial derivatives is large and thus cannot be computed by small multi-r-ic depth four circuits. We then lift this result to multi-r-ic depth four circuits for suitable set of parameters.

2 Preliminaries

Notation:

• For a polynomial f, we use∂_Y⁼^k(f)to refer to the space of partial derivatives of orderk of f with respect to monomials of degreekinY.

• We usez^=`andz^≤`to refer to the set of all the monomials of degree equal to`and at most`, respectively, inZ variables.

• We usez^≤`_MLto refer to the set of all the multilinear monomials of degree at most`inZvariables.

• We usez^≤`_NonMLto refer to the set of all the non-multilinear monomials of degree at most`inZ variables.

• For a monomialmwe use|MonSupp(m)|to refer to the size of the set of variables that appear in it.

• For a polynomial f, we use |MonSupp(f)|to refer to the maximum |MonSupp(m)| over all monomials in f.

Depth four circuits: A depth four circuit (denoted by ΣΠΣΠ) over a field F and variables {x₁,x₂, . . . ,x_n} computes polynomials which can be expressed in the form of sums of products of polynomials. That is,

Xs i=1

d_i

Y

j=1

Q_i,_j(x₁, . . . ,x_n)for some d_i’s. A depth four circuit is said to have a bottom support oft (denoted byΣΠΣΠ^{t}) if it is a depth four circuit and all the monomials in each polynomialQ_i,j are supported on at mostt variables.

Multi-r-ic arithmetic circuits:

Definition 2(multi-r-ic circuits). Letr= (r₁,r₂,· · ·,r_n). An arithmetic circuitΦis said to be a syntactically multi-r-ic circuit if for all product gates u∈Φand u =u₁×u₂× · · · ×u_t, each variable x_i can appear in at most r_imany of the u_i’s (i∈[t]). Further the total degree with respect to every variable over all the polynomials computed at u₁,u₂,· · ·,u_t, is bounded by r , i.e.P

j∈[t]deg_x

i(f_u

j)≤r_ifor all i∈[n].

Ifr= (r,r,· · ·,r), then we simply refer to them as multi-r -ic circuits.

Complexity Measure: We shall now describe our complexity measure which we shall henceforth refer to as Dimension of Projected Shifted Skew Partial Derivatives. This is a natural extension of the Dimension of Shifted Skew Partial Derivatives as used by[KST18].

This is analogous to the work of[KLSS14]where they study ashifted partials inspiredmeasure calledShifted Projected Partial derivativesand then[KLSS17]where they studyProjected Shifted Partial derivatives.

(6)

Since the polynomial of interest is multilinear, it does make sense for us to only look at those shifts of the partial derivatives, that maintain multilinearity. At the same time, since the individual degree of the intermediate computations in the multi-r-ic depth four circuit is large and non-multilinear terms

"cancel" out to generate the multilinear polynomial, we can focus on the multilinear terms generated after the shifts by projecting our linear space of polynomials down to them. We describe this process formally, below.

Let the variable set X be partitioned into two fixed, disjoint setsY andZ such that |Y| is a magnitude larger than|Z|. Letσ_Y:F[YtZ]7→F[Z]be a map such that for any polynomial f(Y,Z), σ_Y(f)∈F[Z]is obtained by setting every variable inY to zero by it and leavingZ variables untouched.

Let mult :F[Z]7→F[Z]be a map such that for any polynomialf(Y,Z), mult(f)∈F[Z]is obtained by setting the coeficients of all the non-multilinear monomials in f to 0 and leaving the rest untouched. We usez^≤`·σ_Y(∂_Y⁼^kf)to refer to the linear span of polynomials obtained by multiplying each polynomial inσ_Y(∂_Y⁼^kf)with monomials of degree at most`inZ variables. We will now define our complexity measure, Dimension of Projected Shifted Skew Partial Derivatives (denoted byΓ_k,_`) as follows.

Γ_k,_`(f) =dim

F-span¦ mult

z^≤`·σ_Y

∂_Y⁼^kf©

This is a natural generalization of Shifted Skew Partial Derivatives of[KST18]. The following proposition is easy to verify.

Proposition 3(Sub-additivity). Let k and`be integers. Let the polynomials f,f₁,f₂be such that f =f₁+f₂. Then,Γ_k,_`(f)≤Γ_k,_`(f₁) +Γ_k,_`(f₂).

Monomial Distance: We recall the following definition of distance between monomials from [CM19].

Definition 4(Definition 2.7,[CM19]). Let m₁,m₂be two monomials over a set of variables. Let S₁ and S₂be the multisets of variables corresponding to the monomials m₁and m₂respectively. The distance dist(m₁,m₂)between the monomials m₁and m₂is themin{|S1| − |S1∩S₂|,|S2| − |S1∩S₂|}where the cardinalities are the order of the multisets.

For example, let m₁ = x₁²x₂x₃²x₄ and m₂ = x₁x₂²x₃x₅x₆. ThenS₁ = {x₁,x₁,x₂,x₃,x₃,x₄}, S₂ = {x₁,x₂,x₂,x₃,x₅,x₆},|S₁|=6,|S₂|=6 and dist(m₁,m₂) =3. It is important to note that two distinct monomials could have distance 0 between them if one of them is a multiple of the other and hence the triangle inequality does not hold.

The following beautiful lemma (from[GKKS14]) is key to the asymptotic estimates required for the lower bound analyses.

Lemma 5 (Lemma 6,[GKKS14]). Let a(n),f(n),g(n):Z_≥0→Z_≥0be integer valued functions such that(f +g) =o(a). Then,

ln(a+f)!

(a−g)!= (f +g)lna±O

(f +g)² a

We use the following strengthening of the Principle of Inclusion and Exclusion in our proof.

Lemma 6(Strong Inclusion-Exclusion[KS14]). Let W₁,W₂,· · ·,W_t be subsets of a finite set W . For a parameterλ≥1, letP

i,j∈[t]

i6=j

W_i∩W_j ≤λP

i∈[t]|W_i|.Then,

∪_i∈[t_]W_i

≥ _4λ¹ P

i∈[t]|W_i|.

(7)

Polynomial Families: Let n,α,k be positive integers. We define the polynomial family P_n,_α_,k _n,_α_,k≥0as follows. Let the variable setX ={x₁, . . . ,x_N

0}be partitioned into two fixed, and disjoint setsY andZ. We first define the polynomial family f_n,_α_,k(Y,Z)as follows (as it was defined in [KST18]).

f_n,_α_,k=Y^k

i=1

g_i(Y_i,Z_i)where g_i(Y_i,Z_i) = X

a,b∈[n]

y_a,b⁽ⁱ⁾ Y

c∈[α]

z_a,c⁽^i,1⁾z_c⁽^i,2_+α⁾_,b.

It is easy to see that|Y|is n²k and|Z|is 2αnk. We shall henceforth usemto refer to|Z|. Thus, N₀=|X|=|Y|+|Z|=k(n²+2αn).

Letc be a fixed constant in(0, 1). We shall now define another polynomial familyP_n,_α_,kbased on the definition of f_n,_α_,k. Let p=N₀^−c. Let ˆX ={ˆx_1,1, ˆx_1,2, . . . , ˆx_1,t, . . . , ˆx_N₀_,1, ˆx_N₀_,2, . . . , ˆx_N₀_,t}be a variable set distinct fromX such thatt= ^N⁰^log_p ^N⁰. Let Lin_p:X 7→Xˆbe a linear map such thatx_i7→P_t

j=1ˆx_i,_j for alli∈[N₀]. Then the polynomialP_n,α,k(Xˆ)be defined to be f_n,α,k◦Lin_p(Xˆ). That is,

P_n,α,k=f_n,α,k

t

X

j=1

ˆx_1,j,

t

X

j=1

ˆx_2,_j,· · ·,

t

X

j=1

xˆ_N₀_,j

!

wheret= N₀logN₀

p .

Note thatP_n,α,kis a polynomial onN=N₀²⁺^clogN₀many variables.

We will now recall the following lemma which in the mentioned form is due to Kumar and Saptharishi[Sap19].

Lemma 7(Analogous to Lemma 20.5,[Sap19]). Letρbe a random restriction on the variable setX thatˆ sets each variable to zero with a probability of at least(1−p)where p=N₀^−c for some constant c∈(0, 1). Then f_n,_α_,k(X)is a projection ofρ(P_n,_α_,k(Xˆ))with a probability of at least(1−2^−N⁰).

3 Multi-r -ic Depth Four Circuits of Low Bottom Support

For some carefully chosen parameterskand`, we shall first show that if a multi-r-ic depth four circuit C of bottom supportµissmallthenΓ_k,`(C)is not too large. We shall then show thatΓ_k,`(f_n,α,k)is large and thus it cannot be computed byC.

3.1 Upper Bound onΓ_k,_`(C)

Recall thatC is a sum of at mosts many products of polynomialsT⁽¹⁾+· · ·+T⁽^s⁾where eachT_i is a syntactically multi-r-ic product of polynomials of low monomial support.

We shall first prove a bound onΓ_k,`(T_i)for an arbitraryT_i and derive a bound onΓ_k,`(C)by using sub-additivity of the measure (cf.Theorem 3).

LetT be a syntactic multi-r-ic product of polynomials ˜Q₁(Y,Z)·Q˜₂(Y,Z)·. . .·Q˜_D(Y,Z)·R(Y) such that

MonSupp(Q˜_i)

≤µ. We will first argue thatD is not too large sinceT is a syntactically multi-r-ic product. We shall first pre-process the productT by doing the following procedure.

Repeat this process until all but at most one of the factors inT (exceptR) have aZ-support of at least^µ₂.

(8)

1. Pick two factors ˜Q_i₁ and ˜Q_i,2such that they have the smallestZ-support amongstQ₁,· · ·,Q_D. 2. If both of them have support strictly less than ^µ₂, merge these factors to obtain a new factor. Else,

stop.

In the afore mentioned procedure, it is important to note that the monomial support post merging will still be at most beµsince the factors being merged are of support strictly less than^µ₂. Henceforth, W.L.O.G we shall consider that every product gate at the top, in any multi-r-ic depth four circuit to be in a processed form.

LetT =Q₁(Y,Z)·Q₂(Y,Z)·. . .·Q_t(Y,Z)·R(Y)be the product obtained after the preprocessing.

Each of theQ_i has aZ-support of at least ^µ₂. The totalZ-support is at most|Z|r =m r sinceT is a syntactically multi-r-ic product. Thust could at most be^{2m r}_µ .

Lemma 8. Let n,k,r,`andµbe positive integers such that`+kµ <^m₂. Let T be a processed syntactic multi-r -ic product of polynomials Q₁(Y,Z)·Q₂(Y,Z)·. . .·Q_t(Y,Z)·R(Y)such that|MonSupp(Q_i)| ≤µ.

Then,Γ_k,_`(T)is at most _k^t

· _`+^m_k_µ

·(`+kµ).

Before provingTheorem 8, we shall first use it to show an upper bound on the dimension of the space of Projected Shifted Skew Partial derivatives of a depth four multi-r-ic circuit of low bottom support.

Lemma 9. Let n,k,r,`andµbe positive integers such that`+kµ < ^m₂. Let C be a processed syntactic multi-r -ic depth four circuit of bottom supportµand size s. Then,Γ_k,`(C)is at most s·

2m r µ

k

· _`+^m_k_µ

·(`+kµ).

Proof. W.L.O.G we can assume thatC be expressed asP_s

iT⁽ⁱ⁾such thatT⁽ⁱ⁾is a processed syntactically multi-r-ic product of bottom support polynomials at mostµ. FromTheorem 3, we get thatΓ_k,_`(C)≤ P_s

i=1Γ_k,_`(T⁽ⁱ⁾). From the afore mentioned discussion we know that the number of factors inT⁽ⁱ⁾with a non-zeroZ-support is at most ^{2m r}_µ . FromTheorem 8, we get thatΓ_k,`(T⁽ⁱ⁾)is at most

2m r µ

k

· _`+^m_k_µ

· (`+kµ). By putting all of this together, we get that

Γ_k,`(C)≤s· ^{2m r}

µ

k

· m

`+kµ

·(`+kµ).

Proof ofTheorem 8. We will first show by induction onk, the following.

∂_Y^=kT ⊆F-span





 [

S∈(t−k^t )

¨ Y

i∈S

Q_i(Y,Z)·z^≤kµ_ML ·F[Y]

«





 [

F-span





 [

S∈(t−k^t )

¨ Y

i∈S

Q_i(Y,Z)·z^{≤k rµ}_NonML·F[Y]

«







The base case of induction fork =0 is trivialT is already in the required form. Let us assume the induction hypothesis for all derivatives of order<k. That is,∂_Y^=k−1T can be expressed as a linear combination of terms of the form

h(X,Y) =Y

i∈S

Q_i(Y,Z)·h₁(Z)·h₂(Y)

(9)

whereSis a set of sizet−(k−1),h₁(Z)is a polynomial inZvariables of degree at most(k−1)rµ, andh₂(Y)is some polynomial inY variables. In fact,h₁(Z)can be expressed as a linear combination of multilinear monomials of degree at most(k−1)µ, and non-multilinear monomials of degree at most (k−1)rµ.

For someu∈[|Y|]and some fixedi₀inS,

∂h(Y,Z)

∂y_u =





 X

j∈S

Y

i∈Si6=j

Q_i(Y,Z)·∂Q_j(Y,Z)

∂y_u ·h₁(Z)·h₂(Y)





 +

Q

i∈SQ_i Q_i₀ ·Q_i

0(Y,Z)·h₁(Z)·∂h₂(Y)

∂y_k

∈F-span





 Y

i∈Si6=j

Q_i(Y,Z)·∂Q_j(Y,Z)

∂y_u ·h₁(Z)·F[Y]|j ∈[S]





 [F-span

¨ Q

i∈SQ_i

Q_i₀ ·Q_i₀(Y,Z)·h₁(Z)·F[Y]

«

⊆F-span





 [

T∈(|S|−1^S )

¨ Y

i∈T

Q_i(Y,Z)·z^µ_ML·h₁(Z)·F[Y]

«





 [





 [

T∈(|S|−1^S )

¨ Y

i∈T

Q_i(Y,Z)·z_NonML^≤rµ ·h₁(Z)·F[Y]

«







⊆F-span





 [

T∈(t−k^t )

¨ Y

i∈T

Q_i(Y,Z)·z^≤kµ_ML ·F[Y]

«





 [

F-span





 [

T∈(t−k^t )

¨ Y

i∈T

Q_i(Y,Z)·z^{≤k r}_NonML^µ ·F[Y]

«







The last inclusion follows from the fact thath₁(Z)is a linear combination of multilinear monomials of degree at most(k−1)µ, and non-multilinear monomials of degree at most(k−1)rµ. From the discussion above we know that any polynomial in∂_Y^k(T)can be expressed as a linear combination of polynomials of the form _∂^∂_y^h

u. Further every polynomial of the form _∂^∂_y^h

u belongs to the set W =F-span





 [

T∈(t−k^t )

¨ Y

i∈T

Q_i(Y,Z)·z^≤k_ML^µ·F[Y]

«





 [

F-span





 [

T∈(t−k^t )

¨ Y

i∈T

Q_i(Y,Z)·z^{≤k r}_NonML^µ ·F[Y]

«





 .

Thus, we get that∂⁼^kT is a subset ofW. This completes the proof by induction.

(10)

From the afore mentioned discussion, we can now derive the following expressions.

σ_Y

∂_Y⁼^kT

⊆F-span





 [

S∈(_t−k^[^t^])

¨ Y

i∈S

σ_Y(Q_i)·z^≤k_ML^µ

«





 [

F-span





 [

S∈(_t−k^[t])

¨ Y

i∈S

σ_Y(Q_i)·z^{≤k r}_NonML^µ

«







z^≤`σ_Y

∂_Y^=kT

⊆F-span





 [

S∈(t−k^[t])

¨ Y

i∈S

σ_Y(Q_i)·z^≤`+kµ_ML

«





 [F-span





 [

S∈(_t−k^[^t^])

¨ Y

i∈S

σ_Y(Q_i)·z^≤`+_NonML^{k r}^µ

«







=⇒ F-span¦ mult

z^≤`·σ_Y

∂_Y⁼^kT©

⊆F-span





 [

S∈(_t−k^[^t^])

¨ Y

i∈S

mult(σ_Y(Q_i))·z^≤k_ML^µ+`

«





 .

Thus we get that dim F-span

mult z^≤`·σ_Y(∂_Y⁼^kT) is at most

dim



F-span





 [

S∈(t−k^[t])

¨ Y

i∈S

mult(σ_Y(Q_i))·z^≤kµ+`_ML

«













≤dim



F-span





 [

S∈(t−k^t )

¨ Y

i∈S

mult(σ_Y(Q_i))

«









·dim

F-span¦

z^≤kµ+`_ML ©

≤ t

t−k

·

kµ+`

X

i=0

m i

≤ t

k

· m

`+kµ

·(`+kµ) (Since`+kµ <m/2).

3.2 Lower Bound onΓ_k,_`(f_n,_α_,k)

First we recall the generalized Hamming bound[GRS19, Section 1.7].

Claim 10. Let the vectorsa= (a₁,a₂, . . . ,a_k)andb = (b₁,b₂, . . . ,b_k)correspond to the indices of the Y -monomial y_a⁽¹⁾

1,b₁y_a⁽²⁾

2,b₂· · ·y_a⁽^k⁾

k,b_k that is used to derive f_n,_α_,k with. For every ∆₀ < k, there is a subset P_∆₀⊂[n]^2kof size ⁿ^2k−∆0

∆0(_∆0^2k) such that for all(a,b),(a⁰,b⁰)∈ P,dist((a,b),(a⁰,b⁰))≥∆₀.

(11)

Observation 11. It is important to note that ^∂^k^f^n,α,k

y_a⁽¹⁾

1,b1y_a⁽²⁾

2,b2···y^(k)

ak,bk

for any choice of(a,b)∈[n]^2kis a multilinear monomial over just the Z variables.

Claim 12. Let (a,b) and (a⁰,b⁰) be such that dist((a,b),(a⁰,b⁰)) ≥ ∆₀. Then dist

∂_(a,b)^k f_n,_α_,k,∂_(a^k₀_,b₀₎f_n,_α_,k

≥α∆₀.

Let m₁,m₂, . . . ,m_t be the monomials in the set M₀(= ∂_P⁼^k

∆0

f_n,α,k), over Z variables such that dist(m_i,m_j)≥ ∆ ≥α∆₀for alli 6= j. Further,σ_Y(M₀) =M₀. LetM be the set of mutlilinear monomials of the formm_im⁰overZ-variables for some 1≤i≤t wherem⁰is a monomial of length`.

It is important to note that the setM now corresponds to the setz^=`·σ_Y

∂_P⁼^k

∆0

f_n,_α_,k

. We shall now show that the cardinality of the setM is large enough for a suitable setting of parametersα,∆₀andk. Lemma 13. Let m,k,d,r,∆₀,∆,`andµbe positive integers such that`+kµ < ^m₂,(d−k)²=o(m),∆²= o(m),∆₀=δk and`= ^m₂(1−")for some fixed constantsδand". Then,|M | ≥¹₂ ₂

1−"

_δαk

· ^m−(_`^d−k⁾ . Proof. For alli ∈[t], LetB_i be the set of multilinear monomials of the formm_im⁰where m⁰ is a multilinear monomial of degree`. From the previous discussion, it follows that|M |=

∪^t_i=1B_i . Using the principle of Inclusion and Exclusion, we get that

∪^t_i₌₁B_i ≥

Xt i=1

|B_i| − Xt i6=j∈[t]

B_i∩B_j . Claim 14. For all i∈[t],|B_i|= ^m−(d−k)_`

.

Proof. Since deg(m_i)is equal tod−k, the cardinality ofB_i is equal to ^m−(_`^d−k⁾ . Claim 15. For all i,j ∈[t]such that i6= j ,

B_i∩B_j

≤ ^{m−(d−k)−∆}_`−∆

.

Proof. Consider any two monomials ˆm_i and ˆm_j fromB_i andB_j respectively. For ˆm_i and ˆm_j to be identical, ˆm_i should contain at least∆variables from ˆm_j\mˆ_iand similarly ˆm_j should contain at least

∆variables from ˆm_i\mˆ_j. The rest of the at most(`−∆)many variables should be the same both in ˆ

m_i and ˆm_j. The number of such multilinear monomials overZ variables is at most ^m−(_`−∆^d−k^)−∆

. PuttingTheorem 14andTheorem 15together, we get the following.

|M |=

∪^t_i₌₁B_i ≥t

m−(d−k)

`

− t² 2

m−(d−k)−∆

`−∆

.

LetT₁=t ^m−(d−k)_`

andT₂= ^t₂² ^{m−(d−k)−∆}_`−∆

. Let us consider the case whereT₂=λT₁whereλ≥1 for some setting of the parameters∆,α,`andk.

λ= T₂ T₁=

t²

2 m−(d−k)−∆

`−∆

t ^m−(_`^d−k⁾

(12)

= t

2· (m−(d−k)−∆)!

(`−∆)!(m−`−(d−k))!·(m−`−(d−k))!`!

(m−(d−k))!

= t

2·(m−(d−k)−∆)!

(m−(d−k))! · `!

(`−∆)!

= t

2·(m−(d−k)−∆)!

m! · m!

(m−(d−k))!· `!

(`−∆)!

= t

2· m⁽^d−k⁾·`^∆

m⁽^d−k^)+∆ (UsingTheorem 5)

= t 2·

` m

∆

.

The math block above crucially uses the fact that∆²=o(`)and(d−k)²=o(m)while invoking Theorem 5. Sinceλ≥1, we get that ^t₂·_`

m

_∆

≥1. For some suitably fixed constantsδand", let∆₀be set toδk and`be set to ^m₂(1−"). Recall that for a fixed∆₀,t = ⁿ^2k−∆⁰

∆0(∆^2k0) and∆=α∆₀=δαk. Thus, n^2k−∆⁰

2 _∆^2k

0

·

` m

∆

≥1 n^2k−∆⁰≥2∆₀m

` _∆

2k

∆₀

n^2k−∆⁰≥

2 1−"

_∆ 2k

∆₀ _∆₀

n^(2−δ)k≥

2 1−"

_αδ_k 2k

∆₀ _δk

and hence,

α≤(2−δ)logn−δlog₂

δ

δlog ₂

1−"

. InvokingTheorem 6with the previous discussion, we get that

|M | ≥T₁

4λ = t ^m−(_`^d^−k⁾

4λ = 1

2 m

` _∆

·

m−(d−k)

`

=1 2

2 1−"

_δαk

·

m−(d−k)

`

.

Lemma 16. Let m,k,d,r,`andµbe positive integers such that`+kµ < ^m₂,∆₀=δk and`= ^m₂(1−") for some fixed constantsδand". Then,Γ_k,`(f_n,α,k)≥ |M | ≥¹₂ ₂

1−"

_δαk

· ^m−(d_`^−k) . Proof. Recall thatM corresponds to the setz^=`·σ_Y

∂_P⁼^k

∆0

f_n,_α_,k

. SinceM is a bag of multilinear monomials over just theZ variables

|M |=dim

F-spann mult

z^=`·σ_Y

∂_P⁼^k

∆0

f_n,_α_,ko

(13)

Since∂_P⁼^k

∆0

f_n,_α_,k⊆∂⁼^kf_n,_α_,k andz^=`⊆z^≤`, we get that dim

F-span

n mult

z^=`·σ_Y

∂_P⁼^k

∆0f_n,_α_,k o

≤dim

F-span¦ mult

z^≤`·σ_Y

=Γ_k,_`(f_n,_α_,k). Thus,Γ_k,_`(f_n,_α_,k)≥ |M | ≥¹₂ ₂

1−"

_δαk

· ^m−(_`^d^−k⁾ .

We shall now a size lower bound on the depth four multi-r-ic circuits of low bottom support that compute f_n,_α_,k.

Theorem 17. Letδ =0.25 and"=0.8. Let n,r,α,k andµbe positive integers such that r ≤ n^0.2, µ≤^log₅₀ⁿ andα= ⁽^2−δ)^logⁿ

0.9δlog_1−"² .Let C be a depth four multi-r -ic circuit of low bottom supportµand size s.

If C computes the polynomial f_n,_α_,k then s must beexp(Ω(klogn)).

Proof. Recall that the polynomialf_n,α,kis defined on the variable setsYandZsuch that|Z|=m=2αnk. Let`be an integer such that`= ^m₂(1−")and`+kµ < ^m₂. Let∆₀=δk. Let us assume that the polynomial f_n,_α_,k is computed by a depth four multi-r-ic circuitC of low bottom supportµand sizes. Then it must be the case thatΓ_k,`(f_n,α,k) =Γ_k,`(C).

s≥

1 2

₂

1−"

_δαk

· ^m−(_`^d−k⁾

2m rµ

k

· _`+kµ^m

·(`+kµ)

≥ 1

2(`+kµ)·

2 1−"

_δαk

· kµ

2e m r k

·

m−(d−k)

`

`+mkµ

= 1

2(`+kµ)·

2 1−"

_δαk

· kµ

2e m r _k

·(m−(d−k))!

m! · (m−`−kµ)!

(m−`−(d−k))!·(`+kµ)!

`!

≥ 1

2(`+kµ)·

2 1−"

_δαk

· kµ

2e m r k

· `^k^µ

m⁽^d^−k⁾·(m−`)^(d^−k)−kµ

= 1

2(`+kµ)·

2 1−"

_δαk

· kµ

2e m r k

·

` m−`

kµ

·

m−` m

d−k

= 1

2(`+kµ)·

2 1−"

_δαk

· µ

4eαn r k

·

1−"

1+"

kµ

·

1+"

2

d−k

= 1

2(`+kµ)·

2 1−"

_δαk

· µ

4eαn r _k

·

1−"

1+"

_kµ

·

1+"

2

_2αk

= 1

2(`+kµ)·

2 1−"

_δ

·

1+"

2

2_αk

· µ 4eαn r

k

·

1−"

1+"

kµ

= exp

αklog ₂

1−"

_δ

·₁_+"

2

−klogn−klogr−klog^4e_µ^α+kµlog^1−"₁_+"

2(`+kµ) .

In the above math block, we use Theorem 5 to simplify the terms along with the fact that k²µ²= o(m−`), (d−k)² =o(m)andk²µ²= o(`). To get a meaningful lower bound, we need

(14)

αlog ₂

1−"

_δ

·_1+"

2

to be strictly greater than(logn+logr+log^4e_µ^α). Let us setαto ⁽^2−δ)^logⁿ

0.9δlog_1−"² . This reduces to showing that there exist constantsδ,"andνsuch that

(2−δ)·log ₂

1−"

_δ

·₁_+"

2

0.9δlog_1−"² −1≥ν. (1)

Let us fix the constants as follows: "=0.8,δ=0.25 andν=0.23. Through some calculations, it can be verified thatEquation 1gets satisfied. Thus,

s≥exp k

0.23 logn−logr−log^4eα_µ +µlog^1−"₁_+"

2(`+kµ) Ifµ≤^log₅₀ⁿ and r ≤n^0.2, we get thats≥exp(Ω(klogn)).

4 Multi-r -ic Depth Four Circuits

To prove the main theorem, we also need the following lemma.

Lemma 18(Analogous to Lemma 20.4,[Sap19]). Let P be a multi-r -ic polynomial that is computed by a syntactically multi-r -ic depth 4 circuit C of size s≤N^γµfor someγ >0. Letρbe a random restriction that sets each variable to zero independently with probability(1−N⁻²^γ). Then with probability at least (1−N^−γµ)the polynomialρ(P)is computed by a multi-r -ic depth four circuit C⁰of bottom support at mostµand size s.

Proof ofTheorem 1. Letnbe a large positive integer. Let us set some relevant parameters in terms ofn or otherwise as follows.

µ=logn

50 , "=0.8,

δ=0.25, α=(2−δ)logn

0.9δlog_1−"² , k= c

100logn N₀=k(n²+2αn),

N =N₀²⁺^clogN₀, γis a small constant such thatN²^γuN₀^c.

Let ˆX ={ˆx_1,1, ˆx_1,2, . . . , ˆx_1,t, . . . , ˆx_N₀_,1, ˆx_N₀_,2, . . . , ˆx_N₀_,t}be a set of variables over which the polynomial P_n,_α_,kis defined. Letρ: ˆX 7→ {0,∗}be a random restriction such that a variable is set to zero with a probability of(1−N^−2γ), and is left untouched otherwise. LetC be a syntactically multi-r-ic depth four circuit of sizes≤N^γµthat computesP_n,_α_,k.Theorem 18tells us thatC⁰=ρ(C)is a multi-r-ic depth four circuit of sizes and bottom support at mostµwith a probability of at least(1−N^−γµ).

Conditioned on this probability,ρ(P_n,_α_,k)has a multi-r-icΣΠΣΠ^{µ}size at mosts.

Theorem 7tells us that f_n,α,kis a p-projection ofρ(P_n,α,k)with a probability of(1−2^−N⁰)and hence f_n,_α_,kalso has a multi-r-icΣΠΣΠ^{µ}of size at mostswith a probability of at least(1−N^−γµ−2^−N⁰). In other words, there is a random restrictionσsuch that f_n,_α_,kis a p-projection ofρ(P_n,_α_,k)andC⁰=ρ(C) is a multi-r-icΣΠΣΠ^{µ}circuit.

(15)

On the other hand, fromTheorem 17we know that any multi-r-icΣΠΣΠ^{µ}circuit that computes f_n,_α_,kmust be of size exp(Ω(klogn)). Thus, it must be that exp(Ω(klogn))≤s≤N^γµ. We can choose cto be a small enough constant such that the aforementioned expression is satisfied. Thus,smust at least be exp Ω(log²n)

. The explicit polynomialQ_nisP_n,_α_,kwhereα= ⁽^2−δ)^logⁿ

0.9δlog_1−"² andk= ₁₀₀^c logn.

References

[AKV18] Noga Alon, Mrinal Kumar, and Ben Lee Volk. Unbalancing sets and an almost quadratic lower bound for syntactically multilinear arithmetic circuits. InCCC, volume 102 ofLIPIcs, pages 11:1–11:16. Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik, 2018.

[AV08] Manindra Agrawal and V. Vinay. Arithmetic circuits: A chasm at depth four. Inproceedings of Foundations of Computer Science (FOCS), pages 67–75, 2008. doi:10.1109/FOCS.2008.

32.

[BS83] Walter Baur and Volker Strassen. The complexity of partial derivatives. Theor. Comput.

Sci., 22:317–330, 1983.

[CELS18] Suryajith Chillara, Christian Engels, Nutan Limaye, and Srikanth Srinivasan. A near- optimal depth-hierarchy theorem for small-depth multilinear circuits. InFOCS, pages 934–945. IEEE Computer Society, 2018.

[CLS18] Suryajith Chillara, Nutan Limaye, and Srikanth Srinivasan. A quadratic size-hierarchy theorem for small-depth multilinear formulas. InICALP, volume 107 of LIPIcs, pages 36:1–36:13. Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik, 2018.

[CLS19] Suryajith Chillara, Nutan Limaye, and Srikanth Srinivasan. Small-depth multilinear formula lower bounds for iterated matrix multiplication with applications.SIAM J. Comput., 48(1):70–92, 2019.

[CM19] Suryajith Chillara and Partha Mukhopadhyay. Depth-4 lower bounds, determinantal complexity: A unified approach. computational complexity, May 2019. URL: https:

//doi.org/10.1007/s00037-019-00185-4,doi:10.1007/s00037-019-00185-4. [FLMS15] Hervé Fournier, Nutan Limaye, Guillaume Malod, and Srikanth Srinivasan. Lower bounds

for depth-4 formulas computing iterated matrix multiplication.SIAM J. Comput., 44(5):1173–

1201, 2015.

[GKKS14] Ankit Gupta, Pritish Kamath, Neeraj Kayal, and Ramprasad Saptharishi. Approaching the chasm at depth four. Journal of the ACM (JACM), 61(6):33, 2014.

[GRS19] Venkatesan Guruswami, Atri Rudra, and Madhu Sudan. Essential coding theory. 2019.

URL:https://cse.buffalo.edu/faculty/atri/courses/coding-theory/book/. [HS17] Sumant Hegde and Chandan Saha. Improved lower bound for multi-r-ic depth four circuits

as a function of the number of input variables.Proceedings of the Indian National Science Academy, 83(4):907–922, 2017.

[HY11] Pavel Hrubeš and Amir Yehudayoff. Homogeneous formulas and symmetric polynomials.

Computational Complexity, 20(3):559–578, 2011.