Proof of the generalized von Neumann theorem

In this section we prove Theorem 4.5. In a nutshell, our argument here will be a rigorous implementation of the following scheme:

polynomial average

weighted linear average+o(1) (van der Corput)

weighted parallelopiped average+o(1) (weighted gen. von Neumann) unweighted parallelopiped average+o(1) (Cauchy–Schwarz).

The argument is based upon that used to prove [18, Proposition 5.3], namely repeated application of the Cauchy–Schwarz inequality to replace various functions gj by ν (or ν+1), followed by application of the polynomial forms condition (Definition 3.6) to re-place the resulting polynomial averages ofν by 1+o(1). The major new ingredient in the argument compared to [18] will be thepolynomial exhaustion theorem (PET) induction scheme (used for instance in [6]) in order to estimate the polynomial average in (25) by a linear average similar to that treated in [18, Proposition 5.3]. After using PET induction to achieve this linearization, the rest of the proof is broadly similar to that in [18, Proposition 5.3], except for the fact that the shift parameters are restricted to be of sizeM or √

M rather than N, and that there is also some additional averaging over short shift parameters of sizeO(H).

The arguments are elementary and straightforward, but will require a rather large amount of new notation in order to keep track of all the weights and factors created by applications of the Cauchy–Schwarz inequality. Fortunately, none of this notation will be needed in any other section; indeed, this section can be read independently of the rest of the paper (although it of course relies on the material in earlier sections, and also on Appendix A).

We begin with some simple reductions. First observe (as in [18]) that Lemma 3.14 allows us to replace the hypotheses |gj|61+ν by the slightly stronger |gj|6ν, at the (acceptable) cost of worsening the implicit constant in (25) by a factor of 2^k. Next, we claim that it suffices to findd,t,candQ∈Z[h" 1, ...,ht,W]^dfor which we have the weaker

(i.e. we only control the average using the norm ofg1, rather than the best norm of all of the gi’s). Indeed, if we could show this, then by symmetry we could find dj, tj, and

wheneverj∈[k] andν is pseudorandom. The claim then follows by using Lemma A.3 to obtain a local Gowers norm U^√^Q([H]^" ^t^,W⁾

M which dominates each of the individual norms U^√^Q^"^j^([H]^tj^,W)

M , and takingc:=min_j∈[k]c_j. (Note that the pointwise bound|gj|6ν and the polynomial forms condition easily imply that theU^√^Q^"^j^([H]^tj^,W)

M norm ofg_j isO(1).) It remains to prove (30). It should come as no surprise to the experts that this type of “generalized von Neumann” theorem will be proven via a large number of applications of van der Corput’s lemma and the Cauchy–Schwarz inequality. In order to keep track of the intermediate multilinear expressions which arise during this process, it is convenient to prove a substantial generalization of this estimate. We first need the notion of a polynomial system, and averages associated with such systems.

Definition 5.1. (Polynomial system) Apolynomial systemSconsists of the following objects:

• An integerD>0, which we call thenumber of fine degrees of freedom;

• A non-empty finite index setA(the elements of which we shall refer to asnodes of the system);

• A polynomial R_α∈Z[m,h₁, ...,h_D,W] in D+2 variables attached to each node α∈A;

• Adistinguished node α₀∈A;

• A (possibly empty) collectionA⁰⊂A\{α0} ofinactive nodes. The nodes inA\A⁰ will be referred to asactive. Thus for instance the distinguished nodeα₀is always active.

We say that a node α∈A is linear if R_α−R_α₀ is at most linear in m, thus the distinguished node is always linear. We say that the entire system S is linear if every active node is linear. We make the following non-degeneracy assumptions:

• Ifαandβ are distinct nodes inA, thenR_α−R_β is not constant inm,h₁, ...,h_D.

• Ifαandβ are distinctlinear nodes inA, thenRα−Rβ is not constant in m.

Given any two nodesαandβ, we define thedistance d(α, β) between the two nodes to be them-degree of the polynomialRα−Rβ(which is non-zero by hypothesis); thus this distance is symmetric, non-negative, and obeys the non-Archimedean triangle inequality

d(α, γ)6max(d(α, β), d(β, γ)).

Note thatαis linear if and only if d(α, α₁)61, and furthermore we haved(α, β)=1 for any two distinct linear nodesαand β.

Example 5.2. TakeD:=0,A:={1,2,3}, withR1:=0,R2:=mandR3:=m², and let 3 be the distinguished node. Then the node 3 is linear and the other two are non-linear.

(If the distinguished node was 1 or 2, the situation would be reversed.)

Remark 5.3. The non-Archimedean semi-metric is naturally identifiable with a tree whose leaves are the nodes ofS, and whose intermediate nodes are balls with respect to this semi-metric; the distance between two nodes is then the height of their join. It is this tree structure (and the distinction of nodes into active, inactive and distinguished) that shall implicitly govern the dynamics of the PET induction scheme which we shall shortly perform. We will however omit the details, as we shall not explicitly use this tree structure in this paper.

Definition 5.4. (Realizations and averages) LetS be a polynomial system andν be a measure. We define aν-realization f=(f_α)_α∈A ofS to be an assignment of a function f_α:X!Rto each nodeαwith the following properties:

• for any nodeα, we have the pointwise bound|f_α|6ν;

• for any inactive nodeα, we havef_α=ν.

We refer to the function f_α₀ attached to the distinguished node α₀ as the distin-guished function. We define theaverage Λ_S(f)∈Rof a systemS and itsν-realizationf to be the quantity

Λ_S(f) :=E_h₁_,...,h_D_∈[H]E_m∈[M_] Z

α∈A

T^R^α^(m,h¹^,...,h^D^,W)fα.

Example 5.5. If S is the system in Example 5.2, then Λ_S(f) =E_m∈[M_]

f₁T^mf₂T^m²f₃. (31) Example 5.6. The average

E_h,h0∈[H]E_m∈[M_] Z

νT^m+hf2T^m+h⁰f2T^(m+h)²f3T^(m+h⁰⁾²f3

can be written in the form Λ_S(f) with distinguished function f3, where S is a system with D:=2, A:={1,2,2⁰,3,3⁰} with the node 1 inactive and distinguished node 3, with R1:=0, R2:=m+h1, R2⁰:=m+h2, R3:=(m+h1)² and R4:=(m+h2)², andf is given byf1:=ν,f2⁰:=f2andf3⁰:=f3.

Example 5.7. (Base example) LetSbe the system withD:=0,A:={1, ..., k},α0:=1, A⁰=∅(thus all nodes are active) andQj:=Pj(Wm)/W. We observe from (3) that this is indeed a system. Thenf:=(g1, ..., gk) is aν-realization ofS with distinguished function g1, and

Λ_S(f) =E_m∈[M]

T^P¹^{(W m)/W}g1... T^P^k^{(W m)/W}gk. This systemS is linear if and only if the polynomialsPj−Pj⁰ are all linear.

Remark 5.8. (Translation invariance) Given a polynomial systemS and a polyno-mialR∈Z[m,h1, ...,ht,W], we can define the shifted polynomial systemS −Rby replac-ing each of the polynomialsRαbyRα−R; it is easy to verify that this does not affect any of the characteristics of the system, and in particular we have Λ_S(f)=Λ_S−R(f) for any ν-realization f ofS (and hence ofS −R). This translation invariance gives us the free-dom to set any single polynomialRαof our choosing to equal 0; indeed, we shall exploit this freedom whenever we wish to use van der Corput’s lemma or the Cauchy–Schwarz inequality to deactivate any given node.

The estimate (30) then follows immediately from the following proposition.

Proposition5.9. (Generalized von Neumann theorem for polynomial systems) Let S be a polynomial system with distinguished node α0. Then, if η1 is sufficiently small depending on Sand α0,there exist d, t>0,Q∈Z[h" 1, ...,ht,W]^d and c>0depending only on S and α0 such that one has the bound

|Λ_S(f)| _Skf_α₀k^c

U√^{Q([H]t ,W)}^! M

+o_S(1)

whenever ν is a pseudorandom measure and f is aν-realization of S with distinguished function fα₀.

Indeed, one simply applies this proposition to Example 5.7 to conclude (30).

It remains to prove Proposition 5.9. This will be done in three stages. The first is the “linearization” stage, in which a weighted form of van der Corput’s lemma and the polynomial forms condition are applied repeatedly (using the PET induction scheme) to reduce the proof of Proposition 5.9 to the case where the system S is linear. The second stage is the “parallelopipedization” stage, in which one uses a weighted variant of the “Cauchy–Schwarz–Gowers inequality” to estimate the average Λ_S(f) associated

with a linear system S by a weighted average of the distinguished function fα₀ over parallelopipeds. Finally, there is a relatively simple “Cauchy–Schwarz” stage in which the polynomial forms condition is used one last time to replace the weights by the constant 1, at which point the proof of Proposition 5.9 is complete. We remark that the latter two stages (dealing with the linear system case) also appeared in [18]; the new feature here is the initial linearization step, which can be viewed as a weighted variant of the usual polynomial generalized von Neumann theorem (see e.g. [6]). This linearization step is also the step which shall use the fine shiftsh=O(H) (for reasons which will be clearer in §6); this should be contrasted with the parallelopipedization step, which relies on coarse-scale shiftsm=O √

M .

5.10. PET induction and linearization

We now reduce Proposition 5.9 to the linear case. We shall rely heavily here on van der Corput’s lemma, which in practical terms allows us to deactivate any given node at the expense of duplicating all the other nodes in the system. Since this operation tends to increase the number of active nodes in the system, it is not immediately obvious that iterating this operation will eventually simplify the system. To make this more clear we need to introduce the notion of a weight vector.

Definition5.11. (Weight vector) Aweight vectoris an infinite vectorw=(w! ₁, w₂, ...) of non-negative integersw_j, with only finitely many of the w_j’s being non-zero. Given two weight vectorsw=(w! ₁, w₂, ...) andw!⁰=(w⁰₁, w⁰₂, ...), we say thatw<! w!⁰if there exists k>1 such thatw_k<w_k⁰, and such thatw_j=w_j⁰ for allj >k. We say that a weight vector islinear ifw_j=0 for allj>2.

It is well known that the space of all weight vectors forms a well-ordered set; indeed, it is isomorphic to the ordinalω^ω. In particular, we may perform strong induction on this space. The space of linear weight vectors forms an order ideal; indeed, a weight is linear if and only if it is less than (0,1,0, ...).

Definition 5.12. (Weight) LetS be a polynomial system, and let αbe a node inS (in practice this will not be the distinguished nodeα0). We say that two nodesβ andγ inS areequivalent relative toαifd(β, γ)<d(α, β). This is an equivalence relation on the nodes ofS, and the equivalence classes have a well-defined distance from α. We define theweight vector w!α(S) ofS relative toαby setting thejth component for anyj>1 to equal the number of equivalence classes at distancej from α.

Example 5.13. Consider the system in Example 5.2. The weight of this system relative to the node 1 is (1,1,0, ...), whereas the weight of the system in Example 5.6

relative to the node 2 is (0,1,0, ...) (note that the inactive node 1 is not relevant here, nor is the node 2⁰ which has distance 0 from 2), which is a smaller weight than that of the previous system.

The key inductive step in the reduction to the linear case is then the following.

Proposition 5.14. (Inductive step of linearization) Let S be a polynomial system with distinguished node α₀ and a non-linear active node α. If η₁ is sufficiently small depending on S, α₀ and α, then there exists a polynomial system S⁰ with distinguished node α⁰₀ and an active node α⁰ with w!α⁰(S⁰)<w!α(S) with the following property: given any pseudorandom measure ν and any ν-realization f of S, there exists a ν-realization f⁰ of S⁰ with the same distinguished function (thus fα₀=f_α⁰0

0)such that

|ΛS(f)|²ΛS⁰(f⁰)+oS(1). (32) Indeed, given this proposition, a strong induction on the weight vector w!_α(S) im-mediately implies that, in order to prove Proposition 5.9, it suffices to do so for linear systems (since these, by definition, are the only systems without non-linear active nodes).

Before we prove this proposition in general, it is instructive to give an example.

Example 5.15. Consider the expression (31) withf₁, f₂ andf₃ bounded pointwise byν. We rewrite this expression as

Λ_S(f) = Z

f1E_m∈[M]T^mf2T^m²f3

and thus, by the Cauchy–Schwarz inequality (94),

|ΛS(f)|²6 Z

ν Z

ν|E_m∈[M]T^mf2T^m²f3|².

By (15), the first factor is 1+o(1). Also, from van der Corput’s lemma (Lemma A.1), we have

|E_m∈[M_]T^mf2T^m²f3|²6E_h,h⁰_∈[H]E_m∈[M_]T^m+hf2T^m+h⁰f2T^(m+h)²f3T^(m+h⁰⁾²f3+o(1).

We may thus conclude a bound of the form (32), where ΛS⁰(f⁰) is the quantity studied in Example 5.6. Note from Example 5.13 thatS⁰ has a smaller weight thanS relative to suitably chosen nodes.

Proof of Proposition 5.14. By using translation invariance (Remark 5.8), we may normalize R_α=0. We split A=A₀∪A₁, whereA₀:={β∈A:d(α, β)=0} andA₁:=A\A₀.

Sinceα is non-linear, the distinguished nodeα0 lies inA1. ThenRβ is independent of mfor allβ∈A0: Rβ(m, h1, ..., hd, W)=Rβ(h1, ..., hd, W). We can then write

Λ_S(f) =E_h₁_,...,h_D_∈[H]

F_h₁_,...,h_DE_m∈[M]G_m,h₁_,...,h_D, where

F_h₁_,...,h_D:= Y

β∈A0

T^R^β^(h¹^,...,h^d^,W⁾f_β and

Gm,h₁,...,h_D:=T^R^β^(m,h¹^,...,h^d^,W)fβ.

Since|fβ| is bounded pointwise byν, we have that|Fh₁,...,h_D|6Hh₁,...,h_D, where Hh₁,...,h_D:= Y

β∈A0

T^R^β^(h¹^,...,h^D^,W)ν, (33)

and thus, by the Cauchy–Schwarz inequality,

|ΛS(f)|²6

E_h₁_,...,h_D_∈[H]

H_h₁_,...,h_D

E_h₁_,...,h_D_∈[H]

H_h₁_,...,h_D|Em∈MG_m,h₁_,...,h_D|². Since ν is pseudorandom and thus obeys the polynomial forms condition, we see from Definition 3.6 and (33) (takingη1 sufficiently small) that

E_h₁_,...,h_D_∈[H]

Hh₁,...,h_D= 1+o_S(1)

(note by hypothesis that theRβ−Rβ⁰ are not constant in m,h1, ...,hD). Since we are always assumingN to be large, the o_S(1) error is bounded. Thus we reduce to showing that

E_h₁_,...,h_D_∈[H]

Hh₁,...,h_D|E_m∈MGm,h₁,...,h_D|²Λ_S⁰(f⁰)+o_S(1)

for some suitableS⁰ andf⁰. But by the van der Corput lemma (Lemma A.1) (using (16) to get the upper bounds onGm,h₁,...,h_D), we have

|Em∈MG_m,h₁_,...,h_D|²E_h,h0∈[H]E_m∈[M]G_m+h,h₁_,...,h_DG_m+h⁰_,h₁_,...,h_D+o(1), and so, to finish the proof, it suffices to verify that the expression

E_h₁_,...,h_D_,h,h0∈[H]E_m∈[M]

Hh₁,...,h_DGm+h,h₁,...,h_DGm+h⁰,h₁,...,h_D (34) is of the form Λ_S⁰(f⁰) for some suitableS⁰ and f⁰. By inspection, we see that we can constructS⁰ andf⁰ as follows:

• We haveD+2 fine degrees of freedom, which we labelh₁, ...,h_D,handh⁰.

• The nodesA⁰ofS⁰areA⁰:=A0∪A1∪A⁰₁, whereA⁰₁is another copy ofA1(disjoint fromA0∪A1), with distinguished nodeα⁰₀=α0∈A1.

• We choose the nodeα⁰to be an active node ofSinA1which has minimal distance fromα0. (Note thatA1 always contains at least one active node, namelyα0.)

• Ifβ∈A0, thenβ is inactive inS⁰, with

R⁰_β(m,h₁, ...,h_d,h,h⁰,W) :=R_β(m,h₁, ...,h_d,W) andf_β⁰:=ν;

• Ifβ∈A1, thenβ is inactive inS⁰ if and only if it is inactive in S, with R⁰_β(m,h₁, ...,h_d,h,h⁰,W) :=R_β(m+h,h₁, ...,h_d,W) andf_β⁰:=fβ;

• Ifβ⁰∈A⁰₁ is the counterpart of some nodeβ∈A1, thenβ⁰ is inactive in S⁰ if and only ifβ is inactive inS, with

R⁰_β0(m,h₁, ...,h_d,h,h⁰,W) :=R_β(m+h⁰,h₁, ...,h_d,W) andf_β⁰0:=fβ.

It is then straightforward to verify thatS⁰ is a polynomial system, thatf⁰ is a real-ization ofS⁰, and that (34) is equal to Λ_S⁰(f⁰). It remains to show thatw!α⁰(S⁰)<w!α(S).

Let d be the degree in m of Rα−Rα⁰, and thus d>1. One easily verifies that the jth component ofw!α⁰(S⁰) is equal to that ofw!α(S) forj >d, and equal to one less than that of w!α(S) when j=d (basically due to the deactivation of all of the nodes inA0). The claim follows. (The behavior of these weight vectors forj <dis much more complicated, but is fortunately not relevant due to our choice of ordering on weight vectors.)

5.16. Parallelopipedization

By the preceding discussion, we see that to prove Proposition 5.9 it suffices to do so in the case where S is linear. To motivate the argument, let us first work through an unweighted example (withν=1).

Example 5.17. (Unweighted linear case) Consider the linear average Λ_S(f) =E_h,h⁰_∈[H]E_m∈[M]

f0T^hmf1T^h⁰^mf2,

with distinguished function f0, and with |f0|, |f1| and |f2| bounded pointwise by 1.

We introduce some new coarse-scale shift parametersm1, m2∈√ M

. By shifting mto m−m1−m2, one can express the above average as

E_h,h0∈[H]E_m∈[M]

E_m

1,m2∈[√

M]f0T^h(m−m¹^−m²⁾f1T^h⁰^(m−m¹^−m²⁾f2+o(1),

and then, shifting the integral byT^hm¹^+h⁰^m², we obtain E_h,h⁰_∈[H]E_m∈[M_]

E_m

1,m2∈[√

M]T^hm¹^+h⁰^m²f0T^h(m−m²^)+h⁰^m²f1T^h⁰^(m−m¹^)+hm¹f2+o(1).

The point is that T^h(m−m²^)+h⁰^m²f₁ does not depend on m₁, while T^h⁰^(m−m¹^)+hm¹f₂ does not depend onm₂. One can then use the Cauchy–Schwarz–Gowers inequality (see e.g. [19, Corollary B.3]) to estimate this expression by

E_h,h0∈[H]E_m∈[M]

E_m

1,m⁰₁,m2,m⁰₂∈[√

M]T^hm¹^+h⁰^m²f0T^hm⁰¹^+h⁰^m²f0

×T^hm¹^+h⁰^m⁰²f0T^hm⁰¹^+h⁰^m⁰²f0

^1/4 +o(1).

The main term here can then be recognized as a local Gowers norm (24).

Now we return to the general linear case. Here we will need to address the presence of many additional weights which are all shifted versions of the measureν, which requires the repeated use of weighted Cauchy–Schwarz inequalities. See [18, §5] for a worked example of this type of computation. Our arguments here shall instead follow those of [19, Appendix C], in particular relying on the weighted generalized von Neumann inequality from that paper (reproduced here as Proposition A.2).

We turn to the details. To simplify the notation we writeh:=(h₁, ..., h_d) and h:=

(h₁, ...,h_d). We use the translation invariance (Remark 5.8) to normalize R_α₀=0. We then splitA={α₀}∪A_l∪A_nl, whereA_lconsists of all the linear nodes, andA_nlall the non-linear (and hence inactive) nodes. By the non-degeneracy assumptions in Definition 5.1, we may write

R_α=b_αm+c_α

for allα∈Al and somebα, cα∈Z[h,W] with thebα’s all distinct and non-zero. We can then write

Λ_S(f) =E_h∈[H]_DE_m∈[M]

f_α₀

α∈Anl

T^R^α^(m,^h,W)ν

α∈Al

T^b^α^m+c^αf_α.

We introduce some new coarse-scale shift parametersmα∈√ M

forα∈Al, and thus the vector"m:=(mα)_α∈A_l lies in√

M^Al

. We shift mtom−P

α∈Almα and observe that E_m∈[M]x_m=E_m∈[M_]x_m−^P

α∈Alm_α+o(1) whenevermα∈√

andxmεN^εfor allε>0. Averaging this inm"(cf. (96)), we obtain E_m∈[M]xm=E_m∈[_" ^√_M_]AlE_m∈[M_]x_m−^P

α∈Almα+o(1).

Applying this (and (16)), and shifting the integral by the polynomial independent of the coarse-scale parameterm_α. Also observe the pointwise bound

|f_α,m,_h,_"_m,W|6ν_α,m,_h,_"_m,W, where

ν_α,m,_h,_m,W_" :=T^b^α⁽^h,W)m+c^α⁽^h,W⁾⁺^P^β∈A^l^(b^β⁽^h,W)−b^α⁽^h,W))m^βν.

By applying the weighted generalized von Neumann theorem (Proposition A.2) in them"

variables, we thus have defined(²⁶) in Appendix A. We now claim the estimate

E_h∈[H]DE_m∈[M]

kν_α,m,_h,·,Wk²^|AA^ll^|−1\{α}1 (37) for eachα∈Al. Indeed, the left-hand side can be expanded as

E_h∈[H]_DE_"_m(0),"m⁽¹⁾∈[H]^A^lE_m∈[M]

(²⁶) These norms will only make a brief appearance here; they are not used elsewhere in the main argument.

The distinctness of thebβ’s ensures that the polynomial shifts ofν here are all distinct, and so, by the polynomial forms condition (Definition 3.6), we obtain the claim (taking η1 suitably small).

In view of (36), (37) and H¨older’s inequality, we see that

|ΛS(f)| S

Thus, to prove Proposition 5.9, it suffices to show that E_h∈[H]DE_m∈[M] weighted average off_α₀ over parallelopipeds, or more precisely as

E_h∈[H]DE_m_"(0),m"⁽¹⁾∈[√

5.18. The final Cauchy–Schwarz inequality

Let us temporarily drop the weightwin (39) and consider the unweighted average E_h∈[H]DE_"_m(0),m"⁽¹⁾∈[√ non-zero. Thus it suffices to show that

E_h∈[H]_DE_"_m(0),"m⁽¹⁾∈[√

(²⁷) There is the (incredibly unlikely) possibility thatD=0 orD=1, but by using the monotonicity of the Gowers norms (Lemma A.3), one can easily increaseDto avoid this.

Applying the Cauchy–Schwarz inequality (94) with the pointwise bound |fα₀|6ν, we reduce to showing that

E_h∈[H]DE_m_"(0),m"⁽¹⁾∈[√ M]^Al

ω∈{0,1}^A^l

T^Q⁰⁽^h,^m^"^(ω)^,W⁾ν

(w(h,m"⁽⁰⁾,m"⁽¹⁾)−1)^j

= 0^j+o_S(1) forj=0,2 (with the usual convention 0⁰=1) which in turn follows if we show that

E_h∈[H]DE_m_"(0),m"⁽¹⁾∈[√ M]^Al

ω∈{0,1}^A^l

T^Q⁰⁽^h,^m^"^(ω)^,W⁾ν

w(h,"m⁽⁰⁾,"m⁽¹⁾)^j= 1+oS(1)

forj=0,1,2. Let us just demonstrate this in the hardest casej=2, as it will be clear from the proof that the same argument also works for j=0,1 (as they involve fewer factors ofν). We expand the left-hand side as

E_h∈[H]_DE_m_"(0),m"⁽¹⁾∈[√

M]^AlE_m,m0∈[M]

ω∈{0,1}^Al

T^Q⁰⁽^h,^"^m^(ω)^,W)ν

× Y

ω∈{0,1}^A^l

α∈Anl

T^Q⁰⁽^h,^m^"^(ω)^,W^)+R^α^(m−

α∈Alm_α, h,W)

×T^Q⁰⁽^h,^"^m^(ω)^,W)+R^α^(m⁰⁻^P^α∈A^l^m^α^,^h,W⁾ν.

One can then invoke the polynomial forms condition (Definition 3.6) one last time (again takingη1small enough) to verify that this is indeed 1+oS(1). Note that as every node in A_nlis non-linear, the polynomialsR_αhave degree at least 2, which ensures that the poly-nomials used to shiftν here are all distinct. This concludes the proof of Proposition 5.9 in the linear case, and hence in general, and Theorem 4.5 follows.

Remark 5.19. One can define polynomial systems and weights (Definitions 5.1 and 5.12) for systems of multivariable polynomials Rα∈Z[m1, ...,mr,h1, ...,hD, W] (see for example [23]). Following the steps of the PET induction (§5.10) and parallelopipedization (§5.16), one can prove a multivariable version of the polynomial generalized von Neumann theorem (Theorem 4.5).

Dans le document The primes contain arbitrarily long polynomial progressions (Page 29-40)