• Aucun résultat trouvé

A POSITIVITY PRESERVING INVERSE ITERATION FOR FINDING THE PERRON PAIR OF AN IRREDUCIBLE NONNEGATIVE THIRD ORDER TENSOR

N/A
N/A
Protected

Academic year: 2022

Partager "A POSITIVITY PRESERVING INVERSE ITERATION FOR FINDING THE PERRON PAIR OF AN IRREDUCIBLE NONNEGATIVE THIRD ORDER TENSOR"

Copied!
22
0
0

Texte intégral

(1)

FINDING THE PERRON PAIR OF AN IRREDUCIBLE NONNEGATIVE THIRD ORDER TENSOR

CHING-SUNG LIU, CHUN-HUA GUO, AND WEN-WEI LIN

Abstract. We propose an inverse iterative method for computing the Perron pair of an irre- ducible nonnegative third order tensor. The method involves the selection of a parameterθkin the kth iteration. For every positive starting vector, the method converges quadratically and is positivity preserving in the sense that the vectors approximating the Perron vector are strictly positive in each iteration. It is also shown thatθk= 1 near convergence. The computational work for each iteration of the proposed method is less than four times (three times if the tensor is symmetric in modes two and three, and twice if we also take the parameter to be 1 directly) that for each iteration of the Ng–Qi–Zhou algorithm, which is linearly convergent for essentially positive tensors.

Key words. inverse iteration, nonnegative tensor,M-matrix, nonnegative matrix, positivity preserving, quadratic convergence, Perron vector, Perron root

AMS subject classifications.65F15, 65F50

1. Introduction. A real-valuedmth-order n-dimensional tensor A consists of nmentries in R, and takes the form

A= (Ai1i2...im), Ai1i2...im ∈R, 1≤i1, i2, . . . , im≤n.

A tensor A is called nonnegative (positive) if Ai1i2...im ≥ 0 (Ai1i2...im > 0) for all i1, . . . , im. For various applications of tensors, nonnegative tensors in particular, see [13].

For an n-dimensional column vector x = [x1, x2, . . . , xn]T ∈ Rn, we define an n-dimensional column vector:

Axm1:=

 Xn i2,...,im=1

Ai i2...imxi2. . . xim

1in

. (1.1)

Definition 1.1 ([3, 16]). Let Abe anmth-ordern-dimensional tensor andCbe the set of all complex numbers. Assume thatAxm−1 is not identically zero. We say that (λ,x)∈C×(Cn\{0})is an eigenpair (eigenvalue-eigenvector) ofAif

Axm−1=λx[m−1], (1.2)

wherex[m1]= [xm−1 1, xm−2 1, . . . , xmn1]T.

For an irreducible nonnegative tensorA, the Perron–Frobenuis theorem [3, The- orem 1.4] states that there are scalarλ >0 and unit vectorx >0 satisfying (1.2), and that |λ| ≤ λ for every eigenvalue λ of A. The number λ is then called the

Department of Applied Mathematics, National Chiao Tung University, Hsinchu 300, Taiwan (chingsungliu@gmail.com). This author was supported in part by the Ministry of Science and Technology.

Department of Mathematics and Statistics, University of Regina, Regina, SK S4S 0A2, Canada (Chun-Hua.Guo@uregina.ca). This author was supported in part by an NSERC Discovery Grant and ST Yau Center at Chiao-Da in Taiwan.

Department of Applied Mathematics, National Chiao Tung University, Hsinchu 300, Taiwan (wwlin@math.nctu.edu.tw). This author was supported in part by the Ministry of Science and Technology, the National Center for Theoretical Sciences, and ST Yau Center at Chiao-Da in Taiwan.

1

(2)

spectral radius of A, denoted by ρ(A), and is also called the Perron root ofA. The corresponding positive unit vectorx is unique [3] and is called the Perron vector of A. The Perron pair (ρ(A),x) is needed in several applications. In particular, it is related to measuring higher order connectivity in hypergraphs [7, 8], and determines the probability distribution of higher-order Markov chains [16]. By computingρ(A) for a suitable irreducible nonnegative tensor A, we can also determine whether an irreducibleZ-tensor is an M-tensor [21]. The problem of computing the Perron pair has attracted much attention in recent years.

In 2009, Ng, Qi, and Zhou [16] presented a power method for computingρ(A) for a nonnegative tensor. Later on, Chang, Pearson, and Zhang [2] proved its convergence for primitive tensors. Linear convergence of the algorithm was then proved in [20]

for essentially positive tensors, with a particular starting vector. Without giving the detailed definitions, we simply mention that essentially positive tensors are primitive tensors, and primitive tensors are irreducible nonnegative tensors. Liu, Zhou, and Ibrahim [14] noted that for any irreducible nonnegative tensor A, one can apply the Ng–Qi–Zhou (NQZ) algorithm to B =A+sI, where sis a positive scalar and I is the unit tensor. Convergence of the algorithm is then guaranteed by [2] since B is primitive. For a primitive tensor, starting with any positive vector, the NQZ algorithm also produces approximations to the Perron vector that are positive vectors. So the algorithm is positivity preserving. But it is noted in [20] that the rate of convergence could be worse than linear if the tensor is primitive, but not essentially positive.

The Perron pair can also be found by Newton’s method [15], which has local quadratic convergence, but is not positivity preserving. Global convergence may be achieved through a line search [15], but this requires additional assumptions and the resulting algorithm is more complicated and is still not positivity preserving. The positivity of approximations is important in applications; if the approximations lose positivity then they may be meaningless and could not be interpreted.

For irreducible nonnegative second order tensors (i.e., matrices), there are fast- converging and positivity preserving methods [17, 5, 9] for computing the Perron pair. Our goal in this paper is to propose a fast-converging and positivity preserving algorithm for computing the Perron pair of an irreducible nonnegative third order tensorA, with a detailed convergence analysis. Third order tensors, as the immediate generalization of matrices, have received special attention [11, 12, 18].

In 1971, Noda [17] introduced a positivity preserving method for the nonnegative matrix eigenvalue problem, which has quadratic convergence [5] and is now called the Noda iteration. In this paper, we propose a positivity preserving iteration for nonnegative third order tensors by combining the idea of Newton’s method with the idea of the Noda iteration. We, therefore, call the iteration a Newton–Noda iteration (NNI). NNI is an inverse iterative method with variable shifts, and naturally preserves the strict positivity of the Perron vector in its approximations at all iterations for a positive starting vector. The major advantage of NNI is that, for any positive initial vector, it converges quadratically and computes the desired eigenpair correctly. Fur- thermore, NNI always generates a monotonically decreasing sequence of approximate eigenvalues, converging quadratically toρ(A), and the computational work (in terms of flop counts) each iteration is less than four times (and sometimes just twice) that for each iteration of the NQZ algorithm.

The paper is organized as follows. In section 2, we introduce some preliminaries and motivation. In section 3, we present a Newton–Noda iteration, and prove some basic properties for it. In section 4, we establish its convergence theory, and derive

(3)

the asymptotic convergence rate precisely. Finally, in section 5 we present some numerical examples illustrating the convergence theory and the effectiveness of NNI, and we make some concluding remarks in section 6.

2. Preliminaries, notation and motivation. A real matrixA= [Aij]∈Rn×k is called nonnegative (positive) ifAij ≥0 (Aij>0) for alliand j. For real matrices A and B of the same size, if A−B is nonnegative (positive), we write A ≥ B (A > B). A real square matrixAis called aZ-matrix if all its off-diagonal elements are nonpositive. AnyZ-matrixAcan be written assI−B withB ≥0; it is called a nonsingularM-matrix ifs > ρ(B), and a singularM-matrix ifs=ρ(B), whereρ(·) is the spectral radius.

In addition, we denote|A|= [|Aij|], and the superscriptT denotes the transpose of a vector or matrix. From now on we use v(i) (instead of vi) to represent the ith element of a vectorv, since the notationvi may be confused with a vector sequence vi. Throughout the paper, we use the 2-norm for vectors and matrices. All vectors are realn-vectors and all matrices are realn×nmatrices, unless specified otherwise.

The following result is well known (see [1, Theorems 6.2.3 and 6.2.7] for example).

Theorem 2.1. For aZ-matrixA, the following are equivalent:

(i) A is a nonsingularM-matrix.

(ii) A1≥0.

(iii) Av>0 for some vectorv>0.

An irreducible Z-matrix is a nonsingularM-matrix if and only if for some v>0 the vectorAv is nonnegative and nonzero.

The irreducibility of a tensor is a natural generalization of the irreducibility of a matrix.

Definition 2.2 ([3, 16]). An mth-order n-dimensional tensor A is called re- ducible if there exists a nonempty proper index subset S⊂ {1,2, . . . , n} such that

Ai1i2...im = 0, ∀ i1∈S, ∀ i2, . . . , im∈/S.

If Ais not reducible, then we callA irreducible.

For vectors v = [v(1),v(2), . . . ,v(n)]T and w = [w(1),w(2), . . . ,w(n)]T, with v(i) 6= 0 for all i, we define wv to be the n-vector whose ith element is wv(i)(i), and then define

maxw v

= max

i

w(i) v(i)

, minw v

= min

i

w(i) v(i)

.

Theorem 2.3 ([3, Theorems 1.4 and 4.2]). Let A be an irreducible nonnegative tensor of ordermand dimensionn. Then there existλ>0and a unit vectorx>0 such that

Axm1x[m 1].

If λis an eigenvalue of A, then |λ| ≤λ. Denoteλ by ρ(A). If λ is an eigenvalue with a nonnegative unit eigenvectorx, thenλ=ρ(A)andx=x. Moreover, for any v>0

min

Avm1 v[m1]

≤ρ(A)≤max

Avm1 v[m1]

.

(4)

For a third ordern-dimensional tensor, (1.1) can be written as

Ax2:=



xTA1x ... xTAnx

, (2.1)

where then×nmatricesAiare given byA(i,:,:), using the Matlab multi-dimensional array notation. From (2.1), the nonnegative tensor eigenvalue problem (1.2) can be written as a nonlinear eigenvalue problem, i.e.,

A(x)x=λx, where

A(x) =D(x)1

 xTA1

... xTAn

, D(x) =

 x(1)

. ..

x(n)

. (2.2)

We will also need

B(x) =D(x)1



xT A1+AT1 ... xT An+ATn

. (2.3)

Note that for eachx>0,A(x) andB(x) are nonnegative and Ax2

x[2] = A(x)x

x , B(x)x= 2A(x)x. (2.4)

Lemma 2.4. Let v be a positive vector and A be an irreducible nonnegative third order n-dimensional tensor. Then A(v) and B(v) are irreducible nonnegative matrices.

Proof. IfA(v) is a reducible matrix, then there exists a nonempty proper index subsetS⊂ {1,2, . . . , n}such that

(A(v))ij = 0, ∀i∈S, ∀j /∈S. (2.5) Becausevis a positive vector, from (2.2) and (2.5), it follows that

(D(v)A(v))ij =

 vTA1

... vTAn



ij

= 0, ∀i∈S, ∀j /∈S. (2.6)

On the other hand,

(D(v)A(v))ij = Xn k=1

Aikjv(k). (2.7)

SinceA ≥0 andv>0, by combining (2.6) and (2.7), it follows that Aikj= 0, ∀i∈S, ∀j /∈S, k= 1, . . . , n,

(5)

which contradicts the fact thatAis irreducible. Hence,A(v) is an irreducible matrix.

SinceB(v)≥A(v),B(v) is also an irreducible matrix.

Theorem 2.5. For any positive unit vector v, let λ= max

Av2 v[2]

. If v 6=x (wherexis the Perron vector ofA) , thenλI−A(v)and2λI−B(v)are nonsingular M-matrices. If v = x, then λI−A(v) and 2λI−B(v) are singular M-matrices, i.e., ρ(A(x)) =ρ(A)and ρ(B(x)) = 2ρ(A). Moreover, if(2ρ(A)I−B(x))q≥0 for a unit vectorq, then q=±x.

Proof. We have by (2.4)

λ= max Av2

v[2]

= max

A(v)v v

≥ρ(A(v)).

Moreover, λ = maxA(v)v

v

= ρ(A(v)) if and only if A(v)v = ρ(A(v))v (see [1, Exercise 2.2.12]), i.e., Av2 = ρ(A(v))v[2], which holds if and only if v = x by Theorem 2.3. This proves the statements aboutA(v).

Similarly, we have by (2.4) 2λ= max

2Av2 v[2]

= max

B(v)v v

≥ρ(B(v)), and 2λ= max

B(v)v v

=ρ(B(v)) if and only ifv=x.

For the irreducible singularM-matrixM = 2ρ(A)I−B(x), we havev>0 such that Mv= 0. Given Mq≥0 for a unit vectorq. SupposeMq6= 0. Then fors >0 large enough, w = sv+q > 0 is such that Mw ≥ 0 and Mw 6= 0. Thus M is a nonsingular M-matrix by Theorem 2.1, a contradiction. Therefore, Mq = 0 and thus B(x)q = 2ρ(A)q. SinceB(x)x = 2ρ(A)x, it follows from by the Perron–

Frobenius theorem thatq=±x.

2.1. Motivation. We define two vector valued functionsr:Rn+1+ →Rn and f :Rn+1+ →Rn+1 as follows:

r(x,λ) =λx[2]− Ax2, f(x, λ) =

−r(x,λ)

1

2 1−xTx

. (2.8)

Then the Jacobian off(x, λ) is given by Jf(x, λ) =−

Jxr(x,λ) x[2]

xT 0

. (2.9)

HereJxr(x,λ) is the matrix of partial derivatives ofr(x, λ) with respect tox, i.e., Jxr(x,λ) = 2λD(x)−G(x), (2.10) whereD(x) is defined by (2.2) and

G(x) =



xT A1+AT1 ... xT An+ATn

 (2.11)

withAi=A(i,:,:).

(6)

Note thatAx2= 12G(x)x. It follows from (2.10) that r(x, λ) =λx[2]− Ax2= 1

2Jxr(x, λ)x. (2.12) We now consider using Newton’s method to solvef(x,λ) = 0. Given an approxi- mation (bxk,bλk), Newton’s method produces the next approximation (bxk+1,bλk+1) as follows:

Jf(xbk,bλk) dk

δk

=

λbkxb[2]k − Axb2k

1

2 xbTkbxk−1

, (2.13)

b

xk+1=bxk +dk, (2.14)

k+1=bλkk. (2.15)

Using elimination in (2.13), we find

δk=

1

2 bxTkbxk−1

−xbTk

Jxr(bxk,bλk)1

λbkxb[2]k − Axb2k b

xTk

Jxr(bxk,bλk)1

b x[2]k

. (2.16)

By (2.12) we can simplify (2.16) to

δk= −1

2xbTk

Jxr(bxk,bλk)1

b x[2]k

. (2.17)

From the first equation of (2.13) we have, using (2.8), (2.9), (2.14) and (2.12), 0 =Jxr(bxk,λbk) (xbk+1−xbk) +r(bxk,bλk) +δkbx[2]k

=Jxr(bxk,λbk)bxk+1−1

2Jxr(xbk,bλk)xbkkxb[2]k

=Jxr(bxk,λbk)

b xk+1−1

2xbk

kxb[2]k . Hence, we have the following linear system

Jxr(bxk,bλk)wbk =bx[2]k , (2.18) where

b

wk = −1 δk

b

xk+1−1 2bxk

, i.e., xbk+1= 1

2bxk−δkwbk. (2.19) This means that bxk+1 is a linear combination of bxk and wbk. Suppose we already have bxk > 0. We would like to guarantee bxk+1 > 0. What is needed here is that Jxr(bxk,bλk) is a nonsingularM-matrix. In this case,wbk>0 by (2.18) andδk <0 by (2.17), and thusxbk+1>0.

Whenbxk >0, the matrixJxr(bxk,bλk) is an irreducibleZ-matrix by Lemma 2.4.

By (2.12) and Theorem 2.1, it is a nonsingularM-matrix ifbλkbx[2]k −Abx2kis nonnegative and nonzero. This suggests takingbλk = max

Abx2k b x[2]k

,which is precisely the idea of

(7)

the Noda iteration [17]. Newton’s method does not determine λbk in this way, and it is unlikely that Jxr(bxk,bλk) will be a nonsingularM-matrix when (xbk,bλk) is close to (x, ρ(A)) since Jx(x, ρ(A)) is a singular M-matrix. Indeed, we have examples showing that the sequence{bxk}produced by Newton’s method can fail to be positive.

We are thus motivated to present a new algorithm that combines the idea of Newton’s method with the idea of the Noda iteration.

3. The Newton–Noda iteration and some basic properties. In this sec- tion, we will propose a Newton–Noda iteration (NNI) for computing the spectral radiusρ(A) and the associated eigenvector of an irreducible nonnegative third order tensorA, and then we prove a number of basic properties of NNI, which will be used to establish its convergence theory in section 4.

3.1. Newton–Noda iteration. Based on (2.18)–(2.19) and the Noda iteration, we propose a Newton–Noda iteration (NNI) which is an inverse iteration, and each iteration consists of four steps

Jxr(xk, λk)wk=x[2]k , yk =wk/kwkk, (3.1) e

xk+1=xkkyk, (3.2)

xk+1=xek+1/kexk+1k, (3.3) λk+1= max Ax2k+1

x[2]k+1

!

, (3.4)

whereθk >0 is to be defined later by (3.11).

The following lemma shows that the parameterθk >0 in (3.2) naturally preserves the strict positivity ofxk at all iterations.

Lemma 3.1. LetAbe an irreducible nonnegative third order tensor. Given a unit vectorxk >0, ifxk 6=x andθk >0, thenyk,xk+1>0and

λk+1k−min hkk) e x[2]k+1

!

, (3.5)

where

hk(θ) = θx[2]k

kwkk +θ2r ykk

+r xkk

. (3.6)

Proof. By (2.10),Jxr(xkk) = 2λkD(xk)−G(xk) =D(xk) 2λkI−B(xk) . So the vectorwk in (3.1) satisfies

kI−B(xk)

wk=xk. (3.7)

Sinceλk= max

Ax2k x[2]k

andxk6=x, we know by Theorem 2.5 that 2λkI−B(xk) is a nonsingularM-matrix. Thus

wk = 2λkI−B(xk)1

xk>0.

Thenyk >0, andxk+1 >0 since θk>0.

(8)

By (3.1),

Jxr(xkk)yk= 2λkD(xk)−G(xk)

yk= x[2]k

kwkk. (3.8)

Therefore,

kD(xk)yk− x[2]k kwkk

=G(xk)yk =



xTk A1+AT1 yk ...

xTk An+ATn yk

=



xTkA1yk ... xTkAnyk

+



yTkA1xk ... yTkAnxk

. (3.9)

From (3.2) and (3.9), we have

Axe2k+1=

 e

xTk+1A1xek+1 ... e

xTk+1Anexk+1

=Ax2kk



xTkA1yk ... xTkAnyk

+θk



yTkA1xk ... yTkAnxk

+θk2Ay2k

=Ax2k+ 2λkθkD(xk)yk−θkx[2]k

kwkk +θ2kAy2k

=

λkx[2]k + 2λkθkD(xk)ykkθ2kyk[2]

−θkx[2]k kwkk +θk2Ay2k−λkθk2y[2]k +Ax2k−λkx[2]k

kex[2]k+1−θkx[2]k kwkk +θk2

Ay2k−λky[2]k

+Ax2k−λkx[2]k . Therefore,

Aex2k+1kex[2]k+1−hkk), (3.10) where

hk(θ) = θx[2]k

kwkk +θ2r ykk

+r xkk

.

From (3.10), it follows that λk+1= max Ax2k+1

x[2]k+1

!

= max λkxe[2]k+1−hkk) e

x[2]k+1

!

k−min hkk) e x[2]k+1

! .

We next show that{λk}is strictly decreasing for suitableθk, unlessxk =x for somek, in which case NNI terminates withλk =ρ(A).

Theorem 3.2. Let Abe an irreducible nonnegative third order tensor andη >0 be a fixed constant. Given a unit vector xk >0, suppose xk 6= x and θk in (3.2) satisfies

θk = (

1 ifhk(1)≥ x

[2]

k

(1+η)kwkk;

ηk otherwise, (3.11)

(9)

where for each kwith hk(1)< x

[2]

k

(1+η)kwkk,

ηk = η

(1 +η)kwkk µk−λk

min x[2]k y[2]k

!

with µk = max Ay2k

y[2]k

. Then 0 < ηk <1 whenever it is defined,xk+1 >0 in (3.3), and

λk > λk+1≥ρ(A). (3.12)

Proof. By Lemma 3.1, we have

λk+1k−min hkk) e x[2]k+1

! . We need to provehkk)>0.

From (3.6), we have hk(θ) = θx[2]k

kwkk +θ2r ykk

+r xkk

= θx[2]k

kwkk +θ2r(ykk) +r xkk

2 λk−µk

y[2]k (3.13)

= θx[2]k

(1 +η)kwkk+ θηx[2]k (1 +η)kwkk +θ2r(ykk) +r xkk

2 λk−µk

y[2]k , (3.14)

whereµk = max

Ayk2 y[2]k

.Whenµk≤λk,

hk(1)≥ x[2]k

kwkk+r(ykk) +r xkk

(by (3.13))

≥ x[2]k

(1 +η)kwkk >0.

Wheneverhk(1)≥ x

[2]

k

(1+η)kwkk, we haveλk+1< λk withθk = 1. Ifhk(1)< x

[2]

k

(1+η)kwkk, thenµk > λk, and ηk>0 is defined. Suppose ηk ≥1. Then

η

(1 +η)kwkkmin x[2]k y[2]k

!

≥ µk−λk

,

and thus

ηx[2]k

(1 +η)kwkk+ λk−µk

y[2]k ≥0.

(10)

It follows from (3.14) thathk(1) ≥ (1+η)x[2]kkw

kk, a contradiction. Soηk <1. We now have

θkk = η

(1 +η)kwkk µk−λk

min x[2]k y[2]k

!

, (3.15)

which ensures the inequality θkηx[2]k

(1 +η)kwkk ≥θ2k µk−λk

y[2]k . (3.16)

Substituting (3.16) into (3.14), we obtain hkk) =

"

θkx[2]k

(1 +η)kwkk +θ2kr(ykk) +r xkk# +

"

θkηx[2]k

(1 +η)kwkk +θ2k λk−µk

y[2]k

#

≥ θkx[2]k

(1 +η)kwkk +θ2kr(ykk) +r xkk

.

Therefore,

hkk)≥ θkx[2]k

(1 +η)kwkk >0 (3.17)

and then

λk+1k−min hkk) e x[2]k+1

!

< λk,

By Theorem 2.3 we haveλk+1≥ρ(A).

Algorithm 3.1Newton–Noda iteration (NNI) 1. Givenx0>0 withkx0k= 1, λ0= max

Ax20 x[2]0

,η >0, andtol>0.

2. fork= 0,1,2, . . .

3. Solve the linear systemJxr(xk, λk)wk=x[2]k . 4. Normalize the vectorwk: yk =wk/kwkk.

5. Compute the scalarθk satisfying (3.11).

6. Compute the vectorxek+1=xkkyk.

7. Normalize the vectorexk+1: xk+1 =xek+1/kexk+1k.

8. Computeλk+1= max Ax2k+1

x[2]k+1

andλk+1= min

Ax2k+1 x[2]k+1

. 9. untilconvergence: |λk+1−λk+1|/λk+1<tol.

Based on (3.1)–(3.4) and (3.11), we can present NNI as Algorithm 3.1. The main computational work in each iteration is in lines 3,5, and 8. The computational work in line 8 is the same as that for one iteration of the NQZ algorithm, which is 2n3flops,

(11)

since the main computational work for one iteration of the NQZ algorithm is just one evaluation of Av2 for a positive vectorv. If θk needs to be determined in step 5, then an additional 2n3flops are needed. But we will see later in this section that we always have θk = 1 near convergence. Forming the linear system in step 3 requires 2n3 flops in general. But if the tensorAis symmetric in modes two and three [13], i.e.,Ai =ATi for alli= 1, . . . , n, then forming the linear system only requiresO(n2) flops. Solving the linear system in step 3 by the GTH algorithm [6] will require 43n3 flops. Therefore, the computational work (in terms of flop counts) in each iteration of NNI is less than four times (and sometimes just twice) that for each iteration of the NQZ algorithm.

The vector wk can be computed by the GTH algorithm accurately even near convergence, and is guaranteed to be positive. Therefore, Algorithm 3.1 generates a positive vector sequence {xk}, so it is a positivity preserving algorithm. In what follows we will prove some properties ofθk,xk andyk.These properties will help us to establish the global and quadratic convergence of NNI.

3.2. Some basic properties. Lemma 2.4 shows that B(x) is an irreducible nonnegative matrix. Recall that B(x)x = ρ(B(x))x. Then for any orthogonal matrix

x V

direct computation gives xT

VT

B(x)

x V

=

ρ(B(x)) cT

0 L

.

Similarly, for an irreducible nonnegative matrix B(xk), let uk be the unit length positive eigenvector corresponding to ρ(B(xk)). Then for any orthogonal matrix uk Vk

it holds that uTk

VkT

B(xk)

uk Vk

=

ρ(B(xk)) cTk

0 Lk

. (3.18)

WhenB(xk)→B(x),we haveuk→x and chooseVk such thatVk →V,and then we haveck →candLk→L.

Now define

εkk−ρ(A), Bk = 2λkI−B(xk), τk = 2λk−ρ(B(xk)). (3.19) Then from (3.18) we have

uTk VkT

Bk

uk Vk

=

τk −cTk 0 Lk

,

whereLk = 2λkI−Lk. For 2λk6=ρ(B(xk)), it is easy to verify that uTk

VkT

Bk1

uk Vk

=

" 1

τk bTk 0 Lk1

#

withbTk =cTkL

−1 k

τk . (3.20) Lemma 3.3. Assume that the sequence

λk,xk,yk is generated by Algorithm 3.1.

For any subsequence

xkj ⊆ {xk}, we have the following results:

(i) If xkj →vasj → ∞,thenv>0.

(ii) If xkj →x asj→ ∞, thenykj →x asj→ ∞.

(12)

Proof. (i). If limj→∞xkj =v, thenv≥0. LetS be the set of all indices t such that limj→∞x(t)k

j =v(t) = 0. Sincexkj= 1, S is a proper subset of {1,2, . . . , n}.

SupposeS is nonempty. Then by the definition ofλk,

λ0≥λkj = max

Ax2kj x[2]k

j

≥ xTk

jAtxkj

x(t)k

j

2 for allt= 1,2, . . . , n.

Since limj→∞x(t)k

j = 0 for t ∈ S, it holds that limj→∞xTk

jAtxkj = vTAtv = 0 for t ∈ S. Thus, Atpq = 0 for all t ∈ S and for all p, q /∈ S, which contradicts the irreducibility ofA. Therefore,S is empty and thusv>0.

(ii). Sincexkj →x, we haveB(xkj)→B(x).Then we haveukj →xkj →0, andτkj →0, where we have used ρ(B(x)) = 2ρ(A). From (3.7) and (3.20), we get

τkjwkjkjBkj1xkj = ukjuTk

j +ukjcTk

jLkj1VkTjkjVkjLkj1VkTj xkj. SinceLkj1→[ρ(B(x))I−L]1 andτkj →0, we haveτkjVkjLkj1VkTj →0. Note that VkTjukj = 0 andVTx= limk→∞VkTjukj = 0. We then get

jlim→∞ukjcTkjLkj1VkTjxkj =xcT(ρ(B(x))I−L)1VTx= 0.

A combination of the above relations shows that

j→∞lim τkjwkj = lim

j→∞

ukjuTkj +ukjcTkjLkj1VkTjkjVkjLkj1VkTj

xkj =x. Hence,

jlim→∞ykj = lim

j→∞

τkjwkj

τkjwkj =x.

Lemma 3.4. Assume that the sequence

λk,xk,yk is generated by Algorithm 3.1.

Then the sequence{kwkk kyk−xkk}is bounded, that is, there exists a constantM1>

0 such that

kwkk kyk−xkk ≤M1 for allk.

Proof. From (2.12),

Jxr(xkk)yk−Jxr(xkk) (yk−xk) =Jxr(xkk)xk = 2r(xkk)≥0.

This means

Jxr(xkk)yk≥Jxr(xkk) (yk−xk). (3.21) We may assumexk6=yk. From (3.8) and (3.21), we have

x[2]k ≥ kwkk kyk−xkkJxr(xkk)pk, (3.22)

(13)

where pk = (yk−xk)/kyk−xkk with kpkk = 1. Because xk,yk > 0 and kxkk = kykk= 1,we havepk0 andpk 0 for allk.

Suppose{kwkk kyk−xkk}is not bounded. Since {xk} and {pk} are bounded, we have for some subsequence{kj}

jlim→∞

wkjykj −xkj=∞, lim

j→∞xkj =:v, lim

j→∞pkj =:p. (3.23) Sincewkjykj −xkj≤2wkj, we also have limj→∞wkj=∞. By Lemma 3.3 we havev>0. We now provev=x.

Since the sequence

λk is monotonically decreasing and bounded below byρ(A), limj→∞kj = 2α exists. By Theorem 2.5, 2λkjI −B(xkj) are nonsingular M- matrices. Thus 2αI −B(v) is an M-matrix, so 2α ≥ ρ(B(v)). If 2α > ρ(B(v)), then

jlim→∞wkj = lim

j→∞kjI−B(xkj)1

xkj = (2αI−B(v))1v=:w>0, contradictory to limj→∞

wkj=∞. Thus 2α=ρ(B(v)). Now

ρ(B(v)) = lim

j→∞kj = lim

j→∞2 max

Ax2kj x[2]k

j

= 2 max Av2

v[2]

= max

B(v)v v

.

It follows that B(v)v =ρ(B(v))v and then Av2 = 12ρ(B(v))v[2]. Thus v =x by Theorem 2.3.

Now limj→∞xkj =xand limj→∞λkj =α= 12ρ(B(x)) =ρ(A) by Theorem 2.5.

It follows from (3.22) and (3.23) that Jxr(x,ρ(A))p= lim

j→∞Jxr(xkjkj)pkj ≤ lim

j→∞

x[2]k j

wkjykj −xkj = 0.

Thus (2ρ(A)I−B(x))p≤0. Then we havep=±x by Theorem 2.5. But by the definition ofpk,pis neither positive nor negative. The contradiction shows that the sequence{kwkk kyk−xkk}is bounded.

Lemma 3.5. Assume that the sequence

λk,xk,yk is generated by Algorithm 3.1.

Then hkk)can be expressed in the form hkk) =2θkx[2]k

kwkk +R(θkyk,xk, λk), (3.24) whereR(x,xkk)≤M2kx−xkk2 for some constant M2.

Proof. To prove this, we use Taylor’s theorem for the functionr(x,λk) around the pointxk:

r(x,λk) =r(xkk) +Jxr(xkk)(x−xk) +R(x,xkk), (3.25) whereR(x,xkk)≤M2kx−xkk2for some constantM2,noting thatλk≤λ0 for allk.

Therefore, from (3.25), we have

r(θkykk) =r(xkk) +Jx(xkk)(θkyk−xk) +R(θkyk,xkk).

(14)

Sincer(xkk) =12Jxr(xkk)xk by (2.12), we get

r(θkykk) =−r(xkk) +Jxr(xkk)(θkyk) +R(θkyk,xkk)

=−r(xkk) +θkx[2]k

kwkk +R(θkyk,xkk).

Hence, noting thatr(θkykk) =θ2kr ykk

,

hkk) = θkx[2]k

kwkk +θ2kr ykk

+r xkk

=2θkx[2]k

kwkk +R(θkyk,xk, λk), which completes the proof.

Lemma 3.6. For NNI, we have (i) θk = 1ifkyk−xkk ≤ min(x

[2]

k )

M1M2 ,whereM1 andM2 are as in Lemmas 3.4 and 3.5.

(ii) {θk} is bounded below by some constant ξ > 0, assuming that xk 6= x for each k.

Proof. (i). From Lemma 3.4 and assumption, we have

x[2]k

M2kwkk kyk−xkk ≥ min x[2]k

M2kwkk kyk−xkke

≥ min(x[2]k ) M1M2

e≥ kyk−xkke, (3.26) wheree= [1, . . . ,1]T.Then, from (3.26) and Lemma 3.5,

x[2]k

kwkk≥M2kyk−xkk2e≥R(yk,xkk). (3.27) Substituting (3.27) into (3.24), we obtain

hk(1) = 2x[2]k

kwkk +R(yk,xkk)≥ x[2]k

kwkk > x[2]k (1 +η)kwkk, which meansθk= 1.

(ii). From (3.11), we recall that θk=

(

1 ifhk(1)≥ x

[2]

k

(1+η)kwkk; ηk otherwise,

where ηk = η

(1+η)kwkk(µkλk)min

x[2]k y[2]k

<1 withµk = max

Ayk2 y[2]k

> λk. Suppose θk is not bounded below byξ >0. Since xk is bounded, we can find a subsequence {kj}such that

jlim→∞θkj = 0, lim

j→∞xkj =:v. (3.28)

(15)

Note thatv>0 by Lemma 3.3.

As in the proof of Lemma 3.4, we have limj→∞kj = 2α≥ρ(B(v)).

If 2α > ρ(B(v)), then limj→∞wkj =:w >0, as in the proof of Lemma 3.4. In this case,

jlim→∞ykj = lim

j→∞

wkj

wkj =:y>0.

Ifηk is defined only on a finite subset of{kj}, thenθkj = 1 except for a finite number ofj values, contradicting limj→∞θkj = 0. If ηk is defined on an infinite subset{kji} of{kj}, then

ilim→∞ηkji = η

(1 +η)kwk(µ−α)min v

y 2

>0,

where limi→∞µkji =µ > α sinceηkji <1. This is contradictory to limj→∞θkj = 0.

Thus 2α=ρ(B(v)), andv=x as in the proof of Lemma 3.4. Hence, limj→∞xkj = x and by Lemma 3.3 limj→∞ykj =x, which meansxkj and ykj are close enough forjsufficiently large. Therefore, from (i),θkj = 1 forjlarge enough, a contradiction to limj→∞θkj = 0.

This Lemma shows that ifxk andyk are close enough, then the parameterθk in (3.2) can easily be determined, i.e.,θk = 1.

Corollary 3.7. If xk=yk for anyk≥0in Algorithm 3.1, then xk =x. Proof. If xk =yk, then θk = 1 by Lemma 3.6, and it is easily seen from Algo- rithm 3.1 thatλk+1k. By Theorem 3.2, we havexk =x.

4. Convergence analysis. In this section, we prove that the convergence of NNI is global and quadratic, assuming thatxk 6=x for eachk.

4.1. Global convergence of NNI. Theorem 3.2 shows that the sequence λk

is strictly decreasing and bounded below byρ(A),and hence converges. We now show that the limit ofλk is preciselyρ(A).

Theorem 4.1. Let A be an irreducible nonnegative third order tensor and the sequence

λk is generated by Algorithm 3.1. Then the monotonically decreasing se- quence

λk converges toρ(A),and{xk}from Algorithm 3.1 converges to the positive eigenvector x corresponding toρ(A).

Proof. From (3.5), (3.17) and Lemma 3.6, we have

λk−λk+1= min hkk) e x[2]k+1

!

≥min θkx[2]k (1 +η)kwkkxe[2]k+1

!

≥min ξx[2]k (1 +η)kwkkex[2]k+1

!

. (4.1)

Sinceθk≤1 by construction, we havekexk+1k=kxkkykk ≤2. It follows from (4.1) that limk→∞kwkk1min(x[2]k ) = 0.

Suppose min(x[2]k ) is not bounded below by a positive constant. Then there exists a subsequence{kj}such that limj→∞min(x[2]kj) = 0. Sincekxkjk= 1, we may assume that limj→∞xkj = v exists. Then limj→∞min(x[2]k

j) = min(v[2]) = 0. This is a contradiction sincev>0 by Lemma 3.3. Therefore, min(x[2]k ) is bounded below by a positive constant, and thus limk→∞kwkk1= 0.

Références

Documents relatifs

Concretely, the search for correspondences is cast as a hypergraph matching problem using higher-order constraints instead of the unary or pairwise ones used by previous

Le choix de la période a été fait pour avoir une vision globale, de tous les programmes et expériences menés depuis les années 70 , afin de déterminer et mieux cerner le problème

Une connaissance du niveau le plus fin Nous caract´erisons les m´ethodes de simplification par le fait qu’elles prennent en donn´ee d’entr´ee un mod`ele suppos´e ˆetre au niveau

In this paper, we construct arbitrarily high order accurate DG schemes which preserve positivity of the radiative intensity in the simulation of both steady and unsteady

The method detects critical numerical fluxes which may lead to negative density and pressure, and then imposes a simple flux limiter combining the high-order numerical flux with

Keywords: positivity preserving; high order accuracy; compressible Euler equations; gas dynamics; finite difference scheme; essentially non-oscillatory scheme; weighted

In this paper, we first introduce an interesting quadrature rule for two variable polynomi- als on a triangle, by which we can obtain a sufficient condition for a finite volume or a

The purpose of the present paper is thus to get one step further and to provide an integrity basis for isotropic polynomial functions of a completely symmetric third-order tensor..