HAL Id: hal-01964512
https://hal.archives-ouvertes.fr/hal-01964512
Preprint submitted on 22 Dec 2018
HAL
is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire
HAL, estdestinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.
Extended Krylov subspace methods for large-scale differential Sylvester matrix equations
Abdeslem Hafid Bentbib, El Mostafa Sadek, Lakhlifa Sadek, Hamad Alaoui Talibi
To cite this version:
Abdeslem Hafid Bentbib, El Mostafa Sadek, Lakhlifa Sadek, Hamad Alaoui Talibi. Extended Krylov
subspace methods for large-scale differential Sylvester matrix equations. 2018. �hal-01964512�
(will be inserted by the editor)
Extended Krylov subspace methods for large-scale differential Sylvester matrix equations
Abdeslem Hafid Bentbib · El Mostafa Sadek · Lakhlifa Sadek · Hamad Talibi Alaoui
Received: date / Accepted: date
Abstract In this paper, we present a new numerical methods for solving large-scale differ- ential Sylvester matrix equations with low rank right hand sides. These differential matrix equations appear in many applications such as robust control problems, model reduction problems and others. We present two approaches based on extended global Arnoldi process the problem. The first one is based on the expression of the exact solution and a global ex- tended Krylov method for the computation of the exponential of a matrix. The second one is based on low-rank approximate solutions and extended global Arnoldi algorithm. We give some theoretical results and report some numerical experiments to show the effectiveness of the proposed methods.
Keywords Extended global Krylov subspaces, Matrix exponential, low-rank approxima- tion, Differential Sylvester equations.
Mathematics Subject Classification (2010) 65F·15A
Abdeslem Hafid Bentbib
Facult´e des Sciences et Techniques-Gueliz,
Laboratoire de Math´ematiques Appliqu´ees et Informatique, Morocco. E-mail: [email protected]
El Mostafa Sadek (corresponding author) Laboratory of Engineering Sciences for Energy, National School of Applied Sciences of El Jadida, University Chouaib Doukkali, Morocco.
E-mail: [email protected] Lakhlifa Sadek
Facult´e des Sciences d’El Jadida,
Morocco. E-mail: [email protected] Hamad Talibi Alaoui
Facult´e des Sciences d’El Jadida, Morocco. E-mail: talibi [email protected]
1 Introduction
In this paper, we consider the differential Sylvester matrix equation (DSE in short) of the form X˙(t) =AX(t) +X(t)B+EFT, t∈[t0,Tf]
X(t0) =X0 (1)
whereA∈Rn×n,B∈Rp×p, andE∈Rn×s,F∈Rp×s, are full rank matrices, withsn,p.
The initial condition is given in a factored form asX0=Z0Z˜0Tand the matricesAandBare assumed to be large and sparse.
Differential Sylvester matrix equations play a fundamental role in many problems in control, filter design theory, model reduction problems, differential equations and robust control problems; see, e.g.,[1, 10, 16] and the references therein.
The exact solution of the differential Sylvester matrix equation (1) is given by the fol- lowing result.
Theorem 1 [1] The unique solution of the differential Sylvester equations (1) is defined by X(t) =e(t−t0)AX0e(t−t0)B+
Zt t0
e(t−τ)AEFTe(t−τ)Bdτ. (2)
For small or medium-sized differential Sylvester matrix equations, there are several methods to solve this equation, for example Backward Differentiation Formula (BDF) and Rosenbrock method [9, 25].
During the last years, several projection methods based on Krylov subspaces have been proposed for solving large matrix equation, see, e.g.,[2, 5, 6, 15–17, 28]. The main idea em- ployed in these methods is to use a extended Krylov subspace and then apply the Galerkin- type orthogonality condition. In [16], Jbilou and Hached are presented two approaches to solving the large differential Sylvester matrix equation, by using the block extended Krylov subspaces. We use in this work, the extended global Krylov subspaces for solving large differential Sylvester matrix equations.
The rest of the paper is organized as follows. In the next section, we give a new expres- sion of the unique solutionX(t)of the differential Sylvester matrix equation (1). This expres- sion based on the global Arnoldi process. In section 3, we recall the extended global Arnoldi algorithm with some of its properties. In Section 4, we define EGA-exp method by using the matrix exponential approximation by extended global Krylov subspaces and quadrature method for computing numerical solution of large-scale differential Sylvester matrix equa- tions. In Section 5, we present a low-rank solution method for solving the problem (1) by using projections onto extended global Krylov subspacesKmg(A,E)andKmg(BT,F). Fi- nally, Section 6 is devoted to numerical experiments.
Throughout the paper, we use the following notations, The Frobenius inner product of the matricesXandY is defined byhX,YiF=tr(XTY), wheretr(Z)denotes the trace of a square matrixZ. The associated norm is the Frobenius norm denoted bykkF. The Kronecker productA⊗B= [ai,jB]whereA= [ai,j]. This product satisfies the properties:(A⊗B)(C⊗ D) = (AC⊗BD)and(A⊗B)T =AT⊗BT. We also using the matrix productdefined in [8]. The following proposition gives some properties satisfied by the this product.
Proposition 1 Let A,B,C∈Rn×ps, D∈Rn×n, L∈Rp×pandα∈R. Then we have, 1. (A+B)TC=ATC+BTC.
2. AT(B+C) =ATB+ATC.
3. (αA)TC=α(ATC).
4. (ATB)T=BTA.
5. AT(B(L⊗Is)) = (ATB)L.
A blockVm= [V1,V2, ...,Vm]is to beF-orthonormal matrix ifVmTVm=Im. We also have the following result
Lemma 1 ([21])LetVm= [V1,V2, ...,Vm]is be an n×ms F-orthonormal matrix, let Z and Y be a matrices ofRm×sandRms×q. Then we have
kVm(Z⊗Is)kF=kZkF and kVmYkF≤ kYkF.
2 Expression of the exact solution of the differential Sylvester equation
In this section we will give a new expression of the unique solutionX(t)of the differential Sylvester matrix equation (1). This expression based on the global Arnoldi process andPA
be the minimal polynomial ofAforEof degreeqandPBT be the minimal polynomial ofBT forFof degreeq0.
The modified global Arnoldi process constructs anF-orthonormal basisV1,V2,· · ·,Vm
of the matrix Krylov subspace
Km(A,V) =span
V,AV,A2V, ...,Am−1V . The modified global Arnoldi process is described as follows:
Algorithm 1The modified global Arnoldi process(MGA)
Inputs:A∈Rn×n,V∈Rn×sandman integer.
1. SetV1=kVVk
F; 2. Forj=1, ...,m 3. Ve=AVj; 4. Fori=1, ...,j 5. hi,j=hVi,VeiF; 6. Ve=Ve−hi,jVi; 7. end for(i).
8. Computehj+1,j=keVkF; 9. Vj+1=Ve/hj+1,j; 10. end for(j).
LetVm= [V1,V2, ...,Vm]andHemAis the(m+1)×mupper Hessenberg matrix whose en- tries hi,j are defined by Algorithm 1 and HmA is the m×m matrix obtained fromHemA by deleting its last row. We have the following relation
AVm=Vm(HmA⊗Is) +hm+1,mVm+1(eTm⊗Is), whereeTm= [0,0,· · ·,0,1].
The following result shows that the solutionXof (1) can be expressed in terms of the global Arnoldi basis.
Theorem 2 Let q and q0be the degrees of the minimal polynomials of A for E and BT for F respectively. LetVq= [V1,V2, ...,Vq]andWq0 =
W1,W2, ...,Wq0
are the F-orthonormal matrices constructed by applying simultaneously q and q0steps of the global Arnoldi algo- rithm to the pairs(A,E)and(BT,F)respectively. Then the unique solution X of (1) can be expressed as:
X(t) =Vq(Yqq0(t)⊗Is)WqT0. (3) where Yqq0(t)is the solution of the low-order differential Sylvester equation:
Y˙m(t)−HqAYm(t)−Ym(t) HqB0
T
−EeqFeqT0 =0.
whereEeq=kEkFe(q)1 andFeq0=kFkFe(q10).
Proof LetZbe the matrix defined byVq(Yqq0(t)⊗Is)WqT0. Then we have,
Z(t)−˙ AZ(t)−Z(t)B−EFT =Vq(Z(t)˙ ⊗Is)WqT0 −AVq(Z(t)⊗Is)WqT0 −Vq(Z(t)⊗Is)WqT0B−EFT.
=Vq
Y˙qq0(t)−HqAYqq0(t)−Yqq0(t)
HqB0
T
−EeqFeqT0
⊗Is
WqT0. WhenYqq0the solution of the lower order differential Sylvester equation ˙Yqq0(t)−Tq,AYqq0(t)−
Yqq0(t) HqB0
T
−EeqFeqT0 =0, we get
Z(t)−˙ AZ(t)−Z(t)B−EFT=0.
Therefore, using the fact that the solution of (1) is unique, it follows thatX(t) =Vq(Yqq0(t)⊗
Is)WqT0.
3 The extended global Arnoldi process
In this section, we recall the extended global Krylov subspace and extended global Arnoldi process. LetV be a matrix of dimensionn×s. Then the extended global Krylov subspace associated to(A,V)is given by
Kmg(A,V) =span
V,A−1V,AV,A−2V,A2V, ...,Am−1V,A−mV (4) The extended global Arnoldi process constructs anF-orthonormal basis{V1,V2, ...,Vm} of the extended global Krylov subspaceKmg(A,V)([17]). The algorithm is summarized as follows
Algorithm 2The extended global Arnoldi process(EGA)
Inputs:A∈Rn×n,V∈Rn×sandman integer.
1. Compute the globalQR[V,A−1V], i.e ,[V,A−1V] =V1(Ω⊗Is);
2. SetV0= [ ];
3. Forj=1, ...,m
4. SetVj(1)=Vj(:,1 :s);Vj(2)=Vj(:,s+1 : 2s);
5. Vj= [Vj−1,Vj];U= [AV(1)j ,A−1Vj(2)];
6. Fori=1, ...,j 7. Hi,j=ViTU;
8. U=U−Vi(Hi,j⊗Is);
9. end for(i).
10. Compute the global QR decomposition ofU, i.e.,U=Vj+1(Hj+1,j⊗Is);
11. end for(j).
LetVm= [V1,V2, . . . ,Vm]withVi∈Rn×2sand 2m×2mupper block Hessenberg matrix Tm,A=VTm(AVm).
We have the following relation
AVm=Vm+1(Tm,A⊗Is)
=Vm(Tm,A⊗Is) +Vm+1(Tm+1,mA EmT⊗Is),
whereTm,A=VTm+1(AVm), andEmT= [02×2(m−1),I2]is the matrix formed with the last 2 columns of the 2m×2midentity matrixI2m.
4 Using the matrix exponential approximation and quadrature method
In this section, we show the approximation of the solution of the large differential Sylvester matrix equations (1), based on matrix exponential approximation with extended global Krylov subspace and quadrature method.
That the exact solution to (1) is given by X(t) =e(t−t0)AX0e(t−t0)B+
Zt t0
e(t−τ)AEFTe(t−τ)Bdτ. (5)
For obtain on approximate solution of (1), we use the approximate ofe(t−τ)AEande(t−τ)BTF bay extended global Krylov subspaces, and finally, we find the approximate solution ofX(t) by the quadrature method.
In what follows, we consider projections onto extended global Krylov subspaces. Let Vm= [V1, ...,Vm]andWm= [W1, ...,Wm]be the F-orthonormal matrices whose columns form an F-orthonormal basis of the subspaceKmg(A,E)andKmg(BT,F), respectively.
Following [29, 26, 27], an approximation toZA=e(t−τ)AEcan be obtained as Zm,A=Vm
e(t−τ)Tm,AVmE⊗Is
, whereTm,A=VTm(AVm).
In the same way, an approximation toZB=e(t−τ)BTF, is given by Zm,B=Wm(e(t−τ)Tm,BWmF⊗Is), whereTm,B=WTm(BTWm).
Then, you have
e(t−τ)AEFTe(t−τ)B≈Zm,A(t)Zm,BT (t). (6) Assuming thatX(0) =0, therefore the approximation solution of the differential Sylvester equation (1) as follows
Xm(t) =Vm(Xm(t)⊗Is)WTm, (7) where
Xm(t) = Zt
t0Xm,A(τ)XTm,B(τ)dτ, (8)
with
Xm,A(τ) =e(t−τ)Tm,AVTmE Xm,B(τ) =e(t−τ)Tm,BWTmF
The next result shows that the 2m×2mmatrix functionXm(t)is solution of a low-order differential Sylvester matrix equation.
Theorem 3 Let Xm(t)be the matrix function defined by (8), then itXm(t)satisfies the fol- lowing low-order differential Sylvester matrix equation
X˙m(t) =Tm,AXm(t) +Xm(t)TTm,B+EemFemT, t∈[t0,Tf], (9) whereEem=VTmE andFem=WTmF.
Proof The proof can be easily derived from the expression (8) and the result of theorem 1.
The residual associated to the approximationXm(t)is defended bayRm(t) =X˙m(t)− AXm(t)−Xm(t)B−EFT. Next, we give a result that allows us the computation of the norm of the residual.
Theorem 4 Let Xm(t) =Vm(Xm(t)⊗Is)WTmbe the approximation obtained at step m by the extended global Arnoldi method. Then the residual Rm(t)satisfies the relation
kRm(t)k2F≤ kTm+1,mA Xm(t)kF2+kTm+1,mB Xm(t)k2F, (10) whereXm(t)is the2×2m matrix corresponding to the last2rows ofXm(t).
Proof We have
Rm(t) =X˙m(t)−AXm(t)−Xm(t)B−EFT, where Xm(t) =Vm(Xm(t)⊗Is)WTm. Therefore
Rm(t) =Vm(X˙m(t)⊗Is)WTm−AVm(Xm(t)⊗Is)WTm−Vm(Xm(t)⊗Is)WTmB−EFT. We use the following properties
AVm=Vm+1(Tbm,A⊗Is), WTmB= (bTm,BT ⊗Is)WTm+1, Vm=Vm+1
I2sm 02s,2s
, WTm=
I2sm02s,2s
WTm+1,
and
Tbm,A=
Tm,A
Tm+1,mA EmT
, TbTm,B=
TTm,BEm(Tm+1,mB )T ,
we obtained Rm(t) =Vm+1
X˙m(t)⊗Is 0
0 0
WTm+1−Vm+1
Tm,AXm(t)⊗Is 0 Tm+1,mA EmTXm(t)⊗Is0
WTm+1
−Vm+1
Xm(t)TTm,B⊗IsXm(t)Em(Tm+1,mB )T⊗Is
0 0
WTm+1−Vm+1
VTmEFTWm0
0 0
WTm+1
=Vm+1 "
X˙m(t)−Tm,AXm(t)−Xm(t)TTm,B−EemFeT −Xm(t)Em(Tm+1,mB )T
−Tm+1,mA EmTXm(t) 0
#
⊗Is
WTm+1
SinceXm(t)exact solution of the equation
X˙m(t) =Tm,AXm(t) +Xm(t)TTm,B+EemFeT. Then
Rm(t) =Vm+1
0 −Xm(t)Em(Tm+1,mB )T
−Tm+1,mA EmTXm(t) 0
⊗Is
WTm+1.
If you noteXm(t) =EmTXm(t), we have
kRm(t)k2F≤ kTm+1,mA Xm(t)k2F+kTm+1,mB Xm(t)k2F.
The following result shows that the approximationXm(t)is an exact solution of a per- turbed differential Sylvester equation.
Theorem 5 Let Xm(t)be the approximate solution given by (7). Then we have
X˙m(t) = (A−Fm,A)Xm(t) +Xm(t)(B−Fm,B) +EFT, (11) where
Fm,A=Vm+1(Tm+1,mA EmT⊗Is)(VTmVm)−1VTm, Fm,B=WTm(WmWTm)−1[Em(Tm+1,mB )T⊗Is]Wm+1T . Proof By multiplying the (9) left byVmand right byWTm, we have
Vm(X˙m(t)⊗Is)WTm=Vm(Tm,AXm(t)⊗Is)WTm+Vm(Xm(t)TTm,B⊗Is)WTm+Vm(EemFemT⊗Is)WTm
=Vm(Tm,A⊗Is)(Xm(t)⊗Is)WTm+Vm(Xm(t)⊗Is)(TTm,B⊗Is)WTm
+Vm(Eem⊗Is)(FemT⊗Is)WTm, and like
AVm=Vm(Tm,A⊗Is) +Vm+1(Tm+1,mA EmT⊗Is), WTmB= (TTm,B⊗Is)WmT+ (Em(Tm+1,mB )T⊗Is)Wm+1T , so
X˙m(t) = [AVm−Vm+1(Tm+1,mA EmT⊗Is)](Xm(t)⊗Is)WTm
+Vm(Xm(t)⊗Is)[BWTm−(Em(Tm+1,mB )T⊗Is)Wm+1T ] +EFT
= [A−Vm+1(Tm+1,mA EmT⊗Is)(VTmVm)−1VTm]Xm(t)
+Xm(t)[B−WTm(WmWTm)−1(Em(Tm+1,mB )T⊗Is)Wm+1T ] +EFT, Then
X˙m(t) = (A−Fm,A)Xm(t) +Xm(t)(B−Fm,B) +EFT.
The next result states that the error Em(t) =X(t)−Xm(t)satisfies also a differential Sylvester matrix equation.
Theorem 6 Let X(t)e the exact solution of (1) and let Xm(t)be the approximate solution obtained at step m. The errorEm(t) =X(t)−Xm(t)satisfies the following equation
E˙m(t) =AEm(t) +Em(t)B−Rm(t), (12) Proof We have
X(t) =˙ AX(t) +X(t)B+EFT,
Rm(t) =X˙m(t)−AXm(t)−Xm(t)B−EFT, so
E˙m(t) =X(t)˙ −X˙m(t)
=AX(t) +X(t)B+EFT−AXm(t)−Xm(t)B−EFT−Rm(t)
=A(X(t)−Xm(t)) + (X(t)−Xm(t))B−Rm(t)
=AEm(t) +Em(t)B−Rm(t),
Notice that from theorem 6, the error Em(t)can be expressed in the integral form as follows
Em(t) =e(t−t0)AEm,0e(t−t0)B+ Zt
t0
e(t−τ)ARm(τ)e(t−τ)Bdτ, t∈[t0,Tf]. (13) whereEm,0=Em(t0).
Next, we give an upper bound for the norm of the error by using the 2-logarithmic norm defined byµ2(A) =12λmax(A+AT).
Theorem 7 Assume that the matrices A and B are such thatµ2(A) +µ2(B)6=0. Then at step m of the extended global Arnoldi process, we have the following upper bound for the norm of the errorEm(t),
kEm(t)k2≤ kEm,0k2e(t−t0)(µ2(A)+µ2(B))+αm
e(t−t0)(µ2(A)+µ2(B))−1
(µ2(A) +µ2(B)) , (14) where
αm= max
t∈[t0,t]
(max{kTm+1,mA Xm(t)k2,kTm+1,mB Xm(t)k2}).
The matrixXmis the2×2m matrix corresponding to the last2rows of Gm(t).
Proof We first point out thatketAk2≤eµ2(A)t.Using the expression (13) ofEm(t), we obtain the following relation
kEm(t)k2=ke(t−t0)AEm,0e(t−t0)Bk2+ Z t
t0
ke(t−τ)ARm(τ)e(t−τ)Bk2dτ.
Therefore, using (13) and the fact thatke(t−τ)Ak2≤e(t−τ)µ2(A), we get kEm(t)k2≤ kEm,0k2e(t−t0)(µ2(A)+µ2(B))+max
t∈[t0,t]
kRm(t)k2 Z t
t0
e(t−τ)µ2(A)e(t−τ)µ2(B)dτ
≤ kEm,0k2e(t−t0)(µ2(A)+µ2(B))+max
t∈[t0,t]
kRm(t)k2et(µ2(A)+µ2(B)) Zt
t0
eτ(µ2(A)+µ2(B))dτ.
Using the result of theorem 4, we obtain max
t∈[t0,t]
kRm(t)k2=αmand then
kEm(t)k2≤ kEm,0k2e(t−t0)(µ2(A)+µ2(B))+αm
e(t−t0)(µ2(A)+µ2(B))−1 (µ2(A) +µ2(B)) . Next, we give another upper bound for the norm of the errorEm(t).
Theorem 8 Let X(t)be the exact solution to (1) and let Xm(t)be the approximate solution obtained at step m. Then we have
kEm(t)k2≤ kFk2etµ2(B)Γ1,m(t) +kEemk2etµ2(A)Γ2,m(t), (15) where
(
Γ1,m(t) =Rtt
0e−τ µ2(B)kZA(τ)−Zm,A(τ)k2dτ, Γ2,m(t) =Rtt
0e−τ µ2(A)kZB(τ)−Zm,B(τ)k2dτ.
Proof From the expressions ofX(t)andXm(t), we have kEm(t)k2 =kX(t)−Xm(t)k2
=k Zt
t0
(ZA(τ)ZTB(τ)−Zm,A(τ)Zm,B(τ)T)dτk2
=k Zt
t0
(ZA(τ)ZTB(τ)−Zm,A(τ)ZTB(τ) +Zm,A(τ)ZTB(τ)−Zm,A(τ)Zm,BT (τ))dτk2
=k Zt
t0
[(ZA(τ)−Zm,A(τ))ZBT(τ) +Zm,A(τ)(ZB(τ)−Zm,B(τ))T]dτk2
≤ Z t
t0
kZB(τ)k2k(ZA(τ)−Zm,A(τ))k2+kZm,A(τ)k2k(ZB(τ)−Zm,B(τ))k2dτ, Now as
µ2(Tm,A) = 1
2λmax(Tm,A+TTm,A)
≤ 1
2λmax(A+AT)
=µ2(A),
and since
kZB(τ)k2≤e(t−τ)µ2(B)kFk2,
kZm,A(τ)k2≤e(t−τ)µ2(Tm,A)kEemk2≤e(t−τ)µ2(A)kEemk2, we also have
kEm(t)k2≤ kFk2etµ2(B) Zt
t0
e−τ µ2(B)kZA(τ)−Zm,A(τ)k2dτ +kEemk2etµ2(A)
Zt t0
e−τ µ2(A)kZB(τ)−Zm,B(τ)k2dτ.
we get
kEm(t)k2≤ kFk2etµ2(B)Γ1,m(t) +kEemk2etµ2(A)Γ2,m(t),
One can use some known results [19, 27] to derive upper bounds forkZA(τ)−Zm,A(τ)k2, andkZB(τ)−Zm,B(τ)k2, when using the extended global Krylov subspaces.
Theorem 9 Let Zm,A(τ)the approximate of ZA(τ)by extended global Arnoldi method, we get the following upper bound for the exponential approximation error em,A(τ) =ZA(τ)− Zm,A(τ):
kem,A(τ)k2≤ kVm+1(Tm+1,mA ⊗Is)k2 Z τ
0
e(u−τ)V(A)kLm,A(u)k2du, (16) where
(
Lm,A(u) =EmTe(t−u)Tm,AEem⊗Is, V(A) =λmax(A+A2T).
Proof We have
ZA(τ) =e(t−τ)AE,
Zm,A(τ) =Vm(e(t−τ)Tm,AEem⊗Is), Then
ZA0(τ) =−Ae(t−τ)AE=−AZA(τ), and
Zm,A0 (τ) =−Vm(Tm,A⊗Is)(e(t−τ)Tm,AEem⊗Is)
=−[AVm−Vm+1(Tm+1,mA EmT⊗Is)](e(t−τ)Tm,AEem⊗Is)
=−AZm,A(τ) +Vm+1(Tm+1,mA ⊗Is)Lm,A(τ).
Therefore, the errorem,A(τ) =ZA(τ)−Zm,A(τ)is such that
e0m,A(τ) =−Aem,A(τ)−Vm+1(Tm+1,mA ⊗Is)Lm,A(τ), which allows to give the following expression ofem,A:
em,A(τ) =− Z τ
0
e(u−τ)AVm+1(Tm+1,mA ⊗Is)Lm,A(u)du. (17) Asτ−u>0, it follows that
ke(u−τ)Ak2≤e(τ−u)µ2(−A)=e(u−τ)V(A). Then, we get
kem,A(τ)k2≤ kTm+1,mA k2 Z τ
0
e(u−τ)V(A)kLm,A(u)k2du.
Notice that ifV(A)is not known butV(A)≥0 (which is the case for positive semidefinite matrices) then we get the upper bound
kem,A(τ)k2≤ kVm+1(Tm+1,mA ⊗Is)k2 Z τ
0
kLm,A(u)k2du.
To define a new upper bound for the norm of the global errorEm(t), we can use the upper bounds for the errorsem,Aandem,Bin the expression (15) stated in theorem8 to get kEm(t)k2 ≤ kFk2etµ2(B)
Z t t0
e−τ µ2(B)kem,A(τ)k2dτ+kEemk2etµ2(A) Zt
t0
e−τ µ2(A)kem,B(τ)k2dτ,
and then we obtain
kEm(t)k2≤ kFk2etµ2(B)kVm+1(Tm+1,mA ⊗Is)k2 Z t
t0
e−τ µ2(B)Sm,A(τ)dτ (18) +kEemk2etµ2(A)kWm+1(Tm+1,mB ⊗Is)k2
Z t t0
e−τ µ2(A)Sm,B(τ)k2dτ, (19) where
Sm,A(τ) =R0τe(u−τ)V(A)kLm,A(u)k2du, Sm,B(τ) =R0τe(u−τ)V(B)kLm,B(u)k2du.
Sincemis generally very small (m<1000), the factorsLm,AandLm,Bcan be computed using theexpm function of Matlab, and we calculate the approximation of the integral ap- pearing in the right sides of (14) and (??) by quadrature formulas.
The approximate solution Xm(t)could be given as a product of two matrices of low rank. It is possible to decompose it asXm=Z1Z2T where the matrixZ1andZ2 are of low rank (lower than 2m). Consider the singular value decomposition of the 2m×2mmatrix
Xm(t) =Ge1ΣGeT2,
whereΣ is the diagonal matrix of the singular values of Xmsorted in decreasing order.
LetX1,l andX2,l be the 2m×lmatrices of the first l columns ofYe1 andYe2 respectively, corresponding to thel singular values of magnitude greater than some tolerancedtol. We obtain the truncated SVD
Xm(t)≈U1,lΣlU2,lT, whereΣl=diag[σ1, . . . ,σl]. SettingZ1,m=Vm
U1,lΣl1/2⊗Is
, andZ2,m=Wm
U2,lΣl1/2⊗Is
it follows that
Xm≈Z1,mZ2,mT . (20)
This is very important for large problems when one doesn’t need to compute and store the approximationXmat each iteration. We notice that there are cheaper ways to compute the low rank representations of the approximate solution; see [2, 5, 4].
The algorithm of first approach for solving large differential Sylvester matrix equations (EGA-exp)is summarized as follows
Algorithm 3The extended global Arnoldi (EGA-exp) method for DSE’s
1. InputsX0ann×smatrix,Aann×nmatrix, B anp×pmatrix,Eann×smatrix and F anp×smatrix.
2. Choose a tolerancetol>0 and an integermmax. 3. Form=1 :mmax
(a) ComputeVmandTm,Aby Algorithm 1 applied to(A,E).
(b) ComputeWmandTm,Bby Algorithm 1 applied to(BT,F).
(c) Set Eem = kEkFe1(2m), Fem = kFkFe(2m)1 and compute Xm,A(τ) = e(t−τ)Tm,AEem, Xm,B(τ) = e(t−τ)Tm,BFemby using the Matlab function expm.
(d) Use a quadrature method to compute the integral (8) and get an approximation ofXm(t)for each t∈[t0,Tf].
(e) IfkRm(t)kF<tolstop.
(f) Using (20), the approximate solutionXm(t)is given byXm≈Z1,mZ2,mT . 4. End
5 Low-rank approximate solutions by extended global Arnoldi algorithm
In this section, we will apply the extended global Arnoldi algorithm to obtain low-rank approximate solutions to the solutionX(t)of the large differential Sylvester matrix equations (1). This approach has been applied for Lyapunov, Stein or Riccati matrix equations, for more details see [2, 5, 6, 21].
Applying the extended global Arnoldi algorithm to the pairs (A,E)and(BT,F), we get F-orthonormal bases{V1,V2, ...,Vm} and{W1,W2, ...,Wm}of extended global Krylov subspacesKmg(A,E)andKmg(BT,F), respectively and we have
Tm,A=VTm(AVm) and Tm,B=WTm(BTWm), where
Vm= [V1,V2, ...,Vm] and Wm= [W1,W2, ...,Wm].
We then consider approximate solutions of the large differential Sylvester matrix equations (1) that have the low-rank form
Xm(t) =Vm(Ym(t)⊗Is)WTm, (21) whereYm(t)the solution of the reduced differential Sylvester matrix equation
Y˙m(t)−Tm,AYm(t)−Ym(t)TTm,B−EemFemT=0. (22) withEem=kEkFe(2m)1 andFem=kFkFe(2m)1 .
Next, we give a result that allows us the computation of the norm of the residual.
Theorem 10 Let Xm(t) =Vm(Ym(t)⊗Is)WTmbe the approximation obtained at step m by the extended global Arnoldi algorithm. Then, the Frobenius norm of the residual Rm(t)as- sociated to the approximation Xm(t)satisfies the relation
kRm(t)k2F ≤ kTm+1,mA Ym(t)kF2+kTm+1,mB Ym(t)k2F, (23) where Ym(t)is the2×2m matrix corresponding to the last2rows of Ym.
Proof The proof is similar to the theorem (4).
Note that, we have to solving the reduced order differential Sylvester matrix equation (22) by Backward Differentiation Method (BDF) or by the Rosenbrock method, see [9, 16, 25].
We summarize the steps of the second approach extended global Arnoldi whit PDF or Rosenbrock method in the following algorithm
Algorithm 4The extended global Arnoldi and PDF or Rosenbrock method for DSE’s (EGA)
1. InputX0, a tolerancetol>0, an integermmax. 2. Form=1 :mmax
(a) Apply the extended block Arnoldi algorithm to(A,E)and(BT,F)to get theF−orthonormal matri- cesVmandWmand the upper block Hessenberg matricesTm,AandTm,B.
(b) Use the BDF or Rosenbrock method to solve the differential Sylvester equation (22).
(c) IfkRm(t)kF<tolstop.
(d) Compute the approximate solutionXm(t)by using (20).
3. End
6 Numerical experiments
In this section, we present some numerical experiments of large and sparse differential Sylvester matrix equations. We give comparisons between the exponential approach (EGA- exp), low-rank approximate solutions by extended global Arnoldi algorithm via PDF method (EGA-PDF) and low-rank extended global Arnoldi method via Rosenbrock method (EGA- ROS). The algorithms are coded in Matlab2014. All the experiments were performed on a laptop with an Intel Core i5 processor and 4GB of RAM. In all of the examples, the coeffi- cients of the matricesEandFwere random values uniformly distributed on[0,1].
Example 1
In this first example, the matricesAandBare obtained from the centered finite difference discretization of the operators:
LA(u) =∆u+f1(x,y)∂u
∂x+,f2(x,y)∂u
∂y+f(x,y)u LB(u) =∆u+g1(x,y)∂u
∂x+g2(x,y)∂u
∂y+g(x,y)u
on the unit square[0,1]×[0,1]with homogeneous Dirichlet boundary conditions. The num- ber of inner grid points in each direction wasn0andp0for the operatorsLAandLB, respec- tively. The matricesAandBwere obtained from the discretization of the operatorLAandLB
with the dimensionsn=n20andp=p20, respectively. The discretization of the operatorLA(u) andLB(u)yields matrices extracted from theLyapackpackage [23] using the command fdm 2d matrix and denoted as A = fdm(n0,’f 1(x,y)’,’f 2(x,y)’,’f(x,y)’). In this example, n=10,000 ands=4900, respectively, and are named asA=fdm(n0,f1(x,y),f2(x,y),f(x,y)) andB=fdm(s0,g1(x,y),g2(x,y),g(x,y))withf1(x,y) =−exy,f2(x,y) =−sin(xy),f(x,y) = y2,g1(x,y) =−100ex,g2(x,y) =−12xyandg(x,y) =p
x2+y2. For this experiment, we usedr=3. The time interval considered was[0,2]and the initial conditionX0=X(0)was X0=Z0Z0T, whereZ0=0n×2. The tolerance was set to 10−10for the stop test on the resid- ual. For the EGA-BDF and Rosenbrock methods, we used a constant timesteph=0.1. The entries of the matricesEandF were random values uniformly distributed on the interval [0,1]and their rank were set tos=2, we chose a size of 2500×2500, 2500×2500 for the matricesAandB, respectively.
In Figure (1), we plotted the Frobenius norms of the residualskRm(Tf)kF at final timeTf
versus the number of extended global Arnoldi iterations for the EGA-exp, EGA-BDF and EGA-ROS methods.
In Figure (2),the matricesAandBare obtained from the discretisation of the operator LA(u)andLB(u)with dimensionsn=10000 ands=4900, respectively. we plotted the Frobenius norms of the residualskRm(Tf)kF at final timeTf versus the number of extended global Arnoldi iterations for the EGA-exp, EGA-BDF and EGA-ROS methods
Example 2
For the second set of experiments, we use the matricesadd32,pde2961, andthermalfrom the University of Florida Sparse Matrix Collection [12] and from the Harwell Boeing Col- lection (http://math.nist.gov/MatrixMarket). The tolerance was set to 10−7for the stop test
0 5 10 15 20 25
−8
−6
−4
−2 0 2 4
m iterations log10(||R(m)||F)
EGA−exp EGA−BDF EGA−ROS
Fig. 1 Residual norm vs numbermof extended global Arnoldi iterations
0 5 10 15 20 25 30 35
−8
−6
−4
−2 0 2 4 6
m iterations log10(||R(m)||F)
EGA−exp EGA−BDF EGA−ROS
Fig. 2 Residual norm vs numbermof extended global Arnoldi iterations
on the residual. For the EGA-BDF and EGA-Ros methods, we used a constant timestep h=0.01,
In Figure (3),the matrices A=thermal and B=add32 with dimensions n=3456 and p=4960, respectively, ands=3. we plotted the Frobenius norms of the residuals kRm(Tf)kF at final timeTf versus the number of extended global Arnoldi iterations for the EGA-exp, EGA-BDF and EGA-ROS methods
0 2 4 6 8 10 12
−10
−8
−6
−4
−2 0 2 4 6 8
m iterations log10(||R(m)||F)
EGA−exp EGA−BDF EGA−ROS
Fig. 3 Residual norm vs numbermof extended global Arnoldi iterations
In Figure (4), we used the matricesA=pde2961 andB=fdm(s0,100ex,p
2x2+y2,y2− x2)with dimensionsn=2961 ands=8100, respectively, andr=3. we plotted the Frobe- nius norms of the residualskRm(Tf)kFat final timeTfversus the number of extended global Arnoldi iterations for the EGA-exp, EGA-BDF and EGA-ROS methods
1 2 3 4 5 6 7
−10
−8
−6
−4
−2 0 2 4 6
m iterations log10(||R(m)||F)
EGA−exp EGA−BDF EGA−ROS
Fig. 4 Residual norm vs numbermof extended global Arnoldi iterations