HAL Id: inria-00091944
https://hal.inria.fr/inria-00091944
Submitted on 7 Sep 2006
HAL
is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire
HAL, estdestinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.
Lagrangian Approaches for a class of Matching Problems in Computational Biology
Nicola Yanev, Rumen Andonov, Philippe Veber, Stefan Balev
To cite this version:
Nicola Yanev, Rumen Andonov, Philippe Veber, Stefan Balev. Lagrangian Approaches for a class
of Matching Problems in Computational Biology. [Research Report] PI 1814, 2006, pp.18. �inria-
00091944�
I R I
S A
IN STITUT D
E R
ECHERCHEE N IN
FORMATIQU
EET SYSTÈ MES
ALÉ
ATOIRES
P U B L I C A T I O N I N T E R N E
N
oI R I S A
1814
LAGRANGIAN APPROACHES FOR A CLASS OF MATCHING PROBLEMS IN COMPUTATIONAL BIOLOGY
NICOLA YANEV AND RUMEN ANDONOV AND PHILIPPE
VEBER AND STEFAN BALEV
INSTITUT DERECHERCHE EN INFORMATIQUE ETSYSTÈMESALÉATOIRES
Campus de Beaulieu – 35042 Rennes Cedex – France Tél. : (33) 02 99 84 71 00 – Fax : (33) 02 99 84 71 71 http://www.irisa.fr
!
" !!!# $ ! !
! % & ' () ! & !* +
& ! ! ! !* + $ !&
! & !* & !! &' )
!!)( &, - $ * +!-!
) # !!*
% '!- ' ! ', -%
. !! & #!&/ 0 !
/!/ '! & *
1 ! ! ! &0!! * 1!/&
&!&0 &!'&!
!&* 1/& &! !!! //&&'!
!! &-!!& - !/ * 23' '4
0 &&&/-!/ & /'0# /&!!*
/%' !- /' ! '
-
A C G C A A
A G T C T
ACG_CAA A_GTC_T
5 6 7 ! !
7 ! ! 8 ! ) % !! %
* 7 ! $$ & ! &
$ * 7 ! ! !
! !* 9 ! $ ! %
&:1' &! &
! * ; ( !' ! $ &
!$ & ! - &* -!'5 *)
) ! !* +
% ) &*
9 !! ) & ) ! ! !
! N'()! & !<"=*! &&
m & < (= li' i = 1, . . . , m* <
& = $ (* & &
(!$ &&&$!* & ! & &
! (* 5( ! )) $! * 9(i
j' $! ri=j−i−1
k=1lk* 9 )!
< $=! )&n=N+ 1−m
i=1li <5 *<==* "
&
T ={(r1, . . . , rm)|1≤r1≤ · · · ≤rm≤n}.
& ! ! ! !(U∪V, U×V)') U ={u1, . . . , um} && (& V ={v1, . . . , vn} && $
! * "& & & )
)* M ⊆U×V 6
< = d(u) = 1'u∈U <)d(x) &x=* " ( &
-! =* >) ! &
m*
< = " &' ! ' (ui, vj) ∈ M' (uk, vl) ∈ M & i < k'
j ≤l* " (!$ & & &$!* "
$! *
1 ) < = ' < = ! 3 !
&(& #* 5 *<=) !& & *
|T |=m+n−1
m
+&3 $! ri =j−i−1
k=1lk+i−1* 9 $
! & &
1≤r1<· · ·< rm≤m+n−1
&& & & mm+n−1! *
<=
! " # $ ! " # $
% ! " #
% ! " #
% ! " #
<=
1 2 3
1 2 3 4 5 6 7 8 9
V
U
<=
5 6 <= 2-! & !
'?&* <=.!&)& $(! *
<= !& <=*
@ ! ) & ) ! &! -
)& !< =* 9& & )!!
)!& &)&$!# & &*
+ & G= (U×V, E)* 2 $- !!&
& !* 5 ! )) & $ vij' i = 1, . . . , m' j = 1, . . . , n & &) n×m & < 5 * =* " $ vij' j = 1, . . . , n )
& i * !& ( & $- !&
! ( *
@&! $ G) !& !
& !* 9 & m% G*
!! & AB* + &$&'%
$ &&$) !6 E={(vij, vi+1,l)|i= 1, . . . , m−1,1 ≤j ≤l ≤n}* + &&)$ S & T & & S
$ 3 & T $ * 1)
%% !& ) & < =&
S%T ! G* 5 * !&*
" ) ) $ $ $ ) & *
! ! $$ & *
" ! ) & !* 5 -!'
!* > & !
$' ' %+ 1&%+'
$ )&3& S%T !* + '
!& #&!& ! *
" "$&! )
&& ! !(* " < (=)!
block position
T S
i = 1 i = 2 i = 3 i = 4 i = 5 i = 6
j = 1 j = 2 j = 3 j = 4
5 6 2-! !* " ! ( !& &
) ! ('''''*
(& &%& 8&!!')$ (
&)&! (* ,L&6 L={(i, k)|i < k&(i &k }
) & & ( ) & ( & (* ,
R={(i, k)|(i, k)∈L, k−i >1} <%= (* " ( L\R
& (* + )!! ! &(
*
" ()() &!&(! * 9 )
" !&) #
• cij' i= 1, . . . , m' j= 1, . . . , n'! (i! j
• dijkl'(i, k)∈L'1≤j≤l≤n'& )(i&
k )(i ! j &(k ! l*
"# cij < = ! &
!& (i! &! ')! )
) & (i* "# dijkl &
) ! & ( i & j* ,! < ) &
(=$! 3 ' && # di,j,i+1,l*
" & !& # & "
! 8 ! 3& & * 9 ( <
R=∅=)!# $ && !&
" $ ! 3& S%T !* 9& (
( ')&& !&
{(vij, vkl)|(i, k)∈R, 1≤j≤l≤n}
) )) z%&*
S%T ! & $z%& $& !* 2S%T !
$ -|R| z%&' ( R* " ! && &
S%T !& $&z%& & &!* "" $ 3&
&! !<5 *=*
)) ' &$ ! ! !
& 3& S%T ! ) ! & & &C
$ * " ! $& $ & !
!- O(mn2)* 9 & && ) & ! !
! ! ! !*
j = 2 j = 3 j = 4
i = 1 i = 2 i = 3 i = 4 i = 5 i = 6
j = 1
block position
T S
c1122
c2232
c3243 c4354
c5464
c1132
c3264
c4364
5 6 2-!&!* " 8&!! $ *
"xS%T ! & * " $&z% && * "
&! & (1,2,2,3,4,4)*
,yij $ &$ G* yij (i ! j&
8) * ,Y !!&3&) 6
n j=1
yij= 1 i= 1, . . . , m <=
j l=1
yil− j l=1
yi+1,l≥0 i= 1, . . . , m−1, j= 1, . . . , n−1 <=
yij ≥0 i= 1, . . . , m, j= 1, . . . , n <=
. <= & < = &<= ! < =* " )
Y ∩Bmn - & *
9 & ( ' ) & & $ %
zijkl' (i, k) ∈ L' 1 ≤ j ≤ l ≤ n* " $ & &&& ) ) $ % $ yi = (yi1, ...yin) ∈ Bn ) & ci = (ci1, ...cin) ∈ Rn &
zik = (zi1k1, . . . , zi1kn, zi2k2, . . . , zi2kn, . . . , zinkn) ∈ Bn(n+1)2 (i, k) ∈ L ) &
dik = (di1k1, . . . , di1kn, di2k2, . . . , di2kn, . . . , dinkn)∈Rn(n+1)2 *
. & 2n×n(n+1)2 &%& & - !!& ) % i&k* " -A 3n)<!* A n
)=!& i<!* k=*
1)! & !&3&
zIPL =v(P T P(L)) = min{
m i=1
ciyi+
(i,k)∈L
dikzik} <=
6 y= (y1, . . . , ym)∈Y, <?=
yi=Azik (i, k)∈L < =
yk=Azik (i, k)∈L <D=
zik∈Bn(n+1)2 (i, k)∈L <=
" v(.)) & ! $ $!
& P T P(L)) z $ 3-&*
9 ) & !!&3& <?=%<D= & zik ∈ R
n(n+1)
+ 2 '
) !L!- $ P T P !*
" '$-ci&8* " ! $
cij &&&di,j,i+1,l'l=j, . . . , n* +) &dik n×n
# dijkl $ & &
) & * 9 & ! & ! $
) &) -! *
,A & B ) ! 8* A·B -!&
A & B ) && ! !& EminF & ! !
!&EGF*
,A &B ) 8 n×n* M =A⊗B &3& M(i, j) = mini≤r≤jA(i, r) +B(i, j)
>))!( &! ("! $*
&$' " &3& S%T !
!) &O(mn2)& ! * !!!
! & ,& ! *
Y
, A - # <=%<= ) & &
$ * @ !$ A & <";= ! ) ";!$ *
!i= 1, . . . , n
&(i, n)< =
!i= 1, . . . , m
!j=n−1, . . . ,1
! $aij <A "; H - &! $! A ";
&(i, j)<) =
"3 - ";* ";!$ '
A ";&Y *
@ &!$ ) <=%<=
$- %$&$ Y* "$-<
$ = &!!- !)
Y ) && *
,y % <=%<=* > <=' <= I) f = (fsj, f(i,k)(i+1,j)) i= 1, m−1 j= 1, n G- **
k≤j
f(i,k)(i+1,j)=yij i= 1, m−1 fsj=y1j j= 1, n
>)()!! )(I)!!' I)f -!&
$- %$& I)<! G)* > I)!&
&i, k, p, j ' ( )(i, k) )(p, j)
* % '
%$& y' ** yij = f(i−1,k)(ij) = 1* "' $- !
$f $ $- ! $$ Y $y*
" & # 3& $ ! ! $- %
& & (!$*
") ( (i1, k1) & (i2, k2) i1 < i2 & ) k1 !
$(i2, k2)* ")!L & &
! $3 AB* J)!& H(O(mn3)
!- " *
9 L &' P T P(L) $ & $ && &!&
!* 2 ! !)$ )
i & k' & & ( && (i, k)* " !
& -Dik Dik(j, l) ! ) $ (i, j)& (k, l)*
1 j > l' ! !' Dik(j, l) # *
5 ' P T P(L) D1m*
+ ((i, k), i < k && $[a, b])[i, k]⊆[a, b]* ,&
L(ik) (L && [i, k]* "' !Dik(&
)6
* 9L(ik)={(i, k)}& - $ Dik=
dik (i, k)∈L
˜0 ) <K=
) ˜0 !! - !$ &3& < % 3 ) & =&$ 8 !!!*
* @) ' L(ik) &' - s∈[i, k] &
L(ik)-!(i, k) && [i, s] [s, k]* "
Dik =
Dis·Dsk+dik (i, k)∈L
Dis·Dsk )
<=
9 ! m $ ' & &' !
&!& O(m) !* 5 ' ! !&
& - O(n3) !& < - ! ) (min,+)! =* @$
!- O(mn3)* "! ' n )& & m'&
! ' ! &-! $$*
&L(i)={(i, k1), . . . ,(i, kr)}'k1< k2< . . . kr & *
L(i)={(i, k1), . . . ,(i, kr)} Dikr = (. . .(dik1⊗dik2)⊗. . .)⊗dikr
"!) & ! ! 6
L={(i, k1), . . . ,(i, kr)}=L
{(i, kr)}')$v(L:zijkrl = 1) =dijkrl+ minj≤s≤lv(L : zijkr−1s= 1)*
&+ ' ) *
+( ) , -
+ +
9& !A⊗B' )) 6 M -&3&
M(i, j) = mini≤r≤jA(i, r)'
M(i, j) = min{M(i, j−1), A(i, j)}, j ≥i
5 A⊗B = M+B* 5 ⊗ ! n×n
!- O(n2)&!- " O(rn2)*
L $!L={(i1, k1), . . . ,(ir, kr)}' P T P(L)&!& ) &!%
&!)- e∈(1, m)&L &&
[1, e]' [e, m]* ,I={i1, . . . , is}&& & ' I)&! P T P(L) ) &!&!* !!&&
t≤s−1' !Ditit+1* ")$) 6
" p= (p1, p2, . . . , pn) =Di1i2·Di2i3·. . .·Dis−1is·pp= (0,0, . . . ,0)
ipi=v(P T P(L:y1i= 1)) v(P T P(L)) = min1≤i≤n{pi}
2 ! Dikik+1 &3 p
! $ !! !) *
+ && $'!- P T P(L) %
! O(sn2)!! Ditit+1*
5 )! ' !&!&
&!& '& & ' O(srn2)
')s & &! '&r- &
'$!& "*
) & $)L &&
'!!&3&< =%<D=&zik ∈R
n(n+1)
+ 2 *
, Pyz !! &3& <?=%<D= & zik ∈ R
n(n+1)
+ 2 & PyzI $-
! <?=%<=* +) PyzI & !!*
!& ! )& $&) , -%
<=%<=* " $ !$ !L !!
Pyz % * + $ & < & = L ( Pyz =PyzI * M %%!& )& '&3& ! Y
&! !G* 9L={(i, i+ 1), i= 1, m−1}Pyz & !
I) G !!* ;' !!&
*
# n≥3 L PyzI =Pyz R=
(⇒) + ) (R= (1,3)'m= 3 &n= 3* " ! A = (y11 = y12 = y21 = y22 = 0.5, y32 = 0.75, y33 = 0.25, z1121 = z2132 = z1222 = z1232 = 0.5, z2232=z2233 =z1132 =z1133 = 0.25) ∈Pyz & <)$-&
! A= %$&$ Pyz B = (y11 =y21 =y32 =z1132 = 1)&
C= (y12=y22=y32=z1232= 1) A [B, C]* " 8
! m, n≥3 &R )&*
(⇐)5)& "*
" ( & $ , *