HAL Id: hal-00417717
https://hal.archives-ouvertes.fr/hal-00417717
Submitted on 16 Sep 2009
HAL is a multi-disciplinary open access
archive for the deposit and dissemination of
sci-entific research documents, whether they are
pub-lished or not. The documents may come from
teaching and research institutions in France or
L’archive ouverte pluridisciplinaire HAL, est
destinée au dépôt et à la diffusion de documents
scientifiques de niveau recherche, publiés ou non,
émanant des établissements d’enseignement et de
recherche français ou étrangers, des laboratoires
Extracting Constrained 2-Interval Subsets in 2-Interval
Sets
Guillaume Blin, Guillaume Fertin, Stéphane Vialette
To cite this version:
Guillaume Blin, Guillaume Fertin, Stéphane Vialette. Extracting Constrained 2-Interval Subsets in
2-Interval Sets. Theoretical Computer Science, Elsevier, 2007, 385 (1-3), pp.241-263. �hal-00417717�
Extra ting Constrained
2
-Interval Subsetsin
2
-Interval Sets⋆
Guillaume Blin
IGM-LabInfo - UMRCNRS 8049
Université deMarne-la-Vallée
77 454Marne-la-Vallée Cedex 2 - FRANCE
Guillaume Fertin
LINA,FRE CNRS 2729
Université deNantes, 2 rue de la Houssinière
BP 92208 44322 Nantes Cedex 3 - FRANCE
Stéphane Vialette
Laboratoire deRe her he enInformatique (LRI), UMRCNRS8623
Université Paris-Sud, 91405 Orsay Cedex - FRANCE
Abstra t
2
-interval sets were used in [28,29℄ for establishing a general representation for ma ros opi des ribers of RNAse ondary stru tures. In this ontext, we have a2
-intervalforea hlegallo alfoldinagivenRNAsequen e,anda onstrainedpatternmadeofdisjoint
2
-intervalsrepresentsaputativeRNAse ondarystru ture.Wefo us hereontheproblemofextra tinga onstrained patterninasetof2
-intervals.More pre isely, given a set of2
-intervalsD
and a modelR
des ribing if two disjoint2
-intervalsinasolution anbeinpre eden eorder(<
),beallowedtonest(⊏
)and/or be allowed to ross(≬
), we onsider the problemof nding amaximum ardinality subsetD
′
⊆ D
ofdisjoint
2
-intervals su h thatanytwo2
-intervals inD
′
agreewith
R
. The dierent ombinations of restri tions on modelR
alter the omputational omplexity ofthe problem, and need tobe examinedseparately.Inthispaper,weimprovethetime omplexityof[29℄formodel
R
=
{⊏}
bygiving an optimalO(n log n)
time algorithm, wheren
is the ardinality of the2
-interval setD
. We also give a graph-like relaxation for modelR
=
{⊏, ≬}
that is solvable inO(n
2
√
n)
time. Finally, we prove that the onsidered problem is NP- omplete
for model
R
=
{<, ≬}
even for same-length intervals, and give a xed-parameter tra tability resultbasedon the rossing stru tureofD
.The problem of establishing a general representation of stru tured patterns,
i.e., ma ros opi des ribers of RNA se ondary stru tures, was onsidered in
[28,29℄.The approa histoset up ageometri des ription ofheli es by means
of a natural generalization of intervals, namely a
2
-interval. A2
-interval isthe disjoint union of two intervals on the line. The geometri properties of
2
-intervalsprovideapossibleguideforunderstandingthe omputationalom-plexity of nding stru tured patterns in RNA sequen es. Using a model to
represent non sequential informationallows usfor varying restri tions onthe
omplexity of the pattern stru ture. Indeed, two disjoint
2
-intervals,i.e., two2
-intervalsthat donot interse tinany point, an be inpre eden e order(<
), be allowed to nest (⊏
) or be allowed to ross (≬
). Furthermore, the set of2
-intervals and the pattern an have dierent restri tions, e.g., all intervalshave the same length or all the intervals are disjoint. These dierent
om-binations of restri tions alter the omputational omplexity of the problems,
andneedtobeexaminedseparately.Thisexaminationprodu ese ient
algo-rithmsformore restri tivestru tured patterns, andhardness resultsfor those
less restri tive.
In this paper, we onsider the problem of nding a onstrained patternin a
set of
2
-intervals.More pre isely, given a set of2
-intervalsD
and a modelR
des ribing if two disjoint2
-intervals in a solution an be inpre eden e order (<
), be allowed to nest (⊏
) and/or be allowed to ross(≬
), we onsider theproblemofndingamaximum ardinalitysubset
D
′
⊆ D
ofdisjoint
2
-intervals su hthat any two2
-intervalsinD
′
agree with
R
. The problem of ndingthelargest
2
-interval pattern in a set of2
-intervalsD
with respe t to a givenabstra t model, referredhereafter asthe
2
-IntervalPatternproblem,hasbeen introdu edby Vialette [28,29℄. Vialettedivided the problemindierent
lasses based on the stru ture of the modeland gave for most of them either
NP- ompletenessresultsorpolynomial-timealgorithms.Dividingtheproblem
in several lasses was later proved to be extremely useful for approximating
of the
2
-IntervalPattern problem[8℄.⋆
An extended abstra t of this work appeared in Pro eedings of the
15
th
Annual
Symposium on Combinatorial Pattern Mat hing (CPM 2004) [5℄. This work was
partiallysupported by theCNRS ACIMasse deDonnï¾
1
2
es NavGraphe proje t. Email addresses: gblinuniv-mlv.fr(Guillaume Blin),fertinlina.univ-nantes.fr(Guillaume Fertin), vialettelri.fr (Stéphane
Inthe present paper, wefo us onthree spe ial ases of the
2
-Interval Pat-tern problem:
(1) The
2
-intervalsof the solution subsetneed to be pairwisenested,(2) Two
2
-intervals in a solution an only be nested or rossing, and allthe intervalsinvolved inthe2
-interval setD
are disjoint,and(3) Two
2
-intervalsinasolution anonlybe nestedorinpre eden e, andall the intervals involved in the2
-intervalsetD
have the same length. We givepre ise results for these threeproblems. Those three problems are ofimportan e sin e ea h one is a straightforward extension of the problem of
nding a given
2
-interval set in another2
-interval set introdu ed in [29℄ and furtherstudiedin[19℄and[23℄,andhen eisstronglyrelated,inthe ontextofmole ularbiology,topatternmat hing overRNA se ondary stru tures.More
pre isely, in this paper, we improve the time omplexity of the best known
algorithmfor
R
=
{⊏}
by givinganoptimalO(n log n)
timealgorithm.Also,we give a graph-like relaxation for
R
=
{⊏, ≬}
that is solvable inO(n
2
√
n)
time.Finally,weprovethattheproblemisNP- ompletefor
R
=
{<, ≬}
,and,we give a xed-parameter tra tability result based on the rossing stru ture
of
D
. Those results almost omplete the table proposed by Vialette [29℄ (seeTable1)andprovideanimportantsteptowards abetterunderstandingofthe
pre ise omplexity of
2
-interval patternmat hing problems.There are basi ally two main lines of resear h our results refer to: (i)
ar -annotated sequen es and protein topologies,and (ii)
t
-intervals ombinatori s.•
Fora sequen eS
,an ar -annotation ofS
is a set of unordered pairs of po-sitionsinS
. In this ontext, given two ar -annotated sequen esS
1
andS
2
,theAr -PreservingSubsequen e(APS)problemaskstondan
o ur-ren eof
S
1
inS
2
, and the Longest Ar -Preserving CommonSubse-quen e(LAPCS)problemsaskstondthelongest ommonar -annotated
sequen e that o urs both in
S
1
andS
2
. The APS and LAPCS problemsareusefulinrepresentingthe stru turalinformationofRNAand protein
se-quen es[11,21,18,1℄.Thebasi ideaistoprovideameasureforsimilarity,not
onlyonthesequen e level,butalsoonthestru turallevel(anar -annotated
sequen eisviewedasaRNAsequen etogetherwithphosphodiesterbonds).
Furthermore,asimilarproblemto ompare thethree-dimensionalstru ture
of proteins is the Conta t Map Overlap problem des ribed by in[16℄.
Viksnaand Gilbert des ribed algorithmsfor patternmat hing and pattern
learninginTOPS diagram(formaldes ription of protein topologies)[30℄.
•
Our results are also related to the independent set problem in dierentextensions of
2
-interval graphs. A graphG
is at
-interval graph if there is an interse tion model whose obje ts onsist of olle tions oft
intervals,t
≥ 1
, su h thatG
is the interse tion graph of this model [26,20℄. FromOf parti ular interest is the lass of
2
-interval graphs. For example, line graphs,trees and ir ular-ar graphs are2
-intervalgraphs. However, Westand Shmoys [31℄ have shown that the re ognition problem for
t
-intervalgraphsisNP- ompleteforevery
t
≥ 2
(this hastobe omparedwith lineartime re ognition of
1
-interval graphs). In the ontext of sequen e similar-ity, [22℄ ontains an appli ation of graphs having interval number at mosttwo. In [3℄, the authors onsidered the problem of s heduling jobs that are
given as groups of non-interse ting segments onthe real line. Of parti ular
importan e, they showed that the maximum weighted independent set for
t
-interval graphs (t
≥ 2
) is APX-hard even for highly restri ted instan esAlso, they gave a
2t
-approximation algorithm for general instan es basedonafra tional version ofthe Lo alRatio Te hnique [2℄.Finally,some
om-plexityissuesofstandardoptimizationproblems for
t
-intervalsetsare given in[6℄.The remainder of the paper is organized as follows. In Se tion 2 we briey
review the terminologyintrodu ed in [29℄. In Se tion 3, we improve the time
omplexity of the best known algorithmfor model
R
=
{⊏}
.In Se tion4, wegive a graph-like relaxation for model
{⊏, ≬}
that is solvable inpolynomial-time. In Se tion 5, we prove that the
2
-interval pattern problem for modelR
=
{<, ≬}
is NP- omplete even when all intervals involved in the input2
-interval set have the same length. Finally, we give in Se tion 6 axed-parameter tra tabilityresult based on the rossingstru ture of
D
.2 Preliminaries
An interval and a 2-interval represent respe tively a sequen e of ontiguous
basesandpairingsbetweentwointervals,i.e.,stems,inRNAse ondary
stru -tures. Thus,
2
-intervals an be seen as ma ros opi des ribers of RNAstru -tures.
Formally, a
2
-interval is the disjoint union of two intervals on a line. We denote it byD
= (I
1
, J
1
)
whereI
1
andJ
1
are intervals su h thatI
1
< J
1
(here<
is the stri t pre eden e order between intervals) ; in that ase wealso write
Left(D) = I
1
andRight(D) = J
1
. If[x : y]
and[x
′
: y
′
]
are two
intervals su h that
[x : y] < [x
′
: y
′
]
, we will sometimes write
D
= ([x :
y], [x
′
: y
′
])
to emphasize on the pre ise denition of the
2
-intervalD
. LetD
1
= (I
1
, J
1
)
andD
2
= (I
2
, J
2
)
betwo2
-intervals.They are alled disjoint if(I
1
∪J
1
)
∩(I
2
∪J
2
) =
∅
(i.e.,involvedintervalsdonotinterse t). The overing intervalofa2
-intervalD
,writtenCover
(D)
,istheleast interval overingbothLeft
(D)
andRight
(D)
.(I
1
, J
1
)
andD
2
= (I
2
, J
2
)
. We will writeD
1
< D
2
ifI
1
< J
1
< I
2
< J
2
,D
1
⊏ D
2
ifI
2
< I
1
< J
1
< J
2
andD
1
≬ D
2
ifI
1
< I
2
< J
1
< J
2
. Two2
-intervalsD
1
andD
2
areτ
- omparable for someτ
∈ {<, ⊏, ≬}
ifD
1
τ D
2
orD
2
τ D
1
. LetD
be a set of2
-intervals andR
⊆ {<, ⊏, ≬}
be non-empty. The setD
isR- omparableifanytwodistin t2
-intervalsofD
areτ
- omparableforsome
τ
∈ R
.Throughoutthepaper, thenon-emptysubsetR
is alledamodel.Clearly, if a set of
2
-intervalsD
isR
- omparable thenD
is a set of disjoint2
-intervals.The ground set of aset of2
-intervalsD
,writtenGS(
D)
, isthe set ofallsimpleintervalsinvolved inD
,i.e.,GS
(
D) =
S
D
∈D
(Left(D)
∪ Right(D))
. The leftmost (resp. rightmost)elementof a set of disjoint2
-intervalsD
isthe2
-intervalD
i
∈ D
su hthatLeft(D
i
) < Left(D
j
)
(resp.Right(D
j
) < Right(D
i
)
) for allD
j
∈ D − D
i
. Observe that it ould be the ase thatD
i
is both theleftmost and rightmost element of
D
(this is indeed the ase if|D| = 1
or ifD
j
⊏ D
i
for allD
j
∈ D − D
i
).We denehereafter two additionalparameterson
D
.The depthofD
,writtenDepth(
D)
, is the size of a maximum ardinality{≬}
- omparable subset ofD
(a ording to [29℄, this parameter is polynomial-time omputable). The
for-ward rossingnumberof
D
,writtenFCrossing(
D)
,isdenedbyFCrossing(
D) =
max
D
i
∈D
|{D
j
: D
i
≬ D
j
}|
. Clearly,FCrossing(
D) ≥ Depth(D) − 1
for any setD
of 2-intervals.Following[11℄,Vialetteproposedin[29℄,twonaturalrestri tionsontheground
set of
D
(a third restri tion, i.e., balan ed2
-intervals, well-suited forinvesti-gating RNA se ondary stru tures spa e was introdu ed in[8℄):
(1) allthe intervals of the ground set
GS
(
D)
are of the same length,(2) alltheintervalsof thegroundset
GS(
D)
are disjoint,i.e.,if twointervalsI, I
′
∈ GS(D)
overlap, thenI
= I
′
.
Using restri tions on the ground set allows us for varying restri tions on the
omplexity ofthe
2
-intervalset stru ture, andhen e onthe omplexity ofthe problems. These two restri tions involve three levels of omplexity:•
unlimited:no restri tions•
unit: restri tion 1•
disjoint: restri tions 1and 2Given a set of
2
-intervalsD
, a modelR
⊆ {<, ⊏, ≬}
and a positive integerk
, the2
-Interval Patternproblem onsists in ndinga subsetD
′
⊆ D
of
ardinalityatleast
k
su hthatD
′
is
R
- omparable.Forthesakeofbrevity,the2
-Interval Pattern problemwith respe t to a modelR
over anunlimited(resp. unit and disjoint) ground set is abbreviated in
2
-IP-Unl-R
(resp.2
-IP-Unit-R
and2
-IP-Dis-R
).with respe t tothe models
{<}
,{⊏}
,{≬}
and{<, ⊏}
( f. Table 1).Inthisarti le,weanswerthree openproblemsand weimprovethe omplexity
of another one, as shown inTable 1. Moreover, we show that
2
-IP-Unit-{<
,
≬
}
is xed parameter tra table when parameterized by the forward rossingnumber of
D
.2
-IntervalPatternProblemGroundSet
Model Unlimited Unit Disjoint
{<, ⊏, ≬}
NP- ompleteO(n
√
n)
[24℄{⊏, ≬}
NP- ompleteO(n
2
√
n) ⋆
{<, ⊏}
O(n
2
)
{<, ≬}
NP- omplete⋆
?{<}
O(n log n)
{⊏}
O
(n log n) ⋆ •
{≬}
O(n
2
log n)
Table 12
-interval pattern problem omplexity wheren
=
|D|
.When not spe ied, the omplexity omes from[29℄.⋆
ontributions of thepresent paper.•
improvement of theexisting omplexity (whi h wasO(n
2
)
in[29℄).
3 Improving the omplexity of
2
-IP-Unl-{⊏}
The problem of nding the largest
{⊏}
- omparable subset in a set of2
-intervals was onsidered in [29℄. Observing that this problem is equivalent
to nding a largest lique in a omparability graph (a linear time solvable
problem[17℄), an
O(n
2
)
time algorithmwasthusproposed. We improvethat
result by givingan optimal
O(n log n)
time algorithm.The ine ien y of the algorithm proposed in [29℄ lies in the ee tive
on-stru tion of a omparability graph. We show that this onstru tion an be
avoided by onsidering trapezoids instead of
2
-intervals. Re all that atrape-zoid graph is the interse tion graph of a nite set of trapezoids between two
parallellines[9℄(itiseasilyseenthattrapezoidgraphsgeneralizebothinterval
graphs and permutation graphs). Analogously to
2
-intervals, we will denoteby
T
= ([x : y], [x
′
: y
′
])
the trapezoid with top interval
[x : y]
and bottominterval
[x
′
: y
′
]
.
Proposition 1
2
-IP-Unl-{⊏}
issolvable inO(n log n)
time.line. Constru t a olle tion of trapezoids
T = {T
1
, T
2
, . . . , T
n
}
between two parallellinesasfollows.Forea h2
-intervalD
i
= ([x : y], [x
′
: y
′
])
∈ D
,weadd
the trapezoid
T
i
= ([x : y], [
−y
′
:
−x
′
])
to
T
.Claim 2 Forall
1
≤ i ≤ j ≤ n
,the2
-intervalsD
i
andD
j
are{⊏}
- omparable if and only if the trapezoidsT
i
andT
j
are non-interse ting.PROOF. [of Claim℄Let
D
i
= ([x
i
: y
i
], [x
′
i
: y
′
i
])
andD
j
= ([x
j
: y
j
], [x
′
j
: y
′
j
])
be two
2
-intervals ofD
, andT
i
= ([x
i
: y
i
], [
−y
′
i
:
−x
′
i
])
andT
j
= ([x
j
:
y
j
], [
−y
j
′
:
−x
′
j
])
be the two orresponding trapezoids inT
. Suppose thatD
i
andD
j
are{⊏}
- omparable. Without loss of generality, we may assumeD
j
⊏ D
i
. Thus, we havey
i
< x
j
andy
′
j
< x
′
i
. It follows immediately that−x
′
i
<
−y
j
′
,and hen e the two trapezoidsT
i
andT
j
are non-interse ting.Theproof of the onverse isidenti al.
2
Clearly, the olle tion
T
an be onstru ted inO(n)
time. Based on ageo-metri representationof trapezoidgraphsby boxes intheplane, Felsneret al.
[12℄ have designed a
O(n log n)
algorithm for nding a maximum ardinalitysub olle tionof non-interse ting trapezoids in a olle tionof trapezoids, and
the propositionfollows.
2
Based on Fredman's bound for the number of omparisons needed to
om-putemaximumin reasingsubsequen esinpermutation[13℄,Felsneret al.[12℄
argued that their
O(n log n)
time algorithm for ndinga maximumardinal-ity sub olle tion of non-interse ting trapezoids in a olle tion of trapezoids
is optimal. Then itfollows from Proposition 1 that our
O(n log n)
timealgo-rithm for nding a maximum ardinality
{⊏}
- omparable subset in a set of2
-intervalsisoptimal aswell.4 A polynomial-time algorithm for
2
-IP-Dis-{⊏, ≬}
In this se tion, we give an
O(n
2
√
n)
time algorithmfor the
2
-IP-Dis-{⊏, ≬}
problem, wheren
is the ardinality of the set of2
-intervalsD
. Re all that given aset of2
-intervalsD
overadisjointgroundset, theproblemaskstondthesize ofamaximum ardinality
{⊏, ≬}
- omparablesubsetD
′
⊆ D
.Observe
that the
2
-IP-Dis-{⊏, ≬}
problem has aninteresting formulationin terms ofonstrained mat hings in general graphs: Given a graph
G
together with alinearordering
π
of itsverti es,the2
-IP-Dis-{⊏, ≬}
problemis equivalenttoany two distin t edges
{u, v}
and{u
′
, v
′
}
of
M
, neithermax
{π(u), π(v)} <
min
{π(u
′
), π(v
′
)
}
nor
max
{π(u
′
), π(v
′
)
} < min{π(u), π(v)}
o ur.
Roughly speaking, our algorithmis athree-step pro edure. First,the interval
graphof allthe overingintervalsof the
2
-intervalsinD
is onstru ted. Next, allthemaximal liquesofthatgrapharee iently omputed.Finally,forea hmaximal liquewe onstru tanewgraphandndasolutionusingamaximum
ardinalitymat hing algorithm.The size of abest solution found inthe third
step is thus returned. Clearly, the e ien y of our algorithm relies upon an
e ientalgorithmforndingallthemaximal liquesintheinterse tionofthe
overing intervals.We now pro eedwith the detailsof our algorithm.
Let
D = {D
i
: 1
≤ i ≤ n}
beasetof2
-intervals.ConsiderthesetC
D
omposedofallthe overingintervalsofthe
2
-intervalsinD
,i.e.,C
D
=
{Cover(D) : D ∈
D}
. Now, letΩ(
C
D
)
be the interval graph asso iated withC
D
. The graphΩ(
C
D
)
has a vertexv
i
for ea h intervalCover(D
i
)
inC
D
and two verti esv
i
andv
j
ofΩ(
C
D
)
arejoined byanedgeifthetwoasso iatedintervalsCover
(D
i
)
andCover(D
j
)
interse t. An illustration ofC
D
andΩ(
C
D
)
for a given set of2
-intervalsD
is given in Figure 1. Most in the interest in the interval graphΩ(
C
D
)
stems from the followinglemma.Fig. 1. Illustration of
C
D
andΩ(
C
D
)
for a given set of2
-intervalsD
on a disjoint groundset.Lemma 3 Let
D
be a setof2
-intervalsandD
′
be a
{⊏, ≬}
- omparablesubset ofD
. Then,{v
i
: D
i
∈ D
′
}
indu es a omplete graph in
Ω(
C
D
)
.PROOF. Let
D
i
andD
j
be two distin t2
-intervals ofD
′
. Sin e
D
i
andD
j
are{⊏, ≬}
- omparable then it follows that either intervalsCover(D
i
)
andCover(D
j
)
overlaporoneintervalis ompletely ontainedintheother.Inboth ases, intervalsCover
(D
i
)
andCover
(D
j
)
interse t, and hen e verti esv
i
andv
j
arejoinedbyanedgeinΩ(
C
D
)
.Therefore{v
i
: D
i
∈ D
′
}
indu esa omplete
graph in
Ω(
C
D
)
.2
Observethat the onverse isfalsesin e theinterse tionoftwo
2
-intervalsinD
resultsinanedgeinΩ(
C
D
)
,andhen etwo2
-intervalsasso iatedtotwodistin t3 we now only need to fo us on maximal liques of
Ω(
C
D
)
. Several problemsthatare NP- ompleteongeneralgraphshavepolynomial-timealgorithmsfor
interval graphs. The problemof nding allthe maximal liques of a graph is
onesu hexample.Indeed,anintervalgraph
G
= (V, E)
isa hordalgraphandas su h has at most
|V |
maximal liques [14℄. Furthermore, all the maximalliques ofa hordalgraph an befound in
O(n + m)
time,wheren
=
|V |
andm
=
|E|
, by a modi ation of Maximum Cardinality Sear h (MCS) [25,4℄.Let
C
be a maximal lique ofΩ(
C
D
)
. As observed above, any two2
-intervals asso iatedtotwodistin tverti esinthe maximal liqueC
maynotbe{⊏, ≬}
- omparable. Let
D
′
⊆ D
be the set of all
2
-intervals asso iated to verti es in themaximal liqueC
.BasedonC
, onsiderthe graphG
C
= (V
C
, E
C
)
dened byV
C
= GS(
D
′
)
and
E
C
=
{{I, J} : D = (I, J) ∈ D
′
}
. Inother words,the set
of verti es of
G
C
is the groundset ofD
′
and the edges of
G
C
isthe2
-intervalsubset
D
′
itselfviewed asasetof subsetsofsize
2
.Notethatthe onstru tion ofG
C
ispossibleonlybe auseD
′
hasdisjointgroundset.Thefollowinglemma
is animmediate onsequen e of the denition of
G
C
and Lemma3.Lemma 4 Let
C
be a lique inΩ(
C
D
)
andG
C
= (V
C
, E
C
)
be the graph on-stru tedasdetailedabove.Then,{(I
i
1
, J
i
1
), (I
i
2
, J
i
2
), . . . , (I
i
k
, J
i
k
)
}
isa{⊏, ≬}
- omparablesubsetifandonlyif{{I
i
1
, J
i
1
}, {I
i
2
, J
i
2
}, . . . , {I
i
k
, J
i
k
}}
isa mat h-ing inG
C
.Proposition 5 The
2
-IP-Dis-{⊏, ≬}
problem is solvable inO(n
2
√
n)
time,
where
n
is the number of2
-intervals inD
.PROOF. Considerthe algorithmgiven inFigure2.Corre tnessof this
algo-rithm follows from Lemmas 3 and 4. What is left is to prove the time
om-plexity. Clearly, the interval graph
Ω(
C
D
)
an be onstru ted inO(n
2
)
time.
Allthe maximal liques of
Ω(
C
D
)
an be found inO(n + m)
time,wherem
isthe number of edges in
Ω(
C
D
)
[25,4℄. Summingup, the rst two steps an bedone in
O(n
2
)
timesin e
m < n
2
.Wenow turnto the time omplexityof the
loop(infa tthedominanttermofouranalysis).Forea hmaximal lique
C
ofΩ(
C
D
)
,the graphG
C
an be onstru tedinO
(n)
time sin e|C| ≤ n
.Wenowonsider the omputationof a maximal mat hing in
G
C
. Mi aliand Vazirani[24℄ (see also[27℄) gave an
O(
q
|V ||E|)
time algorithmfor nding a maximalmat hing in a graph
G
= (V, E)
. ButG
C
has at mostn
edges (as ea h edgeorresponds to a
2
-interval) and hen e has at most2n
verti es. Then itfol-lowsthatamaximummat hing
M
inG
C
anbefound inO(n
√
n)
time.Sin eΩ(
C
D
)
is an interval graph withn
verti es, it has at mostn
maximal liques[14℄, we on lude that the algorithmas awhole runs in
O(n
2
√
n)
Max
{⊏, ≬}
-Comparable2
-Interval PatternInput: A set of
2
-intervalsD
with disjoint groundsetOutput: The size of a maximum ardinality
{⊏, ≬}
- omparable subset ofD
1. Constru t the interval graph
Ω(
C
D
)
2. Compute all maximal liques in
Ω(
C
D
)
3. For ea h maximal lique
C
inΩ(
C
D
)
3.1. Constru t the graph
G
C
3.2. Compute a maximalmat hing
M
inG
C
3.3. Store the ardinality of
M
inm(C)
4. Return
max
{m(C) : C
is amaximal lique ofΩ(
C
D
)
}
Fig.2.AlgorithmMax
{⊏, ≬}
-Comparable2
-Interval Pattern.5
2
-IP-Unit-{<, ≬}
is NP- ompleteTheorem6below ompletestheanalysis of
2
-IP-Unit-R
and2
-IP-Unl-R
forany model
R
⊆ {<, ⊏, ≬}
(see Table 1).Theorem 6 The
2
-IP-Unit-{<, ≬}
problem is NP- omplete.PROOF. First,we willpresent the two de ision problems we willdeal with
(Exa t 3-CNF-Sat and
2
-IP-Unit-{<, ≬}
). Then, we will give severalin-termediate lemmasthat willnallybeused inProposition 14 tovalidatethe
proof of the NP- ompleteness of the
2
-IP-Unit-{<, ≬}
problem.We provide a polynomial-timeredu tion from the Exa t 3-CNF-Sat
prob-lem: Given a set
V
n
ofn
variables and a setC
q
ofq
lauses (ea h omposedof three literals) over
V
n
, the problem asks to nd a truth assignment forV
n
that satises all lauses of
C
q
. It is well-known that the Exa t 3-CNF-Satproblem is NP- omplete [15℄. For the sake of larity, we now state formally
the
2
-IP-Unit-{<, ≬}
problem: Given a set of2
-intervalsD
, and a positive integerk
,the problemaskstondasubsetD
′
⊆ D
of ardinalitygreaterthan
orequal to
k
,su h thatD
′
is
{<, ≬}
- omparable.Clearly,
2
-IP-Unit-{<, ≬}
problemisinNP.Weshowthatgivenanyinstan eofExa t3-CNF-Satwith
q
lausesonaset ofn
variables,we an onstru tin polynomial-time an instan e of the
2
-IP-Unit-{<, ≬}
problem withk
=
(7n
− 2)q
su hthat there exists a satisfyingtruthassignmentfor the boolean formula i there exists a{<, ≬}
- omparablesubsetD
′
∈ D
of size at least
k
. We detail this onstru tion hereafter.Let
V
n
=
{x
1
, x
2
, ...x
n
}
be a set ofn
variables andC
q
=
{c
1
, c
2
, . . . , c
q
}
be a olle tionofq
lauses. Forthe sake of larity, letus deneD
on the integral linesu h that any interval of the groundset is of size four. Letus start withthe pre ise denition of the representation of a single lause
c
i
ofC
q
as illus-tratedinFigure4.The dottedre tangle ontheleft (resp.right)ispart oftherepresentation of lause
c
i
−1
(resp.c
i+1
). The pre ise adjustment of the rep-resentation of two onse utive lauses is illustrated in Figure 3 and formallydened afterwards. For onvenien e, wewillsplit the representation of
c
i
into seven groups(representedingray):A
i
,B
i
,C
i
L
,C
i
R
,D
i
,E
i
andF
i
.Ea hgroupinturnisdivided intoblo ks (represented inwhite).There are
11 + 2n
blo ks for ea h lause:n
blo ks forA
i
;3
blo ks forB
i
;1
blo k forC
i
L
;n
blo ks forC
i
R
;2
blo ks forD
i
;3
blo ks forE
i
;2
blo ks forF
i
.Fig.3.Jun tion between therepresentation of lauses
c
i
−1
andc
i
For example, in Figure 4 we use three boolean variables and hen e we have
seventeen blo ks. For the sake of larity, in the gures of this se tion, the
intervalsof the ground set might bedrawn ondierent levels.
We now turn to give a pre ise denition of ea h group in the representation
ofa given lause
c
i
.In thefollowing,wewillrefertoanintervalofthe ground set as a simple interval. LetF P
(c
i
)
denote the smallest starting position of any simpleintervalof the representation of lausec
i
.Weset, for onvenien e,F P
(c
1
) = 0
. For any1 < i
≤ q
, we haveF P
(c
i
) = F P (c
i
−1
) + 104n
−
21
. Moreover, letF P
(α)
denote the smallest starting position of any simpleintervalof group
α
∈ {C
i
L
, A
i
, B
i
, C
R
i
, D
i
, E
i
, F
i
|1 ≤ i ≤ q}
.Group
C
i
L
is omposed of one blo k ontaining2n
simple intervals (asillus-tratedinFigure5):
{[F P (C
i
L
) + 7k, F P (C
L
i
) + 7k + 4]
|0 ≤ k ≤ 2n − 1}
,whereF P
(C
i
L
) = F P (c
i
)
.The2n
simpleintervalsoftheblo kofgroupC
i
L
representinthe left torightorder(
x
1
, x
1
, x
2
, x
2
. . . x
n
, x
n
).Bydenition, the simple in-terval representingx
m
inC
i
L
is dened by[F P (C
i
L
) + 14(m
− 1), F P (C
L
i
) +
14(m
− 1) + 4]
. And onsequently, the simple interval representingx
m
inC
i
L
is dened by
[F P (C
i
squares)ofgroup
C
i
L
.Group
D
i
is omposedoftwoblo ks(
D
i
1
andD
i
2
),ea h ontaining2n
−1
simple intervals (as illustrated in Figure 6):{[F P (D
i
) + 5k, F P (D
i
) + 5k + 4]
|0 ≤
k
≤ 4n − 3}
whereF P
(D
i
) = F P (c
i
) + 34n
− 10
. By onstru tion,blo kD
i
1
is omposed of the following simpleintervals:
{[F P (D
i
) + 5k, F P (D
i
) + 5k +
4]
|0 ≤ k ≤ 2n −2}
andblo kD
i
2
is omposedofthefollowingsimpleintervals:{[F P (D
i
) + 5k, F P (D
i
) + 5k + 4]
|2n − 1 ≤ k ≤ 4n − 3}
.
Fig. 6.Des riptionof thesimple intervals ofgroup
D
i
.
Group
A
i
is omposed of
n
blo ks (one blo k for ea h boolean variable),ea h ontaining foursimple intervals(as illustrated inFigure7):
{[F P (A
i
) +
7k, F P (A
i
) + 7k + 4], [F P (A
i
) + 2 + 14l, F P (A
i
) + 6 + 14l], [F P (A
i
) + 5 +
14l, F P (A
i
) + 9 + 14l]
|0 ≤ k ≤ 2n − 1, 0 ≤ l ≤ n − 1}
where
F P
(A
i
) =
F P
(c
i
) + 54n
− 20
. The4n
simple intervals of groupA
i
represent in the left
to right order (
x
1
, x
1
, x
1
, x
1
, x
2
, x
2
, x
2
, x
2
, . . . x
n
, x
n
, x
n
, x
n
). By onstru tion,in any blo k of group
A
i
the se ond (resp. third) simple interval overlaps
both the rst and the third(resp. the se ond and the fourth) simpleinterval.
By denition, the two simple intervals representing
x
m
inA
i
are dened by[F P (A
i
) + 14(m
−1) + 7, F P (A
i
) + 14(m
−1) + 11]
and[F P (A
i
) + 14(m
−1) +
2, F P (A
i
) + 14(m
− 1) + 6]
.And onsequently, the two simpleintervals
repre-senting
x
m
inA
i
are dened by[F P (A
i
) + 14(m
− 1), F P (A
i
) + 14(m
− 1) + 4]
and[F P (A
i
) + 14(m
− 1) + 5, F P (A
i
) + 14(m
− 1) + 9]
.Fig. 7.Des riptionof thesimple intervals ofgroup
A
i
.
Group
B
i
is omposed of three blo ks (one for ea h literal in a lause),
ea h ontaining
2n
simple intervals (as illustrated in Figure 8):{[F P (B
i
1
) +
6k, F P (B
i
1
)+6k +4], [F P (B
2
i
)+6k, F P (B
2
i
)+6k +4], [F P (B
3
i
)+6k, F P (B
3
i
)+
6k + 4]
|0 ≤ k ≤ 2n − 1}
whereF P
(B
i
1
) = F P (c
i
) + 68n
− 20
,F P
(B
i
2
) =
F P
(c
i
)+80n
−20
,F P
(B
i
3
) = F P (c
i
)+92n
−20
.The2n
simpleintervalsofea hblo k of group
B
i
represent inthe left toright order (
x
1
, x
1
, x
2
, x
2
. . . x
n
, x
n
). By denition, the simple intervalrepresentingx
m
inB
i
j
, withj
∈ {1, 2, 3}
, isdened by
[F P (B
i
j
) + 12(m
− 1), F P (B
j
i
) + 12(m
− 1) + 4]
.And onsequently,the simple interval representing
x
m
inB
i
j
, withj
∈ {1, 2, 3}
, is dened by[F P (B
i
j
) + 12(m
− 1) + 6, F P (B
j
i
) + 12(m
− 1) + 10]
.Fig.8.Des riptionofthesimpleintervalsofgroup
B
i
.Duetospa e onsiderations,
the des ription is divided in three lines. Ea h line starts with the end part of the
previous lineinorder to indi atethe onguration ofthewholedes ription.
Group
E
i
is omposedof threeblo ks, ea h ontaining
2n
− 1
simpleintervals (as illustrated in Figure 9):{[F P (E
i
1
) + 6k, F P (E
1
i
) + 6k + 4], [F P (E
2
i
) +
6k, F P (E
i
2
) + 6k + 4], [F P (E
3
i
) + 6k, F P (E
3
i
) + 6k + 4]
|0 ≤ k ≤ 2n − 2}
whereF P
(E
i
1
) = F P (c
i
) + 68n
− 17
,F P
(E
i
2
) = F P (c
i
) + 80n
− 17
,F P
(E
i
3
) =
F P
(c
i
)+92n
−17
.Therefore,ea hsimpleintervalofblo kE
i
j
interse tsexa tly two simple intervalsof blo kB
i
j
, for1
≤ j ≤ 3
.Group
C
i
R
is omposed ofn
blo ks (one blo k for ea h boolean variable),ea h ontainingtwosimpleintervals(asillustratedinFigure10):
{[F P (C
i
R
) +
14k, F P (C
i
R
) + 14k + 4], [F P (C
R
i
) + 14k + 3, F P (C
R
i
) + 14k + 7]
|0 ≤ k ≤
n
− 1}
whereF P
(C
i
R
) = F P (c
i
) + 104n
− 19
.The2n
simpleintervalsofgroupC
i
R
represent in the left to right order (x
1
, x
1
, x
2
, x
2
. . . x
n
, x
n
). By denition, the simple interval representingx
m
inC
i
R
is dened by[F P (C
i
R
) + 14(m
−
1), F P (C
i
R
)+14(m
−1)+4]
.And onsequently,thesimpleintervalrepresentingx
m
inC
i
R
isdened by[F P (C
i
R
) + 14(m
− 1) + 3, F P (C
R
i
) + 14(m
− 1) + 7]
.Therefore, by onstru tion,inany blo kof group
C
i
R
the two simpleintervalsomposing this blo k are overlapping.
Finally, group
F
i
is omposed of two blo ks, ea h ontaining
2n
− 1
simpleintervals (as illustrated in Figure 11):
{[F P (F
i
) + 5k, F P (F
i
) + 5k + 4]
|0 ≤
k
≤ 4n−3}
whereF P
(F
i
) = F P (c
i
) + 118n
−21
.By onstru tion,blo kF
i
1
isFig.9.Des riptionof the simpleintervalsof group
E
i
.AsinFigure8,due to spa e
onsiderations,the des ription isdividedinthree lines.
Fig.10.Des ription of the simpleintervals ofgroup
C
i
R
.omposedofthefollowingsimpleintervals:
{[F P (F
i
)+5k, F P (F
i
)+5k+4]
|0 ≤
k
≤ 2n − 2}
and blo kF
i
2
is omposed of the following simple intervals:{[F P (F
i
) + 5k, F P (F
i
) + 5k + 4]
|2n − 1 ≤ k ≤ 4n − 3}
.
Fig.11.Des ription of thesimpleintervalsof group
F
i
.
Theset ofsimple intervalsof theinstan e of
2
-IP-Unit-{<, ≬}
isobtainedby assembling together in order the representation of the lausesc
1
toc
q
. It is easy to he k the following properties (whi hare represented inFigure12):•
for any1 < i
≤ q
, the smallest position of any simple intervalof groupC
i
L
is greater than the biggest position of any simple interval of groups
E
i
−1
and
B
i
−1
;
•
forany1 < i
≤ q
,the smallestpositionofanysimpleintervalofgroupF
i
−1
isgreater than the biggestpositionof any simple interval of group
C
i
L
;•
forany1 < i
≤ q
, the biggest position of any simpleinterval of groupF
i
−1
isless thanthe smallestposition of any simple intervalof group
D
i
•
for any1
≤ i ≤ q
, the smallest position of any simple interval of groupA
i
isgreater than the biggestpositionof any simple interval of group
D
i
;
•
forany1
≤ i ≤ q
, the biggest positionof any simple intervalof groupA
i
is
less than the smallest positionof any simpleinterval of groups
B
i
and
E
i
;
•
for any1
≤ i ≤ q
, the smallestpositionof any simple intervalof groupC
i
R
isgreaterthan the biggest positionof any simple intervalof groups
B
i
and
E
i
;
•
forany1
≤ i ≤ q
,the biggestpositionof anysimple intervalofgroupC
i
R
is less than the smallest positionof any simpleinterval of groupF
i
.
Now that we have dened the ground set of
D
, let us dene formally the2-intervalsof
D
(partially illustratedin Figure4).Forea h lause
c
i
,D
is omposed of2n
2-intervalsbuiltwithasimpleintervalof group
C
i
L
and asimple intervalof groupA
i
:• {([F P (C
i
L
) + r, F P (C
L
i
) + r + 4], [F P (A
i
) + s, F P (A
i
) + s + 4]),
• ([F P (C
i
L
) + s, F P (C
L
i
) + s + 4], [F P (A
i
) + r, F P (A
i
) + r + 4])
}
withr
= 14(k
− 1), s = r + 7, 1 ≤ k ≤ n
For ea h lause
c
i
,D
is omposed of4n
− 2
2-intervals built with a simple intervalof groupD
i
and a simpleinterval of group
E
i
:• {([F P (D
i
) + 5k, F P (D
i
) + 5k + 4], [F P (E
i
1
) + 6k
′′
, F P
(E
1
i
) + 6k
′′
+ 4]),
• ([F P (D
i
) + 5k
′
, F P
(D
i
) + 5k
′
+ 4], [F P (E
i
2
) + 6k
′′
, F P
(E
2
i
) + 6k
′′
+ 4])
}
with0
≤ k ≤ 2n − 2, 2n − 1 ≤ k
′
≤ 4n − 3, 0 ≤ k
′′
≤ 2n − 2
.Forea h lause
c
i
,D
is omposed of6n
2-intervalsbuiltwithasimpleintervalof group
B
i
and a simpleinterval of group
C
i
R
:• {([F P (B
i
j
) + r, F P (B
j
i
) + r + 4], [F P (C
R
i
) + s, F P (C
R
i
) + s + 4]),
• ([F P (B
i
j
) + r + 6, F P (B
i
j
) + r + 10], [F P (C
R
i
) + s + 3, F P (C
R
i
) + s + 7])
}
withr
= 12(k
− 1), s = 14(k − 1), j ∈ {1, 2, 3}, 1 ≤ k ≤ n
.For ea h lause
c
i
,D
is omposed of4n
− 2
2-intervals built with a simple intervalof groupE
i
and a simpleinterval of group
F
i
:• {([F P (E
i
2
) + 6k
′
, F P
(E
2
i
) + 6k
′
+ 4], [F P (F
i
) + 5k, F P (F
i
) + 5k + 4]),
• ([F P (E
i
3
) + 6k
′
, F P
(E
3
i
) + 6k
′
+ 4], [F P (F
i
) + 5k
′′
, F P
(F
i
) + 5k
′′
+ 4])
}
with2n
− 2 ≤ k ≤ 4n − 3, 0 ≤ k
′
≤ 2n − 2, 4n − 2 ≤ k
′′
≤ 6n − 4}
.Forea h lause
c
i
,D
is omposed of6n
2-intervalsbuiltwithasimpleintervalof group
A
i
and asimple interval ofgroup
B
i
:• {([F P (A
i
) + r + 2, F P (A
i
) + r + 6], [F P (B
i
j
) + s, F P (B
j
i
) + s + 4]),
• ([F P (A
i
) + r + 5, F P (A
i
) + r + 9], [F P (B
i
j
) + s + 6, F P (B
j
i
) + s + 10])
}
with
r
= 14(k
− 1), s = 12(k − 1), j ∈ {1, 2, 3}, 1 ≤ k ≤ n
.For ea h lause
c
i
, in order to represent the lausec
i
, we delete fromD
the 2-interval([F P (A
i
) + r + 2, F P (A
i
) + r + 6], [F P (B
i
j
) + s, F P (B
j
i
) + s + 4])
with
r
= 14(m
− 1)
,s
= 12(m
− 1)
ifx
m
is thevalueof thej
th
literal of
c
i
. In a similar way, ifx
m
is the value of thej
th
literal of
c
i
, we delete fromD
the 2-interval([F P (A
i
) + r + 5, F P (A
i
) + r + 9], [F P (B
i
j
) + s + 6, F P (B
j
i
) + s + 10])
with
r
= 14(m
− 1)
,s
= 12(m
− 1)
.Clearly,this onstru tion an be arriedout inpolynomial-time.Wenowgive
anintuitivedes ription of the dierent elements of the set of
2
-intervalsthatwe have built. Blo k
B
i
1
(resp.B
i
2
andB
i
3
) represents the value of the rst (resp. se ond and third) literal, sayx
m
(orx
m
), of the lausec
i
; for this, the2-interval between the simple interval of the
m
th
blo k of groupA
i
and the simpleinterval ofB
i
1
(resp.B
i
2
andB
i
3
) orrespondingtox
m
(orx
m
) isnot inD
(stillthesimpleintervalsare inGS(
D)
).Forinstan e,inFigure13,the fa t that there is no2-intervalbetween the simple interval orrespondingtox
1
inB
i
1
andasimpleintervalof groupA
i
indi atesthat the rstliteralof lause
c
i
isx
1
.Similarly,the fa tthat thereisno2-intervalbetween thesimpleinterval orrespondingtox
2
(resp.x
3
)inB
i
2
(resp.B
i
3
)and asimple intervalof groupA
i
indi atesthat the se ond (resp. third) literalof lause
c
i
isx
2
(resp.x
3
).Fig. 13.Zoomon group
B
i
oftherepresentation ofa lause
c
i
= (x
1
∨ x
2
∨ x
3
)
The sequen e of blo ks
(C
i
−1
R
,C
i
L
,A
i
,B
i
,C
i
R
)
orresponds to a me hanismwhi h propagates the value of ea h variableof
V
n
. Blo ks(D
i
,E
i
,F
i
)
orre-spond to a literal sele ting me hanism that indi ates, for ea h lause
c
i
, the literal (i.e., the rst, se ond or third) whi h satisesc
i
. Noti e that the two previous intuitivenotions willbe detailedand laried afterwards.We start the proof by giving some properties (Lemmas 8 to 13) about the
maximal ardinality of a set of
{<, ≬}
- omparable2
-intervals inD
in ouronstru tion. Then, these results will be used in Proposition 14to prove the
validity of the redu tion. In the rest of this paper, we will use the following
notations:
where
I
isa simpleintervalbelongingtoblo kX
andJ
is asimpleinterval belongingtoblo kY
;•
for any1
≤ i ≤ q
and any set of groupsα
⊆ {C
i
L
, A
i
, B
i
, C
R
i
, D
i
, E
i
, F
i
}
,D(α)
denotesasetof{<, ≬}
- omparable2
-intervalsbetweenblo ksofgroupsbelongingto
α
(forexample,D(D
i
, E
i
, F
i
)
denotesasetof
{<, ≬}
- omparable2
-intervalsbetween blo ksD
i
1
, D
2
i
, E
1
i
, E
2
i
, E
3
i
, F
1
i
andF
i
2
);•
for any1
≤ i ≤ q
,D(c
i
)
denotes a set of{<, ≬}
- omparable2
-intervals in the representation of lausec
i
.Lemma 7 For any set of groups
α
andβ
,|D(α)| + |D(β)| ≥ |D(α
S
β)
|
.PROOF. The union of the sets
α
andβ
ould result in one of the followingases:
(a)
D(α)
andD(β)
have atleast a2
-interval in ommon;(b) atleast a
2
-interval ofD(α)
and a2
-interval ofD(β)
are not disjoint; ( ) at least a2
-interval ofD(α)
and a2
-interval ofD(β)
are not{<, ≬}
- omparable.
In ase (a) it is lear that the dupli ated
2
-interval willnot be ounted morethan on e in
|D(α
S
β)
|
. In ase (b), only one of the two2
-intervals whi hare not disjoint an be in
D(α
S
β)
. In ase ( ), only one of the two2
-intervals whi h are not
{<, ≬}
- omparable an be inD(α
S
β)
. If none ofthose three ases o ur then,
D(α)
S
D(β)
is{<, ≬}
- omparable, and thus,|D(α)| + |D(β)| = |D(α
S
β)
|
.Therefore,|D(α)| + |D(β)| ≥ |D(α
S
β)
|
.2
By onstru tion, a
2
-interval an only exist between two blo ks thatorre-spond to a single lause ( f. Figure 4). Thus, the maximum ardinality of
a set of
{<, ≬}
- omparable2
-intervals ofD
(i.e., the full representation ofthe booleanformula) an bededu ed from the maximum ardinalityof
D(c
i
)
where
c
i
is a lause ofC
q
, for any1
≤ i ≤ q
. Pre isely, the maximum ardi-nalityofaset of{<, ≬}
- omparable2
-intervalsinthe representationofallthe lauses isless than orequaltoq
· max
i
∈[1,q]
|D(c
i
)
|
.Werst omputethemaximum ardinalityofaset
D(c
i
)
of{<, ≬}
- omparable2
-intervalsbetween blo ks orresponding toa single lausec
i
.Lemma 8
|D(α)| ≤ 3n
forα
=
{C
i
L
, A
i
, B
i
, C
R
i
}
.PROOF. Bythedisjun tion onstraint,atmostonesimpleintervalperblo k
of
A
i
anbeinvolvedina
2
-intervalbetweenblo ksofA
i
andB
i
.Astherearen
blo ksinA
i
,wehave|D(A
i
, B
i
)
| ≤ n
.Similarly,bythedisjun tion onstraint,
at most one simple interval per blo k of
C
i
between blo ks of
B
i
and
C
i
R
. As there aren
blo ks inC
i
R
,|D(B
i
, C
i
R
)
| ≤ n
.Thus, a ording to Lemma 7,
|D(A
i
, B
i
, C
i
R
)
| ≤ |D(A
i
, B
i
)
| + |D(B
i
, C
R
i
)
| ≤
2n
.Moreover, at most one simple interval per blo k of
A
i
an be involved in a
2
-interval between blo ks ofA
i
and
C
i
L
sin e the two2
-intervals between agiven blo k of
A
i
and
C
i
L
are{⊏}
- omparable. As there aren
blo ks inA
i
,|D(C
i
L
, A
i
)
| ≤ n
. Thus, by Lemma 7,|D(C
i
L
, A
i
, B
i
, C
R
i
)
| ≤ |D(A
i
, B
i
, C
R
i
)
| +
|D(C
i
L
, A
i
)
| ≤ 3n
.2
Inthe following,
θ(i, j)
willdenotetheset ofallthesimpleintervalsinB
i
j
andE
i
j
,with1
≤ j ≤ 3
.Thesetδ(i, j)
⊆ θ(i, j)
willdenoteaset ofdisjointsimple intervalsandk(E, i, j)
(resp.k(B, i, j)
)willbethe numberof simpleintervals ofblo kE
i
j
(resp.B
i
j
)whi hareinδ(i, j)
.By onstru tion,ea hsimpleinterval inblo kE
i
j
interse ts twosimple intervalsof blo kB
i
j
( f.Figure14and page 14).Observation 1 (a) If
k(E, i, j) > 0
thenatleastk(E, i, j)+1
simpleintervals of blo kB
i
j
annot belong toδ(i, j)
. Thus,k(B, i, j)
≤ 2n − (k(E, i, j) + 1)
. Hen e,|δ(i, j)| ≤ k(B, i, j) + k(E, i, j) ≤ 2n − (k(E, i, j) + 1) + k(E, i, j) ≤
2n
− 1
.(b)If
k(E, i, j) = 0
thenallthesimpleintervals(i.e.,2n
)ofblo kB
i
j
anbelong toδ(i, j)
. Thus,k(B, i, j)
≤ 2n
. Hen e,|δ(i, j)| ≤ k(B, i, j) + k(E, i, j) ≤ 2n
.Fig. 14. If two simple intervals of blo k
E
i
j
are part ofδ(i, j)
then at least three simple intervals ofblo kB
i
j
annotbelong toδ(i, j)
,and thus|δ(i, j)| ≤ 2n − 1
.Lemma 9 If
|D(D
i
, E
i
, F
i
)
| > 4n − 2
then
|D(c
i
)
| < 7n − 2
.PROOF. Assumethat
|D(D
i
, E
i
, F
i
)
| = 4n−2+γ
with
γ >
0
.Asea hblo kof group
E
i
(i.e.,
E
i
1
, E
2
i
, E
3
i
) is omposed of2n
− 1
simple intervals,there is atleast one simple intervalin ea h blo k of groupE
i
involved in a
2
-intervalof
D(D
i
, E
i
, F
i
)
.
Thus, onsidering only the simple intervalsin groups
B
i
and
E
i
, there are at
most
6n
− 3
(i.e.,3
· (2n − 1)
by Observation 1 (a)) disjoint simpleintervals.By onstru tion, any
2
-interval ofD(A
i
, B
i
, C
i
R
, D
i
, E
i
, F
i
)
is omposed of asimple interval of either group
B
i
or
E
i
. Thus, as there are at most
6n
− 3
disjoint simple intervals in groups
B
i
and
E
i
, there are at most
6n
− 3 2
-intervalsin
D(A
i
, B
i
, C
i
R
, D
i
, E
i
, F
i
)
. As|D(C
i
L
, A
i
)
| ≤ n
( f.proof of Lemma8), by Lemma 7, we an on lude that