• Aucun résultat trouvé

7.3 Alignment combination operators

7.3.1 Alignment composition

For running some alignment methods or processes on heterogeneous re-sources, there is sometimes a need for a starting set of correspondences as a parameter to build more sophisticated alignments [Shvaiko & Euzenat 2013].

The starting set of correspondences is whether created manually or built by composing other alignments. The composition of alignment resources is an operator that creates a new alignment from two existing alignments sharing a common resource. In order to define an operator for composing alignment resources, at first, we need to define the composition at the correspondence level and, then, at the alignment level.

7.3.1.1 Composing correspondences

Given two correspondences c1 = hex, ey, R1i and c2 = het, ez, R2i where, R1 ={(r1, w1), . . . ,(rn, wn)} and R2 ={(s1, b1), . . . ,(sm, bm)} and ey =et, the composition of c1 and c2 is a correspondence:

ccomp=Compζ(c1, c2) =hex, ez, Rcompi where

Rcomp ={(ri∗sj, wiζwj)|i= 1,−n; j = 1,−m}

∗ is the composition operator for the considered relation algebra (e.g. Table 7.1), and ζ is an associative operator for combining the confidence levels (in our case max−min).

Other than the alignment relations defined by the A5 algebra, our ap-proach considers semantic relations used in a large alignment repositories

(such as Bioportal [Noyet al. 2008]). Thus, we created a similar table for composing semantic relations (such as “exactMatch”, “narrowMatch”, etc.).

We used the SKOS entailment rules for defining this table. In the case of composing heterogeneous alignment relations (logical with semantic), we created transformation tables from one type of relations to the other.

◦ @ A ≡ ⊥ G

@ @ > @ ⊥ @,⊥,G A @,A,≡,G A A A,⊥,G A,G

≡ @ A ≡ ⊥ G

⊥ @,⊥,G ⊥ ⊥ > @,⊥,G G @,G A,⊥,G G A,⊥,G >

Table 7.1: Composition table for logical relations as defined by [Euzenat 2008]

Condition 2 Let Γ be the set of individual alignment relations of a certain category (logical, semantic or other). Let Θbe the power set of Γ (Θ = 2Γ).

The set of alignment relations Θshould have closure under the composi-tion operator (∗): ∗: Θ×Θ→Θ

There are many fuzzy composition operators defined in the litera-ture [Portilla et al.2000]. We define a composition operator that uses the fuzzy relation composition and implements different methods for com-bining confidence measure such as the max−min and max−product [Loetamonphong & Fang 2001] compositions.

In order to calculate the new confidence that is attributed to each com-posed alignment relation, we use the fuzzy relation interpretation (see sec-tion 7.2.2.1) and the “max−min” composition of fuzzy sets [Zadeh 1971, Abbasbandyet al. 2006]. By definition this composition is associative, con-sequently the alignment composition is associative.

The confidence measure attributed to the relationri∗sj that we denoted aswiζwj is calculated as follows:

wiζwjrAi∗sj

1◦A2(ex, ez) = max

ey

(min(µrAi

1(ex, ey), µsAj

2(ey, ez))) where µrAi

1(ex, ey) is the membership function of the fuzzy relation be-tween ex and ey for the specific alignment relation ri and µRA

1◦A2 is the membership function that calculates the confidence measure of the resulting fuzzy relationR.

For example, the composition of

hCity,Town,{({=},0.2),({A},0.6)}i and

hTown,Ciudad,{({=},0.3),({@},0.6)}i is

hCity,Ciudad,{({=}∗{=},0.2),({=}∗{@},0.2),({A}∗{=},0.3),({A}∗{@},0.6)}i

=hCity,Ciudad,{({=},0.2),({@},0.2),({A},0.3),({A,@,=,G},0.6)}i

Ai  

Composi)on  

NormalizeBel   Aj  

Ak  

Ak  

Compute  Mass  and   Plausibility   Aggregate  

Correspondences  

Ak   Ak  

Aggrega)on  method  

Figure 7.3: Composition of two alignments

Normalizing resulting confidence measures for fuzzy relations

ex

ey et

es

ez

(⊂, α1) (≡, β2)

(≡, β1) (≡, γ2)

(⊂, γ1) (≡, α2)

Figure 7.4: Multiple paths for alignment composition

Given the fact that there might be multiple alignment relations per cor-respondence (the alignment representation model does not exclude conjunc-tions within alignment relaconjunc-tions). Multiple alignment correspondences might lead to the same composition (multiple common entities) as described in fig-ure 7.4, then the composition of each couple of alignment relations might

give as a result the same relation (cf. composition table). For instance, the compositions ⊂ ∗ ≡, ⊂ ∗ ⊂ and ≡ ∗ ⊂ give the same composed relations, which is⊂. This leads to calculating alignment confidences for each relation.

In this case we apply a simple aggregation to group all the results of calculating the confidence measures using the fuzzy “max” aggregator. Thus, if multiple paths of composition or multiple relations composition have led to the same composed relationr ∈Θ, then we group the results byr and we aggregate the membership functions using the “max” aggregator.

In order to aggregate the membership functions that have been computed for each relation between the two entities ei and ej, we use the fuzzy set interpretation (see section 7.2.2.2). This means that each relationrkofRcomp that denotes the set of true relations between both entities has a membership function µAcomp(rk)

Let c1 = hex, ey, R1i and c2 = hey, ez, R2i be two alignment corre-spondences, the composition of c1 and c2 is a correspondence ccomp = hex, ez, Rcompi where Rcomp{(rk, µAcomp(rk)), rk = (ri ∗sj)|ri ∈ R1, sj ∈ R2, i= 1,−n, j = 1,−m}.

Two normalizations are applied:

1. if Rcomp contains couples of fuzzy relations (rk, µAcomp(rk)), (rs, µAcomp(rs)) where rk = rs = r, then we group these relations in a subsetR and we apply amax triangular co-norm to aggregate their membership value and remove all duplicates inRcomp.

N ormmax(Rcomp) ={(r,max

ri∈RAcomp(ri))}

2. if Rcomp contains couples of fuzzy relations (rk, µAcomp(rk)), (rs, µAcomp(rs)) where rk ⊆ rs and µAcomp(rk) ≥ µAcomp(rs), then rs

is removed from the list of relations since the first one entails it.

7.3.1.2 Composing Alignments

LetA1 and A2 be two alignments, we define:

• Paths(ex, ez, A1, A2) ={(c1, c2)|∃ey∃R∃S : c1 =hex, R, eyi ∈A1 and c2 = hey, S, ezi ∈ A2} as the set of correspondences associating e1 to e3;

• Corrζ(ex, ez) = {Compζ(ci, cj)|(ci, cj) ∈ Path(ex, ez)} as the set of conflicting correspondences resulting from composing each pair of cor-respondences fromP ath(ex, ez). ζ is the confidence combination func-tion (max−min).

• IfP ath(ex, ez) is empty thenCorr(ex, ez) =∅

Finally we define the composition of two alignmentsA1 andA2 aligning respectively R1 to R2,R2 to R3 as:

Compφζ(A1, A2) =Aggrφ({Corrζ(ex, ez)|ex∈R1, ez ∈R3})

WhereAggrφis an operator that normalizes conflicting correspondences as defined in the following section.

Alignment composition is a complex operation especially for ontologi-cal resources with regard to consistency issues. At the current level of our research we define a composition operator and then we apply an aggrega-tion funcaggrega-tion to avoid inconsistency (only for treating conflictual correspon-dences). For further consistency checking, we assume that an external tool can be used.