• Aucun résultat trouvé

Formal Concept Analysis from the Standpoint of Possibility Theory

N/A
N/A
Protected

Academic year: 2021

Partager "Formal Concept Analysis from the Standpoint of Possibility Theory"

Copied!
20
0
0

Texte intégral

(1)

O

pen

A

rchive

T

OULOUSE

A

rchive

O

uverte (

OATAO

)

OATAO is an open access repository that collects the work of Toulouse researchers and makes it freely available over the web where possible.

This is an author-deposited version published in : http://oatao.univ-toulouse.fr/

Eprints ID : 15496

The contribution was presented at :

http://www.matap.uma.es/icfca2015/files/a61b70b0124cd72ef5b1442c0223db4a-0.html

Official URL: http://dx.doi.org/10.1007/978-3-319-19545-2

To cite this version : Dubois, Didier and Prade, Henri Formal Concept Analysis from

the Standpoint of Possibility Theory. (2015) In: 13th International Conference Formal

Concept Analysis (ICFCA 2015), 23 June 2015 - 26 June 2015 (Malaga, Spain).

Any correspondance concerning this service should be sent to the repository administrator: staff-oatao@listes-diff.inp-toulouse.fr

(2)

Formal Concept Analysis from the Standpoint

of Possibility Theory

Didier Dubois and Henri Prade

IRIT, Universit´e Paul Sabatier, 118 Route de Narbonne, 31062 Toulouse Cedex 09, France

{dubois,prade}@irit.fr

Abstract. Formal concept analysis (FCA) and possibility theory (PoTh) have been developed independently. They address different con-cerns in information processing: while FCA exploits relations linking objects and properties, and has applications in data mining and clus-tering, PoTh deals with the modeling of (graded) epistemic uncertainty. However, making a formal parallel between FCA and PoTh is fruitful. The four set-functions at work in PoTh have meaningful counterparts in FCA; this leads to consider operators neglected in FCA, and thus new fixed point equations. One of these pairs of equations, paralleling the one defining formal concepts in FCA, defines independent sub-contexts of objects and properties that have nothing in common. The similar-ity of the structures underlying FCA and PoTh is still more striking, using a cube of opposition (a device extending the traditional square of opposition in logic). Beyond the parallel between FCA and PoTh, this invited contribution, which largely relies on several past publications by the authors, also addresses issues pertaining to the possible meanings, degree of satisfaction vs. degree of certainty, of graded object-property links, which calls for distinct manners of handling the degrees. Other lines of interest for further research are briefly mentioned.

1

Introduction

Formal concept analysis (FCA) and possibility theory (PoTh) are two theoretical frameworks that are addressing different concerns in the processing of informa-tion. Namely FCA builds concepts from a relation linking objects to the prop-erties they satisfy, which has applications in data mining, clustering and related fields, while PoTh deals with the modeling of (graded) epistemic uncertainty. This difference of focus explains why the two settings have been developed com-pletely independently for a very long time. However, it is possible to build a formal analogy between FCA and PoTh. Both theories heavily rely on the com-parison of sets, in terms of containment or overlap. The four set-functions at work in PoTh actually determine all possible relative positions of two sets. Then the FCA operator defining the set of objects sharing a set of properties, which is at the basis of the definition of formal concepts, appears to be the counterpart of the set function expressing strong (or guaranteed) possibility in PoTh. Then, it

(3)

suggests that the three other set functions existing in PoTh should also make sense in FCA, which leads to consider their FCA counterparts and new fixed point equations in terms of the new operators. One of these pairs of equations, paralleling the one defining formal concepts, define independent sub-contexts of objects and properties that have nothing in common.

The parallel of FCA with PoTh can still be made more striking using a cube of opposition (a device extending the traditional square of opposition existing in logic, and exhibiting a structure at work in many theories aiming at representing some aspects of the handling of information).

In this survey we shall indicate various issues pertaining to FCA that could be worth studying in the future. For instance, the object-property links in for-mal contexts of FCA may be a matter of degree. These degrees may refer to very different notions, such as the degree of satisfaction of a gradual property, the degree of certainty that an object has, or not, a property, or still the typi-cality of an object with respect to a set of properties. These different intended semantics call for distinct manners of handling the degrees, as advocated in the presentation.

Lastly, other examples of lines of interest for further research, such as the extension of the parallel of FCA with PoTh to conceptual pattern structures, or the applications to the fusion of conflicting pieces of information, to the clustering of sets of objects on the basis of approximate concepts, or to the building of conceptual analogical proportions, are briefly mentioned.

2

Possibility Theory and Formal Concept Analysis

-A Parallel

Formal concept analysis [5,30,43] associates objects with the set of their prop-erties, through a formal context which is a binary relation R on the Cartesian product of the set of objects O and the set of properties P. Thus, knowing only that an object x has some property y, the set Rt(y) = {x ∈ O|(x, y) ∈ R} is the

set of the possible objects corresponding to the elementary piece of knowledge “the object has property y”(in the context R). This suggests a possibilistic read-ing of formal concept analysis and leads to considerread-ing the formal counterpart to possibility theory set-functions in this framework. After introducing some notations, we first provide a short refresher on possibility theory [18,21,47]. 2.1 Describing Objects

An object, or item, is denoted by x, or xi in case we consider several ones at

the same time. A subset of objects is denoted by a capital letter X, and we write X = {x1, . . . , xi, . . . , xm}. A set of objects associated with their respective

sets of properties defines a formal context R ⊆ O × P [30]. An object x is associated with its description, denoted ∂(x). In the following, we only consider simple descriptions, expressible in terms of a subset Y of properties yj, namely,

(4)

Besides, a useful kind of structured description is in terms of attributes. Let a, and A = {a1, . . . , ak, . . . , ar}, respectively denote an attribute, and a

set of attributes. The value of attribute a for x is denoted a(x) = u, where u belongs to the attribute domain Ua. In this case, we shall write ∂(x) =

(a1(x), . . . , ak(x), . . . , ar(x)) = (u1, . . . , uk, . . . , ur). This corresponds to a

com-pletely informed situation where all the considered attribute values are known for x. When it is not the case, the precise value ak(x) will be replaced by the

pos-sibility distribution πak(x). Such a possibility distribution [47] is a mapping from Uak to [0, 1], or more generally any linearly ordered scale. Then πak(x)(u) ∈ [0, 1] estimates to what extent it is possible that the value of ak for x is u. 0 means

impossibility; several distinct values may be fully possible (i.e. at degree 1). The characteristic function of an ordinary subset is a particular case of a possibil-ity distribution. Precise information corresponds to the characteristic function of singletons. An elementary property y can be viewed as a subset of a single attribute domain, i.e. y ⊆ U . Note that while a set of properties Y is conjunc-tive (in the sense that an object possesses all properties in Y ), each property y corresponds to a subset of some attribute domain U that is disjunctive [23]: it is a set of mutually exclusive values, since object x having property y possesses a single attribute value a(x) = u in U .

Taking inspiration from the existence of four set functions in possibility the-ory [20], new operators have been suggested in the setting of formal concept analysis [16]. These set functions are now recalled, emphasizing the symmetrical roles played by the object x and the attribute value u, a point of view unusual in possibility theory, but echoing the symmetrical role played by objects and prop-erties in formal concept analysis. See [20,21] for more complete introductions and surveys on possibility theory.

2.2 Possibility Theory

Let πa(x)(u) denote the possibility that object x has value u ∈ U (for attribute a).

For simplicity, we only consider the single-attribute case here. We assume that πa is bi-normalized: ∀x ∃u πa(x)(u) = 1 and ∀u ∃x πa(x)(u) = 1. This means that

for any object x, there is some fully possible value for attribute a, and that for any value u there is an object x that takes this value. Let X be a set of objects, and y ⊆ U be a property. Then, one can define

(i) the possibility measures [47], denoted by Π: Π(X) = max

x∈Xπa(x)(u) and Π(y) = maxu∈y πa(x)(u).

Π(X) estimates to what extent it is possible that there is an object in X having value u, while Π(y) is the possibility that object x has property y. Π is an indicator of non-empty intersection of the fuzzy set induced by the possibility distribution with an ordinary subset. They are measures of “weak, or potential possibility”. Clearly, Π is max-decomposable with respect to set union.

(5)

(ii) the dual measures of necessity N (or “strong or actual necessity”) [17]: N(X) = min

x∈X1 − πa(x)(u) and N (y) = minu∈y1 − πa(x)(u)

N(X) estimates to what extent it is certain (necessarily true) that an object has value u is in X, while N (y) is the certainty that object x has property y. Note that N (y) = 1 − Π(y) where y = U \ y. N may be viewed as a degree of inclusion of the fuzzy set induced by the possibility distribution into an ordinary subset. N is min-decomposable with respect to set intersection. (iii) the measures of “strong (or actual, or guaranteed) possibility” [19]

∆(X) = min

x∈Xπa(x)(u) and ∆(y) = minu∈y πa(x)(u)

∆(X) estimates to what extent it is possible that all objects in X have value u, while ∆(y) estimates the possibility that object x takes any value in y. ∆ may be viewed as a degree of inclusion of an ordinary subset into the fuzzy set induced by the possibility distribution. ∆ is min-decomposable with respect to set union.

(iv) the dual measures of “weak (or potential) necessity or certainty” [19] ∇(X) = 1 − min

x∈Xπa(x)(u) and ∇(y) = 1 − minu∈y πa(x)(u)

∇(X) estimates to what extent there exists at least one object outside X that has a low degree of possibility of having value u, while ∇(y) measures to what extent x has a low possibility value outside y. Note that ∇(y) = 1 − ∆(y). ∇ is an indicator of non-full coverage of the considered universe by the fuzzy set induced by the possibility distribution together with an ordinary subset. ∇ is max-decomposable with respect to set intersection. 2.3 Formal Context Setting

The classical setting of formal concept analysis defined from a formal context relies on a single operator that associates a subset of objects with the set of properties shared by them (and the dual operator). In [16], this framework has been enlarged with the introduction of three other operators. We now recall the four operators which are counterparts to the possibility theory set functions in the setting of a formal context.

Namely, let R be the formal context. Then R(x) = {y ∈ P|(x, y) ∈ R} is the set of properties of object x, and Rt(y) = {x ∈ O|(x, y) ∈ R} is the set of objects having properties y. Then, four remarkable sets can be associated with a subset X of objects (the notations have been chosen here in order to emphasize the parallel with possibility theory):

– the set RΠ(X) of properties that are possessed by at least one object in X:

RΠ(X) = {y ∈ P|Rt(y) ∩ X = ∅} = 

x∈X

R(x). Clearly, we have RΠ(X

(6)

– the set RN(X) of properties s. t. any object that satisfies one of them is necessarily in X:

RN(X) = {y ∈ P|Rt(y) ⊆ X} = 

x∈X

R(x).

In other words, having any property in RN(X) is a sufficient condition for

belonging to X. Moreover, we have RN(X) = RΠ(X) = P \ RΠ(X), and

RN(X1∩ X2) = RN(X1) ∩ RN(X2).

– the set R△(X) of properties shared by all objects in X:

R△(X) = {y ∈ P|Rt(y) ⊇ X} = 

x∈X

R(x).

In other words, satisfying all properties in R△(X) is a necessary condition for

an object for belonging to X. R△(X) is a partial conceptual characterization

of objects in X: objects in X have all the properties of R△(X) and may have some others (that are not shared by all objects in X). It is worth noticing that RΠ(X) provides a negative conceptual characterization of objects in X

since it gathers all the properties that are never satisfied by any object in X. Moreover, we have R△(X

1 ∪ X2) = R△(X1) ∩ R△(X2). Besides, as can be

seen, RN(X) ∩ R(X) is the set of properties possessed by all objects in X

and only by them.

– the set R∇(X) of properties that are not satisfied by at least one object in X.

R▽(X) = {y ∈ P|Rt(y) ∪ X = O} = 

x∈X

R(x).

Note that R▽(X) = R(X) = P \ R(X). In other words, in context R, for

any property in R▽(X), there exists at least one object outside X that misses

it. Moreover, we have R▽(X1∩ X2) = R▽(X1) ∪ R▽(X2).

Note that RΠ(X) and RN(X) become larger when X increases, while R(X)

and R▽(X) get smaller. The four subsets RΠ(X), RN(X), R(X), and R(X)

have been considered by different authors (with different notations) without any reference to possibility theory. D¨untsch et al. [26,27] calls R△ a sufficiency

operator, and its representation capabilities are studied in the theory of Boolean algebras. Taking inspiration as the previous authors from rough sets [40], Yao [45,46] also considers these four subsets. In both cases, the four operators were introduced. See also [33,41].

2.4 The Cube of Opposition in FCA

Before being able to present the structures of opposition relating the four oper-ators introduced in the previous section, we need to start with a refresher on the Aristotelian square of opposition [39]. The traditional square involves four

(7)

logically related statements exhibiting universal or existential quantifications: it has been noticed that a statement A of the form “every x is p”is negated by the statement O “some x is not p”, while a statement like E “no x is p”is clearly in even stronger opposition to the first statement A. These three statements, together with the negation of the last one, namely I “some x is p”, give birth to the Aristotelian square of opposition in terms of quantifiers A: ∀x p(x), E: ∀x ¬p(x), I: ∃x p(x), O: ∃x ¬p(x), pictured in Fig.1. Such a square is usually denoted by the letters A, I (affirmative half) and E, O (negative half). The names of the vertices come from a traditional Latin reading: AffIrmo, nEgO).

Contraries A:∀x p(x) E:∀x ¬p(x) S u b -alte rns Sub-contraries I:∃x p(x) O:∃x ¬p(x) Sub-alterns Cont radictorie s Contra dictori es

Fig. 1. Square of opposition

As can be seen, different relations hold between the vertices. Namely, (a) A and O are the negation of each other, as well as E and I;

(b) A entails I, and E entails O (we assume that there are some x for avoiding existential import problems);

(c) A and E cannot be true together, but may be false together; (d) I and O cannot be false together, but may be true together.

Recently, it has been noticed that such a square can be generated by a binary relation and a subset that can be composed together [12]. Indeed, let R be a binary relation on a Cartesian product X × Y (nothing forbids Y = X in the construction we are going to describe). We assume R = ∅. Let Rt denote

the transposed relation ((y, x) ∈ Rt iff (x, y) ∈ R). Moreover, we assume that

∀x, R(x) = ∅, which means that the relation R is serial, namely ∀x, ∃y such that (x, y) ∈ R; this is also referred to in the following as the X -normalization condition. In the same way Rtis also supposed to be serial, i.e., ∀y, Rt(y) = ∅

(Y-normalization). We further assume that the complementary relation R ((x, y) ∈ R iff (x, y) ∈ R), and its transpose are also serial, i.e. ∀x, R(x) = Y and ∀y, Rt(y) = X . These conditions enforce a non trivial relation between X and

Y. In the following, set complementations will be denoted by means of overbars. Let S be a subset of Y. We assume S = ∅ and S = Y. The relation R and the subset S give birth to the following subset of X, namely the (left) image of S by R

(8)

R(S) = {x ∈ X | ∃s ∈ S, (x, s) ∈ R} = {x ∈ X | S ∩ R(x) = ∅}.

Similarly, we consider R(S), R(S), and R(S) = {x ∈ X | ∀s ∈ S, (x, s) ∈ R} = {x ∈ X | R(x) ⊆ S}. The four subsets thus defined can be nicely organized into a square of opposition. See Fig.2. Indeed, it can be checked that the set counter-parts of the relations existing between the logical statements of the traditional square of oppositions still hold here. Namely, R(S) and R(S) are complements of each other, as R(S) and R(S); we have R(S) ⊆ R(S) and R(S) ⊆ R(S), thanks to X -normalization condition; R(S) ∩ R(S) = ∅ ; R(S) ∪ R(S) = X . Empty intersection A: R(S) E: R(S) In cl u sio n Full union I: R(S) O: R(S) Inc lusion Complements Comp lements

Fig. 2.Square of oppositions induced by a relation R and a subset S

Let us now consider the complementary relation R. We further assume that R = ∅ (i.e., R = X × Y). Moreover we have also assumed the X -normalization of R, i.e. ∀x, ∃y (x, y) ∈ R. In the same way as previously, we get four other subsets of X from R. Namely, R(S) = {x ∈ X | ∃s ∈ S, (x, s) ∈ R} = {x ∈ X | S∪R(x) = X }; R(S); R(S); R(S) = {x ∈ X | ∀s ∈ S, (x, s) ∈ R} = {x ∈ X | S ⊆ R(x)}. This generates a second square of opposition denoted by aeoi.

As can be seen, when R is a formal context (i.e., X = O, Y = P), we have RΠ(S) = R(S), RN(S) = R(S), R(S) = R(S), R(S) = R(S). The

eight subsets involving R and its complement can be organized into a cube of opposition as in Fig.3. The four formal concept analysis operators correspond to the left side facet of the cube of oppositions. The full cube is then obtained by introducing their complements, giving birth to the right side facet. Since RΠ(S) = RN(S), and R(S) = R(S), the classical square of oppositions AEOI

is given by the four corners RN(S), RN(S), RΠ(S), and RΠ(S), and the second

square aeoi on the back of the cube is given by R∆(S), R(S), R(S), and

R∇(S).

Moreover, in the side facets, all edges are uni-directed, including the diagonal ones, and express inclusions. Indeed, as already established in [16], under the X -and Y–normalization hypotheses, the following inclusion relation holds:

(9)

RN(S), R∆(S), RΠ(S), and R∇(S) constitute, four distinct pieces of information [16], which are only (weakly) related by the above relation.

Lastly, it can be checked that we also have R∆(S) ∩ RN(S) = ∅ and R(S) ∩

RN(S) = ∅ on the one hand, and R∇(S) ∪ RΠ(S) = X and R(S) ∪ RΠ(S) = X

on the other hand. These are the relations that holds on the top and on the bottom facets of the cube respectively.

i: R∇(S) I: RΠ(S) O: RΠ(S) o: R∇(S) a: R∆(S) A: RN(S) E: RN(S) e: R∆(S)

Fig. 3.Cube of opposition in formal concept analysis

The cube of oppositions not only underlie FCA (and PoTh) [25], but also is a setting of interest for building bridges with rough set theory [40] (see [12]), or even formal argumentation [1]!

3

Formal Context Decomposition

In FCA, a formal concept [30] is defined as a pair (X, Y ) ∈ O × P such that R△(X) = Y and Rt△(Y ) = X,

where Rt△(Y ) = {x ∈ O|R(x) ⊇ Y } =

y∈Y Rt(y) is the set X of objects having

all properties in Y , and in this case Y is also the maximal set of properties shared by all objects in X. A formal concept (X, Y ) is a maximal sub-rectangle in the formal context, i.e. is such that X × Y ⊆ R. It can be checked that R▽ gives

back the same Galois connection as the one defined from R△, while RN (or RΠ)

induces another connection, which is now described.

Consider the connection defined from RN in a similar formal way as when

defining formal concepts. It was proposed by Popescu [41] and studied in a general setting of residuated algebras, but not in the usual Boolean setting. Namely, let us consider pairs (X, Y ) s.t. RN(X) = Y and RtN(Y ) = X. As

suggested in [22], the pairs (X, Y ) s.t. RN(X) = Y and RtN(Y ) = X allow us to

characterize independent sub-contexts (i.e. that have no common objects and no common properties). They are thus of interest for the decomposition of a formal

(10)

context into smaller independent ones. This is expressed through the following property, proved in [13,24]:1

Proposition 1. The following properties of pairs (X, Y ) are equivalent 1. RN(X) = Y and RtN(Y ) = X

2. RN(X) = Y and RtN(Y ) = X

3. RΠ(X) = Y and RtΠ(Y ) = X

4. R ⊆ (X × Y ) ∪ (X × Y )

Proof. Let us first show that Property 1 implies Property 4. First it is clear that: RN(X) = Y ⇔ x∈XR(x) = Y ⇔x∈XR(x) = Y .

Denoting X + Y = X × Y , it implies R ⊆ X + Y . Likewise due to RtN(Y ) = X, Rt ⊆ Y + X holds.

Finally: R ⊆ (X +Y )∩(Y +X), which equivalently writes: R ⊆ (X ×Y )∪(X ×Y ). Conversely assume Property 4. Then it is clear that RN(X) ⊆ Y and

RtN(Y ) ⊆ X hold since there is no property possessed by any object in X outside Y , and no object outside X that possesses a property outside Y . Sup-pose RN(X) ⊂ Y , i.e. ∃y∈ Y such that property yis possessed by objects

outside X. But then R(x, y∗) = 1 for some x ∈ X, y ∈ Y . So Property 4 does

not hold. Contradiction.

The invariance of Property 4 with respect to complementation proves that the choice of (X, Y ) versus (X, Y ) in Property 1 is immaterial. Hence the equiv-alence with Property 2. For Property 3, note that RN(X) = Y is equivalent to

RΠ(X) = Y . ⊓⊔

Thus, (X, Y ) and (X, Y ) are two independent sub-contexts in R, in the sense that there is no object / property pair (x, y) of the context R either in X × Y or in X × Y . The above proposition does not involve any minimality in the inclusion Property 4 of the above proposition. In particular, the pair (O, P) trivially satisfies it. However, this result leads to a decomposition of R into a disjoint union of minimal independent sub-contexts. Indeed, suppose two pairs (X1, Y1), (X2, Y2) satisfy Proposition1. It implies that for instance, the pair

(X1∩ X2, Y1∩ Y2) satisfies it (it can be checked that RN(X1∩ X2) = Y1∩ Y2),

and likewise with any element of the partition refining both partitions (X1, X1)

and (X2, X2). Due to point 4 of Proposition1, it yields

R ⊆ ((X1× Y1) ∪ (X1× Y1)) ∩ ((X2× Y2) ∪ (X2× Y2)),

where the intersection on the right-hand side comes down to the union of sub-contexts (X1∩ X2) × (Y1∩ Y2), (X1∩ X2) × (Y1∩ Y2), (X1∩ X2) × (Y1∩ Y2),

(X1 ∩ X2) × (Y1 ∩ Y2). The decomposition of R into minimal subcontexts is

achieved by taking the following intersection 

(X,Y ):RN(X)=Y,RtN(Y )=X

(X × Y ) ∪ (X × Y ).

1

(11)

Example 1. The table below presents a formal context. Pairs ({6, 7, 8}, {c, d, e}), or ({5, 6, 7, 8}, {d, e}), or ({2, 3, 4}, {g, h}) are examples of formal concepts, while ({5, 6, 7, 8}, {a, b, c, d, e}), ({2, 3, 4}, ({f, g, h}), ({1}, {i}) are minimal subcon-texts.

Thus, through the notions of formal sub-contexts and of formal concepts, one sees two key aspects of granulation at work. Namely, on the one hand indepen-dent sub-contexts are separated, while inside each sub-context, formal concepts (X, Y ) are identified where each object in X is associated with each property in Y. However, objects in the extension of a formal concept may not be fully similar since they may also possess properties outside the intension of the concept. They are only similar with respect to the properties associated to the formal concept. Thus, the classical Galois connection founding formal concept analysis (asso-ciated with the actual possibility operator), and the other connection induced by the actual necessity operator, respectively embed two basic ideas associated with the idea of a cluster (see, e.g., [35]), namely

1. any pair of elements in a cluster should be closely related in some sense, and 2. any element of a cluster should be sufficiently separated from any element

outside it.

Moreover, formal concept analysis is also useful for conceptual clustering, where clusters should be associated with labels, obtained in this case as a conjunction of the properties shared by the objects in the cluster [11].

Such an idea can be also stated in terms of graph clustering, taking advantage of an exact parallel between formal concept analysis and bipartite graph analysis [31], as viewing an (ideal) cluster as a group of vertices

1. either with no missing link inside the group, 2. or with no link with vertices outside the group.

These two complementary views are also clearly at the basis of cluster analysis for unipartite graphs [42].

In practice, it is important to introduce some tolerance in the evaluation of the similarity between the members of a cluster and in the separatedness of

(12)

the clusters, leading to a more permissive and approximate view of granules or clusters; see, e.g., [32].

The other (mixed) connections RΘ(S) = T and R(T ) = S where Θ, Λ ∈

{Π, N, ∆, ∇} with Θ = Λ are also worth studying. They have still to be better understood and to be investigated systematically. See [15] for a preliminary discussion, and [7,9] for results in the graded case.

4

Graded Links Between a Property and an Object

Fuzzy extensions of FCA where R is a fuzzy relation in LO×P with L often taken as the unit interval have been proposed early [6,10]. However, the development of a fuzzy formal concept analysis theory requires an appropriate algebra of fuzzy sets [6,8]. While many theoretical studies have been developed, the different gradual interpretations of a fuzzy formal context have not been much discussed. Following [14], this section highlights some basic issues regarding the fact that a “fuzzy” or graded extension of binary formal contexts may convey different semantics: graded satisfaction of properties vs. uncertainty.

4.1 Gradual Properties: Unipolar Vs. Bipolar Scale Interpretation In this first interpretation, the values in the table (which are scalars in L) may be understood as providing a refinement of the cross marks. Namely, they rep-resent to what extent an object has a property, while in the classical model, this relationship was not a matter of degree. It is important to remark that in this view, we do not refine the absence of a property for an object (the blank mark is always replaced by the bottom element 0 of L). This view will be referred to as the positive unipolar interpretation. In this interpretation, Rt(y) (resp. R(x))

is considered as the support of the fuzzy set of objects (resp. properties) satis-fying the property y (resp. the object x). One could also consider the opposite convention namely the negative unipolar interpretation where degrees would rep-resent to which extent an object does not have a property and equivalently provide a refinement of the blank marks.

The most commonly used interpretations, through existing FCA proposals, are implicitly based on the positive unipolar interpretation that allows to map a formal context with quantitative attributes into a fuzzy formal context. In this spirit, conceptual scale theory [44] may be used to achieve a suitable (Boolean) representation by successive subsumptions.

Example 2. For instance, the formal context illustrated in Table2 is obtained from Table1 by a conceptual scaling of both many-valued attributes “Age”and “Salary”. As can be seen, we have two sets of properties with obvious sub-sumption relations between them. Pairs ({P eter, Sophie, M ike, Joe}, {age ≥ 20, salary ≥ 1000}), ({Sophie, M ike}, {age ≥ 20, age ≥ 25, salary ≥ 1000, salary ≥ 1200}), or ({M ike}, {age ≥ 20, age ≥ 25, age ≥ 30, salary ≥ 1000, salary ≥ 1200, salary ≥ 1400}) are formal concepts.

(13)

Table 1. Many-valued relation R1 Pierre Sophie Mike Nahla

Age 22 28 30 22

Salary 1100 1300 1500 1500 Table 2. Context subsumption

R2 Pierre Sophie Mike Nahla

age ≥ 20 × × × × age ≥ 25 × × age ≥ 30 × salaire ≥ 1000 × × × × salaire ≥ 1200 × × × salaire ≥ 1400 × ×

Table 3. Context summarization

R3 Pierre Sophie Mike Nahla

age ‘young’ 1 0.7 0.6 1

salary ‘low’ 1 0.8 0.6 0.6

Knowing the ages and the salaries, the formal context R2 can be re-encoded

in a more compact way, using two fuzzy sets ‘young’ and ‘small’ with decreasing membership functions, as illustrated in Table3.

Observe also that R3 offers a more precise representation of initial data than

Table2. The context in Table3, event though more compact than Table2 high-lights the fact that Mike, and to a lesser extent Sophie are not very young and have a salary that is not really low. It constitutes in some sense the negative of the picture shown on Table1. Note that the type of representation on Table3 can be obtained even without providing interpretable fuzzy sets and thus, by normalizing in L the domain of attribute values. This approach is used in [36].

Another interpretation of the degrees, maybe more in the standard spirit of fuzzy logic would be to replace both the cross marks and the blank marks by values in the scale L (L = [0, 1]). Then L possesses a mid-point acting as a pivoting value between the situations where the object possesses the property to some extent and the converse situation where the object possesses the oppo-site property to some extent. Under this view, a fuzzy formal concept should be learnt together with its negation. This view corresponds to a bipolar scale interpretation.

4.2 Uncertainty

Neither the standard FCA approach nor its fuzzy extension are equipped for representing situations of partial or complete ignorance. To this end, in the

(14)

Boolean case, we need to introduce a proper representation of partial uncer-tainty including ignorance in the relational table of the formal context. One may think of introducing gradations of uncertainty by changing crosses and blanks in the table into probability degrees, or by possibility or necessity degrees. In the probabilistic case, one number shall assess the probability that a considered property holds for a given object (its complement to 1 corresponding to the probability it does not hold). However, this is assuming precise knowledge on the probability values, which is not really appropriate if we have to model the state of complete ignorance. It is why we investigate the use of the possibilistic setting in the following.

In the possibilistic setting, crosses may be replaced by positive degrees of necessity for expressing some certainty that an object satisfies a property. The blanks could be refined by possibility degrees less than 1, expressing that it is little possible that an object satisfies a property. However, this convention using a single number in the unit interval for each entry in the context may be misleading as when the number replaces a blank or a cross, the meaning of the number is not the same.

In the possibilistic setting, possibility and necessity functions are related by the duality relation N (A) = 1 − Π(A), that holds for any event A, where A denotes the opposite event [18]. Then, for entries (x, y) in the table, we use a representation as a pair of necessity degrees (α, 1 − β) where α = N ((x, y) ∈ R) (resp. 1−β = N ((x, y) ∈ R)) corresponds to the necessity (certainty) that object xhas (resp. does not have) property y. Moreover, we should respect the property min(α, 1 − β) = 0, since min(N (A), N (A)) = 0 in agreement with complete ignorance, in which case nothing (i.e., neither A nor A) is even somewhat certain. Pairs (1,0) and (0,1) correspond to completely informed situations where it is known that object x has, respectively does not have, property y. The pair (0,0) reflects total ignorance, whereas pairs (α, 1 − β) s.t. 1 > max(α, 1 − β) > 0 correspond to partial ignorance.

An uncertain formal context is thus represented by

RU = {(α(x, y), 1 − β(x, y)) | x ∈ O, y ∈ P}

where α(x, y) ∈ [0, 1] , β(x, y) ∈ [0, 1]. A relational database with fuzzily-known attribute values is theoretically equivalent to the fuzzy set of all ordinary data-bases corresponding to the different possible ways of completing the informa-tion consistently with the fuzzy restricinforma-tions on the attribute values. So, an uncertain formal context may be viewed as a weighted family of all standard formal contexts obtained by changing uncertain entries into sure ones. More precisely, one may consider all the completions of an uncertain formal con-text. This is done by substituting entries (x, y) that are uncertain, i.e., such that 1 > max(α(x, y), 1 − β(x, y)) by a pair (1,0), or a pair (0,1). Replacing (α(x, y), 1 − β(x, y)) by (1, 0) is possible at degree β(x, y), the possibility that x has property y. Similarly, replacing (α(x, y), 1 − β(x, y)) by (0, 1) is possible at degree 1 − α(x, y), the possibility that x does not have the property y. In this way, one may determine to what extent a particular completion (a context C) is

(15)

possible, by aggregating the possibility degrees associated with each completed entry (using min operator). Formally, one can write

π(C) = min(min(x,y):(x,y)∈Cβ(x, y), min(x,y):(x,y)∈C1 − α(x, y)).

Likewise the degree of possibility that (X, Y ) is a formal concept of RU is

π(X, Y ) = sup{π(C) : C such that (X, Y ) is a formal concept of C}. Useful completions are those where partial certainty becomes full certainty. Indeed, given an uncertain formal context and a threshold pair (u, v), let us replace all entries of the form (α, 0) such that α  u with (1, 0) and entries of the form (0, 1 − β) such that 1 − β  v with (0, 1). All such replacements have possibility 1 according to the above formula. Remaining entries, which are more uncertain, can be systematically substituted either by (1,0), or by (0,1). Consid-ering, the two extreme cases where all such entries are changed into (1,0) and the case when where all such entries are changed into (0,1) gives birth to upper and lower completions, respectively. In this way, two classical (Boolean) formal contexts, denoted R∗

(u,v) and R∗(u,v) are obtained as respective results of the

two completions. They allow to determine, for a given threshold (u, v), maximal extensions (resp. minimal intensions) and minimal extensions (resp. maximal intensions) of uncertain formal concepts. It is clear that R∗(u,v) ⊆ R∗(u,v). Let us

illustrate the idea with an example.

Example 3. Table4 exhibits a formal context where some entries are pervaded with uncertainty. Let us examine the situation regarding formal concepts. Take u= 0.7, v = 0.5 for instance. In context R∗(0.7,0.5), examples of formal concepts

are pairs ({6, 7, 8}, {c, d, e}), or ({5, 6, 7, 8}, {d, e}), or ({2, 3, 4}, {g, h}), although with u = 0.9, the last formal concept would reduce to ({2, 3}, {g, h}), i.e., the extent of the concept is smaller.

Now consider R∗

(0.7,0.5), where the entries with low certainty levels (either in

favor or against the existence of the link between x and y) are turned into Table 4. Uncertain formal concepts

1 2 3 4 5 6 7 8 a × b × × c (0.5,0) × × × d × × × × e × × × × f (0, 0.8) × (0, 0.3) g × × (0.8, 0) h × × (0.8, 0) i ×

(16)

positive links. Then, ({2, 3, 4}, {g, h}) remains unchanged as a formal concept, while a larger concept now emerges, namely ({5, 6, 7, 8}, {c, d, e}). However, one may prefer to consider the results obtained from R∗(0.7,0.5), where only

the almost certain information is changed into positive links. In the example, if we move down u to 0.5, and use R∗(0.5,0.5) we still validate the larger former

concept ({5, 6, 7, 8}, {c, d, e}). This illustrates the fact that becoming less and less demanding on the level of certainty, may enable the fusion of close concepts (here ({6, 7, 8}, {c, d, e}), and ({5, 6, 7, 8}, {d, e}), providing a more synthetic view of the formal context.

This small example is intended to illustrate several points. First of all, it should be clear that being uncertain about the existence of a link between an object and a property is not the same as being certain about a gradual link. Second, under uncertainty, there are formal concepts whose boundaries are not affected by uncertainty, while others are. Lastly, regarding certain enough pieces of information as fully certain may help simplifying the analysis of the formal context. Besides, the proposed setting may also handle inconsistent information by relaxing the constraint min(α, 1 − β) = 0. This would amount to introducing paraconsistent links between objects and properties.

5

More Lines for Further Research

Let us briefly conclude this survey of works in FCA inspired by PoTh by men-tioning other examples of lines of interest for further research:

– The parallel of FCA with PoTh leading to the introduction of new operators extends to conceptual pattern structures [28,29], where the description ∂(x) of an object x, may, e.g., be a possibilistic knowledge base [2];

– Applications of FCA to the fusion of conflicting pieces of information issued from multiple sources using pattern structures for labeling sets of possible values in terms of sources supporting them [3];

– The clustering of sets of objects on the basis of approximate concepts [24,32], with labeling of the clusters [38];

– The building of conceptual analogical proportions [37] on the basis of the formal definition of analogical proportions in non-distributive lattices [34], conceptu-alization and analogical reasoning being two basic cognitive activities [4].

References

1. Amgoud, L., Prade, H.: A formal concept view of abstract argumentation. In: van der Gaag, L.C. (ed.) ECSQARU 2013. LNCS, vol. 7958, pp. 1–12. Springer, Heidelberg (2013)

2. Assaghir, Z., Kaytoue, M., Prade, H.: A possibility theory-oriented discussion of conceptual pattern structures. In: Deshpande, A., Hunter, A. (eds.) SUM 2010. LNCS, vol. 6379, pp. 70–83. Springer, Heidelberg (2010)

(17)

3. Assaghir, Z., Napoli, A., Kaytoue, M., Dubois, D., Prade, H.: Numerical informa-tion fusion: lattice of answers with supporting arguments. In: Proceedings of 23rd IEEE International Conference on Tools with Artificial Intelligence (ICTAI’11), pp. 621–628, Boca Raton, 7–9 November 2011

4. Barbot, N., Miclet, L., Prade, H.: Analogical Proportions and the Factorization of Information in Distributive Lattices. In: CEUR Workshop Proceedings of the Tenth International Conference on Concept Lattices and Their Applications CLA 2013, pp. 175-186, La Rochelle, 15-18 October 2013

5. Barbut, M., Monjardet, B.: Ordre et Classification. Alg`ebre et Combinatoire. Tome 2. Hachette, Paris (1970)

6. Bˇelohl´avek, R.: Fuzzy Galois connections. Math. Logic Q. 45, 497–504 (1999) 7. Bˇelohl´avek, R.: Optimal triangular decompositions of matrices with entries from

residuated lattices. Int. J. Approximate Reasoning 50(8), 1250–1258 (2009) 8. Bˇelohl´avek, R., Vychodil, V.: What is a fuzzy concept lattice. In: Proceedings

CLA2005, pp. 34–45, Olomuc, Czech Republic (2005)

9. Bˇelohl´avek, R., Vychodil, V.: Discovery of optimal factors in binary data via a novel method of matrix decomposition. J. Comput. Syst. Sci. 76(1), 3–20 (2010) 10. Burusco, A., Fuentes-Gonzalez, R.: The study of the L-fuzzy concept lattice.

Mathw. Soft Comput. 3, 209–218 (1994)

11. Carpineto, C., Romano, G.: GALOIS: an order-theoretic approach to conceptual clustering. In: Proceedings of 10th International Conference on Machine Learning, pp. 33–40. Morgan Kaufmann, Amherst, 27–29 June 1993

12. Ciucci, D., Dubois, D., Prade, H.: The structure of oppositions in rough set theory and formal concept analysis - toward a new bridge between the two settings. In: Beierle, C., Meghini, C. (eds.) FoIKS 2014. LNCS, vol. 8367, pp. 154–173. Springer, Heidelberg (2014)

13. Djouadi, Y., Dubois, D., Prade, H.: Possibility theory and formal concept analysis: context decomposition and uncertainty handling. In: H¨ullermeier, E., Kruse, R., Hoffmann, F. (eds.) IPMU 2010. LNCS, vol. 6178, pp. 260–269. Springer, Heidel-berg (2010)

14. Djouadi, Y., Dubois, D., Prade, H.: Graduality, uncertainty and typicality in formal concept analysis. In: Cornelis, C., Deschrijver, G., Nachtegael, M., Schockaert, S., Shi, Y. (eds.) 35 years of Fuzzy Sets Theory - Celebratory Volume Dedicated to the Retirement of Etienne E. Kerre. SFSC 261, pp. 127–147. Springer, Heidelberg (2011)

15. Djouadi, Y., Prade, H.: Possibility-theoretic extension of derivation operators in formal concept analysis over fuzzy lattices. Fuzzy Optim. Decis. Making 10(4), 287–309 (2011)

16. Dubois, D., Dupin de Saint Cyr, F., Prade, H.: A possibility-theoretic view of formal concept analysis. Fundamenta Informaticae 75(1–4), 195–213 (2007) 17. Dubois, D., Prade, H.: Fuzzy Sets and Systems: Theory and Applications.

Acad-emic Press, New York (1980)

18. Dubois, D., Prade, H.: Possibility Theory: An Approach to Computerized Process-ing of Uncertainty. Plenum Press, New York (1988)

19. Dubois, D., Prade, H.: Possibility theory as a basis for preference propagation in automated reasoning. In: Proceedings of 1st IEEE International Conference on Fuzzy Systems (FUZZ-IEEE 1992), San Diego, pp. 821–832, 8–12 March 1992 20. Dubois, D., Prade, H.: Possibility theory: qualitative and quantitative aspects.

In: Gabbay, D., Smets, P. (eds.) Quantified Representation of Uncertainty and Imprecision of Handbook of Defeasible Reasoning and Uncertainty Management Systems, vol. 1, pp. 169–226. Kluwer Academic Publisher, Dordercht (1998)

(18)

21. Dubois, D., Prade, H.: Possibility theory. In: Meyers, R. (ed.) Encyclopedia of Complexity and Systems Science, pp. 6927–6939. Springer, New York (2009) 22. Dubois, D., Prade, H.: Possibility theory and formal concept analysis in

informa-tion systems. In: Proceedings of Internainforma-tional Fuzzy Systems Associainforma-tion World Congress and Conference of the Europe Society for Fuzzy Logic and Technology (IFSA-EUSFLAT 2009), pp. 1021–1026, Lisbon, 20–24 July 2009

23. Dubois, D., Prade, H.: Gradualness, uncertainty and bipolarity: making sense of fuzzy sets. Fuzzy Sets Syst. 192, 3–24 (2012)

24. Dubois, D., Prade, H.: Possibility theory and formal concept analysis: characteriz-ing independent sub-contexts. Fuzzy Sets Syst. 196, 4–16 (2012)

25. Dubois, D., Prade, H.: From Blanch´e’s hexagonal organization of concepts to formal concept analysis and possibility theory. Logica Universalis 6(1–2), 149–169 (2012) 26. D¨untsch, I., Gediga, G.: Approximation operators in qualitative data analysis. In: de Swart, H., Orlowska, E., Schmidt, G., Roubens, M. (eds.) Theory and Appli-cation of Relational Structures as Knowledge Instruments, pp. 216–233. Springer, Heidelberg (2003)

27. D¨untsch, I., Orlowska, E.: Mixing modal and sufficiency operators. Bull. Sect. Log. Pol. Acad. Sci. 28(2), 99–106 (1999)

28. Ferr´e, S., Ridoux, O.: A logical generalization of formal concept analysis. In: Gan-ter, B., Mineau, G.W. (eds.) ICCS 2000. LNCS, vol. 1867. Springer, Heidelberg (2000)

29. Ganter, B., Kuznetsov, S.O.: Pattern structures and their projections. In: Del-ugach, H.S., Stumme, G. (eds.) ICCS 2001. LNCS (LNAI), vol. 2120, p. 129. Springer, Heidelberg (2001)

30. Ganter, B., Wille, R.: Formal Concept Analysis. Springer-Verlag, Heidelberg (1999) 31. Gaume, B., Navarro, E., Prade, H.: A parallel between extended formal concept analysis and bipartite graphs analysis. In: H¨ullermeier, E., Kruse, R., Hoffmann, F. (eds.) IPMU 2010. LNCS, vol. 6178, pp. 270–280. Springer, Heidelberg (2010) 32. Gaume, B., Navarro, E., Prade, H.: Clustering bipartite graphs in terms of

approx-imate formal concepts and sub-contexts. Int. J. Comput. Intell. Syst. 6(6), 1125– 1142 (2013)

33. Georgescu, G., Popescu, A.: Non-dual fuzzy connections. Arch. Math. Log. 43(8), 1009–1039 (2004)

34. Miclet, L., Prade, H.,Guennec, D.: Looking for analogical properties in a formal concept analysis setting. In: Napoli, A., Vychodil, V. (eds.) proceedings of 8th International Conference on Concept Lattices and Their Application (CLA 2011), pp. 295–307. INRIA, Nancy, 17–20 October 2011

35. H¨oppner, F., Klawonn, F., Kruse, R., Runkler, T.: Fuzzy Cluster Analysis: Methods for Classification. Data Analysis and Image Recognition. Wiley, New York (1999) 36. Messai, N., Devignes, M., Napoli, A., Tabbone, M.: Many-valued concept lattices for conceptual clustering and information retrieval. In: Proceedings of 18th Euro-pean Conference on Artificial Intelligence, pp. 722–727, Patras (2008)

37. Miclet, L., Barbot, N., Prade, H.: From analogical proportions in lattices to pro-portional analogies in formal concepts. In: Schaub, T., Friedrich, G., O’Sullivan, B. (eds.) Proceedings of 21st European Conference on Artificial Intelligence (ECAI14), pp. 627–632. IOS Press, Prague, 18–22 August 2014

38. Navarro, E., Prade, H., Gaume, B.: Clustering sets of objects using concepts-objects bipartite graphs. In: H¨ullermeier, E., Link, S., Fober, T., Seeger, B. (eds.) SUM 2012. LNCS, vol. 7520, pp. 420–432. Springer, Heidelberg (2012)

39. Parsons, T.: The traditional square of opposition. In: Zalta, E.N. (ed.) The Stanford Encyclopedia of Philosophy, Fall 2008 edn (2008)

(19)

40. Pawlak, Z.: Rough Sets. Theoretical Aspects of. Reasoning about Data. Kluwer Academic Publisher, Norwell (1991)

41. Popescu, A.: A general approach to fuzzy concepts. Math. Log. Q. 50, 265–280 (2004)

42. Schaeffer, E.: Graph clustering. Comput. Sci. Rev. 1, 27–64 (2007)

43. Wille, R.: Restructuring lattice theory: an approach based on hierarchies of con-cepts. In: Rival, I. (ed.) Ordered Sets, pp. 445–470. Reidel, Dordrecht (1982) 44. Wolff, K.E.: Concepts in fuzzy scaling theory: order and granularity. Fuzzy Sets

Syst. 132, 63–75 (2002)

45. Yao, Y., Chen, Y.: Rough set approximations in formal concept analysis. In: Peters, J.F., Skowron, A. (eds.) Transactions on Rough Sets V. LNCS, vol. 4100, pp. 285– 305. Springer, Heidelberg (2006)

46. Yao, Y.: A comparative study of formal concept analysis and rough set theory in data analysis. In: Tsumoto, S., Slowi´nski, R., Komorowski, J., Grzymala-Busse, J.W. (eds.) RSCTC 2004. LNCS (LNAI), vol. 3066, pp. 59–68. Springer, Heidelberg (2004)

47. Zadeh, L.A.: Fuzzy sets as a basis for a theory of possibility. Fuzzy Sets Syst. 1, 3–28 (1978)

(20)

Figure

Fig. 1. Square of opposition
Fig. 2. Square of oppositions induced by a relation R and a subset S
Fig. 3. Cube of opposition in formal concept analysis
Table 2. Context subsumption R 2 Pierre Sophie Mike Nahla
+2

Références

Documents relatifs

We show how a fuzzy context can be scaled to a “Minimum Pattern Struc- ture” (MnPS), a special kind of pattern structures, that is close to interval pattern structures when

Concept lattices of RDF graphs In [10], in addition to the RDF graph, a formal context corresponding to the background knowledge is considered. This context has a set of resources

Accordingly, we present in the following a knowledge discovery process based on Formal Concept Analysis and Pattern Structures that is applied to sets of RDF triples, taking

Construction of conceptual structures (i.e. frames) of a terminology by means of cognitive research of their linguistic units (i.e. terms) gave us a holistic picture of a

He described the different motor and sensory nerves involved by sectioning various nerve branches and spinal roots while testing for the viability of the reflex (Sherrington,

It can be shown that this ball always exist, is unique, and can be determined in linear time.. (1) The radius of B(x

The reference [S]: The textbook introduces the concept of function (figure 1) through the definition: "Any binary relation between two sets – or just in one set-

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des