• Aucun résultat trouvé

Cartification: from Similarities to Itemset Frequencies

N/A
N/A
Protected

Academic year: 2022

Partager "Cartification: from Similarities to Itemset Frequencies"

Copied!
1
0
0

Texte intégral

(1)

Cartification: from Similarities to Itemset Frequencies

Bart Goethals

University of Antwerp, Belgium bart.goethals@ua.ac.be

Abstract.We propose a transformation method to circumvent the problems with high dimensional data. For each object in the data, we create an itemset of the k-nearest neighbors of that object, not just for one of the dimensions, but for many views of the data. On the resulting collection of sets, we can mine frequent itemsets; that is, sets of points that are frequently seen together in some of the views on the data.

Experimentation shows that finding clusters, outliers, cluster centers, or even subspace clustering becomes easy on the cartified dataset using state-of-the-art techniques in mining interesting itemsets.

Références

Documents relatifs

Owing to the interest shown by the community and the potential for future research in clustering in nuclei, the members of the Interna- tional Advisory Committee (most of whom

1 For instance, if we post the global constraint for frequent closed itemsets (FCIs) proposed in [6] in a CP model and if in addition we post the constraint specifying that the user

In Table 1, one can see that when using the tuned version and evaluating with F 1 , the best perfor- mance for each baseline algorithm is obtained for the same number of output

We have study other dynamic spectrum access techniques such as auction theory, Game theory, Markov models and Multi Agent Systems... For certain wireless conditions, some

Given a user-specified mininum frequency threshold σ and a knowl- edge base KB = (F, H), the contextual frequent pattern mining problem consists in enumerating all the

In this case the two positive integers |y| and |y − 1| are consecutive, therefore they are relatively prime. If the product of two relatively prime integers is a square, then each

In this section we develop and discuss the variety of similarity and distance functions for objects (starting with binary qualitative variables, then for arbitrary

For each class of pro- gram (e.g., Tcas includes 37 faulty versions), we report the averaged number of positive test cases |T + |, the averaged number of negative test cases |T − |,