• Aucun résultat trouvé

Advanced Human-Machine Interaction Interaction Data Analysis TD03: Mining Sequences of Itemsets

N/A
N/A
Protected

Academic year: 2022

Partager "Advanced Human-Machine Interaction Interaction Data Analysis TD03: Mining Sequences of Itemsets"

Copied!
1
0
0

Texte intégral

(1)

Advanced Human-Machine Interaction Interaction Data Analysis

TD03: Mining Sequences of Itemsets

INSA Rouen Normandie - Normandie University

TD Machine

1. Explore how to use the Apriori algorithm using Java using Weka (https://www.cs.waikato.ac.nz/

~ml/weka/) or in Python (e.g. apyori1.1.1).

2. Try to apply Apriori on the following data : https://storm.cis.fordham.edu/~gweiss/data-mining/

weka-data/weather.arff (or from Moodle). Why is this not possible ? Pre-process the data so that Apriori could be applied.

3. Using Apriori, identify :

the three best association rules (maximum condence and support) to predict a game (i.e. rules containing "play=yes") ;

the three best association rules (maximum condence and support) to predict the absence of a game (i.e. rules containing "play=no") ;

4. Use the sequence generator presented last week to generate new data that you could mine for the pre- diction of a particular context (e.g. item or item-set).

1

Références

Documents relatifs

The paper estab- lishes a surprisingly simple characterization of Palm distributions for such a process: The reduced n-point Palm distribution is for any n ≥ 1 itself a log Gaussian

Negotiations to remove tariffs on Environmental Goods (EGs) and Environmental Services (ESs) at the Doha Round, followed by negotiations towards an Environmental Goods Agreement

QuantMiner handles both categorical and numerical attributes and optimizes the intervals of numeric attributes during the process of min- ing association rules1. It is based on

We  thank  the  anonymous  reviewers  for  their  helpful  suggestions.  Data  Mining  Concepts  and

While the idea of interaction of a domain expert with discovered rules is not new [2], to the best of our knowledge, EasyMiner is the only web-based system which supports the

The main motivation for using Genetic Algorithm (GA) is that a GA performs a global search and copes better with attribute in- teraction than the greedy rule induction algorithms

In this work, we evaluate the predictive accuracy of a recommender system based on the IMSApriori algorithm and describe our extension to the Frequent Itemset Graph used in [NM03b]

Analyst hardly orientates himself or herself in a big list of mined rules. Therefore, we enumerate here some hints of how to prune the result from less-interesting rules and so