• Aucun résultat trouvé

Sequential pattern mining for analyzing visitor trajectories

N/A
N/A
Protected

Academic year: 2021

Partager "Sequential pattern mining for analyzing visitor trajectories"

Copied!
2
0
0

Texte intégral

(1)

HAL Id: hal-01890429

https://hal.inria.fr/hal-01890429

Submitted on 8 Oct 2018

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Sequential pattern mining for analyzing visitor trajectories

Nyoman Juniarta, Miguel Couceiro, Amedeo Napoli, Chedy Raïssi

To cite this version:

Nyoman Juniarta, Miguel Couceiro, Amedeo Napoli, Chedy Raïssi. Sequential pattern mining for analyzing visitor trajectories. ISWS 2018 - International Semantic Web Research Summer School 2018, Jul 2018, Bertinoro, Italy. �hal-01890429�

(2)

Sequential pattern mining for analyzing visitor trajectories

Nyoman Juniarta, Miguel Couceiro, Amedeo Napoli, and Chedy Raïssi Université de Lorraine, CNRS, Inria, LORIA, F-54000 Nancy, France

[email protected]

1. Introduction

2. Dataset and visitor behaviors

3. Workflow

4. Clustering 5. Sequential pattern mining

6. Results

Hecht Museum is an archaeological museum in Haifa, Israel.

In this work, we present our work on the grouping and mining of trajectory patterns of visitors in this museum.

Within the framework of CrossCult project, we are working on a dataset of the trajectories of 254 visitors in Hecht Museum.

Each trajectory contains a list of visited items.

An example is illustrated in the table below.

Table I. An example of one visitor trajectory

Start Finish Item Room

14:10:09 14:11:14 102 1

14:12:45 14:20:19 402 4

14:22:10 14:25:42 407 4

Based on his/her movement, a visitor can be grouped as one of four defined behaviors: ant, grasshopper, butterfly, and fish, as illustrated in the figures below.

Each trajectory is modeled as a sequence of items.

The trajectory in Table I becomes:

(102, 402, 407)

Distance between any two sequences is measured by simACS [1].

7. Conclusion

We apply two algorithms to obtain two lattices:

• MFCS (mining frequent contiguous sequences) [2]

(102, 202, 301, 402) → (102, 202, 301)

• MRGS (mining rare general sequences) [3]

(102, 202, 301, 402) → (102, 202, 402)

6. Jumping emerging pattern (JEP)

From the two lattices, there are some concepts whose extent contains visitors from the same label.

The intent of such concepts is a JEP.

8. References

[1] E. Egho et al., 2015, On measuring similarity for sequences of itemsets. Data Mining and Knowledge Discovery 29(3), 732–764.

[2] A. Buzmakov et al., 2016, On mining complex sequential data by means of FCA and pattern structures. International Journal of General Systems 45(2), 135–159.

[3] V. Codocedo et al., 2017, A proposition for sequence mining using pattern structures.

Proceedings of ICFCA, 106–121.

Our results highlight some interesting patterns that may define visitor

behaviors. This can help museum researchers to analyze and evaluate the placement of items and the visiting styles.

Furthermore, these patterns can be analyzed to build a recommendation system for future visitors.

Visitors Patterns Cluster Behavior

MFCS

70, 107, 121, 133, 201, 202 (1, 1, 402) A ant

103, 165, 188 (701, 707) B ant

4, 8, 32 (101, 102, 101) C ant/butterfly

46, 47 (101, 602) D grasshopper

89, 163 (602, 203) D grasshopper

MR

GS 71, 79 (701, 504) D grasshopper

97, 98 (701, 406) D grasshopper

E fish

Dataset of visitor paths

Sequential pattern minings Concept lattices

Grouping/clustering Labeled

dataset

Finding JEPs for each

label

Mapping labels into behaviors Ant Grasshopper Butterfly Fish

Patterns Count

(1, 1, 1) 33

(1, 7) 13

(1, 1) 66

Patterns Count

(1, 3) 38

(3, 1) 9

(4, 7) 31

(7, 4) 11

Concept

Intent: Set of common paths Extent: Set of visitors

Références

Documents relatifs

the mining of complex sequential data and dynamics in adapting two algorithms based on pattern structures, the analysis of the trajectories based on jumping emerging patterns

National Research University Higher School of Economics, Moscow, Russia Amedeo Napoli?. LORIA (CNRS – Inria Nancy Grand Est – Université de Lorraine), Vandoeuvre les

In this paper, we present an approach for analyzing sequential datasets within the framework of pattern structures, an extension of FCA dealing with complex (non binary) data..

In order to address this drawback, [12] have proposed a global constraint exists-embedding to encode the subsequence relation, and used projected frequency within an ad hoc

Indeed, contextual sequential pattern mining does not only aim to mine sequences that are specific to one class, but to extract all sequential patterns in a whole sequence database,

Application of sequential pattern mining to the analysis of visitor trajectories.. Nyoman Juniarta,

Nyoman Juniarta, Miguel Couceiro, Amedeo Napoli, and Chedy Raïssi Université de Lorraine, CNRS, Inria, LORIA, F-54000 Nancy,

With regards to trajectory mining, one direction is to try to extend the existing methods and movement pattern definitions to indoor spaces, or alternatively investigate new