• Aucun résultat trouvé

Image Results Exploration using Ephemeral Clustering

N/A
N/A
Protected

Academic year: 2021

Partager "Image Results Exploration using Ephemeral Clustering"

Copied!
3
0
0

Texte intégral

(1)

HAL Id: hal-01070167

https://hal.archives-ouvertes.fr/hal-01070167

Submitted on 6 Oct 2014

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Image Results Exploration using Ephemeral Clustering

José G. Moreno, Gaël Dias

To cite this version:

José G. Moreno, Gaël Dias. Image Results Exploration using Ephemeral Clustering. 2012 World Wide Web Conference (WWW 2012)., Apr 2012, Lyon, France. 2 p. �hal-01070167�

(2)

Image Results Exploration using Ephemeral Clustering

Jose G.Moreno and Ga¨el Dias HULTECH / GREYC - CNRS UMR 6072

Abstract

The recent shift in human-computer interaction from desktop to mobile computing fosters the needs of new interfaces for web image search results exploration. We present two different strategies to cluster results gathered from a text- based image search engine and propose an adapted interface for handled devices. For that purpose, we suggest to expand the original text query based on labels of ephemeral text clusters and compare it to a query log-based approach. In particular, we present a new ephemeral clustering algorithm, the H I SGK-means, which proposes a more compact representation of web search results compared to the commonly used STC algorithm and best fits to the mobile devices constraints by limiting browsing and zooming.

1 Proposed Frameworks

Our approach consists in two different methodologies to address the organization of web image search results. First, an ephemeral clustering approach is presented and a sec- ond one is based on query logs. Independently of the ap- proach, the proposed algorithms must achieve the following goal: building a web image search results taxonomy for mo- bile devices in response to a text query in order to facilitate user exploration. Moreover, the methodologies are based on existing search engines services in order to propose a frame- work comparable to the work proposed in [2] and provide a meta image search engine for mobile devices capable to deal with real-world queries.

The main goal of ephemeral text clustering is to organize web snippets into a compact taxonomy, guaranteeing that at each level of the hierarchy, the most suitable number of clusters is found. Then, meaningful labels must be found for each cluster (i.e. a small set of representative words). The first framework is based on the ephemeral clustering algo- rithm called Hierarchical InfoSimba-based Global K-means (H I SGK-means) [1], which uses web snippet results to gen- erate a hierarchical representation of web search results.

The clusters are groups of web page results and are labelled within the clustering process. Additionally, the H I SGK- means offers four important characteristics for our work:

(1) optimum clustering is guaranteed, (2) the labelling step is included in the clustering process to avoid unlabelled clus- ters, (3) it is language-independent and (4) a compact tax- onomy is built. The procedure to get web image search clusters is defined in algorithm 1. The procedure to get web image search clusters is defined in algorithm 1. The sec- ond framework is based on query logs obtained through the Google Suggestions API, which allows to get frequently-used queries related to a given query. This service is usually used for query completion. As the previous approach, an image

web search engine is queried after query expansion based on query suggestions is made. The results are then displayed in clusters, where each proposed query suggestion is a clus- ter containing the results of the image web search engine.

The algorithm 1 describes the query log-based framework.

In fact, each group is a set of images obtained by the image search API from Google and labelled by the cluster names obtained from theH I SGK-means algorithm concatenated to the original query for the first approach and from the query expansion suggestions via query logs for the second approach. In particular, both approaches retrieve semanti- cally separated results. The ephemeral clustering based ap- proach finds the cluster names “sells services”, “cars new”,

“land rover” and “onca america” (the animal) for the first four image clusters and the query log-based approach sug- gests “jaguar”, “jaguar usa”, “jaguar car” and “jaguar an- imal”. In particular, we developed a prototype for the An- droid operating system for smart-phones, which implements facilitating interaction facilities to explore results. So, in or- der to facilitate the exploration phase, each cluster of image results can be explored with left and right movements as a typical gallery exploration. Moreover, the different groups can be explored using up and down movements allowing a quick exploration of different meanings involved in the orig- inal query.

References

[1] Dias, G., Cleuziou, G., and Machado, D. Informa- tive polythetic hierarchical ephemeral clustering. In IEEE/WIC/ACM International Conference on Web In- telligence (WIC 2011)(2011).

[2] Ding, H., Liu, J., and Lu, H. Hierarchical clustering- based navigation of image search results. In 16th Annual ACM International Conference on Multimedia (MM 2008)(2008), 741–744.

1

(3)

A Framework Algorithm

Algorithm 1Image clusters using different frameworks.

Input:T extQuery, F rameworkj

Output:ImgCluster

ExpandedSet=F rameworkj(T extQuery) forEach elementclusteriinClusterSetdo

ClusterN ame=getClusterN ame(clusteri) ExpandQuery=concat(T extQuery, ClusterN ame) ImgClusteri=getImgResults(ExpandQuery) ImgClusterN amei=ClusterN ame

end for

returnImgCluster

B Example Results

(a) HISGK (b) Query Log

Figure 1: Results for “jaguar” using ephemeral clustering (HISGK-means) and query logs.

C Link youtube video

http://www.youtube.com/watch?v=HmDIgDnUYgQ

2

Références

Documents relatifs

Abstract: In this paper, we are interested to segment weld radiographic images using the level set method (LSM) based on kernel fuzzy c-means clustering (KFCM) in order to

In this paper, in order to achieve higher order accuracy in the temporal discretization, we have used Total Variation Diminishing (TVD) Runge Kutta (RK) methods.. The spatial

Cette étude nous montre que la Dimedone et l’ortho-phénylènediamine deux produits commerciaux et bon marché, réagissent pour donner l’intermédiaire

The wavelength range probed by SOIR allows for a detailed chemical inventory of the Venus atmosphere at the terminators in the upper mesosphere and lower thermosphere (80 to 180

Each Cluster spacecraft contains 11 identical instruments: an active spacecraft potential control instrument (ASPOC), an ion spectrometry instrument (CIS), an electron drift

1) The choice of the unit square to build the strip implies that the corresponding tiling involves only two tiles. Actually, if the unit square is replaced by any other unit cell

Following this idea and based on Kim's critical state model [7], a method t o determine the transport critical current density has been developed [8]. - Temperature dependence

To address the problems related to the organization of web image search results for mobile devices and improve the results obtained with image search systems, we explore two