• Aucun résultat trouvé

Institut Télécom: Partner Introduction

N/A
N/A
Protected

Academic year: 2022

Partager "Institut Télécom: Partner Introduction"

Copied!
5
0
0

Texte intégral

(1)

Institut Télécom:

Partner Introduction

Pierre Senellart, Télécom ParisTech

(2)

Institut Télécom & Télécom ParisTech

Group gathering several public engineering and management school around France.

Leading French engineering school in information technology. One of the schools of ParisTech, the Paris Institute of Technology. Located inside Paris.

Formerly known asENST.

DBWEB Our research group inside Télécom ParisTech.

(3)

The DBW

EB

Team

4 permanent researchers (+1 multi-affiliated member):

Talel Abdessalem Bogdan Cautis Jean-Louis Dessalles Pierre Senellart

Michalis Vazirgiannis (also at AUEB and École polytechnique) 6 PhD candidates (+2 multi-affiliated students), one of them working on Web archiving:

Marilena Oita

One PhD position open at the moment for ARCOMEM

Other PhD, postdoc, and visiting positions potentially open as well

(4)

Research Interests

At the confluence of four areas of research:

Web data management, social networks XML database systems

Database theory Cognitive science Especially, for ARCOMEM:

Crawling of Web objects and intelligent crawling Information extraction and information integration Social network mining

Relevance and ranking of Web data

(5)

Relevant Existing Collaborations

Inside the ARCOMEM consortium:

Internet Memory Foundation deep Web crawling, crawling from RSS feeds

Yahoo! (Sihem Amer-Yahia) querying and exploration of social networks

Outside ARCOMEM:

INRIA Saclay – Île-de-France (Serge Abiteboul) Web data management

University of Oxford (Michael Benedikt, Georg Gottlob) deep Web querying, information extraction

Références

Documents relatifs

Offline phase: constructs a dynamic site map (limiting the number of URLs retrieved), learns the best traversal strategy based on importance of navigation patterns (selecting

• Detect the type of Web application, kind of Web pages inside this Web application, and decide crawling actions accordingly. • Directly targets useful content-rich areas,

The AAH deals with two different cases of adaptation: first, when (part of) a Web application has been crawled before the template change and a recrawl is carried out after that (a

7/44 Introduction Crawling of Structured Websites Learning as You Crawl Conclusion.. Importance of

Offline phase: constructs a dynamic site map (limiting the number of URLs retrieved), learns the best traversal strategy based on importance of navigation patterns (selecting

Discovering new URLs Identifying duplicates Crawling architecture Crawling Complex Content Focused

We use the following scheme for class hierarchies: An abstract subclass of Robot redefines the executionLoop method in order to indicate how the function processURL should be called

The World Wide Web Discovering new URLs Identifying duplicates Crawling architecture Crawling Complex Content Conclusion.. 23