• Aucun résultat trouvé

Hot Topics Hot Topics in Web Search Pierre Senellart (

N/A
N/A
Protected

Academic year: 2022

Partager "Hot Topics Hot Topics in Web Search Pierre Senellart ("

Copied!
7
0
0

Texte intégral

(1)

Hot Topics

Hot Topics in Web Search

Pierre Senellart

([email protected])

(2)

New Aspects of Web Search

Web 2.0: interaction-rich interfaces, social networks, mash-ups, user-provided content

Deep Web: data accessible only through HTML forms

Information Extraction: search at the level of objects inside pages Semantic Web: machine-readable content, complex search queries

(3)

Web 2.0

Definition

Web 2.0: buzzwordabout:

rich dynamic interfaces, especially with the help ofAJAX

(Asynchronous JavaScript and XML) technologies: GMail, Google Suggest

user-editable content,collaborativework and social networks: blogs, Wikipedia, MySpace, Facebook

aggregationof content from multiple sources andpersonalization:

Netvibes, Yahoo! Pipes

(4)

Web 2.0 Impact on Web Search

How to crawl AJAX Web sites?

Previous lecture about social networks and search in social networks How to exploit user feedback in search engines: relevance feedback, cf. slides by C. Manning and P. Raghavan, Stanford:

http://www.stanford.edu/class/cs276/handouts/

lecture9-queryexpansion-1up.pdf

Mash-ups, integration, cf http://pipes.yahoo.com/, http://www.google.com/ig

(5)

Deep Web

cf. slides on the course website:

http://pierre.senellart.com/enseignement/2008-2009/inf396/

5-hot-topics/deep-web.pdf

(6)

Information Extraction

cf. course by William Cohen, CMU:

http://www.cs.cmu.edu/~wcohen/10-707/01-16-introduction.ppt

(7)

Semantic Web

cf. online tutorial by Ivan Herman:

http://www.w3.org/2006/Talks/0428-Beijing-IH/

Références

Documents relatifs

– le droit d’intégrer tout ou partie du document dans un document composite et de le diffuser dans ce nouveau document, à condition que : – L’auteur soit informé. Les

For a HTML page, or for text, the browser must also know what character set is used (this has precedence over the information contained in the document itself). Also returned:

Small scale: disk storage, with memory mapping (cf. mmap) techniques; secondary index for offset of each term in main index Large scale: distributed on a cluster of machines;

Discovering new URLs Identifying duplicates Crawling architecture Recrawling

Imagine you want to build a simple speech recognition system, or at least a spoken term detection (STD) system in a new dialect, for which only very few audio examples are

Search- Monkey contributes to the transformation from the current web to a Semantic Web by creating an ecosystem of developers, publishers and end-users where all participants

a class based representation of documents on carousels, a stack structure of classes according to their ranking, and a meta carousel for the localization of class stacks associ-

Following figure 1, we create various types of profiles differing in terms of the number of pages (Single or Multiple) and the level of data used (Instance or Page).. Note that we