• Aucun résultat trouvé

Understanding the Hidden Web

N/A
N/A
Protected

Academic year: 2022

Partager "Understanding the Hidden Web"

Copied!
28
0
0

Texte intégral

(1)

Understanding the Hidden

Web Pierre Senellart

Introduction Process description

Discovery Wrappers Semantic Analysis Indexing and Querying

Summary

Understanding the Hidden Web

Pierre Senellart

Journées GEMO — 2nd June 2005

(2)

Understanding the Hidden

Web Pierre Senellart

Introduction Process description

Discovery Wrappers Semantic Analysis Indexing and Querying

Summary

The Hidden Web

Definition (Hidden Web)

The set of webpages (which may or may not be dynamically generated) not accessible from thehyperlinked structureof the World Wide Web.

Size estimate (2001) :500 times larger than thesurface Web.

How to understand it and benefit from its content?

(3)

Understanding the Hidden

Web Pierre Senellart

Introduction Process description

Discovery Wrappers Semantic Analysis Indexing and Querying

Summary

The Hidden Web

Definition (Hidden Web)

The set of webpages (which may or may not be dynamically generated) not accessible from thehyperlinked structureof the World Wide Web.

Size estimate (2001) :500 times larger than thesurface Web.

How to understand it and benefit from its content?

(4)

Understanding the Hidden

Web Pierre Senellart

Introduction Process description

Discovery Wrappers Semantic Analysis Indexing and Querying

Summary

The Hidden Web

Definition (Hidden Web)

The set of webpages (which may or may not be dynamically generated) not accessible from thehyperlinked structureof the World Wide Web.

Size estimate (2001) :500 times larger than thesurface Web.

How to understand it and benefit from its content?

(5)

Understanding the Hidden

Web Pierre Senellart

Introduction Process description

Discovery Wrappers Semantic Analysis Indexing and Querying

Summary

Web Service Semantic Interpretation Process

World Wide Web

(6)

Understanding the Hidden

Web Pierre Senellart

Introduction Process description

Discovery Wrappers Semantic Analysis Indexing and Querying

Summary

Web Service Semantic Interpretation Process

World Wide Web

HTML form WSDL discovery UDDI

World Wide Web

(7)

Understanding the Hidden

Web Pierre Senellart

Introduction Process description

Discovery Wrappers Semantic Analysis Indexing and Querying

Summary

Web Service Semantic Interpretation Process

World Wide Web

HTML form WSDL discovery UDDI

World Wide Web

WSDL wrappers HTML form

WSDL discovery UDDI

World Wide Web

(8)

Understanding the Hidden

Web Pierre Senellart

Introduction Process description

Discovery Wrappers Semantic Analysis Indexing and Querying

Summary

Web Service Semantic Interpretation Process

World Wide Web

HTML form WSDL discovery UDDI

World Wide Web

WSDL wrappers HTML form

WSDL discovery UDDI

World Wide Web

analysis

Analyzed Web Services WSDL wrappers HTML form

WSDL discovery UDDI

World Wide Web

(9)

Understanding the Hidden

Web Pierre Senellart

Introduction Process description

Discovery Wrappers Semantic Analysis Indexing and Querying

Summary

Web Service Semantic Interpretation Process

World Wide Web

HTML form WSDL discovery UDDI

World Wide Web

WSDL wrappers HTML form

WSDL discovery UDDI

World Wide Web

analysis

Analyzed Web Services WSDL wrappers HTML form

WSDL discovery UDDI

World Wide Web

Web Services Index indexing

Web Services Index indexing

analysis

Analyzed Web Services WSDL wrappers HTML form

WSDL discovery UDDI

World Wide Web

(10)

Understanding the Hidden

Web Pierre Senellart

Introduction Process description

Discovery Wrappers Semantic Analysis Indexing and Querying

Summary

Web Service Semantic Interpretation Process

World Wide Web

HTML form WSDL discovery UDDI

World Wide Web

WSDL wrappers HTML form

WSDL discovery UDDI

World Wide Web

analysis

Analyzed Web Services WSDL wrappers HTML form

WSDL discovery UDDI

World Wide Web

Web Services Index indexing

Web Services Index indexing

analysis

Analyzed Web Services WSDL wrappers HTML form

WSDL discovery UDDI

World Wide Web

Results Queries querying

Web Services Index indexing

Web Services Index indexing

analysis

Analyzed Web Services WSDL wrappers HTML form

WSDL discovery UDDI

World Wide Web

(11)

Understanding the Hidden

Web Pierre Senellart

Introduction Process description

Discovery Wrappers Semantic Analysis Indexing and Querying

Summary

Web Service Discovery

Crawling the World Wide Web for:

HTML forms implementing a Web Service UDDI registries

WSDL descriptions

Other resources (XML, HTML, Web as a full-text index. . . )

Only interested in Web Services withno side effects:

Ok Yellow Pages

Publication databases . . .

Not Ok Booking services

Mailing list management . . .

(12)

Understanding the Hidden

Web Pierre Senellart

Introduction Process description

Discovery Wrappers Semantic Analysis Indexing and Querying

Summary

Web Service Discovery

Crawling the World Wide Web for:

HTML forms implementing a Web Service UDDI registries

WSDL descriptions

Other resources (XML, HTML, Web as a full-text index. . . )

Only interested in Web Services withno side effects:

Ok Yellow Pages

Publication databases . . .

Not Ok Booking services

Mailing list management . . .

(13)

Understanding the Hidden

Web Pierre Senellart

Introduction Process description

Discovery Wrappers Semantic Analysis Indexing and Querying

Summary

Web Service Discovery

Crawling the World Wide Web for:

HTML forms implementing a Web Service UDDI registries

WSDL descriptions

Other resources (XML, HTML, Web as a full-text index. . . )

Only interested in Web Services withno side effects:

Ok Yellow Pages

Publication databases . . .

Not Ok Booking services

Mailing list management . . .

(14)

Understanding the Hidden

Web Pierre Senellart

Introduction Process description

Discovery Wrappers Semantic Analysis Indexing and Querying

Summary

Web Service Discovery

Crawling the World Wide Web for:

HTML forms implementing a Web Service UDDI registries

WSDL descriptions

Other resources (XML, HTML, Web as a full-text index. . . )

Only interested in Web Services withno side effects:

Ok Yellow Pages

Publication databases . . .

Not Ok Booking services

Mailing list management . . .

(15)

Understanding the Hidden

Web Pierre Senellart

Introduction Process description

Discovery Wrappers Semantic Analysis Indexing and Querying

Summary

Wrapping Web Service Descriptions

Analyzing thestructureof:

HTML forms

Result webpages

(16)

Understanding the Hidden

Web Pierre Senellart

Introduction Process description

Discovery Wrappers Semantic Analysis Indexing and Querying

Summary

Conceptual Model

IsAontologyofconcepts(simple DAG)

Person

Man Woman

Thing

Proceedings Article Book Publication

n-ary typedroles

AuthorOf(Person,Publication) HasName(Person,Name)

(17)

Understanding the Hidden

Web Pierre Senellart

Introduction Process description

Discovery Wrappers Semantic Analysis Indexing and Querying

Summary

Conceptual Model

IsAontologyofconcepts(simple DAG)

Person

Man Woman

Thing

Proceedings Article Book Publication

n-ary typedroles

AuthorOf(Person,Publication) HasName(Person,Name)

(18)

Understanding the Hidden

Web Pierre Senellart

Introduction Process description

Discovery Wrappers Semantic Analysis Indexing and Querying

Summary

Services and queries

Example

Servicegiving authors from publication titles

A*←WrittenBy(P,A),HasTitle(P,T),Input(T)

Query

Service with no input Example

<A,T*>*←WrittenBy(P,A),Article(P),HasTitle(P,T), KeywordOf(“xml”,P)

(19)

Understanding the Hidden

Web Pierre Senellart

Introduction Process description

Discovery Wrappers Semantic Analysis Indexing and Querying

Summary

Services and queries

Example

Servicegiving authors from publication titles

A*←WrittenBy(P,A),HasTitle(P,T),Input(T)

Query

Service with no input Example

<A,T*>*←WrittenBy(P,A),Article(P),HasTitle(P,T), KeywordOf(“xml”,P)

(20)

Understanding the Hidden

Web Pierre Senellart

Introduction Process description

Discovery Wrappers Semantic Analysis Indexing and Querying

Summary

Semantic Interpretation of a Service

How to analyze a Web Service into this formalism?

Field labels and variable names Example requests

Concrete type descriptions

Linguistic analysis of plain text descriptions

(21)

Understanding the Hidden

Web Pierre Senellart

Introduction Process description

Discovery Wrappers Semantic Analysis Indexing and Querying

Summary

Semantic Interpretation of a Service

How to analyze a Web Service into this formalism?

Field labels and variable names Example requests

Concrete type descriptions

Linguistic analysis of plain text descriptions

(22)

Understanding the Hidden

Web Pierre Senellart

Introduction Process description

Discovery Wrappers Semantic Analysis Indexing and Querying

Summary

Semantic Interpretation of a Service

How to analyze a Web Service into this formalism?

Field labels and variable names Example requests

Concrete type descriptions

Linguistic analysis of plain text descriptions

(23)

Understanding the Hidden

Web Pierre Senellart

Introduction Process description

Discovery Wrappers Semantic Analysis Indexing and Querying

Summary

Semantic Interpretation of a Service

How to analyze a Web Service into this formalism?

Field labels and variable names Example requests

Concrete type descriptions

Linguistic analysis of plain text descriptions

(24)

Understanding the Hidden

Web Pierre Senellart

Introduction Process description

Discovery Wrappers Semantic Analysis Indexing and Querying

Summary

Web Service Indexing and Querying

Given a query, represented as an Analyzed Web Service, how to know which known web services to query?

Issues:

Subsumptionof input/output parameters Missing input parameters

Composition of webservices

(25)

Understanding the Hidden

Web Pierre Senellart

Introduction Process description

Discovery Wrappers Semantic Analysis Indexing and Querying

Summary

Web Service Indexing and Querying

Given a query, represented as an Analyzed Web Service, how to know which known web services to query?

Issues:

Subsumptionof input/output parameters Missing input parameters

Composition of webservices

(26)

Understanding the Hidden

Web Pierre Senellart

Introduction Process description

Discovery Wrappers Semantic Analysis Indexing and Querying

Summary

Web Service Indexing and Querying

Given a query, represented as an Analyzed Web Service, how to know which known web services to query?

Issues:

Subsumptionof input/output parameters Missing input parameters

Composition of webservices

(27)

Understanding the Hidden

Web Pierre Senellart

Introduction Process description

Discovery Wrappers Semantic Analysis Indexing and Querying

Summary

Web Service Indexing and Querying

Given a query, represented as an Analyzed Web Service, how to know which known web services to query?

Issues:

Subsumptionof input/output parameters Missing input parameters

Composition of webservices

(28)

Understanding the Hidden

Web Pierre Senellart

Introduction Process description

Discovery Wrappers Semantic Analysis Indexing and Querying

Summary

Web Service Semantic Interpretation Process

World Wide Web

HTML form WSDL discovery UDDI

World Wide Web

WSDL wrappers HTML form

WSDL discovery UDDI

World Wide Web

analysis

Analyzed Web Services WSDL wrappers HTML form

WSDL discovery UDDI

World Wide Web

Web Services Index indexing

Web Services Index indexing

analysis

Analyzed Web Services WSDL wrappers HTML form

WSDL discovery UDDI

World Wide Web

Results Queries querying

Web Services Index indexing

Web Services Index indexing

analysis

Analyzed Web Services WSDL wrappers HTML form

WSDL discovery UDDI

World Wide Web

Références

Documents relatifs

It also outlines how Web Services communicate (choreography) and how they are composed (orchestration) [2]. 3) Goals are used by service requesters to specify what

For example, in our semantics, the answer to the query “hall discovered every moon” is computed by checking to see if, for every

One possible solution to date time- implicit queries is to seek for related temporal references over complementary web resources, such as document collections (e.g.,

The task of data discovery and extraction is to discover new data sources those are unknown in advance for the query processing according to the data inference configuration.. The

– Query optimization and evaluation techniques that focus on cost models to estimate the execution time of a plan, and on searching the space of plans of any shape, i.e., bushy

The goal of these rules is to construct a query execution plan which can be evaluated more efficient than processing the original query, e.g., by changing the evaluation order of

The approach is based on ”implementing” interpolation and definability results in predi- cate logic, while extending them to take into account access restrictions, modifying them

Going more into details, a SDWS annotation prototype should provide information about general properties regarding the content that is provided by the SDWS (→ content properties)