• Aucun résultat trouvé

Leveraging Data Sources Exported as Forms and Web Services

N/A
N/A
Protected

Academic year: 2022

Partager "Leveraging Data Sources Exported as Forms and Web Services"

Copied!
62
0
0

Texte intégral

(1)

Leveraging Data Sources

Exported as Forms and Web Services

Pierre Senellart

29 September 2006

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 1 / 23

(2)

Introduction The Hidden Web

The Hidden Web

Definition (Hidden Web, Deep Web, Invisible Web)

The part of Web content (which may or may not be dynamically

generated) not accessible from the hyperlinked structure of the World Wide Web. Typically: HTML forms, Web Services.

Size estimate (2001) : 500 times larger than the surface Web.

How to understand it and benefit from its content?

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 2 / 23

(3)

Introduction The Hidden Web

The Hidden Web

Definition (Hidden Web, Deep Web, Invisible Web)

The part of Web content (which may or may not be dynamically

generated) not accessible from the hyperlinked structure of the World Wide Web. Typically: HTML forms, Web Services.

Size estimate (2001) : 500 times larger than the surface Web.

How to understand it and benefit from its content?

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 2 / 23

(4)

Introduction The Hidden Web

The Hidden Web

Definition (Hidden Web, Deep Web, Invisible Web)

The part of Web content (which may or may not be dynamically

generated) not accessible from the hyperlinked structure of the World Wide Web. Typically: HTML forms, Web Services.

Size estimate (2001) : 500 times larger than the surface Web.

How to understand it and benefit from its content?

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 2 / 23

(5)

Introduction The Hidden Web

Understanding the Hidden Web

Purpose

Intensional indexing of the Hidden Web High-level queries

⇒ a semantic search engine over the Hidden Web

In a fully, unsupervised, way!

Difficult and broad problem. Possible restriction to some domain (e.g.

publications).

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 3 / 23

(6)

Introduction The Hidden Web

Understanding the Hidden Web

Purpose

Intensional indexing of the Hidden Web High-level queries

⇒ a semantic search engine over the Hidden Web

In a fully, unsupervised, way!

Difficult and broad problem. Possible restriction to some domain (e.g.

publications).

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 3 / 23

(7)

Introduction The Hidden Web

Understanding the Hidden Web

Purpose

Intensional indexing of the Hidden Web High-level queries

⇒ a semantic search engine over the Hidden Web

In a fully, unsupervised, way!

Difficult and broad problem. Possible restriction to some domain (e.g.

publications).

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 3 / 23

(8)

Introduction The Hidden Web

Web Service Semantic Interpretation Process

World Wide Web

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 4 / 23

(9)

Introduction The Hidden Web

Web Service Semantic Interpretation Process

World Wide Web

HTML form WSDL discovery UDDI

World Wide Web

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 4 / 23

(10)

Introduction The Hidden Web

Web Service Semantic Interpretation Process

World Wide Web

HTML form WSDL discovery UDDI

World Wide Web

WSDL wrappers HTML form

WSDL discovery UDDI

World Wide Web

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 4 / 23

(11)

Introduction The Hidden Web

Web Service Semantic Interpretation Process

World Wide Web

HTML form WSDL discovery UDDI

World Wide Web

WSDL wrappers HTML form

WSDL discovery UDDI

World Wide Web

analysis

Analyzed Web Services WSDL wrappers HTML form

WSDL discovery UDDI

World Wide Web

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 4 / 23

(12)

Introduction The Hidden Web

Web Service Semantic Interpretation Process

World Wide Web

HTML form WSDL discovery UDDI

World Wide Web

WSDL wrappers HTML form

WSDL discovery UDDI

World Wide Web

analysis

Analyzed Web Services WSDL wrappers HTML form

WSDL discovery UDDI

World Wide Web

Web Services Index indexing

Web Services Index indexing

analysis

Analyzed Web Services WSDL wrappers HTML form

WSDL discovery UDDI

World Wide Web

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 4 / 23

(13)

Introduction The Hidden Web

Web Service Semantic Interpretation Process

World Wide Web

HTML form WSDL discovery UDDI

World Wide Web

WSDL wrappers HTML form

WSDL discovery UDDI

World Wide Web

analysis

Analyzed Web Services WSDL wrappers HTML form

WSDL discovery UDDI

World Wide Web

Web Services Index indexing

Web Services Index indexing

analysis

Analyzed Web Services WSDL wrappers HTML form

WSDL discovery UDDI

World Wide Web

Results Queries querying

Web Services Index indexing

Web Services Index indexing

analysis

Analyzed Web Services WSDL wrappers HTML form

WSDL discovery UDDI

World Wide Web

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 4 / 23

(14)

Introduction Imprecise data

Imprecise data

Many tasks of the process generate imprecise data, with some confidence value:

Information Extraction Natural Language Processing Data Cleaning

Schema Matching . . .

Need for a way to manage this imprecision, to work with it throughout an entire complex process.

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 5 / 23

(15)

Introduction Imprecise data

Imprecise data

Many tasks of the process generate imprecise data, with some confidence value:

Information Extraction Natural Language Processing Data Cleaning

Schema Matching . . .

Need for a way to manage this imprecision, to work with it throughout an entire complex process.

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 5 / 23

(16)

Introduction Imprecise data

Imprecise data

Many tasks of the process generate imprecise data, with some confidence value:

Information Extraction Natural Language Processing Data Cleaning

Schema Matching . . .

Need for a way to manage this imprecision, to work with it throughout an entire complex process.

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 5 / 23

(17)

Introduction Imprecise data

Imprecise data

Many tasks of the process generate imprecise data, with some confidence value:

Information Extraction Natural Language Processing Data Cleaning

Schema Matching . . .

Need for a way to manage this imprecision, to work with it throughout an entire complex process.

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 5 / 23

(18)

Introduction Imprecise data

Imprecise data

Many tasks of the process generate imprecise data, with some confidence value:

Information Extraction Natural Language Processing Data Cleaning

Schema Matching . . .

Need for a way to manage this imprecision, to work with it throughout an entire complex process.

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 5 / 23

(19)

Introduction Imprecise data

Imprecise data

Many tasks of the process generate imprecise data, with some confidence value:

Information Extraction Natural Language Processing Data Cleaning

Schema Matching . . .

Need for a way to manage this imprecision, to work with it throughout an entire complex process.

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 5 / 23

(20)

Introduction Imprecise data

Imprecise data

Many tasks of the process generate imprecise data, with some confidence value:

Information Extraction Natural Language Processing Data Cleaning

Schema Matching . . .

Need for a way to manage this imprecision, to work with it throughout an entire complex process.

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 5 / 23

(21)

Introduction A Probabilistic XML Warehouse

A Probabilistic XML Warehouse

Query interface (tree-pattern with

join) Update interface

(insertions, deletions)

Probabilistic XML Warehouse

Module 1 transaction Update } Module 2 Module 3 + confidence

+ confidence

Query Results

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 6 / 23

(22)

Process description

Outline

1 Introduction

2 Process description Web Service Discovery

Wrapping Web Service Descriptions Web Service Semantic Analysis Web Service Indexing and Querying

3 Conclusion

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 7 / 23

(23)

Process description Discovery

Web Service Discovery

Crawling the World Wide Web for:

HTML forms implementing a Web Service UDDI registries

WSDL descriptions

Other resources (XML, HTML, Web as a full-text index. . . )

Usually, only interested in a specific domain

⇒ focused crawling?

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 8 / 23

(24)

Process description Discovery

Web Service Discovery

Crawling the World Wide Web for:

HTML forms implementing a Web Service UDDI registries

WSDL descriptions

Other resources (XML, HTML, Web as a full-text index. . . )

Usually, only interested in a specific domain

⇒ focused crawling?

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 8 / 23

(25)

Process description Discovery

Web Service Filtering

Only interested in Web Services with no side effects:

OK Yellow Pages

Publication databases . . .

Not OK Booking services

Mailing list management . . .

Filtering using a number of heuristics:

No credit card number, password, email address (?) HTTP GET should be side-effect free

. . .

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 9 / 23

(26)

Process description Discovery

Web Service Filtering

Only interested in Web Services with no side effects:

OK Yellow Pages

Publication databases . . .

Not OK Booking services

Mailing list management . . .

Filtering using a number of heuristics:

No credit card number, password, email address (?) HTTP GET should be side-effect free

. . .

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 9 / 23

(27)

Process description Discovery

Web Service Filtering

Only interested in Web Services with no side effects:

OK Yellow Pages

Publication databases . . .

Not OK Booking services

Mailing list management . . .

Filtering using a number of heuristics:

No credit card number, password, email address (?) HTTP GET should be side-effect free

. . .

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 9 / 23

(28)

Process description Discovery

Web Service Filtering

Only interested in Web Services with no side effects:

OK Yellow Pages

Publication databases . . .

Not OK Booking services

Mailing list management . . .

Filtering using a number of heuristics:

No credit card number, password, email address (?) HTTP GET should be side-effect free

. . .

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 9 / 23

(29)

Process description Discovery

Web Service Filtering

Only interested in Web Services with no side effects:

OK Yellow Pages

Publication databases . . .

Not OK Booking services

Mailing list management . . .

Filtering using a number of heuristics:

No credit card number, password, email address (?) HTTP GET should be side-effect free

. . .

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 9 / 23

(30)

Process description Wrappers

Analyzing HTML forms

Analyzing the structure of HTML forms.

Issues

What are the relevant form fields?

What is the concrete type of each field?

What is the label of each field?

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 10 / 23

(31)

Process description Wrappers

Analyzing HTML forms

Analyzing the structure of HTML forms.

Issues

What are the relevant form fields?

What is the concrete type of each field?

What is the label of each field?

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 10 / 23

(32)

Process description Wrappers

Analyzing HTML forms

Analyzing the structure of HTML forms.

Issues

What are the relevant form fields?

What is the concrete type of each field?

What is the label of each field?

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 10 / 23

(33)

Process description Wrappers

Probing HTML forms

Probing HTML forms to retrieve sample HTML answer pages:

With dictionary words

With nonsense words

With domain words

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 11 / 23

(34)

Process description Wrappers

Probing HTML forms

Probing HTML forms to retrieve sample HTML answer pages:

With dictionary words

With nonsense words

With domain words

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 11 / 23

(35)

Process description Wrappers

Probing HTML forms

Probing HTML forms to retrieve sample HTML answer pages:

With dictionary words

With nonsense words

With domain words

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 11 / 23

(36)

Process description Wrappers

Query-answer Web pages

Extract data from query-answer Web pages.

Issues

What part of the Web page contains the answer?

How to extract structured content?

How to label this structured content?

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 12 / 23

(37)

Process description Wrappers

Query-answer Web pages

Extract data from query-answer Web pages.

Issues

What part of the Web page contains the answer?

How to extract structured content?

How to label this structured content?

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 12 / 23

(38)

Process description Wrappers

Query-answer Web pages

Extract data from query-answer Web pages.

Issues

What part of the Web page contains the answer?

How to extract structured content?

How to label this structured content?

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 12 / 23

(39)

Process description Wrappers

Automatic wrapper induction using domain knowledge

Annotate pages with knowledge domain (finite automata techniques)

(Both imperfect and incomplete)

Use machine learning to generalize the result into a structural extraction wrapper (work in progress with Mostrare ).

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 13 / 23

(40)

Process description Wrappers

Automatic wrapper induction using domain knowledge

Annotate pages with knowledge domain (finite automata techniques)

(Both imperfect and incomplete)

Use machine learning to generalize the result into a structural extraction wrapper (work in progress with Mostrare ).

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 13 / 23

(41)

Process description Semantic Analysis

Conceptual Model

IsA ontology of concepts (simple DAG)

Person

Man Woman

Thing

Proceedings Article Book Publication

n-ary typed roles

AuthorOf(Person,Publication) HasName(Person,Name)

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 14 / 23

(42)

Process description Semantic Analysis

Conceptual Model

IsA ontology of concepts (simple DAG)

Person

Man Woman

Thing

Proceedings Article Book Publication

n-ary typed roles

AuthorOf(Person,Publication) HasName(Person,Name)

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 14 / 23

(43)

Process description Semantic Analysis

Semantic representation of a service

What is a service described by?

A n-uple of typed input parameters A complex (= nested) type of its output Semantic relations between inputs and outputs

Definition (Complex types) S : set of concepts

T ←− S | <T , . . . , T > | T ∗

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 15 / 23

(44)

Process description Semantic Analysis

Semantic representation of a service

What is a service described by?

A n-uple of typed input parameters A complex (= nested) type of its output Semantic relations between inputs and outputs

Definition (Complex types) S : set of concepts

T ←− S | <T , . . . , T > | T ∗

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 15 / 23

(45)

Process description Semantic Analysis

Semantic representation of a service

What is a service described by?

A n-uple of typed input parameters A complex (= nested) type of its output Semantic relations between inputs and outputs Definition (Complex types)

S : set of concepts

T ←− S | <T , . . . , T > | T ∗

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 15 / 23

(46)

Process description Semantic Analysis

Semantic representation of a service

What is a service described by?

A n-uple of typed input parameters A complex (= nested) type of its output Semantic relations between inputs and outputs Definition (Complex types)

S : set of concepts

T ←− S | <T , . . . , T > | T ∗

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 15 / 23

(47)

Process description Semantic Analysis

Services and queries

Example

Service giving authors from publication titles

A* ← WrittenBy(P,A),HasTitle(P,T),Input(T)

Example Query:

<A,T*>* ← WrittenBy(P,A), Article(P), HasTitle(P,T), KeywordOf(“xml”,P)

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 16 / 23

(48)

Process description Semantic Analysis

Services and queries

Example

Service giving authors from publication titles

A* ← WrittenBy(P,A),HasTitle(P,T),Input(T)

Example Query:

<A,T*>* ← WrittenBy(P,A), Article(P), HasTitle(P,T), KeywordOf(“xml”,P)

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 16 / 23

(49)

Process description Semantic Analysis

Managing extensional information

How to represent extensional information (i.e. documents) in this formalism?

Definition

A document is a service with no input.

Complex types: natural representation of a DTD.

(Disjunctions a|b simulated by (a?,b?)).

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 17 / 23

(50)

Process description Semantic Analysis

Semantic Interpretation of a Service

How to analyze a Web Service?

Field labels, variable names, tag names Concrete type descriptions

(e.g. \d{4}-\d{2}-\d{2} is a date)

Linguistic analysis of plain text descriptions and pages linking to the service

Note: Extracting concepts is easier than extracting relations between concepts.

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 18 / 23

(51)

Process description Semantic Analysis

Semantic Interpretation of a Service

How to analyze a Web Service?

Field labels, variable names, tag names Concrete type descriptions

(e.g. \d{4}-\d{2}-\d{2} is a date)

Linguistic analysis of plain text descriptions and pages linking to the service

Note: Extracting concepts is easier than extracting relations between concepts.

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 18 / 23

(52)

Process description Semantic Analysis

Semantic Interpretation of a Service

How to analyze a Web Service?

Field labels, variable names, tag names Concrete type descriptions

(e.g. \d{4}-\d{2}-\d{2} is a date)

Linguistic analysis of plain text descriptions and pages linking to the service

Note: Extracting concepts is easier than extracting relations between concepts.

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 18 / 23

(53)

Process description Semantic Analysis

Semantic Interpretation of a Service

How to analyze a Web Service?

Field labels, variable names, tag names Concrete type descriptions

(e.g. \d{4}-\d{2}-\d{2} is a date)

Linguistic analysis of plain text descriptions and pages linking to the service

Note: Extracting concepts is easier than extracting relations between concepts.

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 18 / 23

(54)

Process description Indexing and Querying

Web Service Indexing and Querying

Given a query, represented as an Analyzed Web Service, how to know which known web services to query?

Issues

Subsumption of input/output parameters Missing input parameters

Composition of Web Services

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 19 / 23

(55)

Process description Indexing and Querying

Web Service Indexing and Querying

Given a query, represented as an Analyzed Web Service, how to know which known web services to query?

Issues

Subsumption of input/output parameters Missing input parameters

Composition of Web Services

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 19 / 23

(56)

Process description Indexing and Querying

Web Service Indexing and Querying

Given a query, represented as an Analyzed Web Service, how to know which known web services to query?

Issues

Subsumption of input/output parameters Missing input parameters

Composition of Web Services

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 19 / 23

(57)

Process description Indexing and Querying

Web Service Indexing and Querying

Given a query, represented as an Analyzed Web Service, how to know which known web services to query?

Issues

Subsumption of input/output parameters Missing input parameters

Composition of Web Services

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 19 / 23

(58)

Process description Indexing and Querying

Differences with classical database querying

Three main differences:

Information can be queried only through views (Local As View)

Nested types

Incomplete information

Three sources of complexity!

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 20 / 23

(59)

Process description Indexing and Querying

Differences with classical database querying

Three main differences:

Information can be queried only through views (Local As View)

Nested types

Incomplete information

Three sources of complexity!

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 20 / 23

(60)

Conclusion

Outline

1 Introduction

2 Process description

3 Conclusion

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 21 / 23

(61)

Conclusion

Web Service Semantic Interpretation Process

World Wide Web

HTML form WSDL discovery UDDI

World Wide Web

WSDL wrappers HTML form

WSDL discovery UDDI

World Wide Web

analysis

Analyzed Web Services WSDL wrappers HTML form

WSDL discovery UDDI

World Wide Web

Web Services Index indexing

Web Services Index indexing

analysis

Analyzed Web Services WSDL wrappers HTML form

WSDL discovery UDDI

World Wide Web

Results Queries querying

Web Services Index indexing

Web Services Index indexing

analysis

Analyzed Web Services WSDL wrappers HTML form

WSDL discovery UDDI

World Wide Web

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 22 / 23

(62)

Conclusion

Perspectives

Still a lot to do... In particular:

Answering queries using views on the semantic model.

Continue work on automatic wrapper induction.

Form probing.

System implementation of the whole process on a specific domain.

Pierre Senellart (INRIA & U. Paris-Sud) Leveraging Forms & Web Services 29 September 2006 23 / 23

Références

Documents relatifs

For these, we split the RATING data by assigning last (according to time stamp) 5 records from each user as a test data, the remaining data was used as train data. Based on

In this paper, we describe a solution for representing, acquiring and querying assessment data that uses (1) domain ontologies and standard terminologies to give formal descriptions

Toxicology is increasingly a systems discipline, requiring the analysis of multi-scale Omics data [2, 17, 25]. This typically require tools and databases that can

– the ability to manage large amounts of structured and unstructured content, which may be imported and integrated from existing sources, but also may be generated by end users, who

Databases like Wikipedia (especially the categorization schema used within) or RDF or ontology based information systems like DBpedia or freebase 3 (both initialized by

This year, we initiated the first Linked Data for Information Extraction Challenge, showing that it is possible to create large-size training and evaluation data sets, which allows

When analysing large RDF datasets, users are left with two main options: using SPARQL or using an existing non-RDF-specific big data language, both with its own limitations.. The

The current paper contributes to this honourable goal by exposing a method for the development of complex web services which embodies a number of quality