• Aucun résultat trouvé

In Search of Truth (on the Deep Web)

N/A
N/A
Protected

Academic year: 2022

Partager "In Search of Truth (on the Deep Web)"

Copied!
1
0
0

Texte intégral

(1)

In Search of Truth (on the Deep Web)

Divesh Srivastava AT&T Labs Inc.

divesh@research.att.com

Abstract. The Deep Web has enabled the availability of a huge amount of useful information and people have come to rely on it to fulll their information needs in a variety of domains. We present a recent study on the accuracy of data and the quality of Deep Web sources in two domains where quality is important to people's lives: Stock and Flight.

We observe that, even in these domains, the quality of the data is less than ideal, with sources providing conicting, out-of-date and incomplete data. Sources also copy, reformat and modify data from other sources, making it dicult to discover the truth. We describe techniques proposed in the literature to solve these problems, evaluate their strengths on our data, and identify directions for future work in this area.

1

Références

Documents relatifs

The problem of query processing in the Deep Web has been widely investi- gated in the last years, with different approaches and under different perspectives including: data

With PoLDo we can expose an already existing RESTful service, even a third-party one, as a SPARQL endpoint thus making it part of the Linked Data cloud.. Thanks to a configurable

This requires the use of languages that are not only optimized to express concepts in the domain of discourse but also possess clear inference and frame rules allowing modelers to

Going more into details, a SDWS annotation prototype should provide information about general properties regarding the content that is provided by the SDWS (→ content properties)

In this paper the authors present a framework of mobile information needs, juxtaposing search motives—casual, lookup, learn, and investigate—with search

the general model retrieval framework Firestorm to the domain of ER models. Inuenced by the special requirements of the Web we describe

– the ability to manage large amounts of structured and unstructured content, which may be imported and integrated from existing sources, but also may be generated by end users, who

An approach to model provenance on a more detailed level is the Open Provenance Model introduced by Moreau et al. Similar to our model, the authors distinguish three types of pieces