• Aucun résultat trouvé

Provenance for Database Transformations

N/A
N/A
Protected

Academic year: 2022

Partager "Provenance for Database Transformations"

Copied!
1
0
0

Texte intégral

(1)

Provenance for Database Transformations

Val Tannen University of Pennsylvania

[email protected]

Database transformations (queries, views, mappings) take apart, lter, and re- combine source data in order to populate warehouses, materialize views, and provide inputs to analysis tools. As they do so, applications often need to track the relationship between parts and pieces of the sources and parts and pieces of the transformations' output. This relationship is what we call database prove- nance.

This tutorial presents an approach to database provenance that relies on two observations. First, provenance is a kind of annotation, and we can develop a general approach to annotation propagation that also covers other applications, for example to uncertainty and access control. In fact, provenance turns out to be the most general kind of such annotation, in a precise and practically useful sense. Second, the propagation of annotation through a broad class of transformations relies on just two operations: one when annotations are jointly used and one when they are used alternatively. This leads to annotations forming a specic algebraic structure, a commutative semiring.

The semiring approach works for annotating tuples, eld values and at- tributes in standard relations, in nested relations (complex values), and for an- notating nodes in (unordered) XML. It works for transformations expressed in the positive fragment of relational algebra, nested relational calculus, unordered XQuery, as well as for Datalog, GLAV schema mappings, and tgd constraints.

Specic semirings correspond to earlier approaches to provenance, while others correspond to forms of uncertainty, trust, cost, and access control.

This is joint work with J.N. Foster, T.J. Green, Z. Ives, and G. Karvounarakis, done in part within the frameworks of the Orchestra and pPOD projects.

Références

Documents relatifs

Annotation and annotation mining tools for an- alyzing speech prosody in the Polish-German Borderland database.. 9th Language & Technology Conference: Human Language Technologies

mentioned, SP is the general graph structure therefore, it will not be considered as a configuration constraint. Driven by the following facts: workflow constraints may have

As we move to publish biological data as linked open data, there is an opportunity to describe direct links between different types of biological entities as a shortcut to be

There exists extensive research on reconstructing sequences of operations based on input and output data, in particular in change detection and edit distance algorithms..

To support this functionality, for example, the query Que1 would have to be rephrased in the following way: Find sequences s of tokens that support the proposition that Al Qaeda

Several important lessons learnt from the design and usage of the system so far in- clude: (1) The DCPAC ontology in conjunction with ASO has proven sufficiently expressive

In this paper, through the use of a new approach based on the integration of blockchain technology, smart contracts and metadata driven data management, the

Rule-Based Negation Trigger Identification In order to identify negated disabilities we created a list of negation triggers for each language and this module labels them as negation