• Aucun résultat trouvé

(1)ProvSQL: Provenance and Probability Management in PostgreSQL Pierre Senellart, Louis Jachiet, Silviu Maniu, Yann Ramusat RESEARCH

N/A
N/A
Protected

Academic year: 2022

Partager "(1)ProvSQL: Provenance and Probability Management in PostgreSQL Pierre Senellart, Louis Jachiet, Silviu Maniu, Yann Ramusat RESEARCH"

Copied!
1
0
0

Texte intégral

(1)ProvSQL: Provenance and Probability Management in PostgreSQL Pierre Senellart, Louis Jachiet, Silviu Maniu, Yann Ramusat RESEARCH. UNIVERSITY PARIS. ÉCOLE NORMALE S U P É R I E U R E. ProvSQL in Brief. Supported Queries and Provenance Annotations. Light-weight extension for PostgreSQL ě 9.5. Large subset of (nested) SQL, without aggregates or UDFs:. Transparent computation of provenance of SQL queries as a term algebra circuit. – SELECT … FROM … WHERE, JOIN, DISTINCT, GROUP BY – UNION, UNION ALL, EXCEPT. Allows probabilistic query evaluation with various techniques:. Variety of provenance formalisms:. – Enumeration of possible worlds – Knowledge compilation to d-DNNF via external tools (c2d, Dsharp, d4) – Monte-Carlo sampling. – arbitrary provenance semirings – arbitrary provenance semirings with monus (m-semirings) – where-provenance. Open-source and available at https://github.com/PierreSenellart/provsql. How ProvSQL Computes Various Provenance Forms and Probabilities Personnel id name. position. city. classification.       . Director Janitor Analyst Field agent Double agent HR Analyst. New York New York Paris Berlin Paris Paris Berlin. unclassified restricted confidential secret top_secret restricted secret. John Paul Dave Ellen Magdalen Nancy Susan. database with provenance annotations. t1 t2 t3 t4 t5 t6 t7. g2 circuit in the universal m-semiring. a. g1. g3. a. ‘. a. city. query evaluation. New York g1 Paris g2 Berlin g3. SELECT DISTINCT city FROM Personnel EXCEPT SELECT DISTINCT P1.city FROM Personnel P1, Personnel P2 WHERE P1.city = P2.city AND P1.id < P2.id. ‘. b. ‘. t1. t2. b. b. b. ‘. b. t3. t5. t6. t4. t7. counting m-semiring (bag semantics). Boolean function m-semiring (Boolean provenance). g2 ´. g2 g1. ^. g1 ␣. ␣. _. t1. _. ^. t2. ␣. ^. t3. ^. t5. _. t6. t4. +. +. ˆ. ˆ. ˆ. ˆ. +. ˆ. 3. 5. 6. 4. 7. ^. 1. t7. 2. ( = 1). ( = 0). g2. d-DNNF. +. _. (deterministic decomposable negation normal form). ˆ. ^ ␣. _. ^. probability computation (independent tuples). t6. ^. +. 1´¨. ˆ. ˆ. ˆ. Pr(t6). 1´¨ ␣. t3. ( = 0). a ´ b = max(a ´ b, 0). knowledge compilation. ^. ´. ^. _. ^. +. ´. g3. ^. g3. 1´¨. ␣. t5. Pr(t3). Pr(t5). References [] Pierre Senellart. Provenance and probabilities in relational databases: From theory to practice. SIGMOD Record, ():–, . [] Pierre Senellart, Louis Jachiet, Silviu Maniu, and Yann Ramusat. ProvSQL: Provenance and probability management in PostgreSQL. Proceedings of the VLDB Endowment (PVLDB), ():–, . Demonstration..

(2)

Références

Documents relatifs

Choose any social network: real-life acquaintance network [Amaral et al., 2000], Twitter follower graph [Kwak et al., 2010], scientific collaboration network [Amaral et al.,

Answering queries using views, in relational and XML settings Answering queries in the presence of limited access patterns, e.g., in the deep Web..

Create a class InvertedIndex that will be used to store an in-memory version of an inverted index (that is, for each token occurring in the collection, this token and the set of

For each of the following queries (without the quotation marks), note the number of answers given by Google: “Bonnie and Clyde”, “bonnie clyde”, “bonny and Clyde”, “Bonnie

. Implement the PageRank iterative algorithm on such a graph. Do not forget to normalize the adjacency lists so that the sum of all outgoing edges of a given node is one. 15) and

Once you follow the links for creating a pipe, you will be presented with the interface of the graphical editor: on the left, a list of all boxes that can be used inside a pipe; in

The serialized form is a textual, linear representation of the tree; it complies to a (sometimes complicated) syntax;.. There exist an object-oriented model for the tree form:

The expression “textual value of an Element N” denotes the concatenation of all the Text node values which are descendant of N, taken in the document order.. Gemo, Lamsade, LIG,