• Aucun résultat trouvé

Probabilities and Provenance on Trees and Treelike Instances

N/A
N/A
Protected

Academic year: 2022

Partager "Probabilities and Provenance on Trees and Treelike Instances"

Copied!
57
0
0

Texte intégral

(1)

Probabilities and Provenance on Trees and Treelike Instances

Antoine Amarilli1, Pierre Bourhis2, Pierre Senellart1,3,4 September 7th, 2016

1Télécom ParisTech

2CNRS CRIStAL

3National University of Singapore

4École normale supérieure de Paris 1/7

(2)

How to travel to Highlights from Paris?

(3)

How to travel to Highlights from Paris?

2/7

(4)

How to travel to Highlights from Paris?

(5)

How to travel to Highlights from Paris?

2/7

(6)

How to travel to Highlights from Paris?

(7)

How to travel to Highlights from Paris?

(Metro|RER)*|(Bus|Tram)*

2/7

(8)

How to travel to Highlights from Paris?

(9)

How to travel to Highlights from Paris?

50%

(Metro|RER)*|(Bus|Tram)*

2/7

(10)

How to travel to Highlights from Paris?

50%

(11)

How to travel to Highlights from Paris?

50%

90%

(Metro|RER)*|(Bus|Tram)*

2/7

(12)

How to travel to Highlights from Paris?

50%

42%

37%

78% 90%

72%

(13)

How to travel to Highlights from Paris?

(Metro|RER)*|(Bus|Tram)*

50%

90%

42%

37%

90%

83%

78%

72%

What is the probability

that I can attend Highlights 2016?

2/7

(14)

Problem statement

Input:

? QueryQ (Metro|RER)*|(Bus|Tram)*

DatabaseDor graph

% Probabilitieson facts or edges 50%

Output: theprobabilitythat the query is true under the distribution (assuming independence of all probabilistic events)

Complexity: already#P-hardin the input database! (from #MONOTONE-SAT)

(15)

Problem statement

Input:

? QueryQ (Metro|RER)*|(Bus|Tram)*

DatabaseDor graph

% Probabilitieson facts or edges 50%

Output: theprobabilitythat the query is true under the distribution (assuming independence of all probabilistic events)

Complexity: already#P-hardin the input database! (from #MONOTONE-SAT)

3/7

(16)

Problem statement

Input:

? QueryQ (Metro|RER)*|(Bus|Tram)*

DatabaseDor graph

% Probabilitieson facts or edges 50%

Output: theprobabilitythat the query is true under the distribution (assuming independence of all probabilistic events)

Complexity: already#P-hardin the input database! (from #MONOTONE-SAT)

(17)

Problem statement

Input:

? QueryQ (Metro|RER)*|(Bus|Tram)*

DatabaseDor graph

% Probabilitieson facts or edges 50%

Output:theprobabilitythat the query is true under the distribution (assuming independence of all probabilistic events)

Complexity: already#P-hardin the input database! (from #MONOTONE-SAT)

3/7

(18)

Problem statement

Input:

? QueryQ (Metro|RER)*|(Bus|Tram)*

DatabaseDor graph

% Probabilitieson facts or edges 50%

Output:theprobabilitythat the query is true under the distribution (assuming independence of all probabilistic events)

Complexity: already#P-hardin the input database!

(from #MONOTONE-SAT)

(19)

Using treewidth to make the problem tractable

Treewidth by example:

• Treeshave treewidth1

• Cycleshave treewidth2

k-cliquesand(k−1)-gridshave treewidthk−1

→ Treelike: thetreewidthis bounded by aconstant

4/7

(20)

Using treewidth to make the problem tractable

Treewidth by example:

• Treeshave treewidth1

• Cycleshave treewidth2

k-cliquesand(k−1)-gridshave treewidthk−1

→ Treelike: thetreewidthis bounded by aconstant

(21)

Using treewidth to make the problem tractable

Treewidth by example:

• Treeshave treewidth1

• Cycleshave treewidth2

k-cliquesand(k−1)-gridshave treewidthk−1

→ Treelike: thetreewidthis bounded by aconstant

4/7

(22)

Using treewidth to make the problem tractable

Treewidth by example:

• Treeshave treewidth1

• Cycleshave treewidth2

k-cliquesand(k−1)-gridshave treewidthk−1

→ Treelike: thetreewidthis bounded by aconstant

(23)

Using treewidth to make the problem tractable

Treewidth by example:

• Treeshave treewidth1

• Cycleshave treewidth2

k-cliquesand(k−1)-gridshave treewidthk−1

→ Treelike: thetreewidthis bounded by aconstant

4/7

(24)

Using treewidth to make the problem tractable

Treewidth by example:

• Treeshave treewidth1

• Cycleshave treewidth2

k-cliquesand(k−1)-gridshave treewidthk−1

→ Treelike: thetreewidthis bounded by aconstant

(25)

Using treewidth to make the problem tractable

Treewidth by example:

• Treeshave treewidth1

• Cycleshave treewidth2

k-cliquesand(k−1)-gridshave treewidthk−1

→ Treelike: thetreewidthis bounded by aconstant

4/7

(26)

Using treewidth to make the problem tractable

Treewidth by example:

• Treeshave treewidth1

• Cycleshave treewidth2

k-cliquesand(k−1)-gridshave treewidthk−1

→ Treelike: thetreewidthis bounded by aconstant

(27)

Using treewidth to make the problem tractable

Treewidth by example:

• Treeshave treewidth1

• Cycleshave treewidth2

k-cliquesand(k−1)-gridshave treewidthk−1

→ Treelike: thetreewidthis bounded by aconstant

4/7

(28)

Using treewidth to make the problem tractable

Treewidth by example:

• Treeshave treewidth1

• Cycleshave treewidth2

k-cliquesand(k−1)-gridshave treewidthk−1

→ Treelike: thetreewidthis bounded by aconstant

(29)

Using treewidth to make the problem tractable

Treewidth by example:

• Treeshave treewidth1

• Cycleshave treewidth2

k-cliquesand(k−1)-gridshave treewidthk−1

→ Treelike: thetreewidthis bounded by aconstant

4/7

(30)

Using treewidth to make the problem tractable

Treewidth by example:

• Treeshave treewidth1

• Cycleshave treewidth2

k-cliquesand(k−1)-gridshave treewidthk−1

→ Treelike: thetreewidthis bounded by aconstant

(31)

Using treewidth to make the problem tractable

Treewidth by example:

• Treeshave treewidth1

• Cycleshave treewidth2

k-cliquesand(k−1)-gridshave treewidthk−1

→ Treelike: thetreewidthis bounded by aconstant

4/7

(32)

Using treewidth to make the problem tractable

Treewidth by example:

• Treeshave treewidth1

• Cycleshave treewidth2

k-cliquesand(k−1)-gridshave treewidthk−1

→ Treelike: thetreewidthis bounded by aconstant

(33)

Using treewidth to make the problem tractable

Treewidth by example:

• Treeshave treewidth1

• Cycleshave treewidth2

k-cliquesand(k−1)-gridshave treewidthk−1

→ Treelike: thetreewidthis bounded by aconstant

4/7

(34)

Using treewidth to make the problem tractable

Treewidth by example:

• Treeshave treewidth1

• Cycleshave treewidth2

k-cliquesand(k−1)-gridshave treewidthk−1

→ Treelike: thetreewidthis bounded by aconstant

(35)

Using treewidth to make the problem tractable

Treewidth by example:

• Treeshave treewidth1

• Cycleshave treewidth2

k-cliquesand(k−1)-gridshave treewidthk−1

→ Treelike: thetreewidthis bounded by aconstant

4/7

(36)

Using treewidth to make the problem tractable

Treewidth by example:

• Treeshave treewidth1

• Cycleshave treewidth2

k-cliquesand(k−1)-gridshave treewidthk−1

→ Treelike: thetreewidthis bounded by aconstant

(37)

Using treewidth to make the problem tractable

Treewidth by example:

• Treeshave treewidth1

• Cycleshave treewidth2

k-cliquesand(k−1)-gridshave treewidthk−1

→ Treelike: thetreewidthis bounded by aconstant

4/7

(38)

Tractability on treelike instances

(RER|metro)*

|(bus|tram)*

MSO query Treelike data

Theorem

For anyfixedBoolean MSO queryqandk∈N,

given a databaseDoftreewidthkwithindependent probabilities, we can compute inlinear timethe probability thatDsatisfiesq

(39)

Tractability on treelike instances

Tree automaton (RER|metro)*

|(bus|tram)*

MSO query Treelike data

Theorem

For anyfixedBoolean MSO queryqandk∈N,

given a databaseDoftreewidthkwithindependent probabilities, we can compute inlinear timethe probability thatDsatisfiesq

5/7

(40)

Tractability on treelike instances

Tree automaton Tree encoding

(RER|metro)*

|(bus|tram)*

MSO query Treelike data

Theorem

For anyfixedBoolean MSO queryqandk∈N,

given a databaseDoftreewidthkwithindependent probabilities, we can compute inlinear timethe probability thatDsatisfiesq

(41)

Tractability on treelike instances

Tree automaton Tree encoding

(RER|metro)*

|(bus|tram)*

MSO query

linear [Courcelle]

Treelike data

Query answer

TRUE

Theorem

For anyfixedBoolean MSO queryqandk∈N,

given a databaseDoftreewidthkwithindependent probabilities, we can compute inlinear timethe probability thatDsatisfiesq

5/7

(42)

Tractability on treelike instances

Tree automaton Tree encoding

(RER|metro)*

|(bus|tram)*

MSO query Treelike data

Theorem

For anyfixedBoolean MSO queryqandk∈N,

given a databaseDoftreewidthkwithindependent probabilities, we can compute inlinear timethe probability thatDsatisfiesq

(43)

Tractability on treelike instances

Tree automaton Tree encoding

(RER|metro)*

|(bus|tram)*

MSO query

Provenance circuit

linear Treelike data

Theorem

For anyfixedBoolean MSO queryqandk∈N,

given a databaseDoftreewidthkwithindependent probabilities, we can compute inlinear timethe probability thatDsatisfiesq

5/7

(44)

Tractability on treelike instances

Tree automaton Tree encoding

(RER|metro)*

|(bus|tram)*

MSO query

Provenance circuit

linear Treelike data

Probability

95%

linear

Theorem

For anyfixedBoolean MSO queryqandk∈N,

given a databaseDoftreewidthkwithindependent probabilities, we can compute inlinear timethe probability thatDsatisfiesq

(45)

Tractability on treelike instances

Tree automaton Tree encoding

(RER|metro)*

|(bus|tram)*

MSO query

Provenance circuit

linear Treelike data

Probability

95%

linear

Theorem

For anyfixedBoolean MSO queryqandk∈N,

given a databaseDoftreewidthkwithindependent probabilities, we can compute inlinear timethe probability thatDsatisfiesq

5/7

(46)

Lower bound

What can we do for unbounded-treewidth instances?

... not much. Theorem

For any graph signatureσ, there is afirst-orderqueryqsuch that for any constructibleunbounded-treewidthclassI,

probability evaluation ofqonI is#P-hardunder RP reductions

3 1

2

4 3 4

2 1

maps vertices to vertices maps edges to vertex-disjoint paths

Proof idea:extract instances of a hard problem astopological minors using recentpolynomial bounds[Chekuri and Chuzhoy, 2014]

(47)

Lower bound

What can we do for unbounded-treewidth instances?... not much.

Theorem

For any graph signatureσ, there is afirst-orderqueryqsuch that for any constructibleunbounded-treewidthclassI,

probability evaluation ofqonI is#P-hardunder RP reductions

Proof idea:extract instances of a hard problem astopological minors using recentpolynomial bounds[Chekuri and Chuzhoy, 2014]

6/7

(48)

Lower bound

Theorem

For any graph signatureσ, there is afirst-orderqueryqsuch that for any constructibleunbounded-treewidthclassI,

probability evaluation ofqonI is#P-hardunder RP reductions

Proof idea:extract instances of a hard problem astopological minors using recentpolynomial bounds[Chekuri and Chuzhoy, 2014]

(49)

Lower bound

Theorem

For any graph signatureσ, there is afirst-orderqueryqsuch that for any constructibleunbounded-treewidthclassI,

probability evaluation ofqonI is#P-hardunder RP reductions

Proof idea:extract instances of a hard problem astopological minors using recentpolynomial bounds[Chekuri and Chuzhoy, 2014]

6/7

(50)

Lower bound

Theorem

For any graph signatureσ, there is afirst-orderqueryqsuch that for any constructibleunbounded-treewidthclassI,

probability evaluation ofqonI is#P-hardunder RP reductions

3 1

2

4 3 4

2 1

maps vertices to vertices maps edges to vertex-disjoint paths

Proof idea:extract instances of a hard problem astopological minors

(51)

Future and ongoing work

• Improving thelower bound:

• Fromgraphstoarbitrary aritydatabases

• FromFOdown tounions of conjunctive querieswith6=

• Complexity inqueryanddatabase— currentlyΩ

22. ..2

|Q|

× |D|

Which queries canefficientlybe compiled to automata?

• Non-Booleanqueries: efficientenumerationof query results?

• Other tasks: probabilisticconditioning

“Knowing that I’m here, what’s the probability that RER B is up?” Thanks for your attention!

7/7

(52)

Future and ongoing work

• Improving thelower bound:

• Fromgraphstoarbitrary aritydatabases

• FromFOdown tounions of conjunctive querieswith6=

• Complexity inqueryanddatabase— currentlyΩ

22. ..2

|Q|

× |D|

Which queries canefficientlybe compiled to automata?

• Non-Booleanqueries: efficientenumerationof query results?

• Other tasks: probabilisticconditioning

“Knowing that I’m here, what’s the probability that RER B is up?” Thanks for your attention!

(53)

Future and ongoing work

• Improving thelower bound:

• Fromgraphstoarbitrary aritydatabases

• FromFOdown tounions of conjunctive querieswith6=

• Complexity inqueryanddatabase— currentlyΩ

22. ..2

|Q|

× |D|

Which queries canefficientlybe compiled to automata?

• Non-Booleanqueries: efficientenumerationof query results?

• Other tasks: probabilisticconditioning

“Knowing that I’m here, what’s the probability that RER B is up?” Thanks for your attention!

7/7

(54)

Future and ongoing work

• Improving thelower bound:

• Fromgraphstoarbitrary aritydatabases

• FromFOdown tounions of conjunctive querieswith6=

• Complexity inqueryanddatabase— currentlyΩ

22. ..2

|Q|

× |D|

Which queries canefficientlybe compiled to automata?

• Non-Booleanqueries: efficientenumerationof query results?

• Other tasks: probabilisticconditioning

“Knowing that I’m here, what’s the probability that RER B is up?”

Thanks for your attention!

(55)

Future and ongoing work

• Improving thelower bound:

• Fromgraphstoarbitrary aritydatabases

• FromFOdown tounions of conjunctive querieswith6=

• Complexity inqueryanddatabase— currentlyΩ

22. ..2

|Q|

× |D|

Which queries canefficientlybe compiled to automata?

• Non-Booleanqueries: efficientenumerationof query results?

• Other tasks: probabilisticconditioning

“Knowing that I’m here, what’s the probability that RER B is up?”

Thanks for your attention!

7/7

(56)

References I

Chekuri, C. and Chuzhoy, J. (2014).

Polynomial bounds for the grid-minor theorem.

InSTOC.

Courcelle, B. (1990).

The monadic second-order logic of graphs. I. Recognizable sets of finite graphs.

Inf. Comput., 85(1).

(57)

Image credits

• Slide 2:

https://commons.wikimedia.org/wiki/File:

Paris_Metro_map.svg(cropped), user Umx on Wikimedia Commons, public domain

http://www.parisvoyage.com/images/cartoon18.jpg, ParisVoyage, fair use

http://www.vianavigo.com/fileadmin/galerie/pdf/CGU_t_.pdf (cropped), RATP, fair use

• Slides 4 and 5: https://commons.wikimedia.org/wiki/File:

Carte_Transilien_RER_sch%C3%A9matique.svg(modified), user Benjamin Smith on Wikimedia Commons, license CC BY-SA 4.0 international.

Références

Documents relatifs

A regular annotation is a plain text that is collected based on a fixed structure [2], while ontology-based annotation is a set of instances of classes based on the domain

Formally, we showed that the evaluation of monadic second-order (MSO) queries is tractable (in data complexity) over several probabilistic database formalisms, assuming that

for any I of treewidth densely unbounded poly-logarithmically, q has no OBDDs of polynomial size on instances of I.. → Meta-dichotomy on the connected UCQ 6= for which

Introduction Upper bounds Semiring provenance Correlations Lower bounds Conclusion.. Probabilities and Provenance on Trees and

→ Linear time provenance circuit computation on trees and treelike instances:.. for

Provenance Circuits for Trees and Treelike Instances

This second dichotomy result thus shows that bounded-treewidth is necessary for some UCQ 6= queries to have tractable OBDDs; it applies to a more restricted class than the FO query

This second dichotomy result thus shows that bounded-treewidth is nec- essary for some UCQ 6= queries to have tractable OBDDs; it applies to a more restricted class than the FO query