• Aucun résultat trouvé

Query Evaluation on Probabilistic Data A Story of Dichotomies

N/A
N/A
Protected

Academic year: 2022

Partager "Query Evaluation on Probabilistic Data A Story of Dichotomies"

Copied!
256
0
0

Texte intégral

(1)

Query Evaluation on Probabilistic Data A Story of Dichotomies

Antoine Amarilli May 19, 2020

(2)

Table of contents

Introduction and problem statement

Existing results

More general queries: Dichotomy on homomorphism-closed queries More restricted instances: Words, trees and bounded treewidth More restricted instances: Unweighted instances

Conclusion and open problems

2/42

(3)

Uncertain data management

Relational databases managedata, represented here as alabeled graph

WorksAt

Antoine Télécom Paris Antoine Paris Sud

Benny Paris Sud

Benny Technion

İsmail U. Oxford

MemberOf

Télécom Paris ParisTech Télécom Paris IP Paris

Paris Sud IP Paris Paris Sud Paris-Saclay Technion CESAER

A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER

→ Problem:we are notcertainabout the true state of the data

(4)

Uncertain data management

Relational databases managedata, represented here as alabeled graph WorksAt

Antoine Télécom Paris Antoine Paris Sud

Benny Paris Sud

Benny Technion

İsmail U. Oxford

MemberOf

Télécom Paris ParisTech Télécom Paris IP Paris

Paris Sud IP Paris Paris Sud Paris-Saclay Technion CESAER

A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER

→ Problem:we are notcertainabout the true state of the data

3/42

(5)

Uncertain data management

Relational databases managedata, represented here as alabeled graph WorksAt

Antoine Télécom Paris Antoine Paris Sud

Benny Paris Sud

Benny Technion

İsmail U. Oxford

MemberOf

Télécom Paris ParisTech Télécom Paris IP Paris

Paris Sud IP Paris Paris Sud Paris-Saclay

A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER

→ Problem:we are notcertainabout the true state of the data

(6)

Uncertain data management

Relational databases managedata, represented here as alabeled graph WorksAt

Antoine Télécom Paris Antoine Paris Sud

Benny Paris Sud

Benny Technion

İsmail U. Oxford

MemberOf

Télécom Paris ParisTech Télécom Paris IP Paris

Paris Sud IP Paris Paris Sud Paris-Saclay Technion CESAER

A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER

→ Problem:we are notcertainabout the true state of the data

3/42

(7)

Uncertain data management

Relational databases managedata, represented here as alabeled graph WorksAt

Antoine Télécom Paris Antoine Paris Sud

Benny Paris Sud

Benny Technion

İsmail U. Oxford

MemberOf

Télécom Paris ParisTech Télécom Paris IP Paris

Paris Sud IP Paris Paris Sud Paris-Saclay Technion CESAER

A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER

→ Problem:we are notcertainabout the true state of the data

(8)

Uncertain data management

Relational databases managedata, represented here as alabeled graph WorksAt

Antoine Télécom Paris Antoine Paris Sud

Benny Paris Sud

Benny Technion

İsmail U. Oxford

MemberOf

Télécom Paris ParisTech Télécom Paris IP Paris

Paris Sud IP Paris Paris Sud Paris-Saclay Technion CESAER

A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER

→ Problem:we are notcertainabout the true state of the data

3/42

(9)

Uncertain data management

Relational databases managedata, represented here as alabeled graph WorksAt

Antoine Télécom Paris Antoine Paris Sud

Benny Paris Sud

Benny Technion

İsmail U. Oxford

MemberOf

Télécom Paris ParisTech Télécom Paris IP Paris

Paris Sud IP Paris Paris Sud Paris-Saclay

A.

B.

Télécom Paris

Paris Sud

Technion

ParisTech

IP Paris

Paris-Saclay

(10)

Uncertain data model

A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER

80% 10%

40% 80%

100%

90% 90% 50%

90%

100%

Uncertain data model:TID, for tuple-independent database

Each fact (edge) carries aprobability

Each fact exists with its givenprobability

All facts areindependent

Possible worldW:subset of facts

What is theprobabilityof this possible world?0.03%

Pr(W) = Y

F∈W

Pr(F)

!

× Y

F∈W/

1−Pr(F)

!

4/42

(11)

Uncertain data model

A.

B.

Télécom Paris

Paris Sud

Technion

ParisTech

IP Paris

Paris-Saclay 80%

10%

40%

80%

100%

90%

90%

50%

90%

100%

Uncertain data model:TID, for tuple-independent database

Each fact (edge) carries aprobability

Each fact exists with its givenprobability

All facts areindependent

Possible worldW:subset of facts

What is theprobabilityof this possible world?0.03%

Pr(W) = Y

F∈W

Pr(F)

!

× Y

F∈W/

1−Pr(F)

!

(12)

Uncertain data model

A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER 80%

10%

40%

80%

100%

90%

90%

50%

90%

100%

Uncertain data model:TID, for tuple-independent database

Each fact (edge) carries aprobability

Each fact exists with its givenprobability

All facts areindependent

Possible worldW:subset of facts

What is theprobabilityof this possible world?0.03%

Pr(W) = Y

F∈W

Pr(F)

!

× Y

F∈W/

1−Pr(F)

!

4/42

(13)

Uncertain data model

A.

B.

Télécom Paris

Paris Sud

Technion

ParisTech

IP Paris

Paris-Saclay 80%

10%

40%

80%

100%

90%

90%

50%

90%

100%

Uncertain data model:TID, for tuple-independent database

Each fact (edge) carries aprobability

Each fact exists with its givenprobability

All facts areindependent

Possible worldW:subset of facts

What is theprobabilityof this possible world?0.03%

Pr(W) = Y

F∈W

Pr(F)

!

× Y

F∈W/

1−Pr(F)

!

(14)

Uncertain data model

A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER

80% 10%

40% 80%

100%

90% 90% 50%

90%

100%

Uncertain data model:TID, for tuple-independent database

Each fact (edge) carries aprobability

Each fact exists with its givenprobability

All facts areindependent

Possible worldW:subset of facts

What is theprobabilityof this possible world?0.03%

Pr(W) = Y

F∈W

Pr(F)

!

× Y

F∈W/

1−Pr(F)

!

4/42

(15)

Uncertain data model

A.

B.

Télécom Paris

Paris Sud

Technion

ParisTech

IP Paris

Paris-Saclay

80% 10%

40% 80%

100%

90% 90% 50%

90%

100%

Uncertain data model:TID, for tuple-independent database

Each fact (edge) carries aprobability

Each fact exists with its givenprobability

All facts areindependent

Possible worldW:subset of facts

What is theprobabilityof this possible world?

0.03%

Pr(W) = Y

F∈W

Pr(F)

!

× Y

F∈W/

1−Pr(F)

!

(16)

Uncertain data model

A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER 80%

10%

40%

80%

100%

90%

90%

50%

90%

100%

Uncertain data model:TID, for tuple-independent database

Each fact (edge) carries aprobability

Each fact exists with its givenprobability

All facts areindependent

Possible worldW:subset of facts

What is theprobabilityof this possible world?

0.03%

Pr(W) = Y

F∈W

Pr(F)

!

× Y

F∈W/

1−Pr(F)

!

4/42

(17)

Uncertain data model

A.

B.

Télécom Paris

Paris Sud

Technion

ParisTech

IP Paris

Paris-Saclay 80%

10%

40%

80%

100%

90%

90%

50%

90%

100%

Uncertain data model:TID, for tuple-independent database

Each fact (edge) carries aprobability

Each fact exists with its givenprobability

All facts areindependent

Possible worldW:subset of facts

What is theprobabilityof this possible world?0.03%

Pr(W) = Y

F∈W

Pr(F)

!

× Y

F∈W/

1−Pr(F)

!

(18)

Uncertain data model

A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER 80%

10%

40%

80%

100%

90%

90%

50%

90%

100%

Uncertain data model:TID, for tuple-independent database

Each fact (edge) carries aprobability

Each fact exists with its givenprobability

All facts areindependent

Possible worldW:subset of facts

What is theprobabilityof this possible world?0.03%

Pr(W) = Y

F∈W

Pr(F)

!

× Y

F∈W/

1−Pr(F)

!

4/42

(19)

Queries

A central task in databases is toevaluate queries

Query:maps a graph (without probabilities) to YES/NO

Conjunctive query(CQ): can I find a match of apattern?

• e.g.,∃x y z x y z

We want ahomomorphismfrom the pattern to the graph (not necessarilyinjective)

Formally: anexistentially quantified conjunction of atoms (edges)

Union of conjunctive queries(UCQ): can I find a match ofsome pattern?

e.g., ∃x y z x y z

∨ ∃x y z w x y z w

• Formally: afinite disjunctionof CQs

(20)

Queries

A central task in databases is toevaluate queries

Query:maps a graph (without probabilities) to YES/NO

Conjunctive query(CQ): can I find a match of apattern?

• e.g.,∃x y z x y z

We want ahomomorphismfrom the pattern to the graph (not necessarilyinjective)

Formally: anexistentially quantified conjunction of atoms (edges)

Union of conjunctive queries(UCQ): can I find a match ofsome pattern?

e.g., ∃x y z x y z

∨ ∃x y z w x y z w

• Formally: afinite disjunctionof CQs

5/42

(21)

Queries

A central task in databases is toevaluate queries

Query:maps a graph (without probabilities) to YES/NO

Conjunctive query(CQ): can I find a match of apattern?

• e.g.,∃x y z x y z

We want ahomomorphismfrom the pattern to the graph (not necessarilyinjective)

Formally: anexistentially quantified conjunction of atoms (edges)

Union of conjunctive queries(UCQ): can I find a match ofsome pattern?

e.g., ∃x y z x y z

∨ ∃x y z w x y z w

• Formally: afinite disjunctionof CQs

(22)

Queries

A central task in databases is toevaluate queries

Query:maps a graph (without probabilities) to YES/NO

Conjunctive query(CQ): can I find a match of apattern?

• e.g.,∃x y z x y z

We want ahomomorphismfrom the pattern to the graph (not necessarilyinjective)

Formally: anexistentially quantified conjunction of atoms (edges)

Union of conjunctive queries(UCQ): can I find a match ofsome pattern?

e.g., ∃x y z x y z

∨ ∃x y z w x y z w

• Formally: afinite disjunctionof CQs

5/42

(23)

Queries

A central task in databases is toevaluate queries

Query:maps a graph (without probabilities) to YES/NO

Conjunctive query(CQ): can I find a match of apattern?

• e.g.,∃x y z x y z

We want ahomomorphismfrom the pattern to the graph (not necessarilyinjective)

Formally: anexistentially quantified conjunction of atoms (edges)

Union of conjunctive queries(UCQ): can I find a match ofsome pattern?

e.g., ∃x y z x y z

∨ ∃x y z w x y z w

• Formally: afinite disjunctionof CQs

(24)

Queries

A central task in databases is toevaluate queries

Query:maps a graph (without probabilities) to YES/NO

Conjunctive query(CQ): can I find a match of apattern?

• e.g.,∃x y z x y z

We want ahomomorphismfrom the pattern to the graph (not necessarilyinjective)

Formally: anexistentially quantified conjunction of atoms (edges)

Union of conjunctive queries(UCQ): can I find a match ofsome pattern?

e.g., ∃x y z x y z

∨ ∃x y z w x y z w

• Formally: afinite disjunctionof CQs

5/42

(25)

Problem statement: Probabilistic query evaluation (PQE)

Wefixa queryQ, for instance the CQ: x y z

Theinputis a TIDD: A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER 80%

10%

40% 80%

100%

90% 90% 50% 90%

100%

Theoutputis thetotal probabilityof the worlds which satisfy the query:

• Formally:P

W⊆D,W|=QPr(W)

Intuition:theprobabilitythat the query is true

We can always compute the probability in exponential time (go over all possibilities)

Here we can do better (in PTIME):1−(1−80%)×(1−(

1−

(1−10%)×(1−40%)

)× (

1−(1−50%)×(1−90%))

)×(1−80%×(1−(1−90%)×(1−90%))), i.e.,97.65792%

(26)

Problem statement: Probabilistic query evaluation (PQE)

Wefixa queryQ, for instance the CQ: x y z

Theinputis a TIDD: A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER 80%

10%

40%

80%

100%

90%

90%

50%

90%

100%

Theoutputis thetotal probabilityof the worlds which satisfy the query:

• Formally:P

W⊆D,W|=QPr(W)

Intuition:theprobabilitythat the query is true

We can always compute the probability in exponential time (go over all possibilities)

Here we can do better (in PTIME):1−(1−80%)×(1−(

1−

(1−10%)×(1−40%)

)× (

1−(1−50%)×(1−90%))

)×(1−80%×(1−(1−90%)×(1−90%))), i.e.,97.65792%

6/42

(27)

Problem statement: Probabilistic query evaluation (PQE)

Wefixa queryQ, for instance the CQ: x y z

Theinputis a TIDD: A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER 80%

10%

40%

80%

100%

90%

90%

50%

90%

100%

Theoutputis thetotal probabilityof the worlds which satisfy the query:

• Formally:P

W⊆D,W|=QPr(W)

Intuition:theprobabilitythat the query is true

We can always compute the probability in exponential time (go over all possibilities)

Here we can do better (in PTIME):1−(1−80%)×(1−(

1−

(1−10%)×(1−40%)

)× (

1−(1−50%)×(1−90%))

)×(1−80%×(1−(1−90%)×(1−90%))), i.e.,97.65792%

(28)

Problem statement: Probabilistic query evaluation (PQE)

Wefixa queryQ, for instance the CQ: x y z

Theinputis a TIDD: A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER 80%

10%

40%

80%

100%

90%

90%

50%

90%

100%

Theoutputis thetotal probabilityof the worlds which satisfy the query:

• Formally:P

W⊆D,W|=QPr(W)

Intuition:theprobabilitythat the query is true

We can always compute the probability in exponential time (go over all possibilities)

Here we can do better (in PTIME):1−(1−80%)×(1−(

1−

(1−10%)×(1−40%)

)× (

1−(1−50%)×(1−90%))

)×(1−80%×(1−(1−90%)×(1−90%))), i.e.,97.65792%

6/42

(29)

Problem statement: Probabilistic query evaluation (PQE)

Wefixa queryQ, for instance the CQ: x y z

Theinputis a TIDD: A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER 80%

10%

40%

80%

100%

90%

90%

50%

90%

100%

Theoutputis thetotal probabilityof the worlds which satisfy the query:

• Formally:P

W⊆D,W|=QPr(W)

Intuition:theprobabilitythat the query is true

We can always compute the probability in exponential time (go over all possibilities)

Here we can do better (in PTIME):1−(1−80%)×(1−(

1−

(1−10%)×(1−40%)

)× (

1−(1−50%)×(1−90%))

)×(1−80%×(1−(1−90%)×(1−90%))), i.e.,97.65792%

(30)

Problem statement: Probabilistic query evaluation (PQE)

Wefixa queryQ, for instance the CQ: x y z

Theinputis a TIDD: A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER 80%

10%

40%

80%

100%

90%

90%

50%

90%

100%

Theoutputis thetotal probabilityof the worlds which satisfy the query:

• Formally:P

W⊆D,W|=QPr(W)

Intuition:theprobabilitythat the query is true

We can always compute the probability in exponential time (go over all possibilities)

Here we can do better (in PTIME):

1−(1−80%)×(1−(

1−

(1−10%)×(1−40%)

)× (

1−(1−50%)×(1−90%))

)×(1−80%×(1−(1−90%)×(1−90%))), i.e.,97.65792%

6/42

(31)

Problem statement: Probabilistic query evaluation (PQE)

Wefixa queryQ, for instance the CQ: x y z

Theinputis a TIDD: A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER 80%

10%

40%

80%

100%

90%

90%

50%

90%

100%

Theoutputis thetotal probabilityof the worlds which satisfy the query:

• Formally:P

W⊆D,W|=QPr(W)

Intuition:theprobabilitythat the query is true

We can always compute the probability in exponential time (go over all possibilities)

(1−80%)×(1−(

1−

(1−10%)×(1−40%)

)× (

1−(1−50%)×(1−90%))

)×(1−80%×(1−(1−90%)×(1−90%))), i.e.,97.65792%

(32)

Problem statement: Probabilistic query evaluation (PQE)

Wefixa queryQ, for instance the CQ: x y z

Theinputis a TIDD: A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER 80%

10%

40%

80%

100%

90%

90%

50%

90%

100%

Theoutputis thetotal probabilityof the worlds which satisfy the query:

• Formally:P

W⊆D,W|=QPr(W)

Intuition:theprobabilitythat the query is true

We can always compute the probability in exponential time (go over all possibilities)

Here we can do better (in PTIME):1−(1−80%)×

(1−(

1−

(1−10%)×(1−40%)

)× (

1−(1−50%)×(1−90%))

)×(1−80%×(1−(1−90%)×(1−90%))), i.e.,97.65792%

6/42

(33)

Problem statement: Probabilistic query evaluation (PQE)

Wefixa queryQ, for instance the CQ: x y z

Theinputis a TIDD: A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER 80%

10%

40%

80%

100%

90%

90%

50%

90%

100%

Theoutputis thetotal probabilityof the worlds which satisfy the query:

• Formally:P

W⊆D,W|=QPr(W)

Intuition:theprobabilitythat the query is true

We can always compute the probability in exponential time (go over all possibilities)

(

1−

(1−10%)×(1−40%)

)× (

1−(1−50%)×(1−90%))

)×(1−80%×(1−(1−90%)×(1−90%))), i.e.,97.65792%

(34)

Problem statement: Probabilistic query evaluation (PQE)

Wefixa queryQ, for instance the CQ: x y z

Theinputis a TIDD: A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER 80%

10%

40%

80%

100%

90%

90%

50%

90%

100%

Theoutputis thetotal probabilityof the worlds which satisfy the query:

• Formally:P

W⊆D,W|=QPr(W)

Intuition:theprobabilitythat the query is true

We can always compute the probability in exponential time (go over all possibilities)

Here we can do better (in PTIME):1−(1−80%)×(1−(

1−

(1−10%)×(1−40%)

)× (

1−(1−50%)×(1−90%))

)

×(1−80%×(1−(1−90%)×(1−90%))), i.e.,97.65792%

6/42

(35)

Problem statement: Probabilistic query evaluation (PQE)

Wefixa queryQ, for instance the CQ: x y z

Theinputis a TIDD: A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER 80%

10%

40%

80%

100%

90%

90%

50%

90%

100%

Theoutputis thetotal probabilityof the worlds which satisfy the query:

• Formally:P

W⊆D,W|=QPr(W)

Intuition:theprobabilitythat the query is true

We can always compute the probability in exponential time (go over all possibilities)

(1−10%)×(1−40%) 1−(1−50%)×(1−90%)) ×(1−80%×(1−(1−90%)×(1−90%))), i.e.,97.65792%

(36)

Problem statement: Probabilistic query evaluation (PQE)

Wefixa queryQ, for instance the CQ: x y z

Theinputis a TIDD: A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER 80%

10%

40%

80%

100%

90%

90%

50%

90%

100%

Theoutputis thetotal probabilityof the worlds which satisfy the query:

• Formally:P

W⊆D,W|=QPr(W)

Intuition:theprobabilitythat the query is true

We can always compute the probability in exponential time (go over all possibilities)

Here we can do better (in PTIME):1−(1−80%)×(1−(1−(1−10%)×(1−40%))× (

1−(1−50%)×(1−90%))

)

×(1−80%×(1−(1−90%)×(1−90%))), i.e.,97.65792%

6/42

(37)

Problem statement: Probabilistic query evaluation (PQE)

Wefixa queryQ, for instance the CQ: x y z

Theinputis a TIDD: A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER 80%

10%

40%

80%

100%

90%

90%

50%

90%

100%

Theoutputis thetotal probabilityof the worlds which satisfy the query:

• Formally:P

W⊆D,W|=QPr(W)

Intuition:theprobabilitythat the query is true

We can always compute the probability in exponential time (go over all possibilities)

×(1−80%×(1−(1−90%)×(1−90%))), i.e.,97.65792%

(38)

Problem statement: Probabilistic query evaluation (PQE)

Wefixa queryQ, for instance the CQ: x y z

Theinputis a TIDD: A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER 80%

10%

40%

80%

100%

90%

90%

50%

90%

100%

Theoutputis thetotal probabilityof the worlds which satisfy the query:

• Formally:P

W⊆D,W|=QPr(W)

Intuition:theprobabilitythat the query is true

We can always compute the probability in exponential time (go over all possibilities)

Here we can do better (in PTIME):1−(1−80%)×(1−(1−(1−10%)×(1−40%))× (1−(1−50%)×(1−90%)))×(1−80%×(1−(1−90%)×(1−90%))),

i.e.,97.65792%

6/42

(39)

Problem statement: Probabilistic query evaluation (PQE)

Wefixa queryQ, for instance the CQ: x y z

Theinputis a TIDD: A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER 80%

10%

40%

80%

100%

90%

90%

50%

90%

100%

Theoutputis thetotal probabilityof the worlds which satisfy the query:

• Formally:P

W⊆D,W|=QPr(W)

Intuition:theprobabilitythat the query is true

We can always compute the probability in exponential time (go over all possibilities)

(40)

Research goal: Understanding the complexity of PQE What is the complexity ofPQE(Q)depending on the queryQ?

→ Note that we studydata complexity, i.e.,Qisfixedand the input is thedata In this talk: several dichotomies on the PQE problem:

Existing results:

PQE(Q)is in #P for any UCQQand is#P-hardfor some CQs

Dichotomyby Dalvi and Suciu:PQE(Q)for a UCQQis either#P-hardorPTIME

More general queries: dichotomy onhomomorphism-closed queries

PQE(Q)is#P-hardfor all homomorphism-closed queries not equivalent to a safe UCQ

Restricted instances:PQE(Q)for MSO queries...

• Isin PTIMEif the input data is restricted to havebounded treewidth

• Isintractable otherwiseunder some assumptions

Restricted instances:if all probabilities are50%then the complexity is the same

7/42

(41)

Research goal: Understanding the complexity of PQE

What is the complexity ofPQE(Q)depending on the queryQ?

→ Note that we studydata complexity, i.e.,Qisfixedand the input is thedata

In this talk: several dichotomies on the PQE problem:

Existing results:

PQE(Q)is in #P for any UCQQand is#P-hardfor some CQs

Dichotomyby Dalvi and Suciu:PQE(Q)for a UCQQis either#P-hardorPTIME

More general queries: dichotomy onhomomorphism-closed queries

PQE(Q)is#P-hardfor all homomorphism-closed queries not equivalent to a safe UCQ

Restricted instances:PQE(Q)for MSO queries...

• Isin PTIMEif the input data is restricted to havebounded treewidth

• Isintractable otherwiseunder some assumptions

Restricted instances:if all probabilities are50%then the complexity is the same

(42)

Research goal: Understanding the complexity of PQE

What is the complexity ofPQE(Q)depending on the queryQ?

→ Note that we studydata complexity, i.e.,Qisfixedand the input is thedata In this talk: several dichotomies on the PQE problem:

Existing results:

PQE(Q)is in #P for any UCQQand is#P-hardfor some CQs

Dichotomyby Dalvi and Suciu:PQE(Q)for a UCQQis either#P-hardorPTIME

More general queries: dichotomy onhomomorphism-closed queries

PQE(Q)is#P-hardfor all homomorphism-closed queries not equivalent to a safe UCQ

Restricted instances:PQE(Q)for MSO queries...

• Isin PTIMEif the input data is restricted to havebounded treewidth

• Isintractable otherwiseunder some assumptions

Restricted instances:if all probabilities are50%then the complexity is the same

7/42

(43)

Research goal: Understanding the complexity of PQE

What is the complexity ofPQE(Q)depending on the queryQ?

→ Note that we studydata complexity, i.e.,Qisfixedand the input is thedata In this talk: several dichotomies on the PQE problem:

Existing results:

PQE(Q)is in #P for any UCQQand is#P-hardfor some CQs

Dichotomyby Dalvi and Suciu:PQE(Q)for a UCQQis either#P-hardorPTIME

More general queries: dichotomy onhomomorphism-closed queries

PQE(Q)is#P-hardfor all homomorphism-closed queries not equivalent to a safe UCQ

Restricted instances:PQE(Q)for MSO queries...

• Isin PTIMEif the input data is restricted to havebounded treewidth

• Isintractable otherwiseunder some assumptions

Restricted instances:if all probabilities are50%then the complexity is the same

(44)

Research goal: Understanding the complexity of PQE

What is the complexity ofPQE(Q)depending on the queryQ?

→ Note that we studydata complexity, i.e.,Qisfixedand the input is thedata In this talk: several dichotomies on the PQE problem:

Existing results:

PQE(Q)is in #P for any UCQQand is#P-hardfor some CQs

Dichotomyby Dalvi and Suciu:PQE(Q)for a UCQQis either#P-hardorPTIME

More general queries: dichotomy onhomomorphism-closed queries

PQE(Q)is#P-hardfor all homomorphism-closed queries not equivalent to a safe UCQ

Restricted instances:PQE(Q)for MSO queries...

• Isin PTIMEif the input data is restricted to havebounded treewidth

• Isintractable otherwiseunder some assumptions

Restricted instances:if all probabilities are50%then the complexity is the same

7/42

(45)

Research goal: Understanding the complexity of PQE

What is the complexity ofPQE(Q)depending on the queryQ?

→ Note that we studydata complexity, i.e.,Qisfixedand the input is thedata In this talk: several dichotomies on the PQE problem:

Existing results:

PQE(Q)is in #P for any UCQQand is#P-hardfor some CQs

Dichotomyby Dalvi and Suciu:PQE(Q)for a UCQQis either#P-hardorPTIME

More general queries: dichotomy onhomomorphism-closed queries

PQE(Q)is#P-hardfor all homomorphism-closed queries not equivalent to a safe UCQ

Restricted instances:PQE(Q)for MSO queries...

• Isin PTIMEif the input data is restricted to havebounded treewidth

• Isintractable otherwiseunder some assumptions

Restricted instances:if all probabilities are50%then the complexity is the same

(46)

Research goal: Understanding the complexity of PQE

What is the complexity ofPQE(Q)depending on the queryQ?

→ Note that we studydata complexity, i.e.,Qisfixedand the input is thedata In this talk: several dichotomies on the PQE problem:

Existing results:

PQE(Q)is in #P for any UCQQand is#P-hardfor some CQs

Dichotomyby Dalvi and Suciu:PQE(Q)for a UCQQis either#P-hardorPTIME

More general queries: dichotomy onhomomorphism-closed queries

PQE(Q)is#P-hardfor all homomorphism-closed queries not equivalent to a safe UCQ

Restricted instances:PQE(Q)for MSO queries...

• Isin PTIMEif the input data is restricted to havebounded treewidth

• Isintractable otherwiseunder some assumptions

Restricted instances:if all probabilities are50%then the complexity is the same

7/42

(47)

Table of contents

Introduction and problem statement Existing results

More general queries: Dichotomy on homomorphism-closed queries More restricted instances: Words, trees and bounded treewidth More restricted instances: Unweighted instances

Conclusion and open problems

(48)

Basic complexity results

Whenever we canevaluateQin PTIME, thenPQE(Q)is in#P

#P:counting class of problems expressible as thenumber of accepting paths of a nondeterministic polynomial-time Turing Machine

Nondeterministically guess a possible world, then test the query

In particular,PQE(Q)is in#Pfor anyUCQQ

For some queriesQ, the taskPQE(Q)is inPTIME

e.g., single-atom CQs

e.g., x y z

9/42

(49)

Basic complexity results

Whenever we canevaluateQin PTIME, thenPQE(Q)is in#P

#P:counting class of problems expressible as thenumber of accepting paths of a nondeterministic polynomial-time Turing Machine

Nondeterministically guess a possible world, then test the query

In particular,PQE(Q)is in#Pfor anyUCQQ

For some queriesQ, the taskPQE(Q)is inPTIME

e.g., single-atom CQs

e.g., x y z

(50)

Basic complexity results

Whenever we canevaluateQin PTIME, thenPQE(Q)is in#P

#P:counting class of problems expressible as thenumber of accepting paths of a nondeterministic polynomial-time Turing Machine

Nondeterministically guess a possible world, then test the query

In particular,PQE(Q)is in#Pfor anyUCQQ

For some queriesQ, the taskPQE(Q)is inPTIME

e.g., single-atom CQs

e.g., x y z

9/42

(51)

Basic complexity results

Whenever we canevaluateQin PTIME, thenPQE(Q)is in#P

#P:counting class of problems expressible as thenumber of accepting paths of a nondeterministic polynomial-time Turing Machine

Nondeterministically guess a possible world, then test the query

In particular,PQE(Q)is in#Pfor anyUCQQ

For some queriesQ, the taskPQE(Q)is inPTIME

e.g., single-atom CQs

e.g., x y z

(52)

PQE is sometimes #P-hard

Let us show thatPQE(Q)is#P-hardfor the CQQ:

x y z w

Reduce from the problem ofcounting satisfying valuationsof a Boolean formula

• e.g., given(xy)z, compute that it has3satisfying valuations

This problem is already#P-hardfor so-calledPP2DNF formulas:

Positive(no negation) andPartitioned variables:X1, . . . ,XnandY1, . . . ,Ym

2-DNF: disjunction of clauses likeXiYj

Example:φ: (X1Y1)(X1Y2)(X2Y2)(X3Y1)(X3Y2)

a01 a02 a03

a1 a2 a3 1/2 1/2 1/2

b1

b2

b01

b02 1/2

1/2 1

1 1

1 1

Idea: Satisfying valuationsofφcorrespond topossible worldswith amatchofQ

10/42

(53)

PQE is sometimes #P-hard

Let us show thatPQE(Q)is#P-hardfor the CQQ: x y z w

Reduce from the problem ofcounting satisfying valuationsof a Boolean formula

• e.g., given(xy)z, compute that it has3satisfying valuations

This problem is already#P-hardfor so-calledPP2DNF formulas:

Positive(no negation) andPartitioned variables:X1, . . . ,XnandY1, . . . ,Ym

2-DNF: disjunction of clauses likeXiYj

Example:φ: (X1Y1)(X1Y2)(X2Y2)(X3Y1)(X3Y2)

a01 a02 a03

a1 a2 a3 1/2 1/2 1/2

b1

b2

b01

b02 1/2

1/2 1

1 1

1 1

Idea: Satisfying valuationsofφcorrespond topossible worldswith amatchofQ

(54)

PQE is sometimes #P-hard

Let us show thatPQE(Q)is#P-hardfor the CQQ: x y z w

Reduce from the problem ofcounting satisfying valuationsof a Boolean formula

• e.g., given(xy)z, compute that it has3satisfying valuations

This problem is already#P-hardfor so-calledPP2DNF formulas:

Positive(no negation) andPartitioned variables:X1, . . . ,XnandY1, . . . ,Ym

2-DNF: disjunction of clauses likeXiYj

Example:φ: (X1Y1)(X1Y2)(X2Y2)(X3Y1)(X3Y2)

a01 a02 a03

a1 a2 a3 1/2 1/2 1/2

b1

b2

b01

b02 1/2

1/2 1

1 1

1 1

Idea: Satisfying valuationsofφcorrespond topossible worldswith amatchofQ

10/42

(55)

PQE is sometimes #P-hard

Let us show thatPQE(Q)is#P-hardfor the CQQ: x y z w

Reduce from the problem ofcounting satisfying valuationsof a Boolean formula

• e.g., given(xy)z, compute that it has3satisfying valuations

This problem is already#P-hardfor so-calledPP2DNF formulas:

Positive(no negation) andPartitioned variables:X1, . . . ,XnandY1, . . . ,Ym

2-DNF: disjunction of clauses likeXiYj

Example:φ: (X1Y1)(X1Y2)(X2Y2)(X3Y1)(X3Y2)

a01 a02 a03

a1 a2 a3 1/2 1/2 1/2

b1

b2

b01

b02 1/2

1/2 1

1 1

1 1

Idea: Satisfying valuationsofφcorrespond topossible worldswith amatchofQ

(56)

PQE is sometimes #P-hard

Let us show thatPQE(Q)is#P-hardfor the CQQ: x y z w

Reduce from the problem ofcounting satisfying valuationsof a Boolean formula

• e.g., given(xy)z, compute that it has3satisfying valuations

This problem is already#P-hardfor so-calledPP2DNF formulas:

Positive(no negation) andPartitioned variables:X1, . . . ,XnandY1, . . . ,Ym

2-DNF: disjunction of clauses likeXiYj

Example:φ: (X1Y1)(X1Y2)(X2Y2)(X3Y1)(X3Y2)

a01 a02 a03

a1 a2 a3 1/2 1/2 1/2

b1

b2

b01

b02 1/2

1/2 1

1 1

1 1

Idea: Satisfying valuationsofφcorrespond topossible worldswith amatchofQ

10/42

Références

Documents relatifs

Même mobilité par voie antérieure ou postérieure pour les implants en silicone et pour les prothèses en.

Vue de la face antérieure d’un poignet après section du retinaculum flexorum montrant le nerf médian avec sa branche motrice (à gauche sur l’image, sur un fil tracteur) et

 La chirurgie a une place limitée, mais sous-estimée, dans la prise en charge des mains sclérodermiques.  A condition d’être réalisée dans un cadre multidisciplinaire,

ALLONGEMENT DE LA LATENCE MOTRICE DISTALE RALENTISSEMENT DE LA CONDUCTION SENSITIVE DÉNERVATION DU COURT ABDUCTEUR DU POUCE. CONDUCTION NORMALE DE LA BRANCHE CUTANÉE PALMAIRE

Plus logique car c’est la face profonde, visible, du tendon qui est atteinte. Anatomiquement possible

With respect to the radiographic measurements (Table 2), after ligamentous sectioning those wrists with a deeper RS fossa and a greater volar tilt have a scaphoid that is less likely

FIGURE 54-1 The spectrum of radiographic presentation from grades I to V in patients with rheumatoid arthritis after Connor and Morrey [9] and

 Surtout ne pas rater les quelques lésions graves qui sont..