• Aucun résultat trouvé

A Dichotomy for Homomorphism-Closed Queries on Probabilistic Graphs

N/A
N/A
Protected

Academic year: 2022

Partager "A Dichotomy for Homomorphism-Closed Queries on Probabilistic Graphs"

Copied!
34
0
0

Texte intégral

(1)

A Dichotomy for Homomorphism-Closed Queries on Probabilistic Graphs

Antoine Amarilli1and İsmail İlkan Ceylan2 September 16, 2020

1Télécom Paris

(2)

Uncertain data management

In this talk, we managedatarepresented as alabeled graph

WorksAt

Antoine Télécom Paris Antoine Paris Sud

Benny Paris Sud Benny Technion İsmail U. Oxford

MemberOf

Télécom Paris ParisTech Télécom Paris IP Paris

Paris Sud IP Paris Paris Sud Paris-Saclay Technion CESAER

Antoine

Benny

İsmail

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER

→ Problem:we are notcertainabout the true state of the data

2/7

(3)

Uncertain data management

In this talk, we managedatarepresented as alabeled graph

WorksAt

Antoine Télécom Paris Antoine Paris Sud

Benny Paris Sud Benny Technion İsmail U. Oxford

MemberOf

Télécom Paris ParisTech Télécom Paris IP Paris

Paris Sud IP Paris Paris Sud Paris-Saclay Technion CESAER

Antoine

Benny

İsmail

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER

→ Problem:we are notcertainabout the true state of the data

(4)

Uncertain data management

In this talk, we managedatarepresented as alabeled graph

WorksAt

Antoine Télécom Paris Antoine Paris Sud

Benny Paris Sud Benny Technion İsmail U. Oxford

MemberOf

Télécom Paris ParisTech Télécom Paris IP Paris

Paris Sud IP Paris Paris Sud Paris-Saclay Technion CESAER

Antoine

Benny

İsmail

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER

→ Problem:we are notcertainabout the true state of the data

2/7

(5)

Uncertain data management

In this talk, we managedatarepresented as alabeled graph

WorksAt

Antoine Télécom Paris Antoine Paris Sud

Benny Paris Sud Benny Technion İsmail U. Oxford

MemberOf

Télécom Paris ParisTech Télécom Paris IP Paris

Paris Sud IP Paris Paris Sud Paris-Saclay Technion CESAER

Antoine

Benny

İsmail

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER

→ Problem:we are notcertainabout the true state of the data

(6)

Uncertain data management

In this talk, we managedatarepresented as alabeled graph

WorksAt

Antoine Télécom Paris Antoine Paris Sud

Benny Paris Sud Benny Technion İsmail U. Oxford

MemberOf

Télécom Paris ParisTech Télécom Paris IP Paris

Paris Sud IP Paris Paris Sud Paris-Saclay Technion CESAER

Antoine

Benny

İsmail

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER

→ Problem:we are notcertainabout the true state of the data

2/7

(7)

Uncertain data management

In this talk, we managedatarepresented as alabeled graph

WorksAt

Antoine Télécom Paris Antoine Paris Sud

Benny Paris Sud Benny Technion İsmail U. Oxford

MemberOf

Télécom Paris ParisTech Télécom Paris IP Paris

Paris Sud IP Paris Paris Sud Paris-Saclay Technion CESAER

Antoine

Benny

İsmail

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER

→ Problem:we are notcertainabout the true state of the data

(8)

Uncertain data management

In this talk, we managedatarepresented as alabeled graph

WorksAt

Antoine Télécom Paris Antoine Paris Sud

Benny Paris Sud Benny Technion İsmail U. Oxford

MemberOf

Télécom Paris ParisTech Télécom Paris IP Paris

Paris Sud IP Paris Paris Sud Paris-Saclay Technion CESAER

Antoine

Benny

İsmail

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER

→ Problem:we are notcertainabout the true state of the data 2/7

(9)

Uncertain data model A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER

80% 10%

40% 80%

100%

90% 90% 50%

90%

100%

Uncertain data model:TID, for tuple-independent database

Each fact (edge) carries aprobability

Each fact exists with its givenprobability

All facts areindependent

Possible worldW:subset of facts

ProbabilityofW: Pr(W) = Y

F∈W

Pr(F)

!

× Y

F∈W/

1−Pr(F)

!

(10)

Uncertain data model A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER 80%

10%

40%

80%

100%

90%

90%

50%

90%

100%

Uncertain data model:TID, for tuple-independent database

Each fact (edge) carries aprobability

Each fact exists with its givenprobability

All facts areindependent

Possible worldW:subset of facts

ProbabilityofW: Pr(W) = Y

F∈W

Pr(F)

!

× Y

F∈W/

1−Pr(F)

!

3/7

(11)

Uncertain data model A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER 80%

10%

40%

80%

100%

90%

90%

50%

90%

100%

Uncertain data model:TID, for tuple-independent database

Each fact (edge) carries aprobability

Each fact exists with its givenprobability

All facts areindependent

Possible worldW:subset of facts

ProbabilityofW: Pr(W) = Y

F∈W

Pr(F)

!

× Y

F∈W/

1−Pr(F)

!

(12)

Uncertain data model A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER 80%

10%

40%

80%

100%

90%

90%

50%

90%

100%

Uncertain data model:TID, for tuple-independent database

Each fact (edge) carries aprobability

Each fact exists with its givenprobability

All facts areindependent

Possible worldW:subset of facts

ProbabilityofW: Pr(W) = Y

F∈W

Pr(F)

!

× Y

F∈W/

1−Pr(F)

!

3/7

(13)

Uncertain data model A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER

80% 10%

40% 80%

100%

90% 90% 50%

90%

100%

Uncertain data model:TID, for tuple-independent database

Each fact (edge) carries aprobability

Each fact exists with its givenprobability

All facts areindependent

Possible worldW:subset of facts

ProbabilityofW: Pr(W) = Y

F∈W

Pr(F)

!

× Y

F∈W/

1−Pr(F)

!

(14)

Uncertain data model A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER 80%

10%

40%

80%

100%

90%

90%

50%

90%

100%

Uncertain data model:TID, for tuple-independent database

Each fact (edge) carries aprobability

Each fact exists with its givenprobability

All facts areindependent

Possible worldW:subset of facts

ProbabilityofW:

Pr(W) = Y

F∈W

Pr(F)

!

× Y

F∈W/

1−Pr(F)

!

3/7

(15)

Uncertain data model A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER 80%

10%

40%

80%

100%

90%

90%

50%

90%

100%

Uncertain data model:TID, for tuple-independent database

Each fact (edge) carries aprobability

Each fact exists with its givenprobability

All facts areindependent

Possible worldW:subset of facts

ProbabilityofW: Pr(W) = Y

F∈W

Pr(F)

!

× Y

F∈W/

1−Pr(F)

!

(16)

Homomorphism-closed queries

Query:maps a graph (without probabilities) to YES/NO

Conjunctive query(CQ): can I find a match of apattern? e.g., x y z

Union of conjunctive queries(UCQ): doesone of the CQsmatch?

→ Homomorphism-closed queryQ: ifGsatisfiesQandGhas a homomorphism toG0 thenG0 also satisfiesQ

They generalizeCQsandUCQs, but alsoregular path queries(RPQs),Datalog, etc.

4/7

(17)

Homomorphism-closed queries

Query:maps a graph (without probabilities) to YES/NO

Conjunctive query(CQ): can I find a match of apattern? e.g., x y z

Union of conjunctive queries(UCQ): doesone of the CQsmatch?

→ Homomorphism-closed queryQ: ifGsatisfiesQandGhas a homomorphism toG0 thenG0 also satisfiesQ

They generalizeCQsandUCQs, but alsoregular path queries(RPQs),Datalog, etc.

(18)

Homomorphism-closed queries

Query:maps a graph (without probabilities) to YES/NO

Conjunctive query(CQ): can I find a match of apattern? e.g., x y z

Union of conjunctive queries(UCQ): doesone of the CQsmatch?

→ Homomorphism-closed queryQ: ifGsatisfiesQandGhas a homomorphism toG0 thenG0 also satisfiesQ

They generalizeCQsandUCQs, but alsoregular path queries(RPQs),Datalog, etc.

4/7

(19)

Homomorphism-closed queries

Query:maps a graph (without probabilities) to YES/NO

Conjunctive query(CQ): can I find a match of apattern? e.g., x y z

Union of conjunctive queries(UCQ): doesone of the CQsmatch?

→ Homomorphism-closed queryQ: ifGsatisfiesQandGhas a homomorphism toG0 thenG0 also satisfiesQ

They generalizeCQsandUCQs, but alsoregular path queries(RPQs),Datalog, etc.

(20)

Homomorphism-closed queries

Query:maps a graph (without probabilities) to YES/NO

Conjunctive query(CQ): can I find a match of apattern? e.g., x y z

Union of conjunctive queries(UCQ): doesone of the CQsmatch?

→ Homomorphism-closed queryQ: ifGsatisfiesQandGhas a homomorphism toG0 thenG0 also satisfiesQ

They generalizeCQsandUCQs, but alsoregular path queries(RPQs),Datalog, etc.

4/7

(21)

Problem statement: Probabilistic query evaluation (PQE) Here is the problemPQE(Q):

Wefixa queryQ: x y z

Theinputis a TIDD: A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER 80%

10%

40% 80%

100%

90% 90% 50%

90%

100%

Theoutputis theprobabilitythat the query is true

→ Question: What is the complexity ofPQE(Q)depending on the queryQ?

(22)

Problem statement: Probabilistic query evaluation (PQE) Here is the problemPQE(Q):

Wefixa queryQ: x y z

Theinputis a TIDD: A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER 80%

10%

40%

80%

100%

90%

90%

50%

90%

100%

Theoutputis theprobabilitythat the query is true

→ Question: What is the complexity ofPQE(Q)depending on the queryQ?

5/7

(23)

Problem statement: Probabilistic query evaluation (PQE) Here is the problemPQE(Q):

Wefixa queryQ: x y z

Theinputis a TIDD: A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER 80%

10%

40%

80%

100%

90%

90%

50%

90%

100%

Theoutputis theprobabilitythat the query is true

→ Question: What is the complexity ofPQE(Q)depending on the queryQ?

(24)

Problem statement: Probabilistic query evaluation (PQE) Here is the problemPQE(Q):

Wefixa queryQ: x y z

Theinputis a TIDD: A.

B.

İ.

Télécom Paris

Paris Sud

Technion

U. Oxford

ParisTech

IP Paris

Paris-Saclay

CESAER 80%

10%

40%

80%

100%

90%

90%

50%

90%

100%

Theoutputis theprobabilitythat the query is true

→ Question: What is the complexity ofPQE(Q)depending on the queryQ?

5/7

(25)

Results on PQE

Existing dichotomy on theunions of conjunctive queries(UCQs):

Theorem [Dalvi and Suciu, 2012]

Some UCQsQaresafeandPQE(Q)is inPTIME

All others areunsafeandPQE(Q)is#P-hard

We study PQE forhomomorphism-closed queriesand show: Theorem [Amarilli and Ceylan, 2020]

For anyquery Q closed under homomorphisms:

EitherQis equivalent to asafe UCQandPQE(Q)is inPTIME

In all other cases,PQE(Q)is#P-hard

Sobad news: all homomorphism-closed queries arehardexcept safe UCQs

(26)

Results on PQE

Existing dichotomy on theunions of conjunctive queries(UCQs):

Theorem [Dalvi and Suciu, 2012]

Some UCQsQaresafeandPQE(Q)is inPTIME

All others areunsafeandPQE(Q)is#P-hard

We study PQE forhomomorphism-closed queriesand show:

Theorem [Amarilli and Ceylan, 2020]

For anyquery Q closed under homomorphisms:

EitherQis equivalent to asafe UCQandPQE(Q)is inPTIME

In all other cases,PQE(Q)is#P-hard

Sobad news: all homomorphism-closed queries arehardexcept safe UCQs

6/7

(27)

Results on PQE

Existing dichotomy on theunions of conjunctive queries(UCQs):

Theorem [Dalvi and Suciu, 2012]

Some UCQsQaresafeandPQE(Q)is inPTIME

All others areunsafeandPQE(Q)is#P-hard

We study PQE forhomomorphism-closed queriesand show:

Theorem [Amarilli and Ceylan, 2020]

For anyquery Q closed under homomorphisms:

EitherQis equivalent to asafe UCQandPQE(Q)is inPTIME

In all other cases,PQE(Q)is#P-hard

Sobad news: all homomorphism-closed queries arehardexcept safe UCQs

(28)

Results on PQE

Existing dichotomy on theunions of conjunctive queries(UCQs):

Theorem [Dalvi and Suciu, 2012]

Some UCQsQaresafeandPQE(Q)is inPTIME

All others areunsafeandPQE(Q)is#P-hard

We study PQE forhomomorphism-closed queriesand show:

Theorem [Amarilli and Ceylan, 2020]

For anyquery Q closed under homomorphisms:

EitherQis equivalent to asafe UCQandPQE(Q)is inPTIME

In all other cases,PQE(Q)is#P-hard

Sobad news: all homomorphism-closed queries arehardexcept safe UCQs

6/7

(29)

Results on PQE

Existing dichotomy on theunions of conjunctive queries(UCQs):

Theorem [Dalvi and Suciu, 2012]

Some UCQsQaresafeandPQE(Q)is inPTIME

All others areunsafeandPQE(Q)is#P-hard

We study PQE forhomomorphism-closed queriesand show:

Theorem [Amarilli and Ceylan, 2020]

For anyquery Q closed under homomorphisms:

EitherQis equivalent to asafe UCQandPQE(Q)is inPTIME

In all other cases,PQE(Q)is#P-hard

(30)

What’s next?

The result only applies tographs, not higher-arity databases

• Weconjecturethat it holds for arbitrary arity

Adapting tounweightedPQE, where all probabilities are1/2?

• We have a recent result onnon-hierarchical self-join-free CQs [Amarilli and Kimelfeld, 2020]

• Recent paper by Suciu and Kenig [Kenig and Suciu, 2020]

Thanks for your attention!

7/7

(31)

What’s next?

The result only applies tographs, not higher-arity databases

• Weconjecturethat it holds for arbitrary arity

Adapting tounweightedPQE, where all probabilities are1/2?

• We have a recent result onnon-hierarchical self-join-free CQs [Amarilli and Kimelfeld, 2020]

• Recent paper by Suciu and Kenig [Kenig and Suciu, 2020]

Thanks for your attention!

(32)

What’s next?

The result only applies tographs, not higher-arity databases

• Weconjecturethat it holds for arbitrary arity

Adapting tounweightedPQE, where all probabilities are1/2?

• We have a recent result onnon-hierarchical self-join-free CQs [Amarilli and Kimelfeld, 2020]

• Recent paper by Suciu and Kenig [Kenig and Suciu, 2020]

Thanks for your attention!

7/7

(33)

References i

Amarilli, A. and Ceylan, I. I. (2020).

A dichotomy for homomorphism-closed queries on probabilistic graphs.

InICDT.

Amarilli, A. and Kimelfeld, B. (2020).

Uniform reliability of self-join-free conjunctive queries.

arXiv preprint arXiv:1908.07093.

Dalvi, N. and Suciu, D. (2012).

The dichotomy of probabilistic inference for unions of conjunctive queries.

J. ACM, 59(6).

(34)

References ii

Kenig, B. and Suciu, D. (2020).

A dichotomy for the generalized model counting problem for unions of conjunctive queries.

arXiv preprint arXiv:2008.00896.

Références

Documents relatifs

Based on this, indexes for faster exact shortest path query answering have been proposed using fixed-width decompositions [5, 47] (FWDs) in the context of exact graphs: the idea is

This equation also appears in the work of Solodovnikov [10], he showed that if a Riemannian metrics g admits lots of geodesically equivalent metrics some of them having a

For the purposes of answering standard CQs without negation, the minimal canonical model behaves as a usual canonical model, which means that our semantics generalizes the

Abstract. The present work aims to exploit the interplay between the alge- braic properties of rings and the graph-theoretic structures of their associated graphs. We

The classifications of finite dimensional irreducible representations for the Yangian (closed spin chains) and for the reflection algebras (open spin chains) are directly linked to

We have identified six key challenges which are important in a Linked Data ecosystem where not all data is free – they are: i) building dataset reputation, ii) developing

Technologies such as data warehousing, extract, transform and load (ETL), enterprise application integration (EAI) and enterprise information integration (EII) have

The Closed Graph Theorem establishes the converse when E and F are suitable objects of topological algebra, and more specifically topological groups, topological vector spaces