• Aucun résultat trouvé

Top-k Querying of Unknown Values under Order Constraints

N/A
N/A
Protected

Academic year: 2022

Partager "Top-k Querying of Unknown Values under Order Constraints"

Copied!
1
0
0

Texte intégral

(1)

Top-k Querying of Unknown Values under Order Constraints

Antoine Amarilli, Yael Amsterdamer, Tova milo, Pierre Senellart

Research Problem:

Given a set of items with unknown values estimate (A) top-k items (B) their values

based on known values and partial order constraints

Motivating Example:

Top-k categories compatible with a given product

Results

Algorithm for Top-k and Interpolation

Proposition: if 𝒞 implies a total order,

the expected value of 𝑥 ∈ 𝒳 can be computed in PTIME

Theorem: for general 𝒞 , interpolation and top-k are in FP

#𝑃

• Weighted sum of expected values over linear extensions of 𝒞, weights by the probability of each ordering

• Nondeterministically sum over linear extensions

Hardness (tight bounds)

Theorem [Rademacher 2007]: interpolation is FP

#𝑃

-hard

• Even without exact-value constraints

Theorem: top-k is FP

#𝑃

-hard even without expected values

• Reduction from interpolation!

• Use top-k to compare the value of x to fresh exact-value

• Bound the denominator

• Rational number identification using polynomial # comparisons

Approximations

• Solving with high probability is hard unless 𝑁𝑃 ⊆ 𝐵𝑃𝑃

• An FPRAS with bounded error 𝜀 on expected value

By connection to volume computation

High degree PTIME complexity, may admit efficient implementations

Model:

𝒳 = 𝑥

1

, … , 𝑥

𝑛

a set of variables, 𝒳

𝜎

selected variables

• We assume 𝑥𝑖 takes values in [0,1]

𝒞 a set of order and exact-value constraints over 𝒳

• Exact values are rationals

pw 𝒞 - possible worlds, all valuations of 𝒳 that satisfy 𝒞

We assume a uniform pdf over pw 𝒞

Top-k semantics: k items with highest expected values

• Desirable properties (in literature)

• A value estimate compatible with top-k

• Interpolation over posets – independent contribution

Splitting Lemma

• The influence relation x ⟷ 𝑥′ is the symmetric, reflexive, and transitive closure of ≺ on 𝒳\𝒳𝑒𝑥𝑎𝑐𝑡

• Its equivalence classes are used in a definition of uninfluenced decomposition 𝒞1, 𝒞2, … , 𝒞𝑚 of 𝒞

• Proof by a bijection over possible worlds

Tractable Cases

tree-shaped constraint sets

Theorem: if 𝒞 is tree-shaped,

V 𝒞 can be computed in time O 𝒳

2

• Bottom-up processing,

propagating a piecewise polynomial function for the volume of subtree based on root’s parent value

• Complexity proof by induction

Theorem: if 𝒞 is tree-shaped,

𝑥 ’s marginal can be computed in O 𝒳

𝑒𝑥𝑎𝑐𝑡

⋅ 𝒳

2

• A similar bottom-up scheme

• Computing the pdf as a function from 𝑣 to V(𝒞𝑥=𝑣)

• 𝒳𝑒𝑥𝑎𝑐𝑡 factor is due to the pieces of the polynomial

Corollary: if 𝒞 is decomposable to (reverse-)tree-shaped, interpolation and top-k can be solved in PTIME

LTCI, Télécom ParisTech,

Université Paris-Saclay Bar Ilan University Tel Aviv University DI, École normale supérieure, PSL Research University

Inria Paris

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

0 0.2 0.4 0.6 0.8 1

y

x

𝑥 ≥ 𝑦, 0.3 = 𝑤 ≤ 𝑦 ≤ 𝑧 = 0.9

0 0.2

0.4 0.6

0.81 0

0.2 0.4 0.6 0.8 1

0 0.2 0.4 0.6 0.8 1

Y

Z

X

𝑥 ≥ 𝑦, 0.3 = 𝑤 ≤ 𝑦 ≤ 𝑧

Sports Electronics Clothing

Cell Phones

TVs Shoes Watches

Smartphones

Diving Gear Diving Watches

Wearable Devices

0.5 0.5

0.1

0.9 1

?

𝑥0 ≤ 𝑥1 ≤ ⋯ ≤ 𝑥𝑖−1 ≤ 𝑥𝑖 ≤ 𝑥𝑖+1 ≤ ⋯ ≤ 𝑥𝑗−1 ≤ 𝑥𝑗 ≤ 𝑥𝑗+1 ≤ ⋯ ≤ 𝑥𝑛 ≤ 𝑥𝑛+1

𝛼 𝑣𝑖 𝑣𝑗 𝛽

= = = =

Fragment. Distribution independent from other fragments.

Marginals follow (rescaled) Beta distribution by connection to order statistics

𝑥0 = 𝑣0 𝑥1

𝑥2 𝑥3

𝑣4 = 𝑥4 𝑥5 𝑥6 = 𝑣6 𝑥7 = 𝑣7 𝑥8

𝑥11 = 𝑣11 𝑣9 = 𝑥9 𝑥10 = 𝑣10

Références

Documents relatifs

To stay faithful to the physically motivated interaction between the surface normal and the light source direction, we will introduce an indirect coupling between the unit normal

Specifically, we devise a model of network services and their different instances by extending the resource graph model; such a model allows us to formu- late the

The no-gap second order conditions of Pontryagin and bounded strong minima in optimal control problems with initial-final state constraints and mixed state-control

We propose to make a trade-off between the computational time to update episode knowledge when data changes and memory usage, thanks to the setting of a depth boundary dot on the

L'origine du SPC étant la combinaison de WFD et de la dispersion spectrale, j'ai fait varier cette dernière pour observer la génération de SPC. Cela donne une courbe similaire quant

La contribution de la cytogénétique dans l’échec des grossesses humaines pourrait donc être encore plus élevée que les estimations fondées sur les premières séries

This paper proposes a complete solution to perform task- space (operational-space) inverse dynamics while taking into account various tasks, unilateral constraints such as

These are linear forms small at several points, and the generalization to vectors [3] of Nesterenko’s linear independence criterion [10] enables one to deduce a lower bound on the