• Aucun résultat trouvé

A Circuit-Based Approach to Efficient Enumeration

N/A
N/A
Protected

Academic year: 2022

Partager "A Circuit-Based Approach to Efficient Enumeration"

Copied!
2
0
0

Texte intégral

(1)

A Circuit-Based Approach to Efficient Enumeration

Antoine Amarilli April 21, 2017

Many problems in data management require us to evaluate a query on some data, and compute a set of answers. For instance, we may write a conjunctive query in SQL on a relational data base, or an XPath query on data trees. On large data sets, however, the complete set of answers can be huge, and impossible to compute completely. This makes it necessary to compute quickly an initial set of solutions, and to compute more solutions as they are requested by the user, analogously to how search engine results are presented in a paginated fashion.

The theoretical computer science community has been studiying enumeration algo- rithms to give a formal foundation to such tasks. An enumeration algorithm for a query on a dataset consists of two phases: apreprocessing phase, where the input data is read and indexed; and acomputation phase, where the results are computed and output one after the other. The efficiency of the algorithm is measured by giving the complexity of the preprocessing phase, and thedelay, i.e., the time spent in the computation phase to compute each successive answer. The most stringent requirements on enumeration algo- rithms impose linear-time preprocessing, i.e., we index the data set in linear time, and constant-delay, i.e., we produce each solution after a constant amount of time. When the size of the solutions can be large, linear-delay requires that the delay spent to produce each solution is linear in the size of that solution (but independent from the size of the data set).

This talk will present my recent work with Pierre Bourhis, Louis Jachiet, and Stefan Mengel, which is currently under review [1]. In this work, we study constant-delay enumeration algorithms to evaluate queries on data trees, and relational instances of a restricted kind, i.e., bounded-treewidth instances. We show this result by leveraging the tools of knowledge compilation, and factorized representations, to compute efficiently a circuit representation of the answers of the query to be enumerated, using a variant of our earlier work [2,3]. We then show how the valuation of such circuits, corresponding to the answers, can be enumerated efficiently, thanks to structural restrictions on the circuits [5]. This allows us to re-prove the well-known result [4, 6] that the answers to monadic second-order queries with free first-order variables can be enumerated with linear-preprocessing and constant-delay over structures of bounded treewidth. We also derive new enumeration results for circuit enumeration and factorized representations.

1

(2)

The talk may also sketch some of our ongoing work using the same techniques to show efficient enumeration results for queries when the underlying data structures can be updated, in the spirit of works such as [7].

References

[1] Antoine Amarilli, Pierre Bourhis, Louis Jachiet, and Stefan Mengel.A Circuit-Based Approach to Efficient Enumeration. Under review, 2017.

[2] Antoine Amarilli, Pierre Bourhis, and Pierre Senellart. Provenance circuits for trees and treelike instances. In ICALP, 2015.

[3] Antoine Amarilli, Pierre Bourhis, and Pierre Senellart. Tractable lineages on treelike instances: Limits and extensions. In PODS, 2016.

[4] Guillaume Bagan. MSO queries on tree decomposable structures are computable with linear delay. In CSL, 2006.

[5] Adnan Darwiche. On the tractable counting of theory models and its application to truth maintenance and belief revision. J. Applied Non-Classical Logics, 11(1-2), 2001.

[6] Wojciech Kazana and Luc Segoufin. Enumeration of monadic second-order queries on trees. TOCL, 14(4), 2013.

[7] Katja Losemann and Wim Martens. MSO queries on trees: enumerating answers under updates. InCSL-LICS, 2014.

2

Références

Documents relatifs

Evaluating self-reported pressure ulcer prevention measures in persons with spinal cord injury using the revised Skin Management Needs Assessment Checklist: reliability study... Values

Barbiturate-anesthetized cats with severed middle-ear muscles were monaurally stimulated with 1 ms duration bursts of 0. Evoked responses were recorded with a wire

Research on amplitude compression for listeners with reduced dynamic range has been primarily concerned with obtaining further insight into the negative results

(2005) used a flux package consisting of a sonic anemometer and two CPC’s to determine fluxes in two very broad size ranges.. de Leeuw

The problem consists in simultaneously determining for each period t ∈ T : (1) the partition of each demand d ∈ D t into boxes; (2) the batching of boxes into groups of up to f

Additionnal rheological data might be obtained using the presented procedure of off-equilibrium torsion testing, at various temperatures, and by varying phase fraction values

In the worst case, minimal generator constraints require a large number of clauses making the encoding of MNRs ever, large.. To tackle this problem, we consider the same

A mes chers parents : NAKACH Hassnae et MELLIANI Mansour Merci pour votre amour, pour tout l’enseignement que vous m’avez transmis, pour avoir toujours cru en moi et m’avoir