• Aucun résultat trouvé

Efficient Query Processing on Modern Hardware

N/A
N/A
Protected

Academic year: 2022

Partager "Efficient Query Processing on Modern Hardware"

Copied!
1
0
0

Texte intégral

(1)

Efficient Query Processing on Modern Hardware

Thomas Neumann

Lehrstuhl für Informatik III: Datenbanksysteme Fakultät für Informatik

Technische Universität München

ABSTRACT

Most database systems translate a given query into an expression in a (physical) algebra, and then start evaluating this algebraic expression to produce the query result. The traditional way to execute these algebraic plans is the iterator model:

Every physical algebraic operator conceptually produces a tuple stream from its input, and allows for iterating over this tuple stream. This is a very nice and simple interface, and allows for easy combination of arbitrary operators,but it clearly comes from a time when query processing was dominated by I/O and CPU consumption was less important: The iterator interface causes thousands of expensive function calls, degrades the branch prediction of modern CPUs, and ofter results in poor code locality and complex book-keeping.

On modern hardware query processing can be improved considerably by processing tuples in a data centric, and not an operator centric, way. Data is processed such that it can be kept in CPU registers as long as possible. Operator boundaries are blurred to achieve this goal. In combination with an code compilation framework this results in query code that rivals the speed of hand-written code. When using these techniques in the HyPer DBMS, TPC-H Query 1 for example can single-threaded aggregated the scale factor 1GB data set in about 68ms on commodity hardware.

3

Références

Documents relatifs

We would then want to boost the weight of query words that are topically similar to “m´ edecine” but that occur in other documents (it would be no point in selecting words that occur

By using Pig Latin as an intermediate layer of abstraction between SPARQL and MapRe- duce, the mapping is automatically compatible to future versions of Hadoop (including major

In this paper we present FRaQuE (Framework for Rapid Query Processing Evaluation), a framework aimed to offer to a non-expert user the possibility to evaluate query processors

The optimization problem for approximate query evaluation seems to be much harder than commonly known exact query opti- mization: in addition to selection of one of equivalent

For instance, although using a SPARQL endpoint to evaluate a complete query is the best in terms of the number of requests and the amount of transferred data, having multiple

The former simplifies the maintenance of queries by making them human-readable, while the latter provides a syntax agnostic representation that allows multiple templates to be

In this article, we first introduce N EO EMF, a multi-database model persistence framework able to store very large models in an efficient way according to specific

While approaches such as [16] allow approximating the recall of incomplete query results, source selection approaches dedicated to the computation of queries with LIMIT