• Aucun résultat trouvé

Scalability and Optimisation of GroupBy-Joins in MapReduce Scalability and Optimisation of GroupBy-Joins in MapReduce

N/A
N/A
Protected

Academic year: 2021

Partager "Scalability and Optimisation of GroupBy-Joins in MapReduce Scalability and Optimisation of GroupBy-Joins in MapReduce"

Copied!
21
0
0

Texte intégral

Références

Documents relatifs

The second chapter: titled Honors of Ethics in the Poetry of Urwa Ibn al-Ward, in which we extracted the ethical aspects mentioned in the poetry of Urwa, so that in the conclusion of

So, our final model uses word n-grams trained on data from all categories as features and logistic regression as the classification algorithm..

We show that we can improve the replication rate (i.e., reduce the communication cost) and the maximum reducer input (i.e., reduce the longest running time and the space requirements

In this paper we consider the implementation of the Partition Based Spatial Merge Join [13] provided by SpatialHadoop, de- noted as Sjmr, which is the only spatial join algorithm

The goal of our study is to compute the work loads of all reducers, and derive in particular the maximum load and various other statistics based on the loads.. Thus, we only

Theorem 4 (Algorithm correctness) Algorithm 3 assigns each pair of overlapping intervals to at least one reducer in common, where q = 3nw+S T , each of the two relations, X and Y

Theorem 5 (Upper bounds from the heuristic) The above heuristic using a bin size b = q 2 where q is the reducer capacity achieves the following three upper bounds: The total number