• Aucun résultat trouvé

Lossy compression of unordered rooted trees

N/A
N/A
Protected

Academic year: 2022

Partager "Lossy compression of unordered rooted trees"

Copied!
1
0
0

Texte intégral

(1)

Lossy compression of unordered rooted trees

Romain Aza¨ıs, Jean-Baptiste Durand, and Christophe Godin

Inria team BIGS Universit´e Grenoble Alpes Universit´e de Lorraine Inria team MISTIS 54 506 Vandœuvre-l`es-Nancy, France 38 041 Grenoble, France

[email protected] [email protected]

Inria team Virtual Plants Universit´e Montpellier 2 34 095 Montpellier, France [email protected]

Abstract

A classical compression method for trees is to exploit subtree repeats in the structure by representing them by directed acyclic graphs. We propose a lossy compression method that consists in computing a structure with high redundancy that approximates the initial data.

Trees are commonly used to represent hierarchical data appearing in computer science or in biology. Compression methods often take advantage of repeated sub- structures appearing in the tree (see the survey [1]). Directed Acyclic Graph (DAG) compression is a classical approach that exploits subtree repeats in the structure.

However, it should be noted that trees without a high level of redundancy are often insufficiently compressed by this procedure. Self-nested trees are such that all their complete subtrees of a given height are isomorphic. The systematic repetition of subtrees gives them remarkable compression properties by this approach.

We address lossy compression for unordered trees. Loss can be acceptable for visual representation of scenes composed of plants, for example. Our method consists in computing the DAG version of a self-nested tree that closely approximates the tree to compress. A first approximation has been proposed in [2] in which the authors compute in polynomial time the Nearest Embedding Self-nested Tree (NEST) of the initial structure, namely the self-nested tree that minimizes the edit distance to the initial tree and that embeds it. We focus on the presentation of two new algorithms to find a self-nested structure that approximates the initial tree better than the NEST.

These solutions rely on a technique to find the centroid of a forest of small height and may be computed in polynomial time for trees with bounded degree. We prove on a simulated dataset that the error rates of these lossy compression methods are always better than the loss involved in the previous algorithm (on average, we observe a substantial gain of around 20%), while the compression rates are equivalent.

References

[1] S. Sakr, “XML compression techniques: A survey and comparison,” Journal of Com- puter and System Sciences, vol. 75, no. 5, pp. 303 – 322, 2009.

[2] C. Godin and P. Ferraro, “Quantifying the degree of self-nestedness of trees. Application to the structural analysis of plants,”IEEE TCBB, vol. 7, no. 4, pp. 688–703, Oct. 2010.

Références

Documents relatifs

They were identified as benzyl cinnamate, benzyl ben- zoate, α -amyrin and α -amyrin acetate by compar- ison of their spectra with those of authentic samples, as well as by direct

An implicit context modeling of the prediction errors, due to the low resolution image representation including variable block size structure, is then exploited to the for

We then proposed an algorithm to detect these patterns and to merge them, thus leading to compressed structures corresponding to DAGs augmented with return edges7. In this

Here, we performed immunofluorescence labelling of different synaptic proteins in fixed and paraffin embedded human hippocampal sections and report reduced levels of

In contrast, since it snaps values to the middle of intervals, simple range quantization [27] may alter the persistence of critical point pairs in the decompressed data, by

This chapter presents a technology framework in which we can reduce costs, while developing urban information systems that hold up to increasing demands

This last feature stems from a non-reversible phenomenon which leads to the spheroidization of the AgNWs (due to the total surface energy reduction). This

Since a null distance between two subtrees denotes isomorphic structures, these algorithms can be exploited to build the compression graph of T (Godin and Ferraro, 2009)..