• Aucun résultat trouvé

One query at a time: incremental, collective ontology matching

N/A
N/A
Protected

Academic year: 2022

Partager "One query at a time: incremental, collective ontology matching"

Copied!
2
0
0

Texte intégral

(1)

One Query at a Time: Incremental, Collective Ontology Matching

Thomas Kowark and Hasso Plattner

Hasso Plattner Institute

August-Bebel-Str. 88, 14482 Potsdam, Germany {firstname.lastname}@hpi.de

1 Introduction

Ontology matching is not an end in itself but a prerequisite for applications like query answering over different data sources. In software repository analysis, for example, queries reflect process metrics that researchers want to investi- gate. Hence, being able to answer such queries on multiple data sets without having to perform data transformations or manual query rewriting is a desir- able objective. Unfortunately, the terminological differences and often complex correspondences between different software repository representations impede a completely automatic matching and necessitate user input to create more com- prehensive alignments [4].

Our work is concerned with the question of when and how users should in- troduce their expertise into the matching process. Similar to other approaches, such as the keyword-based information retrieval tasks, which Ellis et al. [1] use to extract user knowledge about ontology alignments, we integrate the input pro- cess into the desired application – query translation between different software repositories. From the way this task is carried out, both simple and complex correspondences are inferred. In future translation tasks, these correspondences are reused to incrementally reduce the number of required user interactions. As users only translates their queries of interest, the overall effort for alignment creation is collectivized. In this poster, we present the general architecture of our system and the rules used for complex alignment extraction.

2 Rule-Based Inference of Ontology Alignments

Our approach assumes a setup were a source and a target repository are de- scribed by ontologies OR1 and OR2 containing the respective TBoxes TR1 and TR2, respectively. A query translation thus aims to recreate a query, which was originally issued onOR1, by using concepts fromTR2. A graph based abstraction is used to express the queries in a query language independent manner [2]. After a preprocessing step performed an automatic ontology matching, users can trans- form the remaining unmatched elements of those query graphs using the editor shown in Figure 1. To this end, they select the input element(s) and provide an according output graph. In simple cases, the output graph is structurally similar

(2)

to the input graph, and only node and edge labels change. Inference of element correspondences is straightforward and comprises concept equivalence and sub- sumption. If relabelling does not suffice and the graph structure changes, complex correspondences are inferred. Our system employs a rule set to determine which types of correspondences users provide through their input/output graph trans- formations. The rules are based on the patterns identified by Ritze et al. [3]. For any subsequent query translations, existing transformations are automatically applied by the system, hence, users only have to provide correspondences for missing elements and the required manual effort gradually decreases.

Figure 1.Query graph editor for translating input graph elements (marked red) to an output graph on the right hand side.

References

1. Ellis, J.B., Hassanzadeh, O., Srinivas, K., Ward, M.J.: Collective ontology align- ment. In: Proceedings of the Ontology Matching Workshop (2013)

2. Kowark, T., Dobrigkeit, P., Zeier, A.: Towards a shared repository for patterns in virtual team collaboration. In: 5th International Conference on New Trends in Information Science and Service Science (2011)

3. Ritze, D., Meilicke, C., Sv´ab-Zamazal, O., Stuckenschmidt, H.: A pattern-based ontology matching approach for detecting complex correspondences. In: Shvaiko, P., Euzenat, J., Giunchiglia, F., Stuckenschmidt, H., Noy, N.F., Rosenthal, A. (eds.) OM. CEUR Workshop Proceedings, vol. 551. CEUR-WS.org (2008)

4. Shvaiko, P., Euzenat, J.: Ontology matching: State of the art and future chal- lenges. IEEE Trans. on Knowl. and Data Eng. 25(1), 158–176 (Jan 2013), http://dx.doi.org/10.1109/TKDE.2011.253

Références

Documents relatifs

Evaluating different matchers on these generated tests, we have observed that (a) the difficulties en- countered by a matcher at a test are preserved across the seed ontology, (b)

QALC indexes, used for document selection, are made of single and multi-word terms retrieved by a 2-step procedure: (1)Êautomatic term extraction from questions through part-of-

For example, given that the gold standard is partial with complete target and complete source (case ii), and given the correspondence < a, b, =, 1.0 >, the correspondence <

Then, the key discovery tools VICKEY [3] and LinkEx are applied for the discovery of simple keys, conditional keys, and link keys from the instances of O 1 and O 2 , exploiting

The HarmonicSS platform will offer a variety of services, including big data mining tools for lymphoma prediction and patient stratification, tools for analyzing genetic data,

In ontology matching case, the performance of various matchers over a range of data sets are available and we would like to test if the average of their performances is random...

This note uncovers an unsuspected link between the score matching tech- nique (Hyv¨arinen, 2005, 2008) for learning the parameters of unnormalized density models over

We show that a simple denoising autoencoder training criterion is equiv- alent to matching the score (with respect to the data) of a specific energy based model to that of