• Aucun résultat trouvé

Data Interlinking with Formal Concept Analysis and Link Keys

N/A
N/A
Protected

Academic year: 2022

Partager "Data Interlinking with Formal Concept Analysis and Link Keys"

Copied!
1
0
0

Texte intégral

(1)

Data Interlinking with Formal Concept Analysis and Link Keys

J´erˆome Euzenat

INRIA & Univ. Grenoble Alpes, Grenoble, France

Abstract. Data interlinking, the problem of linking pairs of nodes in RDF graphs corresponding to the same resource, is an important task for linked open data. We introduced the notion of link keys as a way to identify such node pairs [1]. Link keys generalise in several ways keys in relational algebra. Thus, we consider how they could be extracted from data with Formal Concept Analysis. We show that an appropriate encoding makes the notion of candidate link keys correspond to formal concepts [2]. However candidate link keys are not yet link keys as they need to be selected through appropriate measures. We discuss how the measurement and concept extraction processes may be interleaved. If time permits we will also discuss extensions of this model to residual link keys and mutually dependent link keys1.

Keywords: Linked Data, Formal Concept Analysis

References

1. Atencia, M., David, J., Euzenat, J.: Data interlinking through robust linkkey extrac- tion. In: Proc. 21st European Conference on Artificial Intelligence (ECAI), Praha (CZ). (2014) 15–20

2. Atencia, M., David, J., Euzenat, J.: What can FCA do for database linkkey extrac- tion? In: Proc. 3rd ECAI workshop on What can FCA do for Artificial Intelligence?

(FCA4AI), Praha (CZ). (2014) 85–92

1 This work has been developed in collaboration with Manuel Atencia, J´erˆome David and Amedeo Napoli.

Références

Documents relatifs

If other nodes (or any Linked Data application) browses the DSSN the activity streams, data & media artefacts and any RDF resource can be retrieved via standard

Knowledge bases are a fundamental resource for doing entity linking. They often use linked data to provide information about entities, their semantic cat- egories and their

Datalift is an open source platform that include numerous converters, for transforming raw data into RDF, the resource description framework model which is the basis of linked

Adding data to the Web of Data consists of (i) publishing data as RDF graphs: a standard data format, (ii) linking these datasets together, by identifying equivalent resources in

Using the declarative Silk - Link Specification Language (Silk-LSL), data publishers can specify which types of RDF links should be discovered between data sources as well as

Linked Data is about employing the Resource Description Framework (RDF) and the Hypertext Transfer Protocol (HTTP) to publish structured data on the Web and to connect data between

Adding data to the Web of Data consists of (i) publishing data as RDF graphs: a standard data format, (ii) linking these datasets together, by identifying equivalent resources in

Globally extracting a set of link keys across several RDF data sources raises several issues, as link keys differ from database keys in various aspects: (a) they relax two