• Aucun résultat trouvé

The Bag Semantics of Ontology-Based Data Access (Extended Abstract)

N/A
N/A
Protected

Academic year: 2022

Partager "The Bag Semantics of Ontology-Based Data Access (Extended Abstract)"

Copied!
1
0
0

Texte intégral

(1)

The Bag Semantics of Ontology-Based Data Access (Extended Abstract)

?

Charalampos Nikolaou, Egor V. Kostylev, George Konstantinidis, Mark Kaminski, Bernardo Cuenca Grau, and Ian Horrocks

Department of Computer Science, University of Oxford, UK

Ontology-based data access (OBDA) is an increasingly popular approach to enable uniform access to multiple data sources with diverging schemas. In OBDA, an ontology provides a unifying conceptual model for the data sources together with domain knowledge. The ontology is linked to each source by global- as-view (GAV) mappings, which assign views over the data to ontology predi- cates. Users access the data by means of queries formulated using the vocabulary of the ontology; query answering amounts to computing the certain answers of the query over the union of ontology and the materialisation of the views defined by the mappings. The formalism of choice for representing ontologies in OBDA is the description logic DL-LiteR which underpins OWL 2 QL. DL-LiteR was designed to ensure that queries against the ontology are first-order rewritable;

that is, they can be reformulated as a set of relational queries over the sources.

An important observation about the conventional semantics of OBDA is that it is set-based: the materialisation of the views defined by the mappings is for- malised as a virtual ABox consisting of a set of facts over the ontology predi- cates. This treatment is, however, in disagreement with the semantics of database views, which is based on bags (multisets) and where duplicate tuples are retained by default. The distinction between set and bag semantics in databases is very important in practice since it influences the evaluation of aggregate queries that combine various aggregation functions (e.g., Min, Max, Sum, Count, Avg) with the grouping functionality provided in SQL by theGroupByconstruct.

In this paper we study a bag semantics for OBDA that is compatible with the semantics of standard databases and can provide the foundations for the future study of aggregate queries. We propose the ontology languageDL-LitebagR and its restrictionDL-Litebagcore, where ABoxes consist of a bag of facts. We define the bag semantics of conjunctive query (CQ) answering and show that it is com- patible with the conventional set semantics. However, we show thatDL-Litebagcore ontologies may not admit a universal model while CQ answering becomescoNP- hard in data complexity, hence losing first-order rewritability of CQs. To regain tractability, we study the class of rooted CQs, which captures most practical OBDA queries, and show that it admits universal models and enjoys first-order rewritability for DL-Litebagcore ontologies. Unfortunately, these properties do not extend toDL-LitebagR where CQ answering remainscoNP-hard.

?This is the extended abstract of the paper accepted for publication in the proceedings of IJCAI 2017. The full paper may be accessed atarXiv:1705.07105 [cs.AI]. Work supported by the Royal Society under a University Research Fellowship, the EPSRC projects ED3 and DBOnto, and the Research Council of Norway via the Sirius SFI.

Références

Documents relatifs

We study the data complexity of conjunctive query answering under these new semantics, and show a general tractability result for all known first-order rewritable ontology languages.

Both the Biological Collections Ontology (BCO) and the Ontologized Minimum Information About BIobank data Sharing (OMIABIS) are ontologies rooted in specimens and

Answering conjunctive queries (CQs) over Description Logics (DL) ontologies is an important reasoning task with many applications in knowledge represen- tation.. Intensive

(Extensions of both query language and DL are considered in [7].) The probabilistic component follows the semantics of probabilistic ABoxes from [6], which itself is based on

The use of hornMTL and datalogMTL ontologies (with both diamond and box operators in rule bodies and only box operators in rule heads) for querying temporal log data was advocated

In the context of OBDA, the conceptual schema provides end-users with a vocabulary they are familiar with, at the same time masking how data are concretely stored, and enriching

In order to model the relations with access limitations we use a special type of oracle Turing machine (OTM) as follows: (1) the OTM, apart from the standard tape, has an oracle

Searching information on the Web of Data requires user friendly approaches, similar to the ease of keyword-based search engines, but relying on the RDF. In this