• Aucun résultat trouvé

AskOmics: a user-friendly interface to Semantic Web technologies for integrating local datasets with reference resources

N/A
N/A
Protected

Academic year: 2021

Partager "AskOmics: a user-friendly interface to Semantic Web technologies for integrating local datasets with reference resources"

Copied!
2
0
0

Texte intégral

(1)

HAL Id: hal-02401750

https://hal.archives-ouvertes.fr/hal-02401750

Submitted on 10 Dec 2019

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

AskOmics: a user-friendly interface to Semantic Web technologies for integrating local datasets with reference

resources

Xavier Garnier, Anthony Bretaudeau, Fabrice Legeai, Anne Siegel, Olivier Dameron

To cite this version:

Xavier Garnier, Anthony Bretaudeau, Fabrice Legeai, Anne Siegel, Olivier Dameron. AskOmics:

a user-friendly interface to Semantic Web technologies for integrating local datasets with reference

resources. JOBIM 2019 - Journées Ouvertes Biologie, Informatique et Mathématiques, Jul 2019,

Nantes, France. pp.1. �hal-02401750�

(2)

AskOmics: a user-friendly interface to Semantic Web technologies for integrating local datasets with reference resources

Xavier Garnier

1

, Anthony Bretaudeau

1,2

, Fabrice Legeai

1,2

, Anne Siegel

1

and Olivier Dameron

1

1

Univ Rennes, CNRS, Inria, IRISA - UMR 6074, F-35000 Rennes, France

2

Institut de G´ en´ etique, Environnement et Protection des Plantes (IGEPP) - Institut national de la recherche agronomique (INRA) : UMR1349, Agrocampus Ouest - Agrocampus Ouest, UMR1349 IGEPP, F-35 042

Rennes, France Corresponding author: [email protected]

1 Introduction

The study of biological mechanisms require the production of large and heterogeneous datasets.

These omics datasets are obtained routinely into labs, and are also available from public databases.

Each of them have their own format and linking them require lot of time to link them. To ease integration of this data, we have developed AskOmics, a web tool designed to integrate heterogeneous biological data, and query them using a user-friendly interface. The software uses the semantic web technologies in order to homogenize data. AskOmics is used by several research team to analyze genomics, transcriptomics and pathways data. Software development is still ongoing and new features are added as new versions are released. Next version (AskOmics 3) will brings a new set of features.

2 AskOmics, integration and query using the semantic web technologies

AskOmics is a web software that uses the semantic web technologies (RDF/SPARQL) to integrate multiple data formats, and query them through a user-friendly interface. During data integration, user provides input files in common formats (CSV, GFF and BED). AskOmics internally generates the corresponding RDF triples and load them into a triplestore. Two kinds of information are generated, the content, corresponding to the raw data, and the abstraction, which describe how the raw data are organized and interlinked.

The query interface is composed of a dynamic graph that uses the generated abstraction to rep- resent the entities integrated. Users interact with the graph to build a complex query linking several datasets integrated in Askomics. When the query is built, AskOmics internally converts the graph into a SPARQL query and use it to interrogate the triplestore. Results are returned to the web interface and can be downloaded by the user.

AskOmics source code is available under AGPL3 licence at https://github.com/askomics/

askomics. The GenOuest bioinformatics platform hosts a sandbox instance at https://askomics.

genouest.org.

3 Ongoing work

We are currently developing version 3 of AskOmics (https://github.com/xgaia/flaskomics).

This version will bring new features: the possibility to generate an AskOmics abstraction from RDF data, the hierarchy management between entities, and the implementation of federated queries against external endpoints, such as uniprot. Users will be able to link their data to large existing databases without having to import them locally.

This version has a new graphical user interface, build with React, a Javascript library made for building user interface. this change will provide a more modern and maintainable interface. The Python API is being refactored using Flask framework and Celery task queue for better performance by reducing the number of calls and making asynchronous call for long tasks such as data integration.

This new version is splitted into several micro services, AskOmics, a Python API and a Javascript interface. Celery, a task queue, to execute long task asynchronously, Redis, a worker, dependency of Celery, Virtuoso, a triplestore, for storing RDF and accepting SPARQL queries, and Nginx, a web proxy for url redirecting to the services. All these microservices are provided as docker containers.

AskOmics deployment and upgrading is therefore very easy.

Références

Documents relatifs

In this demonstration we will showcase ActiveRaUL [6], a Web service that op- erates on a model defined according to the RDFa User Interface Language (RaUL) 3 that consists of

Static content (e.g., an SVG world map) will be referenced as Datasource and composed with dynamic content built from RDF Datasources (e.g., the FAO Geopolitical Ontology) using

In this demo pa- per, we present the extended Privacy Preference Manager (PPM) that enables users to (1) create privacy preferences using the Privacy Pref- erence Ontology (PPO) and

The task of data discovery and extraction is to discover new data sources those are unknown in advance for the query processing according to the data inference configuration.. The

Setting up SemWIQ requires: (1) setting up a mediator host, (2) setting up wrappers, (3) specification of mappings from local concepts (i.e. database tables and attributes) to

Keywords: Visual SPARQL querying, SPARQL generation, data triplification, data ingestion, semantic data management.. 1

In this demo, we present a user-friendly interface that we designed and implemented in order to allow users to make robots of different types and capabilities achieve basic

Problem: The abundance of resources and the existence of many links in a Web application, make it difficult to look for a specific item. Context of application: