• Aucun résultat trouvé

Semantic Annotation Workflow using Bio-Ontologies

N/A
N/A
Protected

Academic year: 2021

Partager "Semantic Annotation Workflow using Bio-Ontologies"

Copied!
2
0
0

Texte intégral

(1)

HAL Id: lirmm-01052546

https://hal-lirmm.ccsd.cnrs.fr/lirmm-01052546

Submitted on 28 Jul 2014

HAL is a multi-disciplinary open access

archive for the deposit and dissemination of sci-entific research documents, whether they are pub-lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Semantic Annotation Workflow using Bio-Ontologies

Emmanuel Castanier, Clement Jonquet, Soumia Melzi, Pierre Larmande,

Manuel Ruiz, Patrick Valduriez

To cite this version:

Emmanuel Castanier, Clement Jonquet, Soumia Melzi, Pierre Larmande, Manuel Ruiz, et al.. Seman-tic Annotation Workflow using Bio-Ontologies. Workshop on Crop Ontology and Phenotyping Data Interoperability, Mar 2014, Montpellier, France. 1 p., 2014. �lirmm-01052546�

(2)

Semantic Annotation Workflow using

Bio-Ontologies

Castanier

 E.

1

,

 Jonquet C.

2

,

 Melzi S.

2

,

 Larmande P.

3

,

 Ruiz M.

4

,

 Valduriez P.

1 1 – Zenith, INRIA‐LIRMM

2 – SMILE, UM2‐LIRMM  3 – DIADE, IRD

4 – AGAP, CIRAD

NCBO Annotator: Ontology-based annotation workflow

Customized IBC Annotator for database schemas

• First, direct annotations are created by

recognizing concepts in raw text.

• Second, annotations are semantically expanded using knowledge of the

ontologies.

• Third, all annotations are aggregated and scored according to the context in which

they have been created.

Introduction

Biologists have adopted ontologies:

•To provide canonical representation of scientific knowledge

•To annotate experimental data to enable interpretation, comparison, and discovery across databases

•To facilitate knowledge-based applications for decision-support, natural language processing, and data

integration

But off-the-shelf solutions for the biologist to use

ontologies are rare (versions, format, availability, license, overlap, etc.)

Automatically process complex biological resources text content and generate annotations :

•Large-scale – to scale up to many resources and ontologies

•Automatic – to keep precision and accuracy

•Easy to use and to access – web service approach •Customizable – to fit very specific needs

•Smart – to leverage the knowledge contained in ontologies There have been success stories to reproduce: GO

annotations, PubMed indexing, etc.

The challenge

R. Coletta, E. Castanier, P. Valduriez, C. Frisch, D-H. Ngo, Z. Bellahsene: Public data integration with WebSmatch. Workshop on Open Data. 2012, pp. 5-12.

C. Jonquet, N. H. Shah, M. A. Musen.The Open Biomedical Annotator, In AMIA

Symposium on Translational BioInformatics. 2009. pp. 56-60.

C. Jonquet, P. LePendu, S. Falconer, A. Coulet, N. F. Noy, M. A. Musen, N. H. Shah.NCBO Resource Index: Ontology-Based Search and Mining of Biomedical Resources, Web

Semantics. 2011. Vol. 9 (3), pp. 316-324.

J. Wollbrett, P. Larmande, F. De Lamotte, M. Ruiz.Clever generation of rich SPARQL queries from annotated relational schema: application to Semantic Web Service creation for biological databases. BMC Bioinformatics. 2013; 14:126.

• Convert SQL database schemas to RDF/RDFS

with BioSemantic

• Annotate with NCBO Annotator and WebSmatch using customized NCBO services

Annotator relies on WebSmatch to create

mappings between elements of schemas and ontological concepts

Indexing IBC related data with the workflow to

enhance semantic search and mining of data

Montpellier, France

In collaboration with

Références

Documents relatifs

The Bag of Visual Words technique makes it possible to recognize the object by using training in the similarity of visual words. We focus on this feature and assume that

Figure 1 shows the system architecture of our QA system Tikka. Out of the five modules of the question analysis component, only the question classifier and the topic and

“Less is more” illustrates the collaboration dynamic that embodies the Long Tail (http://en.wikipedia.org/wiki/Long_Tail) principle within the Social Web; a huge

Based on the MTCO, business process modelers may model the contract obligation fulfillment process as Contract Workflow Models(CWM) using Business Process Modeling Notation (BPMN)

In our work, we train ResNet [9] models for the task of image classification, for both the coarse classification of plant images and fine-grained leaf

The immediate outcome of the annotation process is an an- notated corpus. One obvious type of analysis is then to test certain hypotheses or assumptions against the newly created

Flowchart 4 shows how genetic information disclosures are allowed or denied in a based on organizational policies, State, and Federal laws. When genetic information or test results

The last type of nodes implements ensemble methods – there is a copy node and a k-means node, which divides the data into clusters by the k-means algorithm (both D → [D] n ), and