• Aucun résultat trouvé

FAIRDOM approach for semantic interoperability of systems biology data and models

N/A
N/A
Protected

Academic year: 2022

Partager "FAIRDOM approach for semantic interoperability of systems biology data and models"

Copied!
1
0
0

Texte intégral

(1)

1

FAIRDOM approach for semantic interoperability of systems biology data and models

Olga Krebs

1

, Katy Wolstencroft

3

, Natalie Stanford

2

, Norman Morrison

2,

, Martin Golebiewski

1

, Stuart Owen

2

, Quyen Nguyen

1

, Jacky Snoep

2

, Wolfgang Mueller

1

, and Carole Goble

2

.

1 Heidelberg Institute for Theoretical Studies, Heidelberg, Germany

2School of Computer Science, University of Manchester, UK

3Leiden Institute of Advanced Computer Science, Leiden, NL

ABSTRACT

The ability to collect and interlink heterogeneous data and model collections is essential in Systems Biology. Effective data exchange and comparison requires sufficient data annotation. This is particu- larly apparent in Systems Biology, where data heterogeneity means that multiple community metadata standards are required for the annotation of a whole investigation, including data, models and pro- tocols.

Here we describe FAIRDOM (http://fair-dom.org/) strategy in the context of semantic data management in the openSEEK , a web- based resource for sharing and exchanging Systems Biology data and models.

1 INTRODUCTION

Data integration is an essential part of Systems Biology.

Scientists need to combine different sources of information in order to model biological systems, and relate those mod- els to available experimental data for validation. . Currently, only a small fraction of the data and models produced dur- ing Systems Biology investigations are deposited for reuse by the community, and only a smaller fraction of that data is standards compliant, semantic content.

By embedding semantic technologies into familiar data management tools, the SEEK [1] enables semantic annota- tion of new data and the generation and querying of linked- data compliant datasets, whilst hiding the complexities of ontologies and metadata from its users. The SEEK is based on the ISA infrastructure (Investigations, Studies and As- says), a standard format for describing how individual ex- periments (assays) are aggregated into wider studies and investigations [2]. This poster will present the semantic data integration in SEEK, and how it supports the whole life cycle of data collection, annotation, sharing and reuse of Systems Biology data and resources.

2 THE JERM ONTOLOGY AND JERM TEMPLATES

The JERM Ontology is an application ontology designed to describe the relationships between items in SEEK (for ex-

* To whom correspondence should be addressed: olga.krebs@h-its.org

ample, data, models, experiment descriptions, samples, pro- tocols, standard operating procedures and publications); and to enable these relationships to be expressed with formal semantics. It is based on the idea of the Minimal Infor- mation Models (https://www.biosharing.org), which have been collected under the umbrella of MIBBI (Minimum Infor- mation for Biological and Biomedical Investigations). The JERM takes the specification one step further, expressing the minimum information model as an OWL ontology (http://bioportal.bioontology.org/ontologies/JERM).

The majority of laboratory scientists use spreadsheets for the daily management and manipulation of data, so the RightField semantic spreadsheet application [3] (also devel- oped during this work) is used to embed semantic annota- tion into the data. RightField-enabled spreadsheets allow the collection of semantic information by stealth.

By embedding the JERM metadata model in a spreadsheet format, and enabling the use of JERM (and other) vocabu- lary terms for annotation, the process of standardized se- mantic data collection can become part of the existing data management activities in the laboratory. JERM spreadsheet templates have been developed for a wide range of experi- mental data types.

REFERENCES

1.Wolstencroft, K., Owen, S., du Preez, F., Krebs, O., Mueller, W., Goble, C. and Snoep, J.L. (2011) The SEEK: a platform for sharing data and

models in Systems Biology. Methods Enzymol, 500, 629-655.

2. Rocca-Serra, P., Brandizi, M., Maguire, E., Sklyar, N., Taylor, C., Begley, K., Field, D., Harris, S., Hide, W., Hofmann, O. et al.(2010) ISA software suite: supporting standards-compliant experimental annotation and enabling curation at the communi- ty level. Bioinformatics, 26, 2354-2356.

3. Wolstencroft, K., Owen, S., Horridge, M., Krebs, O., Mueller, W., Snoep, J.L., du Preez, F. and Goble, C. (2011)RightField:

embedding ontology annotation in spreadsheets. Bioinformatics, 27, 2021-2022

Copyright c2015 for this paper by its authors. Copying permitted for private and academic purposes

Références

Documents relatifs

The Schema Editor also combines the extension of ontologies (the Sensor Type Editor) with using them to annotate instance data (the Sensor Instance Editor). This is one of the

In this study, we applied a set of measures covering the semantic aspects of open data portal design and maintenance to evaluate the quality of government open data portals..

The goal of this workshop is to bring together researchers cross-cutting the fields of semantic web, data management and health informatics to discuss the unique challenges in

En este sentido, tanto del mundo acad´ emico como de la industria, est´ an comprometidos con diferentes esfuerzos para describir sem´ anticamente da- tos e informaci´ on [18],

The CAPAS service aims to extend and improve the existing service with the purpose of achieving a better and more objective assignment of funds as well as to increase the

In this article we will describe two different vocabularies/models for rep- resenting lexico-semantic data on the Semantic Web: in the first case from a synchronic perspective, that

The design of URIs satisfies the Linked Data principles 1) and 2). We also provide relevant information for the URI of each entity, and this implementation helps satisfy the

In this paper we present an approach for scalable semantic version control for Linked Data based on a delta-based storage and on-demand reconstruction of historic versions3.