• Aucun résultat trouvé

FAIRDOM approach for semantic interoperability of systems biology data and models

N/A
N/A
Protected

Academic year: 2022

Partager "FAIRDOM approach for semantic interoperability of systems biology data and models"

Copied!
1
0
0

Texte intégral

(1)

1

FAIRDOM approach for semantic interoperability of systems biology data and models

Olga Krebs

1

, Katy Wolstencroft

3

, Natalie Stanford

2

, Norman Morrison

2,

, Martin Golebiewski

1

, Stuart Owen

2

, Quyen Nguyen

1

, Jacky Snoep

2

, Wolfgang Mueller

1

, and Carole Goble

2

.

1 Heidelberg Institute for Theoretical Studies, Heidelberg, Germany

2School of Computer Science, University of Manchester, UK

3Leiden Institute of Advanced Computer Science, Leiden, NL

ABSTRACT

The ability to collect and interlink heterogeneous data and model collections is essential in Systems Biology. Effective data exchange and comparison requires sufficient data annotation. This is particu- larly apparent in Systems Biology, where data heterogeneity means that multiple community metadata standards are required for the annotation of a whole investigation, including data, models and pro- tocols.

Here we describe FAIRDOM (http://fair-dom.org/) strategy in the context of semantic data management in the openSEEK , a web- based resource for sharing and exchanging Systems Biology data and models.

1 INTRODUCTION

Data integration is an essential part of Systems Biology.

Scientists need to combine different sources of information in order to model biological systems, and relate those mod- els to available experimental data for validation. . Currently, only a small fraction of the data and models produced dur- ing Systems Biology investigations are deposited for reuse by the community, and only a smaller fraction of that data is standards compliant, semantic content.

By embedding semantic technologies into familiar data management tools, the SEEK [1] enables semantic annota- tion of new data and the generation and querying of linked- data compliant datasets, whilst hiding the complexities of ontologies and metadata from its users. The SEEK is based on the ISA infrastructure (Investigations, Studies and As- says), a standard format for describing how individual ex- periments (assays) are aggregated into wider studies and investigations [2]. This poster will present the semantic data integration in SEEK, and how it supports the whole life cycle of data collection, annotation, sharing and reuse of Systems Biology data and resources.

2 THE JERM ONTOLOGY AND JERM TEMPLATES

The JERM Ontology is an application ontology designed to describe the relationships between items in SEEK (for ex-

* To whom correspondence should be addressed: [email protected]

ample, data, models, experiment descriptions, samples, pro- tocols, standard operating procedures and publications); and to enable these relationships to be expressed with formal semantics. It is based on the idea of the Minimal Infor- mation Models (https://www.biosharing.org), which have been collected under the umbrella of MIBBI (Minimum Infor- mation for Biological and Biomedical Investigations). The JERM takes the specification one step further, expressing the minimum information model as an OWL ontology (http://bioportal.bioontology.org/ontologies/JERM).

The majority of laboratory scientists use spreadsheets for the daily management and manipulation of data, so the RightField semantic spreadsheet application [3] (also devel- oped during this work) is used to embed semantic annota- tion into the data. RightField-enabled spreadsheets allow the collection of semantic information by stealth.

By embedding the JERM metadata model in a spreadsheet format, and enabling the use of JERM (and other) vocabu- lary terms for annotation, the process of standardized se- mantic data collection can become part of the existing data management activities in the laboratory. JERM spreadsheet templates have been developed for a wide range of experi- mental data types.

REFERENCES

1.Wolstencroft, K., Owen, S., du Preez, F., Krebs, O., Mueller, W., Goble, C. and Snoep, J.L. (2011) The SEEK: a platform for sharing data and

models in Systems Biology. Methods Enzymol, 500, 629-655.

2. Rocca-Serra, P., Brandizi, M., Maguire, E., Sklyar, N., Taylor, C., Begley, K., Field, D., Harris, S., Hide, W., Hofmann, O. et al.(2010) ISA software suite: supporting standards-compliant experimental annotation and enabling curation at the communi- ty level. Bioinformatics, 26, 2354-2356.

3. Wolstencroft, K., Owen, S., Horridge, M., Krebs, O., Mueller, W., Snoep, J.L., du Preez, F. and Goble, C. (2011)RightField:

embedding ontology annotation in spreadsheets. Bioinformatics, 27, 2021-2022

Copyright c2015 for this paper by its authors. Copying permitted for private and academic purposes

Références

Documents relatifs

The SEmantic Networks of Data: Utility and Privacy (SEND UP) project focuses on such databases and will follow two main goals: (1) prevent illegitimate use of private data

The approach presented in this paper, however, also enables non-semantic- experts to work with Linked Open Data.. 2 visuaLOD - a Linked Data

presented a kernel for semantic graphs derived from a local neighborhood of a node and applied the approach to learning on the RDF-graph of the Semantic Web’s Linked Open data

To enable semantic access, querying and visualisation of this small molecule data, there is a need for an ontology which organises the data according to chemical structure,

The idea to enhance project management systems with semantic technologies enables for example a better search and reuse of already existing data.. Thus it reduces time and costs

Choosing Linked Data principles and Ontologies to make rare disease data link- able at the source was our first design decision, as RDF was designed with the objective of

Based on analyzing several existing Linked Data-driven user interfaces supporting data exploration and querying, we present a novel user visual interface which enables

• We provide the first of its kind linked data model, called Semantic Hadith for publishing Hadith as Linked Data and for linking with other key knowledge sources in the Islamic