• Aucun résultat trouvé

Agronomic Linked Data (AgroLD): a Knowledge-based System to Enable Integrative Biology in Agronomy

N/A
N/A
Protected

Academic year: 2021

Partager "Agronomic Linked Data (AgroLD): a Knowledge-based System to Enable Integrative Biology in Agronomy"

Copied!
3
0
0

Texte intégral

(1)

(2)

Agronomic Linked Data (AgroLD): a

Knowledge-based System to Enable

Integrative Biology in Agronomy

Pierre Larmande

⇤† 1,2

, Nordine El Hassouni

2,3

, Aravind Venkatesan

4

,

Clement Jonquet

2,5

, Manuel Ruiz

‡ 2,3

1 Institut de Recherche pour le D´eveloppement (IRD) – Institut de recherche pour le d´eveloppement

[IRD] : UMR232, Montpellier – 911 avenue Agropolis,BP 6450134394 Montpellier cedex 5, France

2 Institut de Biologie Computationnelle (IBC) – Centre de Coop´eration Internationale en Recherche

Agronomique pour le D´eveloppement, Institut National de la Recherche Agronomique, Institut National de Recherche en Informatique et en Automatique, Universit´e de Montpellier, Centre National de la

Recherche Scientifique – Building 5 - 860 rue de St Priest 34095 Montpellier, France

3 Centre de coop´eration internationale en recherche agronomique pour le d´eveloppement (CIRAD) –

CIRAD – Av Agropolis, Montpellier, France

4 European Bioinformatics Institute [Hinxton] (EMBL-EBI) – EMBL-EBI, Wellcome Trust Genome

Campus, Hinxton, Cambridgeshire, CB10 1SD, UK, United Kingdom

5Laboratoire d´Informatique de Robotique et de Micro´electronique de Montpellier (LIRMM) –

Universit´e de Montpellier : UMR5506, Centre National de la Recherche Scientifique : UMR5506 – 161 rue Ada - 34095 Montpellier, France

Plant science is a multi-disciplinary scientific discipline that includes research areas such as -omics, physiology, genetics, plant breeding, systems biology and the interaction of plants with the environment to name a few. Among other things, agronomic research aims to im-prove crop health, production and study the environmental impact on crops. Researchers need to understand deeply the implications and interactions of the various biological processes, by linking data at di↵erent scales (e.g., genomics, proteomics and phenomics). Recent advances in high-throughput technologies have resulted in a tremendous increase in the amount of ge-nomics or phege-nomics data produced in plant science. This increase, in conjunction with the heterogeneity and variability of the data, presents a major challenge to adopt an integrative research approach. We are facing an urgent need to e↵ectively integrate and assimilate com-plementary datasets to understand the biological system as a whole. The Semantic Web o↵ers technologies for the integration of heterogeneous data and its transformation into explicitly knowledge thanks to ontologies. We have developed AgroLD (the Agronomic Linked Data – www.agrold.org), a knowledge-based system that exploits the Semantic Web technology and some of the relevant standard domain ontologies, to integrate genome to phenome information on plant species widely studied by the plant science community. We present some integration results of the project, which initially focused on genomics, proteomics and phenomics. Currently, AgroLD contains hundreds millions of triples created by annotating more than 50 datasets com-ing from 10 data sources such as Gramene.org [1] and TropGeneDB [2] with 10 ontologies such as Gene Ontology [3] and Plant Trait Ontology [4]. Our objective is to o↵er a domain specific

Speaker

Corresponding author: pierre.larmande@ird.frCorresponding author: ruiz@cirad.fr

(3)

knowledge platform to solve complex biological and agronomical questions related to the impli-cation of genes/proteins in, for instances, plant disease resistance or high yield traits. We expect the resolution of these questions to facilitate the formulation of new scientific hypotheses to be validated with a knowledge-oriented approach.

1. Monaco MK, Stein J, Naithani S, Wei S, Dharmawardhana P, Kumari S, et al. Gramene 2013: Comparative plant genomics resources. Nucleic Acids Res. 2014;42.

2. Hamelin C, Sempere G, Jou↵e V, Ruiz M. TropGeneDB, the multi-tropical crop information system updated and extended. Nucleic Acids Res. 2013;41.

3. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, et al. Gene ontol-ogy: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet [Internet]. 2000;25:25–29. Available from: http://dx.doi.org/10.1038/75556

4. Cooper L, Walls RL, Elser J, Gandolfo MA, Stevenson DW, Smith B, et al. The plant ontology as a tool for comparative plant anatomy and genomic analyses. Plant Cell Physiol. 2013;54:e1.

Keywords: Plant Molecular Biology, Data Integration Knowledge Management, Semantic Web, Linked Data, RDF, SPARQL

Références

Documents relatifs

LinkedCJ, our new ontology for Chinese academic journals, inherits a list of clas- ses, object properties and data properties from existing ontologies of semantic

The rich suite of datasets include data about potential algal biomass cultivation sites, sources of CO 2 , the pipelines connecting the cultivation sites to the CO 2 sources and

Pour aider à la recherche sémantique, certains outils sont souvent proposés soit pour assister les utilisateurs dans la construction de leurs requêtes, soit pour cacher le

E.g., if Interstellar is the initial item, Stanley Kubrick was annotated in one of its reviews, and 2001: A Space Odyssey was discovered through Stanley Kubrick, then 2001: A

The Agronomic Linked Data project (AgroLD) is a Semantic Web knowledge base designed to integrate data from various publicly available plant centric data sources.. The aim of

Currently, SG consists of 12 databases covering various plant species such as Banana, Cocoa, Maize and Rice. AgroLD is being developed in phases to expose all of these databases

Based on these techniques, the users of Disease Compass can browse causal chains of a disease and obtain related information about the selected disease and abnormal states

The design of URIs satisfies the Linked Data principles 1) and 2). We also provide relevant information for the URI of each entity, and this implementation helps satisfy the