• Aucun résultat trouvé

Apprendre à lire aux ordinateurs pour améliorer le blé

N/A
N/A
Protected

Academic year: 2021

Partager "Apprendre à lire aux ordinateurs pour améliorer le blé"

Copied!
27
0
0

Texte intégral

(1)

Apprendre à lire aux ordinateurs pour améliorer le blé

Apprendre à lire aux ordinateurs pour améliorer le blé

Hadi Quesneville

Journées Nationales de la Science Ouverte 6 Décembre 2018

Hadi Quesneville

Journées Nationales de la Science Ouverte 6 Décembre 2018

(2)

Wheat e-infrastructure goals Wheat e-infrastructure goals

Provide the research community with a single entry point of access to genetic, phenotypic and genomics resources.

Provide the research community with a single entry point of access to genetic, phenotypic and genomics resources.

Promote the development of services on top of current databases.

Promote the development of services on top of current databases.

Authority to define guidelines for data curation, nomenclature, standards and integration.

Authority to define guidelines for data curation, nomenclature, standards and integration.

Registry for bioinformatics tools.

Registry for bioinformatics tools.

(3)

WheatIS nodes (#12) WheatIS nodes (#12)

CerealsDB

UCW

(4)

Distributed information system Distributed information system

Node (Platform)  resources Computes / Storage

Integrative database:

genomic, genetic, and phenotypic information, comparative genomics, and functional genomics

Hub Portal Services

A network of bioinformatics platformes

(5)

E-infrastructures key issues E-infrastructures key issues

Interoperability Interoperability Share

Share FindFind ReuseReuse

Semantics Standards

(6)

FAIR principles FAIR principles

To be Findable:

To be Findable:

Persistent identifier (like a DOI or Handle)

Rich metadata to describe the data (making sure it is findable through disciplinary

discovery portals.)

To be Accessible:

To be Accessible:

Data open using a standardised

protocol Clarity and transparency around the conditions

governing access and reuse.

To be

Interoperable: To be

Interoperable:

Use community agreed formats, language

and vocabularies.

Contain links to related information using identifiers.

To be Re- usable:

To be Re- usable:

Maintain its initial richness (not diminished for explaining the findings in one publication).

Clear machine readable licence a nd provenance inf ormation on how the data was formed.

(7)

FINDABLE:

DATA DISCOVERY

FINDABLE:

DATA DISCOVERY

(8)

Full text search of distributed databases

Full text search of distributed

databases

(9)

wheatis.org

wheatis.org

(10)

ONTOLOGIES MAPPING

ONTOLOGIES MAPPING

(11)

Ontologies Ontologies

WIPO:

The Wheat INRA Phenotypes Ontology is developped in the frame of the BreedWheat project.

Observations variables

The WIPO v1.3 contains 262 variables.

WTO:

Wheat Trait Ontology

General traits found in the literature: traits of resistance, development, nutritional, baking quality, etc.

(12)

Goal Goal

1. Link wheat phenotyping data to the literature

2. Semantic enrichment of the wheat phenotyping data

• Align the traits of agronomical interest (WTO) and experimental data (WIPO)

finds correspondences between semantically related entities of the ontologies

(13)

Alignment example Alignment example

WTO WIPO

(14)

WIPO:0000101 Fusarium score, Susceptibility to fusarium

head blight FUS-SCORE_score

WIPO:Biotic stress/Fusarium score/Fusarium score, Susceptibility to fusarium head

blight/FUS-SCORE_score ID:0030905 resistance to Fusarium head blight

/FSOV Concept/plant property/response to environmental condition/response to biotic stress/pest resistance/pathogen resistance/resistance to a fungal pathogen/resistance to Fusarium head blight

WIPO:0000105 Spikes with fusarium at 350DD post

fusarium inoculation (INOC+350DD) FUS-SPK.350DD_score

WIPO:Biotic stress/Spikes with fusarium post fusarium inoculation/Spikes with fusarium at 350DD post fusarium inoculation

(INOC+350DD)/FUS-SPK.350DD_score ID:0030905 resistance to Fusarium head blight

/FSOV Concept/plant property/response to environmental condition/response to biotic stress/pest resistance/pathogen resistance/resistance to a fungal pathogen/resistance to Fusarium head blight

WIPO:0000106 Spikes with fusarium at 450DD post

fusarium inoculation (INOC+450DD) FUS-SPK.450DD_score

WIPO:Biotic stress/Spikes with fusarium post fusarium inoculation/Spikes with fusarium at 450DD post fusarium inoculation

(INOC+450DD)/FUS-SPK.450DD_score ID:0030905 resistance to Fusarium head blight

/FSOV Concept/plant property/response to environmental condition/response to biotic stress/pest resistance/pathogen resistance/resistance to a fungal pathogen/resistance to Fusarium head blight

WIPO:0000113 Septoria tritici - Field Disease Index ST-FDI

WIPO:Biotic stress/Septoria tritici - Field Disease Index/Septoria tritici - Field Disease Index/ST-

FDI ID:0000107 resistance to Septoria Leaf Blotch

/FSOV Concept/plant property/response to environmental condition/response to biotic stress/pest resistance/pathogen resistance/resistance to a fungal pathogen/resistance to septoria/resistance to Septoria Leaf Blotch

WIPO:0000114 Septoria score on leaf 1 ST-F1-SCORE_score

WIPO:Biotic stress/Septoria score on leaf/Septoria score on leaf 1/ST-F1-

SCORE_score ID:0000107 resistance to Septoria Leaf Blotch

/FSOV Concept/plant property/response to environmental condition/response to biotic stress/pest resistance/pathogen resistance/resistance to a fungal pathogen/resistance to septoria/resistance to Septoria Leaf Blotch

WIPO:0000115 Septoria score on leaf 2 ST-F2-SCORE_score

WIPO:Biotic stress/Septoria score on leaf/Septoria score on leaf 2/ST-F2-

SCORE_score ID:0000107 resistance to Septoria Leaf Blotch

/FSOV Concept/plant property/response to environmental condition/response to biotic stress/pest resistance/pathogen resistance/resistance to a fungal pathogen/resistance to septoria/resistance to Septoria Leaf Blotch

WIPO:0000145 Powdey mildew score, Susceptibility to

powdery mildew PM-SCORE_score

WIPO:Biotic stress/Powdey mildew score/Powdey mildew score, Susceptibility to

powdery mildew/PM-SCORE_score ID:0030907 resistance to Powdery Mildew

/FSOV Concept/plant property/response to environmental condition/response to biotic stress/pest resistance/pathogen resistance/resistance to a fungal pathogen/resistance to mildew/resistance to Powdery Mildew

WIPO:0000152 Brown rust score BR-SCORE_1 to 9 score WIPO:Biotic stress/Brown rust score/Brown rust

score/BR-SCORE_1 to 9 score ID:0031017 resistance to Leaf Rust

/FSOV Concept/plant property/response to environmental condition/response to biotic stress/pest resistance/pathogen resistance/resistance to a fungal pathogen/resistance to rust/resistance to Leaf Rust

WIPO:0000162 Yellow rust score, Susceptibility to stripe

rust YR-SCORE_score

WIPO:Biotic stress/Disease score Yellow Rust/Yellow rust score, Susceptibility to stripe

rust/YR-SCORE_score ID:0000089 resistance to Stripe Rust

/FSOV Concept/plant property/response to environmental condition/response to biotic stress/pest resistance/pathogen resistance/resistance to a fungal pathogen/resistance to rust/resistance to Stripe Rust

WTO WIPO

Alignment example

Alignment example

(15)

•Publication data from the European OpenMinTed project

Processing PubMed collection Documents 3 881

Genes 10 254

Taxa 14 853

Phenotypes 8 792 Markers 1 941

Processing PubMed

(16)

Parsing litterature

Parsing litterature

(17)

Integrating results

Integrating results

(18)

SEARCH EXAMPLES

SEARCH EXAMPLES

(19)

Exact match

Exact match

(20)

Access to bibliographie

Access to bibliographie

(21)

Semantic match

Semantic match

(22)

Semantic match

Semantic match

(23)

TDM enrichment

TDM enrichment

(24)

TDM enrichment

TDM enrichment

(25)

Conclusion Conclusion

Text mining vocabulary closer to natural language

Data findable by non-specialists of the data production

Fill the gap between community

(26)

Availability Availability

Tools are open access

www.weatis.org/search.php openminted.eu

Ontologies

agroportal.lirmm.fr

www.weatis.org/DataStandards.php

fairsharing.org/collection/WheatDataInteroperability Guidelines

(27)

Acknowledgment Acknowledgment

T. Letellier, R. Flores,

S. Aubin, C. Nedellec, et al.

D. Edwards, G. Lazo, M. Caccamo, et al.

Références

Documents relatifs

Cement Used Above Ground 1) Plastic pipe, fittings and solvent cement used inside or under a building in a drainage or vent- ing system shall conform to. a) CSA-B181.1, “ABS

Negation requires the complementation of automata, relying on the determinacy of acceptance games, while existential quantifiers require us to simulate an alternating automaton by

Keywords Aggregation-fragmentation equations, polymerization process, size repartition, long-time asymptotic, mass conservation, WENO numerical scheme..

The results revealed convergent selection signatures for the cultivars across the sampled locations in each country in some genomic regions that included candidate genes involved

Geraniol was not toxic for Table 1 Repellent effect of DEET, permethrin, carvacrol, geraniol, cuminaldehyde and cinnamaldehyde on Anopheles gambiae from reference strains,

Herein we report the case of a 3½-year-old boy with severe arterial hypertension and abdominal angina due to a diffuse multivisceral FMD involvement, successfully managed by a

Further- more, in a study on autumnal dawn singing of Winter Wrens (Troglodytes troglodytes), males increased song output before sunrise on the day after a simulated

Moreover, several of them (i.e. macrolides, phenicols, oxazolidinones) cannot be considered strict inhibitors of protein synthesis, but rather modulators because