• Aucun résultat trouvé

From the ELN to Data Warehouse

N/A
N/A
Protected

Academic year: 2022

Partager "From the ELN to Data Warehouse"

Copied!
18
0
0

Texte intégral

(1)

Mercredi 12 Février 2014 Frank Hoonakker

From the ELN to Data Warehouse

Applied to chemical reactions

(2)

2 / 11 2

La chimie de recherche

*

• > 50 000 companies

• > 2 000 universities

600 000 chemists

– 5,4 Billion € (consumables)

– 300 reactions / chemist / years :

180 000 000 reactions / year

50% > failed reactions > 70%

~ 3% pulished

170 000 000 / year «lost chemistry»

* UE & US

(3)

3 / 11 3

Bernard Werber : Nouvelle encyclopédie du savoir relatif et absolu

In scientific journals, only successful experiments are reported. But we should also report those that do not work. Due to lack of information, failed experiments are reproduced indefinitely by other scholars ignorant of their failure ...

Données accessibles aujourd’hui

(4)

4 / 11 4

Données accessibles aujourd’hui

Publications 3 %

Companies 27 %

Lost Chemistry

Unexploitable 70 %

Example - CASREACT : 44 millions reactions

150 000 per week

Only successful

Not all field are usable

Protocol non accessible (need publication) Database content : unknown

Indiscriminate search engines

(5)

5 / 11 5

Chemist’s WorkFlow

Aldrich ($ 1 798M),

Johnson Matthey (€ 9 105 M)…

Réactifs

Données Publications

Système d’information

Archivage Publication

Recherche

Recherche Achat

(6)

6 / 11 6

Changing Worflow 1:

Condensed Graph of Reactions (CGR)

Transformation of a reaction into CGR

Conventional bonds: simples,

doubles, aromatics …

Dynamicals bonds:

Create a simple, break a simple, …

eSniff

Google for chemistry.

(7)

7 / 11 7

Changing Worflow 1:

Condensed Graph of Reactions (CGR)

Transformation of a reaction into CGR

Conventional bonds: simples,

doubles, aromatics …

Dynamicals bonds:

Create a simple, break a simple, …

eSniff

Google for chemistry.

(8)

8 / 11 8

Changing Worflow 1:

Condensed Graph of Reactions (CGR)

eSniff

Google for chemistry.

Similarity for reaction:

• Easier (draw what you need)

• Faster (one query -> find the best)

• Smarter (new knowledge)

• More relevant (think as a chemist)

Use Failed reactions

• Anticipate side reactions

• Calculate protecting group stability

(9)

9 / 11 9

Changing Worflow 2:

Generating Data Warhouse

eShare

Publish (Success

Failed)

ePro eSniff

Private DB

Academic

Feed

Exploit

WEB 2.0

eSniff

eMol

Hot line

Secu- rity

Crowdsourcing

KNOWLEDGE

(10)

10 / 11 10

Changing Worflow : Are the Chemist Agree ?

Would you be interested in using these data ?

Yes % No%

Private

(11)

11 / 11 11

Changing Worflow : Are the Chemist Agree ?

Would you be interested in publishing data ?

Yes % No%

Private

(12)

12 / 11 12

Changing Worflow : Today

eShare Starting

Patent

PhD Thesis

Existing data in Paper Notebook (ECLEIR project)

(13)

13 / 11 13

Changing Worflow : T oday

Today :

800 utilisateurs

1 000 000 réactions

In 3 years :

6 500 utilisateurs

1 800 000 réactions

In 5 years :

10 400 utilisateurs ePro

2 500 000 réactions

Leader «lost Chemistry»

20 000 000

15 000 000

10 000 000

5 000 000

0

35 000 30 000 25 000 20 000 15 000 10 000 5 000 0

Chimistes Données

(14)

14 / 11 14

Changing Worflow : T omorow

Author

Date

Reaction

Cond. Signature CGR

Properties of reactions

Tomorrow

Similarity

Selectivity

Chemists

(15)

15 / 11 15

Changing Worflow : T omorow

Author Date

Reaction

Cond. Signature CGR

Properties of reactions

Tomorrow

Similarity

Selectivity

IC50.

Author

ΔS

EC50 [C]

Log K

Properties of compound

Chemists

Other Field : Biology,

Pharmacology…

(16)

16 / 11 16

Changing Worflow : T omorow

eShare

(17)

17 / 11 17

Conclusion

• Data existing, but are not easily accessible

• Both failed and successful experiment are needed

• Scientists agree to publish some of them, but need an easy way

• The more natural way is to use an ELN

• Laboratory notebook is the starting point for all knowledge…

• eNovalys propose an easy way from ELN to data Warehouse

for the chemists and associate scientist.

(18)

18 / 11 18

Thank Y ou

www.enovalys.com

Références

Documents relatifs

Definition 1: This operator allows displaying cube data that correspond to a fact and several dimensions according to an analysis time interval.. Every quadruplet is constituted by

In the same context, authors in (Solodovnikova et al., 2015) have also investigated the problem of business requirements evolution. They defined a formalism for modeling the

tem is a positive and monotonically decreasing function of the thickness of the ice layer. It follows that an arbitrarily thick layer of ice should form at the water vapor interface

In this context, we build on available multimodal articulatory data and present an inver- sion framework to recover articulation from audio- visual speech information.. Speech

To solve the problem of organization and standardization of these datasets, a controlled vocabulary was used for annotating data, increasing the accuracy of

These farmers sell therefore all their production to small and sporadic exporters, called ‘golondrinos’ (swallows). Finally, with these intermediaries, they face

First, an optimal reciprocally convex inequality is proposed, which provides an optimal bound for the reciprocally convex combination, while less slack matrix variables are

Although these South African donors had significantly (p < 0.01) higher plasma levels of IFNα protein (median 1–2 fg/mL) than healthy European controls, IFNα levels in