• Aucun résultat trouvé

The Central Role of Data‘Capturing and Sharing Chemistry Research Data’

N/A
N/A
Protected

Academic year: 2022

Partager "The Central Role of Data‘Capturing and Sharing Chemistry Research Data’"

Copied!
19
0
0

Texte intégral

(1)

 

      

The Central Role of Data

‘Capturing and Sharing Chemistry Research Data’

Simon Coles

School of Chemistry,

University of Southampton, U.K.

s.j.coles@soton.ac.uk

This work is licensed under a Creative Commons Licence Attribution-ShareAlike 3.0 http://creativecommons.org/licenses/by-sa/3.0/

(2)

 

      

Current Situation - Data Generation

Synthesis Characterisation

(3)

 

      

Current Situation – Data Management

“Data from experiments conducted as recently as six months ago might be suddenly deemed important, but those researchers may never find those numbers – or if they did might not know what those numbers meant”

“Lost in some research assistant’s computer, the data are often irretrievable or an undecipherable string of digits”

“To vet experiments, correct errors, or find new breakthroughs, scientists desperately need better ways to store and retrieve research data”

“Data from Big Science is … easier to handle, understand and

archive. Small Science is horribly heterogeneous and far more vast.

In time Small Science will generate 2-3 times more data than Big Science.”

‘Lost in a Sea of Science Data’ S.Carlson, The Chronicle of Higher Education (23/06/2006)

(4)

 

      

Current Situation – Data and Publishing

(5)

 

      

Separating Data from Interpretations

Underlying data (Institutional data repository) Intellect &

Interpretation (Journal article, report,

etc)

(6)

 

      

Smart Labs

(7)

 

      

Laboratory IRs and Information

Management

(8)

 

      

The R4L Repository

Deposit

Search / Browse

Create new compound Add experiment data and metadata

(9)

 

      

Blogging Experiments

A repository can…

• Allow one to put, store and get digital objects

• Provide minimal search and browse functions

• NOT provide the

presentation and discussion functions essential to a

scientific study

• Social networking tools and

approaches can provide a

way…

(10)

 

      

Facilitating Research

• Facilitates ‘geographically distributed collaborative research’

• Useful approach for sharing ‘failed’ experiments?

(11)

 

      

Machines Blogging Experiments

• Automatic upload by scientific instrument

(12)

 

      

Comments and Annotation

• A picture says a thousand words!

• Chemists like to sketch!

• Need for more advanced

Blog tools / technology

(13)

 

      

Current Situation - Data Deluge

Cl

Cl Cl

Cl Cl

Cl Cl

Cl Cl

Cl Cl

Cl Cl

O

O

O

O N N

N N

N+ O

O O N+

O O O

30,000,000

1.5,000,000

450,000

(14)

 

      

Laboratory Data Management and

Archive

(15)

 

      

The eCrystals Public Data Archive

http://ecrystals.chem.soton.ac.uk

(16)

 

      

NCS Data Publication Policy

• Joint publication: Timed release of data tied to conventional journal article

• Separate publication: Independent release of data so that it can be cited e.g. from a journal article, grant report, poster

• ‘Accidental’ or ‘undesired’ results: Immediate release after agreement with concerned parties

• Never to be formally published results: Automatic release after three years

• Embargo feature: default 3 years, but timescale can be defined by depositor

• Record can be made public at any time (following agreement from all concerned parties)

• Roles of all concerned parties defined (originator, etc)

• Data citation, DOI, Rights

http://www.ncs.chem.soton.ac.uk/pub_pol.htm

(17)

 

      

Linking and aggregating

• Link data and associated

‘publications’

• Dataset annotated with metadata

• Semantic publishing on WWW and in journals

http://www.rsc.org/Publishing/Jou

rnals/ProjectProspect/index.asp

http://www.ukoln.ac.uk/projects/e

bank-uk/pilot/

(18)

 

      

Aggregator services

Institutional data repositories

Deposit , Validation

Publication

Validation Data analysis

Search, harvest Presentation services / portals

Data discovery, linking, citation

Laboratory repository Deposit

eCrystals ‘Global Federation’ Model

Publishers: peer- review journals, conference proceedings, etc

Curation Preservation

Subject Repository

Institution Library &

Information Services Data creation

& capture in

“Smart lab”

Data discovery, linking, citation

Search, harvest

Search, harvest

Deposit

Deposit

Deposit

(19)

 

      

Changing Times!

Information Providers

Information Consumers

All I am saying is that now is the time to develop

the technology to deflect an asteroid

Références

Documents relatifs

Besides, we present an example for publishing the Global Database of Events, Language and Tone (GDELT) as a Streaming Linked Data resource.. We open-sourced the code of our resource

MIAPPE v1.1 Overview Study Plant Material study metadata Experimental design + factors Observation unit Assay m m m m Investigation m Publication metadata Data files

This release includes the first three years of MaNGA data plus a new suite of derived data products based on the MaNGA data cubes, a new data access tool for MaNGA known as Marvin,

All observations of SSOs published in the Gaia DR2, both for astrometry and photometry, are based on measurements obtained by single CCDs.. The TDI rate is an instrumental constant,

Top panel: robust dispersion of the radial velocity residuals as a function of the external G ext RVS magnitude for Gaia DR2 versus the CU6GB, SIM, RAVE, APOGEE, and GB

Gaia DR2 represents the planned major advance with respect to the first intermedi- ate Gaia data release (Gaia DR1, Gaia Collaboration 2016a), making the leap to a

Finally, the spatial and spectral residuals for the unbinned maximum likelihood fit of the curved power law spectrum are displayed in Fig. Spectral residuals for nine sub-sectors

Stars fainter than G ext RVS = 12 are removed from the pipeline; the sour- ceId is also searched for in the auxiliary catalogues to identify standard stars (Sect. 4.1) and store