• Aucun résultat trouvé

The creation of a new type of scientific deposit: Software

N/A
N/A
Protected

Academic year: 2021

Partager "The creation of a new type of scientific deposit: Software"

Copied!
2
0
0

Texte intégral

(1)

HAL Id: hal-01738741

https://hal.inria.fr/hal-01738741

Submitted on 20 Mar 2018

HAL is a multi-disciplinary open access

archive for the deposit and dissemination of sci-entific research documents, whether they are pub-lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Distributed under a Creative Commons Attribution| 4.0 International License

The creation of a new type of scientific deposit: Software

Yannick Barborini, Roberto Di Cosmo, Antoine R. Dumont, Morane Gruenpeter, Bruno Marmol, Alain Monteil, Jozefina Sadowska, Stefano

Zacchiroli

To cite this version:

Yannick Barborini, Roberto Di Cosmo, Antoine R. Dumont, Morane Gruenpeter, Bruno Marmol, et al.. The creation of a new type of scientific deposit: Software. RDA Eleventh Plenary Meeting, Berlin, Germany, Mar 2018, Berlin, Germany. 2018. �hal-01738741�

(2)

Software Heritage sponsors : Contacts : hal.inria.fr josephine.sadowska@inria.fr hal.archives-ouvertes.fr ccsd@ccsd.cnrs.fr www.softwareheritage.org morane@softwareheritage.org

The creation of a new type of scientific deposit:

Software

CCSD¹, HAL-Inria², Software Heritage³

Y. Barborini¹, R. Di Cosmo³, A.R. Dumont³, M. Gruenpeter³, B. Marmol¹, A. Monteil², J. Sadowska², S. Zacchiroli³

Software preservation: a scientific challenge

Software has become an indissociable support of technical and scientific knowledge. The preservation of this universal body of knowledge has become as essential as preserving research articles and data sets.

Software preservation is a pillar of reproducibility. In the quest for making scientific results

reproducible, and pass the knowledge over to future generations, the three main pillars are:

scientific articles, that describe the results, the data sets used or produced, and the software

that embodies the logic of the data transformation[1].

Figure 1: The pillars of knowledge preservation

Software deposit

The collaboration between Software Heritage

(SWH), Hal-Inria and the CCSD has resulted with a

new type of scientific deposit in the national open archive.

Researchers have now the possibility to deposit software source code on Hal-Inria.

Figure 2: The form dedicated to software deposits

The steps for a software deposit:

• deposit a source code archive (.zip) • choose deposit type: software

• add associated metadata • add the software authors

• accept the archival of the deposit on SWH

Figure 3: The life cycle of research software

The descriptive metadata

To ensure an accurate description of the software, different metadata are available on the deposit form and are preserved with the software in the SWH archive. An example:

Provided by the system: MUST:

- Hal identifier - title

- publication date - description - swh-id - authors

SHOULD: MAY:

- license - dependencies - keywords - platform/OS - repository - funding

The intrinsic and persistent identifier

To be able to reproduce an experiment, knowing the exact version of the software used is essential. Soft-ware Heritage will provide the swh-id, intrinsically bound to software components, ensuring persis-tent traceability across future development and or-ganizational changes. The swh-id, like a fingerprint of the Software is specific, persistent and unique. It does not depend on an ID resolver.

Figure 4: The deposit on Hal-Inria

The actors

Software Heritage took the challenge to collect, preserve and share all software that is publicly available in

source code form. Hal-Inria is the open archive of Inria- The French Institute for Research in Computer Sci-ence and Automation. Hal-Inria provides, since 2005, access to the Hal platform, developed by the CCSD-The Center for Direct Scientific Communication. Its main mission is to provide tools, in the respect of open access principles, for archiving and dissemination of scientific publications and data.

Transfer deposit to SWH

Once the deposit is validated, it is pushed to SWH using SWORD protocol. SWH will proceed with the injection of the source code into Alexandria's

Li-brary of Software and will generate the intrinsic

identifier - the swh-id. Hal retrieves the swh-id to use in the citation format.

Figure 5: The software deposit workflow

Figure 6: Browse the deposit on Software Heritage

Software citation

Following the software citation principles[2] and thus considering that software is a legitimate and citable product of research, we have proposed a ci-tation format containing metadata submitted with the software.

Figure 7: Software citation format[3]

Citation is essential for promoting the recognition of software as a valuable research output, and en-suring that the authors have their contributions recognised and rewarded[4].

Références

1.Roberto Di Cosmo, Stefano Zacchiroli (2017) Software Her-itage: Why and How to Preserve Software Source Code. iPRES 2017. https://hal.archives-ouvertes.fr/hal-01590958

2.Smith et al. (2016), Software citation principles. PeerJ Com-put. Sci. 2:e86; DOI 10.7717/peerj-cs.862.

3.Yolanda Gil (2015) Documenting Software through Meta-data. Geosoft.

4.Mike Jackson (2014) How to cite and describe software. The Software Sustainability Institute https://www.soft-ware.ac.uk/how-cite-and-describe-software

Références

Documents relatifs

Everyone will be permitted to modify and redistribute GNU, but no distributor will be allowed to restrict its further redistribution.’ Compare these hopefully

Keywords: minimizing the errors, language for the description of the problem, the principle of “many eyes”, meta-language, method of analyzing the context of words and documents,

Our last point in regard to social norms is that translating models into software in the broader context of relationships between com- putational chemistry and the industry leads to

In this paper, the lack of quality within today’s interactive systems due to insuf- ficient software engineering methods, which don’t consider end users in terms of active

These initiatives are driven by such factors as the considerable impact of the Improvisation Technologies CD-ROM by William Forsythe, the interest of artists and the public to the

Centre de recherche INRIA Paris – Rocquencourt : Domaine de Voluceau - Rocquencourt - BP 105 - 78153 Le Chesnay Cedex Centre de recherche INRIA Rennes – Bretagne Atlantique :

Help researchers and laboratories to improve the management and dissemination of software and data improves the smooth functioning of research. Definition (L,

A power law was observed when AE events occur at constant stress (creep tests) but not when the mechanical behaviour is mainly an elastic response (tensile tests at low temperatures