• Aucun résultat trouvé

IMPROVING OPEN DATA ACCESSIBILITY THROUGH PACKAGE DEVELOPMENT AND COMMUNITY WORK

N/A
N/A
Protected

Academic year: 2021

Partager "IMPROVING OPEN DATA ACCESSIBILITY THROUGH PACKAGE DEVELOPMENT AND COMMUNITY WORK"

Copied!
1
0
0

Texte intégral

(1)

DRIVEN DTU

IMPROVING OPEN DATA ACCESSIBILITY THROUGH PACKAGE DEVELOPMENT AND COMMUNITY WORK

Diego Kozlowski1 Pablo Tiscornia2 Guido Weksler3,4 German Rosati4,5 Natsumi Shokida6 Antonio Vazquez Brust7,8 Demian Zayat9 Elio Campitelli10,4

1 FSTM-UL, 2 INDEC, 3FCE-UBA, 4CONICET, 5IHSS-UNSAM,6 Economía Femini(s)ta, 7FLACSO,8 UTDT, 9FD-UBA, 10CIMA 2020

The eph and presentes packages were developed by members of the R user group (RUG) in Buenos Aires, RenBaires, involving developers with diverse backgrounds. Their purpose is to improve access to public information.

It is an example of the impact of having a strong regional community on package development for improving data access.

EPH - holatam.github.io/eph OVERVIEW

The eph package [1] has as a goal to facilitate the work of those R- users that work with the Argentian Permanent Household survey, which doesn’t count with an official API. Some of its functionalities are:

I

Data gathering,

I

building data pools for cross-time analysis,

I

Organize the information from nomenclatures of occupation and economic activities,

I

organize labels of the database,

I

map the information by agglomerates,

I

replicate the official methodology for poverty and indigence.

GOALS

I

With this, we aim to ease the work of non-expert users, so they can focus on the data analysis, instead of the technical details. We also include warnings and detailed documentation for raising awareness on those things that might have an impact on the results (like data validity).

I

As the majority of the users of the survey come from Argentina or elsewhere in Latin America, and as a way to bring the R code

towards our community, the documentation of the package was written in Spanish.

One use-case of this package is in [3], a periodical report on gender inequalities in Argentina. Figure 1 was taken from this report.

FIGURE 1

presentes. diegokoz.github.io/presentes OVERVIEW

The presentes package includes the publicly available data about vic- tims of state terrorism during the last military dictatorship in Argentina.

The extensive research made by the unique registry of victims of state terrorism (RUVTE) and the memory park is available mostly in PDF files [4] or a webpage not available for bulk download [5]. And include infor- mation about:

I

Oficial victims of illegal repression,

I

victims of illegal repression without a legal claim,

I

victims data from the Parque de la memoria (memory park) database,

I

clandestine detention centers.

FIGURE 2

GOALS

These dataset include many relevant personal information about the victims origin as well as place and date of detention/kidnapping

and discovery of the mortal remains. Also, Clandestine Detention Center (CDC) records where extended with geolocatization obtained from their addresses.

Figure 2 shows the lo- cation of the CDC using Leaflet [6].

ACKNOWLEDGEMENT

The Doctoral Training Unit Data-driven computational modelling and applications (DRIVEN) is funded by the Luxembourg National Research Fund under the PRIDE programme (PRIDE17/12252781).

https://driven.uni.lu

REFERENCES

[1] Diego Kozlowski et al. holatam/eph: dplyr compatibilities. Version 0.3.1. May 2020. doi: 10.5281/zenodo.3842011. url:

https://doi.org/10.5281/zenodo.3842011.

[2] Hadley Wickham. ggplot2: Elegant Graphics for Data Analysis. Springer-Verlag New York, 2016. isbn:

978-3-319-24277-4. url: https://ggplot2.tidyverse.org.

[3] Natsumi Shokida, Daiana Serpa, and Julieta Moure. La desigualdad de género se puede medir. url:

https://ecofeminita.github.io/EcoFemiData/informe_desigualdad_genero/trim_2019_03/informe.nb.html (visited on 06/08/2020).

[4] RUVTE. Informe de Investigación. es. Oct. 2017. url:

https://www.argentina.gob.ar/sitiosdememoria/ruvte/informe (visited on 06/08/2020).

[5] Parque de la Memoria. Base de datos de consulta pública. es-ES. url:

http://basededatos.parquedelamemoria.org.ar/registros/ (visited on 06/08/2020).

[6] Joe Cheng, Bhaskar Karambelkar, and Yihui Xie. leaflet: Create Interactive Web Maps with the JavaScript ’Leaflet’

Library. R package version 2.0.3. 2019. url: https://CRAN.R-project.org/package=leaflet.

Références

Documents relatifs

The remainder of the paper is structured as follows. Firstly, the research approach focused on 5 key issues will be detailed. Data collection and data treatment based on

A researcher will be better placed to conduct ethical research if they are mindful of the context they are working in and make a proactive effort to respect, and ensure benefits

In my opinion, now the nomination completely satisfies all mandatory criteria for safeguarding cultural heritage: it is the element of Intangible Cultural Heritage,

The present element demonstrates its main demand and purpose with sufficient information that completely meet the Convention’s definition of intangible cultural heritage in

However, key informants reported that once Muskrat Falls began, more individu- als chose to engage with E-RGM to work at the project and many exist- ing mobile workers chose to

Even if a country is using its own national stamp it is necessary to have the SGTIN number in human-readable form on the packaging unit for international track-and-trace

more impartial and longer lasting. Only then do we get to work on changing behaviour as a means of reinforcing the impact of the priority measures that have been taken. Most of

Recognizing the importance of the Regional Action Plan for Malaria Control and Elimination in the Western Pacific (2010–2015) 1 as a road map to guide national programmes, as