• Aucun résultat trouvé

Using kittens to unlock photo-sharing website datasets

N/A
N/A
Protected

Academic year: 2021

Partager "Using kittens to unlock photo-sharing website datasets"

Copied!
2
0
0

Texte intégral

(1)

HAL Id: hal-01306513

https://hal.archives-ouvertes.fr/hal-01306513

Submitted on 24 Apr 2016

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Using kittens to unlock photo-sharing website datasets

Simon Gascoin

To cite this version:

Simon Gascoin. Using kittens to unlock photo-sharing website datasets. EGU General Assembly, Apr 2016, Vienna, Austria. 2016. �hal-01306513�

(2)

Printing:

Using kittens to unlock photo-sharing website datasets

Simon Gascoin - CNRS/Centre d’Etudes Spatiales de la Biosphère, Toulouse, France. [email protected]

Context

Mining photo-sharing websites is a promising approach to complement in situ and satellite observations of the environment, however a challenge is to deal with the large degree of noise inherent to online social datasets.

Data

Method

Result

Then all images tagged with “chat”, “gat” or “gato” (cat in French, Spanish and Catalan languages) were queried. The tag “cat” was not considered in order to exclude the results from North America where Flickr got popular earlier than in Europe. The number of “cat” images per month was used to fit a gauss model of the number of images uploaded in Flickr with time. This model was used to remove this trend in the numbers of snow-tagged photographs.

Using the Flickr application programming interface all the public images metadata tagged at least with one of the following words were queried: “snow”, “neige”, “nieve”, “"neu” (snow in French, Spanish and Catalan languages). The search was limited to the geotagged pictures in the Pyrenees area. However, the number of public pictures available for a given time interval depends on several factors, including the Flickr website popularity and the development of digital photography.

The method was evaluated on a time series of the snow cover area in the Pyrenees.

The comparison with MODIS snow cover area shows that the method effectively removes the first-order trend in the flick data (Spearman’s R increases from 0.5 to 0.8)

References

Gascoin, S., Hagolle, O., Huc, M., Jarlan, L., Dejoux, J.-F., Szczypta, C., Marti, R., and Sánchez, R.: A snow cover climatology for the Pyrenees from MODIS snow products, Hydrol. Earth Syst. Sci., 19, 2337-2351, doi:10.5194/hess-19-2337-2015, 2015.

Gascoin, S. Using kittens to unlock photo-sharing website datasets. Journal of Brief Ideas, doi:10.5281/zenodo.44809, 2016.

This study was restricted to 2003-2011. Since 2011 the advent of smartphones with built-in GPS and camera has strongly increased the amount of geotagged data. These data have the potential to overcome some limitations of remote sensing products like cloud obstruction provided that (i) the users choose a public sharing license (ii) the hosting website provides open source API to programmers.

Conclusion

MODIS snow maps were

processed to generate a daily

cloud-free snow cover

climatology over 2000-2013

(Gascoin et al., 2015).

Références

Documents relatifs

During the design phase we considered four data sources: Genomic Data Commons (GDC, [JFGS17]), containing over 310,000 files, across over 32,000 cases, in 40 projects, covering

The Center for Expanded Data Annotation and Retrieval (CEDAR) developed a suite of tools¾the CEDAR Workbench¾that allows users to build metadata templates using ontologies to

For this reason, the ontology described in this paper is currently being integrated into the sys- tem to allow users to use META-SHARE as the basic vocabulary for querying

The evaluation results showed that a simple combination of different feature spaces using classifiers not specifically designed for taking into account the big variety of concepts

[r]

We have designed a multilingual search interface (a front-end for the Flickr database) where users can search in three modes: no translation, automatic translation in the

Although the precision compared to the baseline decreases to 0.3075, substituting the thesaurus terms for the original query text works better in 12 of the 25 cases, showing

The first ap- proach called follow-your-nose (link traversal) [11] is based on following links between data to discover potentially rele- vant data, and the second one is