• Aucun résultat trouvé

Linking entities for enriching and structuring social media content

N/A
N/A
Protected

Academic year: 2022

Partager "Linking entities for enriching and structuring social media content"

Copied!
1
0
0

Texte intégral

(1)

Linking Entities for Enriching and Structuring Social Media Content

Raphaël Troncy

EURECOM Biot, France

[email protected]

ABSTRACT

Social media platforms such as Twitter, Facebook or LinkedIn become a reliable source of news and play a key role for being aware of events around the world. Using social me- dia to recognize, enrich or summarize events is however very challenging. In the first part of this talk, we will present ADEL, a novel hybrid architecture for an adaptive entity linking system, that combines methods from the nat- ural language processing, information retrieval and semantic fields. The framework enables to link all the mentions oc- curring in a text to their entity counterparts in a knowledge base. It is modular and adaptive since it enables to pro- cess text written in different languages and of different kind (newswire, tweets, blog posts, etc.) while entities can be of common types (PERSON, LOCATION and ORGANI- ZATION) or specific ones (dates, numbers) and be disam- biguated in generic or specialized knowledge bases. We will show how ADEL can outperform the state-of-the-art sys- tems on the reference NEEL challenges that happens in the yearly #Micropost workshop (2014-2016).

In the second part of this talk, we will present a framework that can collect microposts from more than 12 social plat- forms and that contain media items, as a result of a query – for example a trending event. We will then show how we can automatically create different visual storyboards that reflect what users have shared about this particular event. The vi- sualization emphasizes the different aspects of storyboards.

A graph view shows the relationships between microposts and topics that we automatically extract, while the timeline view emphasizes the time dimension. The user can watch and interact with the summarized view of all the topics or select a particular one with the additional details. In addi- tion, the states of different views are persistent through the URLs which makes easy sharing possible.

Copyright is held by the author/owner(s).

WWW’16 Companion,April 11–15, 2016, Montréal, Québec, Canada.

ACM 978-1-4503-4144-8/16/04.

http://dx.doi.org/10.1145/2872518.2892109.

BIO

Dr. Rapha¨el Troncy1 is an Associate Professor in the Data Science Depart- ment of EURECOM lead- ing the Multimedia Seman- tics group. He is also co- chair of the W3C Media Fragments Working Group and W3C Incubator Group on Multimedia Semantics, contributes to the W3C Web Annotations Working Group and numerous W3C Community Groups such as Schema.org or Open Linked Education. He is work- ing on information extrac-

tion (project ASRAEL2and in particular named entity link- ing (framework NERD3), ontology modeling in the multi- media domain (project DOREMUS4) and large scale data integration. Applications range from smart cities (project 3cixty5) to social TV and second screen (projects LinkedTV6 and NexGen-TV7) where semantic web, information extrac- tion and multimedia analysis technologies are used together.

Acknowledgments

This work has been partially supported by Bpifrance within the NexGen-TV Project, under grant number F1504054U.

1. REFERENCES

[1] J. Plu, G. Rizzo, and R. Troncy. Revealing Entities from Textual Documents Using a Hybrid Approach. In 3rd International Workshop on NLP & DBpedia, 2015.

[2] G. Rizzo, M. van Erp, and R. Troncy. Benchmarking the Extraction and Disambiguation of Named Entities on the Semantic Web. In9thInternational Conference on Language Resources and Evaluation (LREC’14), 2014.

1http://www.eurecom.fr/~troncy

2http://asrael.limsi.fr/

3http://nerd.eurecom.fr/

4http://doremus.org/

5http://3cixty.com/

6http://www.linkedtv.eu/

7http://nexgentv.fr/

597

Références

Documents relatifs

Several useful features can be used for EL, such as entity context information in original sentence, popularity of each entity’s sense, word co-occurrence in

In the MULTISENSOR (Mining and Understanding of multilin- guaL contenT for Intelligent Sentiment Enriched coNtext and Social Oriented inteRpretation) project 1 , we have developed

This paper describes a code mixed named entity recogni- tion from Social media text in English-Hindi and English- Tamil language pair data.. Our proposed approach is a hy- brid model

In order to measure the adaptation performance of word embedding features, we trained the WAN network on the well-formed Formal Set 1 and reported results on the other data sets

In this work we have identified the mention by creating an entity recognition model on the given training data and link them to the DBpedia using DBpedia Spotlight.. The rest of

Crowdsourcing of Linked Open Data from social media about earthquakes and other disasters supports information filtering and grouping tasks [2].. These ob- servations about

In the present paper, we mainly cover the audio processing aspects of the system, includ- ing speech recognition, speaker diarization and linking (across recordings), the use of

This paper shows how text-based ontology acquisition meth- ods can be enriched by taking all types of domain specific textual units into account, named entities as well as terms,