• Aucun résultat trouvé

Preface

N/A
N/A
Protected

Academic year: 2022

Partager "Preface"

Copied!
5
0
0

Texte intégral

(1)

Semantic Web and Information Extraction SWAIE 2012

Workshop in conjunction with the 18th International Conference on

Knowledge Engineering and Knowledge Management Galway City, Ireland, 9 October 2012

Edited by:

Diana Maynard Marieke van Erp

Brian Davis

(2)
(3)

Preface

There is a vast wealth of information available in textual format that the Seman- tic Web cannot yet tap into: 80% of data on the Web and on internal corporate intranets is unstructured, hence analysing and structuring the data - social ana- lytics and next generation analytics - is a large and growing endeavour. The goal of the 1st workshop on Semantic Web and Information Extraction was to bring researchers from the fields of Information Extraction and the Semantic Web to- gether to foster inter-domain collaboration. To make sense of the large amounts of textual data now available, we need help from both the Information Extrac- tion and Semantic Web communities. The Information Extraction community specialises in mining the nuggets of information from text: such techniques could, however, be enhanced by annotated data or domain-specific resources.

The Semantic Web community has already taken great strides in making these resources available through the Linked Open Data cloud, which are now ready for uptake by the Information Extraction community. The workshop invited contributions around three particular topics: 1) Semantic Web-driven Informa- tion Extraction, 2) Information Extraction for the Semantic Web, and 3) appli- cations and architectures on the intersection of Semantic Web and Information Extraction.

SWAIE 2012 had a number of high-quality submissions. From these, the 6 best papers were chosen for the two paper sessions of the programme: 4 long paper presentations and 2 short ones. Additionally, we held a lightning talks session where attendees could present brief 3-minute talks about late-breaking work or demos around the workshop themes, and a panel session involving some general discussion about these themes. To initiate the workshop, a keynote talk was provided by D.J. McCloskey, NLP Architect in IBM’s new Watson Solutions division. The keynote presented the post-Watson role of Information Extraction and its intersection with the Semantic Web.

We would like to thank the many people who helped make SWAIE 2012 such a success: the Programme Committee, the paper contributors, the invited speaker and panellists, and all the participants present at the workshop who engaged in lively debate.

Diana Maynard, University of Sheffield Marieke van Erp, VU University Amsterdam Brian Davis, DERI Galway

iii

(4)

Programme Committee

• Georgeta Bordea, DERI Galway, Ireland

• Matje van de Camp, Tilburg University, The Netherlands

• Christian Chiarcos, Information Sciences Institute, USA

• Hamish Cunningham, University of Sheffield, UK

• Thierry DeClerck, DFKI, Germany

• Robert Engels, Western Norwegian Research Institute, Norway

• Phil Gooch, City University London, UK

• Seth Grimes, Alta Plana Corporation

• Siegfried Handschuh, DERI, Ireland

• Dirk Hovy, Information Sciences Institute, USA

• Laurette Pretorius, University of South Africa, South Africa

• Birgit Proell, Johannes Kepler University of Linz, Austria

• Giusseppe Rizzo, EURECOM, France

• Piek Vossen, VU University, The Netherlands

• Marie Wallace, IBM Dublin, Ireland

• Ren´e Witte, Concordia University, Montreal, Canada

(5)

Contents

Unsupervised Improvement of Named Entity Extraction in Short Informal Context Using Disambiguation Clues

Mena B. Habib and Maurice van Keulen. . . 1 LODIE: Linked Open Data for Web-scale Information Extraction

Fabio Ciravegna, Anna Lisa Gentile and Ziqi Zhang . . . 11 Ontologies as a Source for the Automatic Generation of Gram-

mars for Information Extraction Systems

Thierry Declerck and Paul Buitelaar . . . 23 Identifying Consumers’ Arguments in Text

Jodi Schneider and Adam Wyner . . . 31 Identifying and Extracting Quantitative Data in Annotated Text

Don J. M. Willems, Hajo Rijgersberg and Jan L. Top . . 43 Scenario-Driven Selection and Exploitation of Semantic Data for

Optimal Named Entity Disambiguation

Panos Alexopoulos, Carlos Ruiz and Jos´e-Manuel G´omez- P´erez . . . 55

v

Références

Documents relatifs

Nevertheless, since in our case the BTC datasets are used as representative samples of the full Web of Linked Data, this decrease can be related to the fact that in 2012 and 2011

Evaluation To determine whether the lack of more detailed information has a negative effect on the performance of author disambiguation using these reference data, we have taken

While many general tools and algorithms already exist for data profiling, most of them cannot be used for graph datasets, because they assume a relational data structure, a

Given a lan- guage grammar, the tool allows to generate lightweight web editors, sup- porting syntax highlighting, syntax validation, scoping, and code com- pletion.. DSLFORGE

Table 2 presents the results of the survey of RDF(S) and OWL usage in our corpus, where for features with non-trivial semantics, we present the measures mentioned in the

To address this problem, the Asset De- scription Metadata Schema (ADMS), a common meta-model for semantic assets is developed to enable repository owners to publish their

They show that regardless of the implicit temporal nature of the query, web snippets contain a broad range of temporal information that can be used as a

Publishing mathematical knowledge is feasible within the current Linked Data architecture, but it would benefit from better application-layer support for dealing with combinations