• Aucun résultat trouvé

Knowledge Graph Quality Management

N/A
N/A
Protected

Academic year: 2022

Partager "Knowledge Graph Quality Management"

Copied!
2
0
0

Texte intégral

(1)

Invited Talk: Knowledge Graph Quality Management

Gianluca Demartini?

School of Information Technology and Electrical Engineering, University of Queensland

GP South Building, Staff House Road, St Lucia QLD 4072 Australia

e-mail:demartini@acm.org

Abstract. This talk discusses recent research related to managing data quality for Knowledge Graphs and also some applications related to the biomedical domain. First, the talk will show how to deal with noisy labels in datasets which are used to train machine learning models. Then, the completeness dimension of knowledge graphs will be discussed. The fo- cus will be on work aiming to estimate the expected number of instances for a class in order to measure the level of data completeness. Related to this, the talk will show how crowdsourced knowledge graphs receive contribution towards increasing completeness and how different people contribute at different levels over time. Finally, it will be demonstrated how human bias reflected in the contributed data can be represented in the knowledge graph and surfaced to users. The second part of the talk will discuss a couple of application scenarios in the biomedical do- main where knowledge graph are used, including entity extraction and information access.

Speaker Bio: Dr. Gianluca Demartini is an Associate Professor in Data Science at the University of Queensland, School of Information Technology and Electrical En- gineering. His main research interests are Information Retrieval, Semantic Web, and Human Computation. He received Best Paper awards at the AAAI Conference on Human Computation and Crowdsourcing (HCOMP) in 2018 and at the European Conference on Information Retrieval (ECIR) in 2016 and the Best Demo award at the International Semantic Web Conference (ISWC) in 2011. He has published more than 100 peer-reviewed scientific publications including papers at major venues such as WWW, ACM SIGIR, VLDBJ, ISWC, and ACM CHI. He has given several in- vited talks, tutorials, and keynotes at a number of academic conferences (e.g., ISWC, ICWSM, WebScience, and the RuSSIR Summer School), companies (e.g., Facebook), and Dagstuhl seminars. He is an ACM Distinguished Speaker since 2015. He serves as co-editor in chief for the Human Computation Journal, area editor for the Journal of Web Semantics, editorial board member for the Information Retrieval journal, and as Crowdsourcing and Human Computation Track co-Chair at WWW 2018. He has

?Copyright©2019 for this text by its author. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).

i

(2)

been Program Committee member for several conferences including WWW, SIGIR, KDD, AAAI, IJCAI, ISWC, and ICWSM. Before joining the University of Queens- land, he was Lecturer at the University of Sheffield in UK, post-doctoral researcher at the eXascale Infolab at the University of Fribourg in Switzerland, visiting researcher at UC Berkeley, junior researcher at the L3S Research Center in Germany, and intern at Yahoo! Research in Spain. In 2011, he obtained a Ph.D. in Computer Science at the Leibniz University of Hanover focusing on Semantic Search.

ii

Références

Documents relatifs

Recognizing that logical reasoning is only one component of the overall task of answering questions, the team at SRI is in the process of formulating similar challenges for

the knowledge embedding is to learn entities and relations in KGs as vector representations in the low-dimensional space. Such representations can be used in many applications, e.g.

We will demonstrate a reusable framework for developing knowledge graphs that supports general, open-ended development of knowledge curation, interaction, and inference..

The main classes in the ontology are: SoftwareImage (image from which a container is deployed), ImageLayer (a line within the Docker file that install software within the

This integrated data space then serves as the basis for our fourth stage, where the integration of graph-specific machine learning capabilities transforms the passive data space into

The DDL workflow comprises four steps: (1) Data Property Reconciliation and Po- tential Identifiers Finder, (2) Entity Type and Domain Recognition, (3) Entity Disam- biguation, and

In this article, we propose the Sematch framework for entity search in the knowledge graph that combines natural language query processing, entity linking, entity type linking

Multilingual aspect: Defining the labels in the ontology in different languages allows to visualize the graph in a multilingual form as shown in Figure 1 which describes the