• Aucun résultat trouvé

Semantic Web and Best Practice in Watson

N/A
N/A
Protected

Academic year: 2022

Partager "Semantic Web and Best Practice in Watson"

Copied!
1
0
0

Texte intégral

(1)

Semantic Web and Best Practice in Watson

Chris Welty IBM Research

1 About the Speaker

Dr Chris Welty is a Research Scientist at the IBM T.J. Watson Research Center in New York. His principal area of research is Knowledge Representation, specif- ically ontologies and the semantic web, and he spends most of his time applying this technology to Natural Language Question Answering as a member of the DeepQA/Watson team. He is best known as a co-developer of the OntoClean methodology with Nicola Guarino, and as the co-chair of the W3C RIF working group.

2 Speech Abstract

IBM’s revolutionary Watson system has successfully beaten human Jeopardy champions, and is now being extended and used in other domains, such as healthcare question answering, and financial data analysis. It is a common mis- conception that Watson is a through-and-through formally semantic system, which translates questions into formal language queries, and returns answers by executing those queries over a large knowledge base. In actual fact, Watson uses a variety of technologies to produce candidate answers to each question, and semantic technologies are primarily used in the subsequent candidate an- swer ranking components. In particular, linked data sources are used to provide typing evidence for candidate answers, but also several other answer ranking components rely more or less on semantics and linked data.

In this speech Dr Welty discusses these semantic components and the data upon which they operate, giving examples of expected and unexpected be- haviours, and how these affect the resulting answers returned by the system.

He also touches upon the methods and practices employed in developing and testing Watson, giving useful suggestions for practitioners building real world large-scale cognitive systems.

Références

Documents relatifs

Co-occurrence selection Without drug name normalization our dataset contains a total of 23,865,029 drug-adverse event co-occurrences, 1,847,073 more than reported in the IMJS

Keywords: Semantic Technologies, Customer Satisfaction, Forecast Models, Survey Data, Verbatim, Model Building, Ontologies and

As data is collected we need to record its context so that it may be properly interpreted, and when it is published we need to know its provenance so that we can understand quality

Semantic Web, Portal, Personalization, Single sign-on, Data integration, Data

The basic idea behind our approach is to use a reasoning system over the set of educational linked data on the Semantic Web so that the system can show a list of inter-related

By employing geospatial semantic features of knowledge bases, it is possible to integrate several data stores and only find information, that is valid within the range of a

ontology is an essential part of any solution to the problems of medical terminology. In Albania the knowledge about ontology are very little compared to

At the core of the platform lies the CrossCult Knowledge Base (CCKB), a repository for storage, management and retrieval of cultural heritage information, building on standard