• Aucun résultat trouvé

Personal Knowledge Base Systems

N/A
N/A
Protected

Academic year: 2021

Partager "Personal Knowledge Base Systems"

Copied!
3
0
0

Texte intégral

(1)

HAL Id: hal-01592601

https://hal.inria.fr/hal-01592601

Submitted on 25 Sep 2017

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Personal Knowledge Base Systems

Serge Abiteboul, David Montoya

To cite this version:

Serge Abiteboul, David Montoya. Personal Knowledge Base Systems. PAP 2017, Personal analytics

and privacy, Sep 2017, Skopje, Macedonia. �hal-01592601�

(2)

Personal Knowledge Base Systems

Serge Abiteboul INRIA & ENS, Paris

serge.abiteboul@inria.fr

David Montoya Engie Ineo & ENS Cachan

david@montoya.one September 24, 2017

The typical Internet user has personal data spread over several devices and across several online systems. Geeks already know how to control their per- sonal data. It is now (or soon will be) possible for everyone to do the same, and there are many advantages to doing so. Everyone should now manage his/her personal information management system. We explain the main fea- tures of personal information management systems (sometimes called, self-data management systems). We consider advantages they provide from societal view- points. We argue that they are coming because of a number of reasons, political, economical, technical, and societal.

We believe that a main argument for their future success is that they enable new functionalities based on knowledge integration, and thus that the future systems are personal knowledge management systems. Such a system should integrate into a coherent whole data of very different nature, in particular, emails and other messages, calendars, contacts, GPS locations, web searches, bills, bank and medical records, etc. The integration of data that typically have to be exported from various systems should be performed by transforming all the information into a single knowledge base on a machine the user controls.

The resulting system thus acts as a digital home for all the digital knowledge of the person.

This integration allows the user to query her personal information within and across different dimensions. It also enables performing analytics to learn from all this information. We believe that many users would be reluctant to give access to all their information to a single company, or if they did that, they would require strong guarantee of privacy. The fact that the integration, the query evaluation, and analysis happen on a system directly controlled by the user guarantees her privacy.

To illustrate the possibly very rich derivation of knowledge, suppose that a user, say Alice, met a friend, say Bob. The agenda indicates the time of a meeting with “Bob”. Some text messages may remove the ambiguity on which

D´epartement d’informatique de l’ENS, ´Ecole normale sup´erieure, CNRS, PSL Research University, 75005 Paris, France

1

(3)

Bob she met (Alice may know several Bobs). The phone number he used has long been aligned with a specific Bob. The location of Alice, as provided by her phone’s GPS, provides the name of the place where they met. Bob’s timeline in his social network provides pictures of the event. Finally, from her web search history, we can even learn about topics they discussed. A meeting that had traces in different systems turned into a rich event in the KB.

Designing such a personal KB is not easy: Data of completely different nature has to be modeled in a uniform manner, pulled into the knowledge base, and integrated with other data. Different from related work in the area of information integration, we cannot rely on outside sources, we cannot expect redundancy, and we have a very limited and particular set of attributes.

We will mention such a system, the Thymeflow system that for instance aligns events based on time, e.g., a meeting in the user’s calendar and a GPS position, or on space, e.g., the address of a contact and a GPS position.

Acknowledgments We will discuss works performed with collegues that we would like to thank:

• Benjamin Andr´ e, Cozy Cloud

• Daniel Kaplan, Fing

• Am´ elie Marian, Rudgers University

• Thomas Pellissier Tanon, ENS Lyon

• Pierre Senellart, ENS

• Fabian M. Suchanek, Telecom ParisTech

The bibliography provides references to these works. More references can of course be found in these papers.

References

[1] Serge Abiteboul, Benjamin Andr, Daniel Kaplan: Managing your digital life. Commun. ACM 58(5): 32-35 (2015)

[2] Serge Abiteboul, Amlie Marian: Personal Information Management Sys- tems, Tutorial, EDBT/ICDT Conference, 2015.

https://www.slideshare.net/ameliemarian

[3] David Montoya, Thomas Pellissier Tanon, Serge Abiteboul, Fabian M.

Suchanek: Thymeflow, A Personal Knowledge Base with Spatio-temporal Data. CIKM 2016: 2477-2480, 2015

[4] David Montoya, Serge Abiteboul, Pierre Senellart: Hup-me: inferring and reconciling a timeline of user activity from rich smartphone data. SIGSPA- TIAL/GIS 2015: 62:1-62:4

2

Références

Documents relatifs

when a user activates a data source (i.e. Facebook), he is then redirected to the service provider site to grant the PeDF the required level of

LinkedCJ, our new ontology for Chinese academic journals, inherits a list of clas- ses, object properties and data properties from existing ontologies of semantic

This project is aimed at knowledge acquisition and metadata integration, providing fast response for a variety of complicated queries, such as finding articles based on same

l’a fait remarquer dans l’extrait 43, la combinaison que l’on peut voir ci-dessus ne fonctionnait pas, c’est pourquoi le groupe propose une autre solution, faisant preuve ici de

For every protein pair the following information is stored: the calculated 22 features normalized and in their initial form, if there exists evidence about

Design parameters and performance characterisation of the DBD plasma actuators that will be employed further on in the thesis to realize instability/flow control in the

A general approach for understanding such tables with the help of linked open data web [2] would be to (1) link the val- ues in each cell to known entities in a linked data

Like a DSSP, a Personal Dataspace Support Platform (PDSP) should provide a set of services to access the data contained in the dataspace, and allow basic handling of this data,