• Aucun résultat trouvé

Adventures in Linked Data: Building a Connected Research Environment

N/A
N/A
Protected

Academic year: 2022

Partager "Adventures in Linked Data: Building a Connected Research Environment"

Copied!
63
0
0

Texte intégral

(1)

Adventures in Linked Data: Building a Connected Research Environment

Lisa Goddard, Memorial University Libraries

& INKE Research Group Access 2012, Montreal QC

October 19th, 2012

(2)

Canadian Writing Research Collaboratory (CWRC)

CFI funded initiative to establish an online

infrastructure for

literary research in and about Canada.

www.cwrc.ca

(3)

Canadian Writing Research Collaboratory (CWRC)

“To enable unprecedented avenues for studying the words that most move people

in and about Canada.”

Authoring Dissemination Discovery

(4)

Collaboration

Emergent Scholars

Senior Scholars Librarians Programmers Project

Managers

“We need to stop talking around the issue of the single-author monograph as the

benchmark for excellence.” (DH Manifesto, 2004)

(5)

Why Linked Data for the Humanties?

(6)

Improved Search

Find all references to food in Joyce’s Ulysses.

(7)

New Discovery Tools

Instead of looking for a needle in a

haystack, an effective text mining tool will show you the shape of the haystack and tell you what needles are there that you'd want to stick in your foot. (Rockwell,

2011)

(8)

Interoperability

Disparate data sources and incompatible data structures are among the biggest

obstacles for 21st century humanities researchers.

(RIN, 2011)

(9)

Collaboration

Linked data doesn’t just accommodate collaboration, it enforces collaboration.

(10)

Big (Text) Data

Scale is a new horizon of intellectual inquiry.

What kinds of humanistic

phenomena appear only at scale? (Liu, 2012)

(11)

Heterogeneity

Text data is messy.

(12)

Getting Started

(13)

Dev Platform

The Orlando Project

(14)

Identify Top Level Entities

http://domain.org/doc.html Annotation

(15)

Mint URIs for Entities

http://this.ca/doc

http://this.ca/event http://this.ca/place

http://this.ca/person

http://this.ca/org

http://this.ca/book http://this.ca/film

http://this.ca/annotation Annotation

(16)

Cool URIs Don’t Change

Sir Tim Berners-Lee is judging you.

Do you really feel that the old URIs cannot be kept

running? If so, you chose them very badly.

- Sir Tim

(17)

Minting URIs

Abstract away from implementation details.

http://tiger.cwrc.ca/person.php?id=

virginia-woolf&format=rdf

http://cwrc.ca/person/virginia-woolf

(18)

Canonical URIs

http://dbpedia.org/resource/Virginia_woolf (303 redirects and content negotiation)

http://dbpedia.org/data/

Virginia_Woolf.rdf http://dbpedia.org/page/

Virginia_Woolf

(19)

URI Patterns

http://dbpedia.org/resource/Mary_Shelley

http://id.loc.gov/authorities/names/n85300519 http://viaf.org/viaf/95216565/

http://cwrc.ca/person/6qMXIR5UNQSiupA

(20)

Define Relationships

http://this.ca/doc

http://this.ca/event http://this.ca/place

http://this.ca/org

http://this.ca/book http://this.ca/film

subjectOf

bornIn

employs employedBy

adaptedFrom

published creator

setIn

annotates subjectOf

hostedBy

http://this.ca/annotation Annotation

http://this.ca/person

(21)

RDF Statement

wrote

http://viaf.org/viaf/39385478/

http://purl.org/dc/elements/1.1/creator

http://dbpedia.org/resource/Mrs_Dalloway

predicate

subject object

(22)

Machine Readable Definitions

http://this.ca/doc

http://this.ca/event http://this.ca/place

http://this.ca/org

http://this.ca/book http://this.ca/film

subjectOf

bornIn

employs employedBy

adaptedFrom

published creator

setIn

annotates subjectOf

hostedBy

http://this.ca/annotation Annotation

? http://this.ca/person

(23)

Ontologies

The semantic web is basically an accessibility initiative for machines.

(24)

Role of Ontologies

Define entities (classes)

Define relationships (properties)

Impose rules to support machine reasoning.

Machine-readable (RDF)

(25)

Reuse or Build?

An ontology is for life.

(26)

Ontology Dowsing

(27)

foaf:Person (Class)

(28)

frbr:Work (Class)

(29)

Assigning Entities to Classes

cwrc:Woolf rdf:type

rdf:type

http://xmlns.com/foaf/0.1/Person http://cwrc.ca/person/456789

http://www.w3.org/1999/02/22-rdf-syntax-ns#type

http://cwrc.ca/person/456789 http://purl.org/vocab/frbr/core#Work http://www.w3.org/1999/02/22-rdf-syntax-ns#type

foaf:Person

cwrc:

MrsDalloway frbr:Work

(30)

Ontology: Predicates

Person to Person

e.g. Relationship Ontology (rel:EnemyOf)

Document to Document

e.g. Bibliographic Ontology (bibo:reviewOf)

Annotation to Document

e.g. Open Annotation Ontology (oa:hasTarget)

(31)

Ontology: Predicates

Person Person

LifePartnerOf

Domain Range

(32)

Classes and predicates work together to support reasoning.

Woolf Mrs.

Dalloway livedIn

Woolf Mrs.

Dalloway lifePartnerOf

Woolf Mrs.

Dalloway creator

(33)

CWRC RDF Ontologies

Entity Relevant Ontologies

Person FOAF, MADS, EAC-CPF

Work FRBR, MODS, DC

Place WGS84 Geo Positioning, Geonames Organization FOAF, EAC-CPF

Event OWL Time, Event

Annotation Open Annotation, OAI-ORE

(34)

So Far

defined our major entities and relationships

minted URIs to represent those things within the CWRC data store

selected ontologies that will help computers to reason about our entities and relationships

(35)

Data Model

(36)

Integrating the Data Model

Creates

(37)
(38)
(39)

CWRC-Writer Tagging Toolbar

(40)
(41)

Results from Authority File.

Results from Authority File.

(42)

API Lookup to Web Services.

API Lookup to Web Services.

(43)

VIAF OpenSearch

http://viaf.org/viaf/search?query=local.names+all+%2

2margaret%20atwood%22+&maximumRecords=100&startRecor d=1&sortKeys=holdingscount&httpAccept=application/rs

s%2Bxml

(44)

DBPedia RESTful API

curl -H "Content-Type: text/xml" -d @post.xml http://dbpedia.org/fct/service > result.xml

<?xml version="1.0"?>

<query

xmlns="http://openlinksw.com/services/facets/1.

0" inference="" same-as="">

<text>Mary Shelley</text>

<view type="text" limit="5" offset=""/>

</query>

(45)

Add custom identifier.

Add custom identifier.

(46)
(47)
(48)

Entity in XML

<span name="ent_1798" class="entity person start” type="person” entity="true” />

Russell Bertrand

<span name="ent_1798" class="entity person end” type="person” entity="true” />

(49)

Entity in RDF

<rdf:Description rdf:ID="ent_1798”>

<w:type type="props">person</w:type>

<w:content type="props">Bertrand Russell</

w:content>

<w:term type="info">Bertrand Russell, 1872-1970</

w:term>

<w:viafid type="info">36924137</w:viafid>

<w:bnf type="info">11923140</w:bnf>

<w:dnb type="info">118604287</w:dnb>

<w:lc type="info">n79056054</w:lc>

<w:certainty type="info">definite</w:certainty>

</rdf:Description>

(50)
(51)
(52)
(53)

List of named entities.

Add Relationship.

Add Relationship.

(54)
(55)
(56)
(57)

Relations Tab

(58)

Relationship in RDF

<rdf:Description rdf:about="ent_1798"

w:external="false">

<w:isAParentOf w:text="is a parent of"

w:external="false">

<rdf:Description rdf:about="ent_1802"

w:external="false" />

</w:isAParentOf>

</rdf:Description>

(59)

Entity in RDF

<rdf:Description rdf:ID="ent_1798">

<w:id type="offset">ent_1798</w:id>

<w:offset type="offset">117</w:offset>

<w:length type="offset">16</w:length>

<w:entity type="offset">true</w:entity>

<w:type type="props">person</w:type>

<w:content type="props">Bertrand Russell</w:content>

<w:term type="info">Bertrand Russell, 1872-1970</w:term>

<w:viafid type="info">36924137</w:viafid>

<w:bnf type="info">11923140</w:bnf>

<w:dnb type="info">118604287</w:dnb>

<w:lc type="info">n79056054</w:lc>

<w:certainty type="info">definite</w:certainty>

</rdf:Description>

(60)

“Patricia Spence was nanny to Russell’s children before becoming his third wife.”

http://cwrc.ca/person/12345

oa:hasTarget oa:h

asBody

oa:hasSemanticTag

http://cwrc.ca/work/12345

oa:Annotation

http://cwrc.ca/annotation/12345

(61)

CWRC Entity Management System

Entity Management Interface

(add, edit, merge, split entities)

Indexed By Entity storage

Entity lookup

CWRC Writer (document authoring interface)

Reads From Writes To

Accesses

Triple Store Populates

Discovery Tools Enables

Write s XM

L Docs

(62)

Credits

CWRC

Susan Brown

Mariana Parades-Olea Jeffery Antoniuk

Denilson Barbosa Ruth Knetchel

Omar Rodriguez-Arenas

INKE

Ray Siemens Stan Ruecker Harvey Quamen John Simpson Jentery Sayers

(63)

Related Links

CWRC Writer Beta:

http://cwrctc.artsrn.ualberta.ca/

CWRC Writer Overview:

http://www.cwrc.ca/projects/infrastructure-proj ects/technical-projects/cwrc-writer/

About CWRC:

http://www.cwrc.ca/about/

Références

Documents relatifs

And we believe that today, data journalism is a premier type of such intermediaries, because data journalists combine the expertise of how to access, interpret,

In order to update the definition of sexual health, identify challenges and opportunities in addressing sexual health, and identify strategies that countries and regions could

Traditional OA systems are not as suitable for aligning LOD ontology schemas; for example, equivalence relations are limited among LOD concepts so that OA systems for LOD

This paper focuses on the conversion of IFC Building Information Models (BIM) to RDF Abox graphs using the emerging W3C Linked Building Data (LBD) modular ontologies: BOT

The publication is carried out following the principles of Linked Data through a life-cycle that includes a research process for the specification of reliable data sources, the

LD-R aims to apply the idea of component-based application development into RDF data model hence enhancing current user interfaces to view, browse and edit Linked Data..

Kerstin Schneider received her diploma degree in computer science from the Technical University of Kaiserslautern 1994 and her doctorate (Dr. nat) in computer science from

In this paper, we present a novel approach that leverages the small graph patterns oc- curring in the Linked Open Data (LOD) to automatically infer the se- mantic relations within