• Aucun résultat trouvé

Forging New Links: Libraries in the Semantic Web

N/A
N/A
Protected

Academic year: 2022

Partager "Forging New Links: Libraries in the Semantic Web"

Copied!
97
0
0

Texte intégral

(1)

Forging New Links:

Libraries in the Semantic Web

Lisa Goddard & Gillian Byrne Memorial University Libraries

Computers in Libraries, Washington D.C.

March 23rd, 2011

(2)

The Gist

General Semantic Web

•How it works.

•A few tools.

•Who’s involved?

Lisa

Libraries & Linked Data

•What it solves.

•Issues & obstacles.

•Where we are now.

Gillian

(3)

Web Search Problems

(4)

High Recall, Low

Precision

(5)

Vocabulary Dependent

(6)

Returns Single Web Pages

(7)

Access to Deep Web

(8)

Identity

(9)

Comparisons

Academic Staff Member

(University College London) Faculty Member

(McGill) Equivalent to?

(10)

Semantic Queries

Find all soccer players, who played as goalkeeper for a club that has a stadium with more than

40,000 seats and who are born in a country with more than 10 million inhabitants.

(11)

Structured Databases

(12)
(13)

The Semantic Solution

1. Structured data

2. Vocabularies & Reasoning

3. Linking

(14)

Machine-Actionable Data

(15)

Structured Data: RDF

(16)

Semantic Web Metadata: RDF

Data model for writing simple statements about web objects.

RDF statements are written as “triples”.

Subject Object

Shakespeare Macbeth

Predicate

Wrote

V

Statement

(17)

RDF Triples

Subject Predicate Object

Shakespeare Shakespeare Anne Hathaway Shakespeare Stratford

Macbeth England Scotland

Wrote Wrote Married Lived in Is in

Set in Part of Part of

King Lear Macbeth

Shakespeare Stratford

England Scotland UK

UK

(18)

RDF Graph: A Semantic Net

AnneHathaway

Shakespeare

Stratford

UK

Macbeth KingLear

Scotland England

wrote

isIn

setIn

(19)
(20)

Vocabularies: Ontologies

(21)

Controlled Vocabulary

Terms and definitions are posted online, so they can be shared by many different organizations.

http://www.mun.ca/lit.owl

#wrote

#setIn

#play #book

#poem

#narrated

xmlns:lit=http://www.mun.ca/lit.owl lit:book

lit:poem lit:wrote

(22)

Shared Vocabularies

Namespaces allow us to combine several vocabularies while maintaining distinct meaning of each element.

#person

#partOf

http://gmu.edu/bio.owl

http://mit.edu/geo.rdf http://mun.ca/lit.owl

#wrote

#setIn

#married

#play #book

#poem

#narrated #isIn

#country

#city

#region

#birthdate

#deathdate

(23)

RDF: Semantic Net & Namespaces

bio:AnneHathaway

bio:Shakespeare

geo:Stratford

geo:UK

lit:Macbeth lit:KingLear

geo:Scotland geo:England

lit:wrote

geo:isIn

lit:setIn

xmlns:geo=http://mit.edu/geo.rdf xmlns:bio=http://gmu.edu/bio.owl xmlns:lit=http://mun.ca/lit.owl

*In RDF namespace URIs always resolve.

(24)

FOAF: People

<rdf:RDF

xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"

xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#"

xmlns:foaf="http://xmlns.com/foaf/0.1/"

<foaf:Person rdf:ID="me">

<foaf:name>Lisa Goddard</foaf:name>

<foaf:title>Ms.</foaf:title>

<foaf:givenname>Lisa</foaf:givenname>

<foaf:family_name>Goddard</foaf:family_name>

<foaf:homepage rdf:resource="http://twitter.com/lisagoddard"/>

<foaf:workplaceHomepage rdf:resource="http://www.library.mun.ca"/>

<foaf:schoolHomepage rdf:resource="http://www.queensu.ca"/>

<foaf:knows>

<foaf:Person>

<foaf:name>Gillian Byrne</foaf:name>

<foaf:mbox rdf:resource="mailto:gbyrne@mun.ca"/></foaf:Person></foaf:knows>

<foaf:knows>

<foaf:Person>

<foaf:name>Dianne Keeping</foaf:name>

<foaf:mbox rdf:resource="mailto:dckeep@mun.ca"/>

</foaf:Person></foaf:knows>

</foaf:Person>

</rdf:RDF>

(25)

Resolvable URIs

xmlns:foaf="http://xmlns.com/foaf/0.1/"

<foaf:Person rdf:ID="me">

<foaf:name>Lisa Goddard</foaf:name>

<foaf:workplaceHomepage

rdf:resource="http://www.library.mun.ca"/>

</foaf:Person>

http://xmlns.com/foaf/0.1/#term_workplaceHomepage

(26)

Resolvable URIs

(27)

Web Ontology Language: OWL

An ontology describes a particular domain of knowledge (e.g. bikes, whiskey).

•Establishes controlled vocabulary.

•Models relationships between entities & concepts.

•Built-in rules and datatypes that support reasoning.

(28)

Reasoning: Ontologies

(29)

OWL Reasoning: Inverse

lit:wrote owl:inverseOf lit:writtenBy lit:settingFor owl:inverseOf lit:setIn

bio:Shakespeare lit:wrote lit:Macbeth lit:Macbeth lit:setIn geo:Scotland

lit:Macbeth lit:writtenBy bio:Shakespeare geo:Scotland lit:settingFor lit: Macbeth

Axiom

Explicit Fact

Implicit Fact

(30)

OWL Reasoning: Transitive

hasParent rdfs:subpropertyOf hasAncestor hasAncestor rdf:type owl:TransitiveProperty

Lisa hasParent Geoffrey Geoffrey hasParent Elizabeth Elizabeth hasParent Beatrix

Lisa hasAncestor Geoffrey Geoffrey hasAncestor Elizabeth Lisa hasAncestor Elizabeth Geoffrey hasAncestor Beatrix Lisa hasAncestor Beatrix

Axiom

Explicit Facts

Implicit Facts

(31)

OWL Reasoning: Symmetrical

bio:married rdf:type owl:SymmetricProperty

bio:AnneHathaway bio:married bio:Shakespeare

bio:Shakespeare bio:married bio:AnneHathaway

Axiom

Explicit Fact

Implicit Fact

(32)

OWL Reasoning: Equivalent

act:borrows owl:equivalentProperty lib:checkedout lit:WilliamShakespeare owl:sameAs bio:Shakespeare

bio:Lisa act:borrows lib:book

bio:Shakespeare lit:wrote lit:Macbeth

bio:Lisa lib:checkedout lib:book

lit:WilliamShakespeare lit:wrote lit:Macbeth

Explicit Facts Axiom

Implicit Facts

(33)

Finding Ontologies

(34)

Tabulator RDF Browser

(35)

Protege editor

to develop Ontologies.

Free

download.

Developed by Stanford

(36)

Linking Distributed Data

(37)

Linking Data

(38)

Linking Data

(39)

Linking Hubs

(40)

Semantic Linking Hubs

(41)

Link Discovery

Discover other representations of an object.

(tw:lisagoddard owl:sameas fb:lgoddard) Discover related resources.

(e.g. Person <-> Publication)

(42)

Link Discovery: Silk Server

Directors Movies

dbpedia:director foaf:made

(43)

Natural Language Processing

Identify people, places, things, concepts in

unstructured text files.

Disambiguate terms.

Suggest links to existing entities.

(44)
(45)
(46)
(47)
(48)

RDF Publishing Tools

(49)

Structured Data Crosswalks

(50)

Drupal 7 CMS

(51)

Semantic MediaWiki

(52)

Semantic MediaWiki

(53)

Semantic MediaWiki

(54)

Semantic Blogging: Zemanta

(55)

Related Media

Related Tags Related

Articles

(56)

Mainstream Linked Data

(57)

Growth of RDFa

Usage of RDFa increased 510% between Mar 2009 and Oct 2010

(58)

Websites with RDF(a)

(59)
(60)

Google Faceted

Recipe

Search

(61)

Google Place Pages

(62)

Semantic Technologies

(63)

Library Linked Data

(64)

Disconnected Data

(65)

Silos in the Library

Source: Provincial Archives of Newfoundland & Labrador

(66)
(67)
(68)

Lost Content

(69)

Weak Links

(70)

Missed Opportunities

(71)

What’s our value?

• 2.0 thinking: our value is linked to our data

• 3.0 thinking: our value is linked to our

(re)useable, shareable data?

(72)

Making the Connection

(73)

Enhanced Linking

http://info.library.mun.ca/uhtbin/cgisirsi/dMp3Asia73/QEII/269560141/18/X100/XAUTHOR/M acpherson,+Cluny,+1879-1966

http://viaf.org/viaf/76097050

(74)

Enhanced Content

(75)

Enhanced personalization

Julie

History 1012

List ID: 12

Newfoundland to confederation

@prefix aiiso: <http://purl.org/vocab/aiiso/schema#>.

@prefix resource: <http://purl.org/vocab/resourcelist/schema# >.

@prefix foaf: <http://xmlns.com/foaf/0.1/> .

@prefix bibo: <http://purl.org/ontology/bibo/> .

@prefix dcterms: <http://purl.org/dc/terms/> .

(76)

Obstacles

(77)

Competing Vocabularies

...how many ways to describe a book, journal article or a place?

Ian Millard, Hugh Glaser, Manuel Salvadores, Nigel Shadbolt http://eprints.ecs.soton.ac.uk/21681/5/cold2010-slides.pdf

(78)

Co-referencing

1. http://dbpedia.org/resource/Cluny_MacPherson

2. http://dbpedia.org/resource/Dr._Cluny_MacPherson 3. http://mpii.de/yago/resource/Cluny_MacPherson

4. http://rdf.freebase.com/ns/guid.9202a8c04000641f800000 00005e34c1

5. http://viaf.org/viaf/76097050

6. http://umbel.org/umbel/ne/wikipedia/Cluny_MacPherson

(79)
(80)

Discovering

• Lots (and lots and lots) of linked data out there

• How to find it?

(81)
(82)

Querying

(83)
(84)

Trust

The largest hurdle to library adoption of Linked Data, though, may not be

educational or technological …The

sticking point for librarians may be an issue of trust.

- Ross Singer, “Linked Data Now!”

(85)

Preservation

What happens when a dataset or linking hub

disappears?

(86)

Data ownership

Library Catalogue

Digital Archive

Database Repository

Ejournal

(87)

Licensing

“You shall not use the data made available through the GC Open Data Portal in any way which, in the opinion of Canada, may bring disrepute to or prejudice the reputation of Canada."

(88)

VoID

:DBpedia a void:Dataset ; dcterms:license

<http://www.gnu.org/copyleft/fdl.html> .

schema to describe linked

datasets

(89)

Oh - One more thing…

“who’s minding the ranch?”

(90)

RDA

• Works with in MARC, but also works as a linked

data Metadata Vocabulary

(91)
(92)

RDF Converters

(93)

Publishing Tools

(94)

Where we are now

(95)

Age of Chaotic Innovation?

LIBRIS (Swedish Union Catalog)

Library of Congress (LCSH, OSI)

German National Library Hungarian National Library British Library

Europeana

Linked Periodicals Data Virtual International Authority File

Dewey Decimal Classification BIBSYS’ authority files

Thesaurus for Economics Rameau

Swedish Subject Headings German Subject Headings

Metadata Authority Description Schema (MADS)

(96)

The Chaos Tamers

• W3C Linked Library Data Incubator Group

• IFLA Semantic Web Interest Group

• CKAN Linked Library Data Group

• LITA/ALCTS Linked Library Data Interest Group

(97)

Questions?

Source: http://www.jenniferbetman.com

Références

Documents relatifs

Through a use case of world health organization data, this project shows that current linked open data technologies have a potential to represent heterogonous

It is also of critical importance for the MSc in Web Science to incorporate an extension to this Semantic Wiki which will allow the instructors to query multiple repositories

In this context, the system should be able to automatically retrieve and integrate information from different sources and to registry educational information about the tools

Linked Data is about employing the Resource Description Framework (RDF) and the Hypertext Transfer Protocol (HTTP) to publish structured data on the Web and to connect data between

This tool provides much easy-to-use functionality to facilitate users in this definition work, such as drag-and-drop mapping, visualization mapping, data source annotation,

Existe-t-il un triangle ABC dont le périmètre est égal à 134 centimètres, le rayon du cercle inscrit est égal à 12 centimètres et le rayon du cercle circonscrit est égal à

Existe-t-il un triangle ABC dont le périmètre est égal à 134 centimètres, le rayon du cercle inscrit est égal à 12 centimètres et le rayon du cercle circonscrit est égal à

The design of URIs satisfies the Linked Data principles 1) and 2). We also provide relevant information for the URI of each entity, and this implementation helps satisfy the