Interoperability Fundamentals: - PowerPoint PPT Presentation

About This Presentation
Title:

Interoperability Fundamentals:

Description:

Interoperability Fundamentals: OAI-PMH and OAI-ORE. SUETr ... London School of Economics Library. Dr Robert Sanderson. Dept. of ... Conformance Tester ... – PowerPoint PPT presentation

Number of Views:80
Avg rating:3.0/5.0
Slides: 23
Provided by: ukol
Category:

less

Transcript and Presenter's Notes

Title: Interoperability Fundamentals:


1

Interoperability Fundamentals OAI-PMH and
OAI-ORE
Interoperability Fundamentals OAI-PMH and
OAI-ORE SUETr Interoperability Event 9th
December 2008 London School of Economics
Library Dr Robert Sanderson Dept. of Computer
Science University of Liverpool azaroth_at_liverpoo
l.ac.uk http//www.openarchives.org/ore/ http/
/foresite.cheshire3.org/
2

Interoperability Fundamentals OAI-PMH and
OAI-ORE
  • Overview
  • OAI Protocol for Metadata Harvesting
  • Introduction
  • Technical Details
  • Example
  • OAI Object Reuse and Exchange
  • Introduction
  • ORE for Repositories (Motivation)?
  • RDF and Atom
  • Support

3

Interoperability Fundamentals OAI-PMH and
OAI-ORE
  • OAI Protocol for Metadata Harvesting
  • Does pretty much what it says on the tin An
    XML over HTTP protocol ... that allows a client
    to harvest ... all of the metadata records in a
    repository.

Service Provider
Data Provider
OAI-PMH Request ListIdentifiers, GetRecords
OAI-PMH Response (Records)?
Local Fetch Record
Local Store Record
4

Interoperability Fundamentals OAI-PMH and
OAI-ORE
  • Architecture
  • Distinction between Data Provider (repository)
    and Service Provider (someone who does something
    with the data)
  • Most service providers are aggregators of more
    than one repository Eg Search, Analysis,
    Summarization, Caching, Proxies, ...
  • Or could be used for inter-repository
    transfer/update, where the Service Provider is
    also a Data Provider.
  • Distinction between Centralized and Distributed
    architecture
  • Centralized Harvest everything into one place
    and then search (PMH)?
  • Distributed Leave data where it is and search
    remotely (Z39.50/SRU)
  • But can be combined distributed search over
    centralized database providing an SRU interface
    and single distributed databases

5

Interoperability Fundamentals OAI-PMH and
OAI-ORE
  • Technical Details
  • Single URL end point that handles protocol eg
    http//www.cheshire3.org/services/oai?
  • Operation (verb) as a parameter
  • Identify Tell me about
    yourself
  • ListMetadataFormats Tell me which formats you
    support
  • ListSets What sets of
    records do you support
  • ListIdentifiers Retrieve headers
    for records
  • ListRecords Retrieve full
    records
  • GetRecord Retrieve single
    known record
  • List operations by timestamp of update to the
    record
  • ...?verbListIdentifersmetadataPrefixoai_dcfrom
    2008-12-01
  • Hence can ask only for changed records since you
    last harvested
  • Compare to RSS/Atom (even order isn't
    guaranteed!)?

6

Interoperability Fundamentals OAI-PMH and
OAI-ORE
  • Support?
  • LOTS of libraries, as the protocol is easy to
    implement.
  • Simple google stats for oai-pmh download
    (language)?
  • c 2,450
  • perl 4,520
  • c 5,440
  • ruby 19,800
  • python 21,700
  • java 28,000
  • php 47,300
  • Okay not all are implementations, but you get the
    picture!
  • Active mailing list (still!)
  • Repository Explorer / Conformance Tester
  • Lots of service providers looking to suck up
    data (eg OAIster)?

7

Interoperability Fundamentals OAI-PMH and
OAI-ORE
  • Example Interaction
  • Harvester wants to fetch all of the metadata
    records in a repository since it last harvested,
    in the simple dublin core format.
  • Verb to use ListRecordshttp//repo.example.org/
    oai?verbListRecordsfrom2008-11-01metadataPrefi
    xoai_dc
  • Response

ltOAI-PMHgt ltresponseDategt2002-06-01T192030Zlt/res
ponseDategt ltrequest verb"ListRecords"
from"2008-11-01" metadataPrefix"oai_dc"gt
http//repo.example.org/oai?lt/requestgt
ltListRecordsgt ltrecordgt ltheadergt
ltidentifiergtoaiarXiv.orghep-th/9901001lt/iden
tifiergt ltdatestampgt2008-12-02lt/datestamp
gt lt/headergt ltmetadatagt
lt!-- Record Is Here --gt
lt/metadatagt lt/recordgt
ltrecordgt ... lt/recordgt ... lt/ListRecordsgt lt/OAI
-PMHgt
8

Interoperability Fundamentals OAI-PMH and
OAI-ORE
  • I Don't Want All This _at_)!
  • Problem In order to download the records you
    want, you have to download everything and then
    filter it. This just wastes everyone's time.
  • Solution (?)
  • There are server defined sets of records (not
    nested).
  • Each record knows which sets it is a member of.
  • Can fetch only those records which are part of a
    named set.
  • How are the sets defined? By the
    server/repository admin...
  • Many people have tried to add search
    functionality to OAI-PMH...
  • This is Wrong Wrong Wrong and shows a fundamental
    misunderstanding of the role of OAI-PMH in the
    overall information landscape!
  • For search, there's OpenSearch and SRU. (Another
    talk!)?

9

Interoperability Fundamentals OAI-PMH and
OAI-ORE
  • What is ORE?
  • A method for making complex digital objects
    available over the web...
  • In order for the object and its component parts
    to be easily and seamlessly reused as parts of
    other objects and in other contexts...
  • And exchanged between organizations,
    infrastructures and services.
  • A set of projects funded by the Andrew W.
    Mellon Foundation, the Coalition for Networked
    Information, Microsoft, the National Science
    Foundation, and the Joint Information Systems
    Committee, under the Open Archives Initiative.

10

Interoperability Fundamentals OAI-PMH and
OAI-ORE
Who is Responsible? Principal Investigators Car
l Lagoze (Cornell
University)? Herbert Van de Sompel (Los Alamos
National Labs)? Editors Pete Johnston
(Eduserv Foundation)? Michael Nelson
(Old Dominion University)? Rob
Sanderson (University of
Liverpool)? Simeon Warner (Cornell
University)? Technical and Advisory
Boards Including Liz Lyon, Peter Murray Rust,
Les Carr, Richard Jones, Julie Allinson, Andy
Powell, Lorcan Dempsey, John Erickson, MacKenzie
Smith, Tony Hammond, Savas Parastatidis, Robert
Tansley, Jane Hunter, Tim Cole, Leigh Dodds, Tim
DiLauro, Jeff Young, ...
11

Interoperability Fundamentals OAI-PMH and
OAI-ORE
  • Main Idea of ORE
  • Create a way to describe an Aggregation of
    Resources... and the relationships between
    them... without changing the way we do
    things... without changing the resources
    themselves... in a manner consistent with the
    web architecture
  • Add boundary information over top of the
    connected resources on the web
  • Publish this information using existing
    technologies... which we call a Resource Map
  • This is concept is nothing new...

12

Interoperability Fundamentals OAI-PMH and
OAI-ORE
The Sky
An Aggregation of Stars
13

Interoperability Fundamentals OAI-PMH and
OAI-ORE
The Web
... with Boundary Information
... and Additional Relationships
Aggr
ReM
14

Interoperability Fundamentals OAI-PMH and
OAI-ORE
ORE for Repositories
Key 1 URI 2 Formats 3 Title 4 Authors 5
Creation Dates 6 Similar Objects 7 Versions 8
Links out 9 Citations in/out a Abstract b
Journal
15

Interoperability Fundamentals OAI-PMH and
OAI-ORE
16

Interoperability Fundamentals OAI-PMH and
OAI-ORE
17

Interoperability Fundamentals OAI-PMH and
OAI-ORE
RDF The ORE Data Model is defined as a Graph,
and expressed in RDF. We express these
relationships as triples
X
4
1
Aggr
ReM
3
5
2
6
URI-ReM URI-Aggr URI-Aggr URI-1 URI-5
oredescribes oreaggregates oreaggregates dcterm
sreferences rdfseeAlso
URI-Aggr URI-1 ... URI-6 URI-2 URI-X
18

Interoperability Fundamentals OAI-PMH and
OAI-ORE
Where's the Data? Triples can also have literal
strings, numbers, dates etc URI-Aggr
dctermsmodified 2008-12-09T103000Z URI-Aggr
dctitle Rob's New Aggregation In
our examples, the green aggregated resources are
the different formats for the same work. That
makes the Aggregation a resource that somehow
represents the work in the abstract, and the
Resource Map a description of that. URI-ReM
oredescribes URI-Aggr URI-ReM dctermsmodifie
d 2008-12-09T103000Z URI-Aggr oreaggregates
URI-ps, URI-pdf, URI-html URI-Aggr dctitle
Parametrization of ... URI-Aggr dctermsmodifie
d 2006-01-18T063000Z URI-Aggr oresimilarTo
infodoi/10.1142/S02177... URI-Aggr
dctermscreator URI-Hui URI-Hui foafname H
ui Li URI-ps dcformat application/postscri
pt ...
19

Interoperability Fundamentals OAI-PMH and
OAI-ORE
  • Serializations
  • RDF has MANY serializations, including simple
    triple formats, XML formats, and RDFa a way to
    embed RDF in XHTML.
  • Recommended are RDF/XML and RDFa.
  • Also recommended is an Atom serialization
  • Each Aggregation is an Atom ltentrygt, and the atom
    elements are mapped to the predicate (middle)
    part of the triple, eg author ? dctermscreator
  • Aggregated Resources are referenced in ltlinkgt
    elements.
  • Anything that can't be expressed natively in atom
    goes into an ltoretriplesgt extension block.
  • This allows aggregations to sit in regular Atom
    feeds for discovery
  • And plays nicely with other Atom based protocols
    like OpenSearch or other GData like systems

20

Interoperability Fundamentals OAI-PMH and
OAI-ORE
  • Support
  • Not as much as OAI-PMH ... yet! Version 1.0 only
    released in October.
  • Libraries Foresite Toolkit http//foresite-tool
    kit.googlecode.com/Java (ORE 0.9, Richard
    Jones) and Python (ORE 1.0, me)Idea Build an
    object model on top of RDF graph a
    Aggregation() a.title New Aggregation
  • Validator Atom/ORE Validator http//www.openarc
    hives.org/ore/1.0/atom-validatorFrom Los Alamos
    National Labs, plus other transforms
  • Generic RDF Libraries, ConvertersAvailable in
    most languages... talk to me about writing a
    foresite library!

21

Interoperability Fundamentals OAI-PMH and
OAI-ORE
  • Repository Operations
  • Create Send ORE in Atom via SWORD from client
  • Update Send ORE in Atom via SWORD from client
    to existing URI
  • Search Return ORE via OpenSearch/SRU
  • Harvest Return ORE via OAI-PMH
  • Archive Archive ORE Resource Map plus
    Aggregated Resources
  • Export Export ORE (to be created by other)
  • Import Create from ORE
  • Real Life Example
  • Wrapper around Flickr API to export
    Photos/Photosets (Rob)?
  • ORE Importer into Omeka Digital Library Platform
    (Sean Hannan)?
  • Ran importer against flickr wrapper to import
    photos out of flickr, along with metadata,
    different sizes, etc. Seamless Interoperability!
  • Other examples DSpace, Fedora, MyExperiment,
    JSTOR, WordPress, ...

22

Interoperability Fundamentals OAI-PMH and
OAI-ORE
Thank You )? Questions? URLs Me
azaroth_at_liverpool.ac.uk PMH
http//www.openarchives.org/pmh/ ORE
http//www.openarchives.org/ore/ Foresite
http//foresite-toolkit.googlecode.com/ This
http//www.csc.liv.ac.uk/azaroth/papers/suetr-ore
.pdf (Bonus points for expressing the above as
an ORE Aggregation!)?
Write a Comment
User Comments (0)
About PowerShow.com