Title: Fun with Geospatial Metadata, CUGIR, CORC, MARC, and OAI: The CSDGM to MARC Grant Project
1Fun with Geospatial Metadata, CUGIR, CORC, MARC,
and OAI The CSDGM to MARC Grant Project
- Adam Chandler, Olin LibraryElaine Westbrooks,
Mann LibraryVivek Uppal, Dept. of Computer
Science
Metadata Working GroupOctober 12, 2001
2Project Workflow
Converter
Authority Control
3Problem
- How do we create a persistent URL pointing to
deep web digital objects across very different
systems?
4Overview (1)
- What is GIS?
- What is CUGIR?
- What are Geospatial Metadata?
- What is CORC?
4
5Overview (2)
- What is the FGDC Clearinghouse?
- Bucket Architecture
- What is OAI?
- Demonstration
- Outcomes
5
6What is GIS?
- A system for capturing, storing, checking,
manipulating, analyzing and displaying data which
are spatially referenced to the Earth. (DoE,
1987)
6
7People
Software
Data
GIS
Procedures
Hardware
7
8Examples of GIS
8
9CUGIR
- Brief History
- Subject areas
- Unrestricted access to data metadata
- National Geospatial Data Clearinghouse NSDI
9
10Geospatial Metadata
Content Standard for Digital Geospatial Metadata
- Brief History
- Definition
- Purpose / Uses / Benefits
- CUGIR Examples TAZ
10
11CORC
- What is CORC? Pathfinders Harvests
Metadata Authority Control Classification - Metadata MARC / DC-RDF
11
12FGDC Clearinghouse
12
1313
14CUGIR
14
15(No Transcript)
16Show Metadata
17(No Transcript)
18Non-Persistent URL
http//130.11.52.184/servlet/FGDCServlet/retrieve?
on2elFdbcurrentrp3mr1accurrentcid181
19Clearinghouse Problems
- 0 hits for users 50 of time (unpublished
research) - gt 10 or more or nodes are offline (Z39.50)
- FGDC Clearinghouse is not well known
- Record maintenance is a undefined
- Harvesting is not a part of model
- Data/Metadata are coupled with the server
software (i.e., smart archive, dumb object DL
model)
19
20Open Archives Initiative
- low-barrier interoperability
- data-provider service-provider model
- metadata harvesting model
- shared metadata format parallel, community-spe
cific metadata formats - authentication on purpose outside protocol
- Source Herbert Van De Sompel, CS 502, Spring 2001
Service Provider
http (get request)
Data Provider
xml response
20
21OAI example
22(No Transcript)
23Buckets
- Buckets are part of the larger Smart Object,
Dumb Archive DL Model (Maly, Nelson, Zubair,
1999). SODA is a reaction to the vertically
integrated (and non-interoperable) DLs that
tended to grow from the ad-hoc origins of many
popular DLs (Esler Nelson, 1998). Separating
the functionality of the archive from that of the
DL allows for greater interoperability and
federation of DLs. The archive's purpose is to
provide DLs the location of buckets (the DLs can
poll the buckets themselves for their metadata),
and the DLs build their own indexes.
Source M. Nelson Smart Objects and Open
Archives. D-Lib Magazine, February 2001, 7(2)
http//www.dlib.org/dlib/february01/nelson/02nelso
n.html
24On Buckets and OAI
Just as buckets break the dependency of the
information objects on archives, the Open
Archives Initiative (OAI) breaks the dependency
of archives on DLs - Michael L. Nelson
25On Buckets and OAI
The OAI does not address the issue of smart
objects, but the archives in the OAI are very
similar to the archives described in the SODA
model in that they have minimal functionality.
OAI archives aim for greater interoperability
through performing less sophisticated functions
(no keyword search functions defined, TC is not
handled at the protocol level) -- a sort of
Reduced Instruction Set Computer philosophy for
archives. - Michael L. Nelson
26Problem Revisited
- How do we create a persistent URL pointing to
deep web digital objects across very different
systems?
27Project Workflow
Converter
Authority Control
28Step 1 Convert SGML -gt MARC
29Step 2 MARC Bucket Location
30Step 3a Import to CORC (MARC)
MARC
View
856 Bucket
30
31Step 3b Import to CORC (DC RDF)
DC RDF
View
856 Bucket
31
32Step 4a Import DC-RDF
33Step 4b Import DC RDF to Local Mysql Database
34MARC Record in Voyager
Click here to see bucket
34
35Table Bucket
36Table DC
37Project Outcomes
- 1. Access for patrons
- -1,300 MARC Records in OPAC-1,300 MARC Records
in CORC/WorldCat-2,600 DC-RDF Records in OAI - 2. OAI data provider (DLF agreement)
- 3. CORC evaluation
- 4. Foundation for CUGIR 2.0
- 5. Data for cross-collection access research