PANDAS PANDORA Digital Archiving System Archiving Web Resources Conference Information Day Canberra, - PowerPoint PPT Presentation

1 / 12
About This Presentation
Title:

PANDAS PANDORA Digital Archiving System Archiving Web Resources Conference Information Day Canberra,

Description:

PANDAS. PANDORA Digital Archiving System. Archiving Web Resources Conference ... process larger volume of online publications and associated metadata batches ... – PowerPoint PPT presentation

Number of Views:118
Avg rating:3.0/5.0
Slides: 13
Provided by: Help84
Category:

less

Transcript and Presenter's Notes

Title: PANDAS PANDORA Digital Archiving System Archiving Web Resources Conference Information Day Canberra,


1
PANDASPANDORA Digital Archiving
SystemArchiving Web Resources Conference
Information DayCanberra, 12 November 2004Paul
KoerbinDigital Archiving BranchNational Library
of Australiapkoerbin_at_nla.gov.au
2
PANDAS
  • Description and purpose
  • PANDORA Digital Archiving System
  • Web-based workflow management system
  • Developed to manage the web archiving processes
    at the National Library of Australia
  • Written in Java on Apple WebObjects application
    development platform
  • First version released in June 2001
  • Second (current) version released August 2002

3
PANDAS
  • Description and purpose
  • Record administrative metadata about titles
    selected (or rejected or monitored) for national
    preservation
  • Schedule and initiate harvesting
  • Manage the quality assurance process and
    associated problem reporting and fixing
  • Prepare items for public display through the
    PANDORA home page
  • Manage access restrictions
  • Generate management reports

4
PANDAS
5
PANDAS
  • How it works
  • Connects with and utilises other software and
    protocols for specific functions
  • Provides an interface to the harvesting software
    currently this is HTTrack (http//www.httrack.co
    m)
  • Uses WebDAV protocol to provide content managers
    with remote access to the harvested files
  • Uses Z39.50 protocol to access the National
    Bibliographic Database to extract metadata from
    the MARC record

6
PANDAS
  • How it works
  • Title and subject listings and title entry pages
    are generated on-the-fly from PANDAS metadata
  • Some static web pages (documents, information)
  • Search engine
  • Unique identifying number generated by PANDAS
  • Persistent URL applied to title entry page
  • http//nla.gov.au/nla.arc-21220

7
PANDAS
  • Demonstration

8
PANDAS
  • Planned developments and future directions
  • Ongoing development and enhancement of PANDAS
  • Improve robustness of system
  • Re-engineer PANDAS software
  • Need to achieve greater efficiencies and increase
    scale of web archiving activity

9
PANDAS
  • Planned developments and future directions
  • Automatically ingest and process larger volume of
    online publications and associated metadata
    batches
  • Comply with international standards and adopt
    standard tools IIPC
  • Incorporate other collection methods domain
    harvesting, deep web, deposit

10
PANDAS
  • Planned developments and future directions
  • Automate collection of more preservation metadata
    and develop metadata management interface
  • Improve access and discovery paths to the
    Archives resources as it continues to grow

11
PANDAS
  • Availability
  • PANDORA partner agencies
  • Authenticated users
  • Public access to archived resources
  • PANDAS evaluation system
  • Documentation (manuals, data model etc) available
    online at http//pandora.nla.gov.au

12
PANDAS
  • Questions?
Write a Comment
User Comments (0)
About PowerShow.com