Moving to PANDAS 3 - PowerPoint PPT Presentation

1 / 26
About This Presentation
Title:

Moving to PANDAS 3

Description:

PANDAS the NLA's software for managing the process of gathering, archiving and ... Pandas 3 redevelopment began September 2004, after limitations/issues with ... – PowerPoint PPT presentation

Number of Views:36
Avg rating:3.0/5.0
Slides: 27
Provided by: Matthew590
Category:
Tags: pandas | moving | pandas

less

Transcript and Presenter's Notes

Title: Moving to PANDAS 3


1
  • Moving to PANDAS 3
  • Future-proofing Institutional Websites
  • 19-20 January 2006
  • London
  • Matthew Walker
  • Deputy Director, Collection Infrastructure
  • IT Division
  • National Library of Australia

2
Background
  • NLA archiving web sites since October 1996.
  • Processes based on a toolkit of public domain
    software.
  • Need for management tool.
  • requirements specification began in 1998.
  • No suitable off-the-shelf products available at
    the time.
  • Development work commenced in 1999.
  • System named PANDAS.

3
Definitions
  • PANDORA the NLAs archive of website
    resources.
  • PANDAS the NLAs software for managing the
    process of gathering, archiving and publishing
    website resources.
  • PANDORA website the delivery mechanism for the
    PANDORA archive.

4
About PANDAS
  • Pandas 1 released in June 2001.
  • Pandas 2 released in August 2002.
  • Pandas 3 redevelopment began September 2004,
    after limitations/issues with Pandas 2 were
    identified and examined.
  • 1st release scheduled March 2006.

5
PANDAS 2
  • Title/resource management
  • Nominating/Identifying
  • Selecting
  • Gathering
  • Processing
  • Archiving
  • Publishing
  • Cataloguing
  • Permissions/Restrictions

6
PANDAS 2
  • Add/edit
  • users.
  • agencies/institutions.
  • collections.
  • subject categories.
  • Manage display of information on PANDORA delivery
    website.

7
PANDAS 2 Broad Software Architecture
8
PANDAS 2 System Architecture
Web Server
WebDAV
WebObjects Application Server
Resolver Service
Lucene Full-text Search Service
Gatherer
Scripter
PANDAS
Notifier
Restricter
Access Restricter
PANDORA
Reporter
HTTrack
Digital Object Storage System (DOSS)
9
PANDAS 3 System Architecture
Web Server
WebDAV
Java Application Server (Tomcat)
Resolver Service
Lucene Full-text Search Service
Gathering Web Service
Pandas Management
Pandas Delivery
HTTrack
Digital Object Storage System (DOSS)
10
PANDAS 3 Broad Software Architecture
11
PANDAS 3 Broad Software Architecture
  • PandasManagement functionality to support
    identification, selection, archiving, cataloguing
    and publication of titles/resources. Reporting is
    also associated with this system.
  • Gathering Web Service gathering/crawling and
    file system manipulation.
  • PandasDelivery display/delivery of published
    archived resources.

12
Pandas 3 Whats the difference?
  • More robust.
  • Implementation consistent with Apple WebObjects
    standards.
  • Improved data model.
  • Faster operation.
  • More efficient implementation of business logic.
  • New user interface focused on core workflows.
  • New worktray concept.

13
Pandas 3 Whats the difference?
  • All gather processing related functionality
    incorporated into Gathering Web Service.
  • Web Services implementation instead of RMI.
  • Database knowledge removed from processing
    scripts.
  • Restrictions notifications integrated into
    PandasManagement application.
  • Restrictions notifications are implemented as
    modules.

14
Pandas 3 Whats the difference?
  • Can run in standard servlet container/Java
    application server (e.g. Tomcat).
  • Previously required Apple WebObjects application
    server.
  • Improved error handling.
  • Easier to determine source of problems.

15
PANDAS 2 Technologies
  • Application Suite
  • Java
  • Apple WebObjects API
  • RMI
  • Perl
  • Lucene
  • System Environment
  • Apache web server
  • Apple WebObjects application server
  • WebDAV
  • Oracle

16
PANDAS 3 Technologies
  • Application Suite
  • Java
  • Apple WebObjects API
  • SOAP XML web services
  • Perl
  • Lucene
  • System Environment
  • Apache web server
  • Tomcat Java application server
  • WebDAV
  • Oracle

17
PANDAS 2 Login Page
18
PANDAS 3 Login Page
19
PANDAS 2 Homepage
20
PANDAS 3 Homepage
21
PANDAS 3 Homepage (a closer look)
Side Menu Bar
Status Bar
Search Bar
22
PANDAS 3 Homepage (a closer look)
23
PANDAS 2 Create Title
24
PANDAS 3 Add Title
25
PANDAS 2 - Search
26
PANDAS 3 Search
27
PANDAS 2 View Title
28
PANDAS 3 View Title
29
PANDAS 2 Edit Title
30
PANDAS 2 Edit Title
31
PANDAS 2 Edit Title
32
PANDAS 2 Edit Title
33
PANDAS 3 Edit Title
34
PANDAS 2 Gather Queue
35
PANDAS 3 Gather Worktray
36
Future functionality
  • Title hierarchies
  • Instance-level restrictions
  • Instances as subset of whole domain snapshot
  • More automation
  • Permission request processes
  • Bulk processing and review
  • Gather schedule management with priorities
  • Integration of Heritrix as gatherer/crawler
  • Abstraction of user/institution info and
    authentication

37
How do I use it?
  • Run locally.
  • Apache
  • Tomcat
  • Java 1.4.2
  • WebObjects 5.2 Java libraries and deployment
    licence
  • Free with Mac OS X server.
  • Purchase from Apple Store (492.77).
  • Perl
  • Lucene
  • Ask UKWAC to archive your website!
Write a Comment
User Comments (0)
About PowerShow.com