Preservation Rumination - PowerPoint PPT Presentation

About This Presentation
Title:

Preservation Rumination

Description:

... to maintain the viability, renderability, and understandability of ... Viability. Renderability. Revised Preservation Pyramid. Description. Secure storage ... – PowerPoint PPT presentation

Number of Views:116
Avg rating:3.0/5.0
Slides: 36
Provided by: brian660
Learn more at: https://www.oclc.org
Category:

less

Transcript and Presenter's Notes

Title: Preservation Rumination


1
Preservation Rumination
  • Priscilla Caplan,
  • FCLA
  • OCLC DSS
  • February 16, 2005

2
Preservation Basics
3
THE NEED FOR DIGITAL PRESERVATION
  • Number of academic/scholarly journals published
    online 15,757
  • Percent of U.S. federal government publications
    produced only online in 2003 65 percent
  • Estimated percent of U.S. federal government
    publications available only online by 2008 90
    percent
  • From California Digital Library
  • http//www.cdlib.org/inside/projects/preservation

4
The problem of abundance
5
The problem of ephemerality
  • Percent of web-based references in scientific
    articles from 3 major journals inaccessible
    within 2 years of publication 21
  • Proportion of websites in 1998 gone in 1999 44
  • Life of an average website 44 days

6
The problems of media life expectancy and
obsolescence
7
The problem of format obsolescence
8
(No Transcript)
9
The problem of rights
10
Preservation strategies
Renderability
Media management
Viability
Secure storage
Integrity

Identity
Description
Capture Selection
Availability
The Preservation Pyramid
11
Authenticity
12
Traditionally, preserving things meant keeping
them unchanged however if we hold on
to digital information without modifications,
accessing the information will become
increasingly more difficult, if not
impossible. From The Paradox of
Preservation, Su-Shing Chen
13
Preservation metadata ...is the information
necessary to maintain the viability,
renderability, and understandability of digital
resources over the long-term. OCLC/RLG Preservat
ion Metadata Framework Working Group
Understandability
14
Understandability
Authenticity
Preservation strategies
Renderability
Media management
Viability
Integrity
Secure storage

Identity
Description
Availability
Capture Selection
Revised Preservation Pyramid
15
Who is doing preservation?
Research Libraries Government Archives Historical
Societies Individual Collectors
16
Who is doing digital preservation?
Research Libraries Government Archives Historical
Societies Individual Collectors National
Libraries Research Centers Public broadcasting
17
Understandability
Authenticity
Preservation strategies
Renderability
Media management
Viability
Integrity
Secure storage

Identity
Description
DSPACE
Availability
Capture Selection
18
Understandability
Authenticity
Preservation strategies
Renderability
Media management
Viability
Integrity
Secure storage

Identity
Description
Availability
Capture Selection
LOCKSS
19
Understandability
Authenticity
Preservation strategies
Renderability
Media management
Viability
Integrity
Secure storage

OCLC Digital Archive
Identity
Description
Availability
Capture Selection
20
Understandability
Authenticity
Preservation strategies
Renderability
Media management
Viability
Integrity
Secure storage

LC Minerva
Identity
Description
Availability
Capture Selection
21
FCLA Digital Archive
Understandability
Authenticity
Preservation strategies
Renderability
Media management
Viability
Integrity
Secure storage

Identity
Description
Availability
Capture Selection
22
Preservation in Action
23
(No Transcript)
24
State Universities
FCLA
25
  • Designed as a dark archive
  • Preservation repository functions only
  • Based on OAIS functional architecture
  • Bit-level and Full preservation
  • Format migration and normalization

26
OAIS Functional Architecture
27
DAITSS Functional Architecture
Reporting
L I B R A R Y
L I B R A R Y
Mgmt DB
Ingest
SIP
Access
DIP
AIP
Storage management
28
DAITSS Data Model
Information Package
Intellectual entity (1)
Data File (1..n)
Bitstream (0..n)
29
DAITSS Data File Object
DAITSS Bitstream Object
30
Risk Management
  • Storing multiple master copies of files
  • Calculating two message digests
  • Storing metadata as XML and in RDBMs
  • Normalizing when possible
  • Always retaining original
  • Action plans and background papers

31
Ingest Functions
  • METS validation and metadata extraction
  • Virus check and checksum verification
  • File format identification
  • Creation of Data File and Bitstream objects
  • Harvesting of external files
  • Normalization and Forward Migration
  • Technical, relationship and event metadata
  • AIP creation
  • Storage update
  • Data table update

32
Ingest Example A simple SIP
XML
SIP
PDF
AVI
33
XML
AIP
XML
XML
XML
SIP
TIFF
XML
TIFF
TIFF
PDF
AVI
XML
Database
XML
34
Future Plans
  • Find partners to install at other places
  • Finish DAITSS
  • Release under open source license
  • Build a community of developers for different
    formats

35
References
  • Priscilla Caplan www.fcla.edu/pcaplan,
    pcaplan_at_ufl.edu
  • FCLA Digital Archive www.fcla.edu/digitalArchive
  • Terry Kuny, A Digital Dark Ages?
    www.ifla.org/IV/ifla63/63kuny1.pdf
  • PREMIS Implementation Survey www.oclc.org/research
    /projects/pmwg/surveyreport.pdf
  • Roy Rosenzweig, Scarcity or Abundance?
    www.historycooperative.org/journals/ahr/108.3/rose
    nzweig.html
  • ONeil et al. Trends in the Evolution of the
    Public Web www.dlib.org/dlib/april03/lavoie/04lav
    oie.html
  • Clifford Lynch, Authenticity and Integrity in
    the Digital Environment www.clir.org/pubs/report
    s/pub92/lynch.html
Write a Comment
User Comments (0)
About PowerShow.com