Keeping Research Data Safe JISC Research Data Digital Preservation Costs Study - PowerPoint PPT Presentation

1 / 17
About This Presentation
Title:

Keeping Research Data Safe JISC Research Data Digital Preservation Costs Study

Description:

Focus on UK universities (but more widely applicable) ... Not just DIY application neutral can cost for in-house archive, full or ... – PowerPoint PPT presentation

Number of Views:29
Avg rating:3.0/5.0
Slides: 18
Provided by: rcbma4
Category:

less

Transcript and Presenter's Notes

Title: Keeping Research Data Safe JISC Research Data Digital Preservation Costs Study


1
Keeping Research Data SafeJISC Research Data
Digital Preservation Costs Study
  • Neil Beagrie
  • Charles Beagrie Limited
  • Alliance for Permanent Access
  • Budapest Nov 2008

2
Overview
  • Aim investigate costs, develop model and
    recommendations
  • Project team Neil Beagrie, Julia Chruszcz,
    Brian Lavoie (OCLC), Cambridge, KCL, Southampton
  • Method detailed analysis of 2 cost models (LIFE
    NASA CET) in combination with OAIS and TRAC
    literature review12 interviews 4 case studies.

3
UK Background
  • Focus on UK universities (but more widely
    applicable)
  • Sustainability of research UK universities move
    to Full Economic Costs (FEC)
  • Data management can be charged as direct or
    indirect costs to research grants
  • Increasing volumes and complexity of research
    data in UK universities

4
What have we Produced?
  • A cost framework consisting of
  • activity model in 3 parts pre-archive, archive,
    support services
  • Key cost variables divided into economic
    adjustments and service adjustments
  • Resources template for Transparent Costing (TRAC)
  • Used in combination to generate cost/charging
    models
  • 4 detailed case studies (ADS, Cambridge, KCL,
    Southampton)
  • Data from other services.

5
Findings
6
Findings
  • Timing. costs c. 333 euros for the creation of a
    batch of 1000 records. Once 10 years have passed
    since creation it may cost 10,000 euros to
    repair a batch of 1000 records with badly
    created metadata (Digitale Bewaring Project)
  • Efficiency Curve effects start-up to
    operational
  • Economy of scale effects Accession rates of 10
    or 60 collections - 600 increase in accessions
    will only increase costs by 325 (ULCC)
  • First mover innovation costs of being first
    to solve a problem and how to finance this.

7
Findings
  • Unit costs examples in Case studies for
    Archaeology, Chemistry, Humanities
  • However costs depend on the adjustments (key cost
    variables)
  • Like restaurant meals final bill and unit costs
    depend on the choices and volume

8
Findings
  • National subject repositories costs (UKDA)

9
Findings
  • ADS projection of long-term preservation costs
  • Implications for sustainability via project
    charges/endowment
  • Preservation interventions (file format
    migrations)
  • Long-term storage costs
  • Assumptions of archive growth (economies of
    scale)
  • Assumptions on first mover innovation

10
Findings
  • NSB/NSF Long-lived data collections identifies 3
    research data collection types with different
    preservation, access, and cost requirements
  • Research collections used by research team
    only, often limited retention and preservation
    needs
  • Community collections used by a discipline,
    data validation and community standards,
    preservation medium to long-term
  • Reference collections used by many disciplines,
    major use of standards , quality control,
    long-term preservation
  • Data collections can move between levels
    (normally with substantial additional investment
    if upgraded)
  • Triage need to prioritise and restrain costs

11
Whats New?
  • FEC based not in or partial in other models but
  • Requirement for HEIs
  • Absence of FEC (a) distorts business cases eg for
    automation (b) cannot accurately compare in-house
    or out-source costs
  • Not just DIY application neutral can cost for
    in-house archive, full or partial shared
    service(s), national/subject data centre archive
    charges
  • Preservation archival storage, preservation
    planning, data management, first mover
    innovation
  • Tailored for research data different collection
    levels, documentation metadata, products from
    data, etc

12
Follow-up Application in UKRDS Feasibility Study
13
UKRDS
  • UK Research Data Service Feasibility Study
    exploring shared data services for UK
    universities
  • Has utilised Keeping Research Data Safe outputs
    and surveyed over 700 researchers
  • UKRDS study as illustration of importance of
  • Preservation/retention time period
  • Collection Levels
  • FEC and business cases

14
UKRDS
15
Current Spectrum of Subject Data Centre Provision
(1)
EBI
CERN
NERC
CCDC
UKDA
AHDS
16
Current Spectrum of Subject Provision (2)
Reference collections
CCDC
Community collections
CERN
UKDA
NERC
EBI
(Store)
Institutional responsibility
Research collections
17
UKRDS
18
Conclusions
  • Data preservation costs not just formula of
    function costs
  • Can illustrate effect of preservation choices on
    costs
  • first mover innovation costs v operational
    costs
  • Endowment archive funding model?
  • Not last word on costs....recommendations for
    future work

19
Further Information
  • Keeping Research Data Safe Final report and
    Executive Summary at http//www.jisc.ac.uk/publica
    tions/publications/keepingresearchdatasafe.aspx
  • Or email neil_at_beagrie.com
Write a Comment
User Comments (0)
About PowerShow.com