PREMIS - PowerPoint PPT Presentation

1 / 29
About This Presentation
Title:

PREMIS

Description:

PREMIS What is PREMIS? Preservation Metadata Implementation Strategies When is PREMIS use? PREMIS is used for repository design, evaluation, and archived ... – PowerPoint PPT presentation

Number of Views:72
Avg rating:3.0/5.0
Slides: 30
Provided by: nek46
Category:

less

Transcript and Presenter's Notes

Title: PREMIS


1
PREMIS
  • What is PREMIS?
  • Preservation Metadata Implementation Strategies
  • When is PREMIS use?
  • PREMIS is used for repository design,
    evaluation, and archived information packaged
    among preservation repositories
  • How is PREMIS use?
  • PREMIS Data Dictionary provides guidelines
    regarding the information a repository uses to
    support the digital preservation process

2
PREMIS
  • What is preservation metadata referring to?
  • It is information that supports and documents
    the digital preservation process. Which includes
    information such as
  • Provenance refers to who has ownership of the
    digital object
  • Authenticity refers to the claim of the digital
    object
  • Preservation activity refers to the activities
    that have been carried out to preserve the
    digital object
  • Technical environment refers to the tasks
    required to interpret and use the digital object
  • Rights management refers the intellectual
    property rights that must be declared

3
PREMIS Data Dictionary
  • Conventions for each entry in the PREMIS Data
    Dictionary
  • Name of the semantic units a descriptive name
    that refers to a piece of information or
    knowledge
  • Example objectIdentifer under the ltobjectgt
  • Semantic components refers to sub-units held
    within a container.
  • Definition refers to the meaning of the semantic
    unit.
  • Rationale explains why the semantic unit is
    needed
  • Data constraint indicates how the semantic unit
    should be encoded.
  • Containers refers to a xml tag that have no
    value rather serve to group related semantic
    units
  • None indicates that the semantic unit can be a
    value of any form
  • Value should be taken from a controlled
    vocabulary PREMIS Data Dictionary does not
    specify what this authority list of values
    should be, and it is assumed that different
    repositories will use different vocabularies.
  • Extension containers are containers that are
    designed to give a place for non-PREMIS metadata

4
PREMIS Data Dictionary
  • Object category specifies the object to which
    the semantic unit applies to (a representation,
    file, or bitstream).
  • Applicability indicates whether the semantic
    unit applies to the category of the object.
  • Example sample values that the semantic unit may
    use
  • Repeatability indicates that a semantic unit can
    take multiple values
  • Obligation indicates whether the value of the
    semantic unit is mandatory, meaning a repository
    must know this information
  • Creation/Maintenance note further detail
    regarding how the values are created and or
    updated
  • Usages notes provides information regarding the
    use of the semantic unit.

5
PREMIS Data Dictionary Mandatory Semantic Units
  • objectIdentifier
  • objectCategory
  • objectCharacteristics
  • format
  • storage
  • eventIdentifier
  • eventType
  • eventDateTime
  • agentIdentifier
  • Note indicates semantic units that are
    repeatable

6
PREMIS Data Model
7
PREMIS Intellectual Entity
  • Intellectual Entity refer to content that can
    be describe as a unit (e.g. books, maps,
    articles)

8
PREMIS Object Entity
  • Objects refer to units of information in
    digital form. PREMIS defines different kinds of
    objects it can an a file, bitstream or
    representation
  • File it is a computer file, such as a pdf, txt
    or JPEG
  • Bitstream refer to data bits within a file that
    contain common properties for preservation
    purposes

9
PREMIS Object Entity
  • Representation refer to a set of files, that
    includes structural metadata, required to be
    identified, stored and maintained in order to
    assemble a complete rendition of an Intellectual
    unit.
  • For example, text files and images files of a
    magazine are required to form a representation.

10
PREMIS Object Entity
  • Sample syntax
  • ltobjectgt lt/objectgt
  • The units of information that can be recorded
    includes
  • Type of object (file, bitstream, or
    representation)
  • A unique identifier for the object under
    ltobjectIdentifiergt
  • Stores information such as the type and value.
    The type refers to the classification of the
    domain that creates the object identifier. The
    value of the object identifier.
  • For example,
  • ltobject xsitype"representation"gt
  • ltobjectIdentifiergt
  • ltobjectIdentifierTypegtFDsys ACPlt/objectIdentifie
    rTypegt ltobjectIdentifierValuegtR0b002ee180b003b0lt/o
    bjectIdentifierValuegt
  • lt/objectIdentifiergt
  • lt/objectgt
  • This particular segment states that this object
    is a representation (that is a set of files, this
    representation has a unique identifier)

11
PREMIS Object Example
  • Other units of information that can be recorded
    includes
  • Information indicating the policy on the set of
    preservation functions to be applied to an
    object under the ltpreservationLevelgt
  • ltobject xsitype"file"gt
  • ltobjectIdentifiergt
  • ltobjectIdentifierTypegtFDsys
    ACPlt/objectIdentifierTypegt
  • ltobjectIdentifierValuegtD09002ee180b003a9lt/obje
    ctIdentifierValuegt
  • lt/objectIdentifiergt
  • ltpreservationLevelgt
  • ltpreservationLevelValuegtfulllt/preservationLeve
    lValuegt
  • lt/preservationLevelgt

12
PREMIS Object Entity
  • Other units of information that can be recorded
    includes
  • Information indicating if the object is subject
    to one or more processes of decoding or
    unbundling under ltcompositeLevelgt
  • information used to verify if an object has been
    changed in an undocumented or unauthorized way
    under ltfixitygt
  • Information contained within the
    ltmessageDigestAlgorithmgt refers to the algorithm
    used to produce the message digest for the
    digital object.
  • Information contained within the ltmessageDigestgt
    refers to the output of the message digest
    algorithm
  • Information contained within the
    ltmessageDigestOriginatorgt refers to the agent
    that generated the original message digest that
    will be compared to the fixity check.

13
PREMIS Object Entity
  • The size of the object under ltsizegt
  • The format of the object under ltformatgt
  • ltformatDesignationgt refers to the identification
    of the format of the object
  • Information contained within ltformatNamegt
    classifies the format of the file or bitstream.
  • ltformatRegistrygt identifies additional
    information about the format by using a entry in
    a format registry.
  • Information contained within lt formatRegistryNamegt
    identifies the format registry that was used.
  • Information contained within lt formatRegistryKeygt
    refers to the unique key used to reference an
    entry for this format in a format registry
  • ltformatNotegt contains additional information
    about the format
  • For example

14
PREMIS Object Example
  • ltobjectCharacteristicsgt
  • ltcompositionLevelgt0lt/compositionLevelgt
  • ltfixitygt
  • ltmessageDigestAlgorithmgtSHA-256lt/messageDiges
    tAlgorithmgt
  • ltmessageDigestgt4977070b92f0bb2642c6be368ad
    68a8d1d1c5dbbb3310544db781f56a860b0a1lt/messageDige
    stgt
  • ltmessageDigestOriginatorgtFDsyslt/messageDigest
    Originatorgt
  • lt/fixitygt
  • ltsizegt9326lt/sizegt
  • ltformatgt
  • ltformatDesignationgt
  • ltformatNamegttext/plainlt/formatNamegt
  • lt/formatDesignationgt
  • ltformatRegistrygt
  • ltformatRegistryNamegtPRONOMlt/formatRegistryNa
    megt
  • ltformatRegistryKeygtx-fmt/111lt/formatRegistry
    Keygt
  • lt/formatRegistrygt
  • ltformatNotegtPlain Text Filelt/formatNotegt
  • lt/formatgt
  • lt/objectCharacteristicsgt

15
PREMIS Object Entity
  • Other units of information that can be recorded
    includes
  • The original name of the object (prior to being
    named by the repository) under ltoriginalNamegt
  • Information about where and how a files are
    stored in the repository under ltstoragegt
  • ltcontentLocationgt stores information needed to
    retrieve a file from a storage system.
  • Information contained within ltcontentLocationTypegt
    refers to the way of accessing the location of
    the content.
  • Information contained within ltcontentLocationValue
    gt refers to the location of the content used by
    the storage system.
  • The medium on which an object is stored is
    contained within ltstorageMediumgt

16
PREMIS Object Entity
  • Information describing a relationship between an
    object and one or more objects.
  • ltrelationshipTypegt classifies the nature of the
    relationship.
  • ltrelationshipSubTypegt characterizes the nature of
    the relationship.
  • ltrelatedObjectIdentificationgt refers to the
    identifier of the related resource.
  • Information contained within ltrelatedObjectIdentif
    ierTypegt refers to the classification of the
    domain that creates the identifier.
  • Information contained within ltrelatedObjectIdentif
    ierValuegt refers to the value of the
    identifier.

17
PREMIS Object Example
  • ltoriginalNamegtS3880IS.txtlt/originalNamegt
  • ltstoragegt
  • ltcontentLocationgt
  • ltcontentLocationTypegtURIlt/contentLocationTypegt
  • ltcontentLocationValuegtfile/u02/app/emc/docume
    ntum/data/fdsysprod1/fdsysprod1/content_storage_01
    /00002ee1/80/55/b0/48.txtlt/contentLocationValuegt
  • lt/contentLocationgt
  • ltstorageMediumgthard disklt/storageMediumgt
  • lt/storagegt
  • ltrelationshipgt
  • ltrelationshipTypegtstructurallt/relationshipTypegt
  • ltrelationshipSubTypegtis part
    oflt/relationshipSubTypegt
  • ltrelatedObjectIdentificationgt
  • ltrelatedObjectIdentifierTypegtFDsys
    ACPlt/relatedObjectIdentifierTypegt
  • ltrelatedObjectIdentifierValuegtR0b002ee180b003
    b0lt/relatedObjectIdentifierValuegt
  • lt/relatedObjectIdentificationgt
  • lt/relationshipgt
  • lt/objectgt

18
PREMIS Event Entity
  • Events refers to actions that involve an object
    and an agent known to the system
  • Events are critical for maintaining the digital
    provenance of an object (helps demonstrates the
    authenticity of the object)
  • Examples of Events
  • modifying an document
  • actions that create new relationships
  • Object could be related to another object as a
    result of a particular event, for instance if a
    program takes file 1 and generates a different
    version known as file 2
  • Actions that check the validity and integrity of
    the objects (i.e. virus scan)

19
PREMIS Event Entity
  • Sample syntax
  • lteventgt lt/eventgt
  • The information that can be recorded under event
    includes
  • A unique identifier for the event under
    lteventIdentifiergt
  • The lteventIdentifierTypegt refers to the
    classification of the domain that creates the
    event identifier. The lteventIdentiferValue refers
    to the value of the event identifier.
  • The type of event under lteventTypegt
  • Classifies the nature of the event.
  • Date, time and type of event under
    lteventDateTimegt

20
PREMIS Event Entity
  • Additional information that can be recorded
    under event includes
  • Detail description of the event under
    lteventDetailgt
  • The outcome of the event under
    lteventOutcomeInformationgt
  • Indicates if the event was a success, partial
    success, or failure.
  • Agents involved in the event and their specific
    roles under ltlinkingAgentIdentifiergt
  • The ltlinkingAgentIdentifierTypegt refers to the
    classification of the domain that creates the
    linking agent identifier. The ltlinkingAgentIdentif
    ierValuegt refers to the value of the linking
    agent identifier. The ltlinkingAgentRolegt
    indicates the role of the agent associated to the
    event.
  • Agents role are defined here because agents can
    perform different roles in different events
  • Objects involved in the event and their specific
    roles under ltlinkingObjectIdentifiergt
  • The ltlinkingObjectIdentifierTypegt refers to the
    classification of the domain that creates the
    linking object identifier. The ltlinkingObjectIdent
    ifierValuegt refers to the value of the linking
    object identifier. The ltlinkingObjectRolegt
    indicates the role of the object associated to
    the event.

21
PREMIS Event Example
  • lteventgt
  • lteventIdentifiergt
  • lteventIdentifierTypegtFDsyseventlt/eventIdentif
    ierTypegt
  • lteventIdentifierValuegt1cdd2b6c-5a2d-449b-b386-
    ebb15eb4af11lt/eventIdentifierValuegt
  • lt/eventIdentifiergt
  • lteventTypegtRendition Submittedlt/eventTypegt
  • lteventDateTimegt2010-10-06T193847-0400lt/event
    DateTimegt
  • lteventDetailgtRendition R0b002ee180b003b0,
    uploaded by hotfolderadmin, was submitted in the
    Submission Information package P0b002ee180b003aflt/
    eventDetailgt
  • lteventOutcomeInformationgt
  • lteventOutcomegtSuccesslt/eventOutcomegt
  • lt/eventOutcomeInformationgt
  • ltlinkingAgentIdentifiergt
  • ltlinkingAgentIdentifierTypegtFDsysagentlt/linki
    ngAgentIdentifierTypegt
  • ltlinkingAgentIdentifierValuegthotfolderadminlt/l
    inkingAgentIdentifierValuegt
  • ltlinkingAgentRolegtimplementerlt/linkingAgentRol
    egt
  • lt/linkingAgentIdentifiergt
  • ltlinkingObjectIdentifiergt
  • ltlinkingObjectIdentifierTypegtFDsyslt/linkingObj
    ectIdentifierTypegt
  • ltlinkingObjectIdentifierValuegtR0b002ee180b003b
    0lt/linkingObjectIdentifierValuegt

22
PREMIS Agent Entity
  • Agents refer to people, organizations, or
    software associated with events, more
    specifically preservation events, of an object
  • In the data model diagram, there is no arrow from
    Agent entity to the Object entity, that is
    because Agents influence Objects indirectly
    through Events.

23
PREMIS Agent Entity
  • Sample syntax
  • ltagentgt lt/agentgt
  • The information that can be recorded under agent
    includes
  • A unique identifier for the agent under
    ltagentIdentifiergt
  • Information contained within ltagentIdentifierTypegt
    refers to the classification of the domain that
    creates the agent identifier.
  • Information contained within ltagentIdentifierValue
    gt refers value of the agent identifier.
  • The agents name under ltagentNamegt
  • The type of agent (people, organization or
    software) under ltagentTypegt

24
PREMIS Agent Example
  • ltagentgt
  • ltagentIdentifiergt
  • ltagentIdentifierTypegtFDsysagentlt/agentIdentif
    ierTypegt
  • ltagentIdentifierValuegthotfolderadminlt/agentIde
    ntifierValuegt
  • lt/agentIdentifiergt
  • ltagentNamegthotfolderadminlt/agentNamegt
  • ltagentTypegtPersonlt/agentTypegt
  • lt/agentgt

25
PREMIS Rights Entity
  • Rights refers to the rights and permission that
    are directly relevant to preserving objects
  • Sample syntax
  • ltrightsgt lt/rightsgt
  • The information that can be recorded under right
    includes
  • A unique identifier for the rights statement
  • The action(s) that the rights statement allows
  • The object(s) to which the statement applies
  • The agents involved in the rights statements and
    their roles
  • Note Keep in mind that FDsys doesnt use ltrightsgt

26
premis.xml Header
  • xmlns - refers to a namespace, which is a unique
    value (Note the xml parser does not use the
    namespace URI to look up information)
  • xmlnsxsi indicates to the XML parser that this
    document should be validated against a schema
  • xsischemaLocation the first value refers to the
    namespace that will be used and the second value
    refers to the location of the schema that will be
    used, in this case it is the MODS XML schema.
  • version refers to the PREMIS version
  • Example
  • ltpremis xmlns"infolc/xmlns/premis-v2"
    xmlnsxsi"http//www.w3.org/2001/XMLSchema-instan
    ce" xsischemaLocation"infolc/xmlns/premis-v2
    http//www.loc.gov/standards/premis/premis.xsd"
  • version"2.0"gt

27
Additional Information On Using premis.xml
  • When will premis.xml be used by METS (aip.xml) ?
  • PREMIS digital object that requires its
    components to be organized so that the integrity
    is preserved, this is known as structural
    metadata, will use METS to accomplished this.
  • METS uses a pointer to the metadata that is
    located outside of the METS document. More
    specifically, it uses a xlinkhref to indicate
    the location of such file.
  • Example code from aip.xml
  • lt!-- PREMIS OBJECT   --gt
  • ltmetsamdSec ID"AMD_OTHER"gt
  • ltmetstechMD ID"D09002ee180affcca-TEC"gt
  • ltmetsmdRef ID"M09002ee180affcca-tdiv"
    MDTYPE"PREMIS" MIMETYPE"text/xml"
    LOCTYPE"URL" xmlnsxlink"http//www.w3.org/1999
    /xlink" xlinkhref"file/premis.xml" /gt
  • lt/metstechMDgt
  • ltmetsdigiprovMD ID"D09002ee180affcca-DIG"gt
  • ltmetsmdRef ID"M09002ee180affcca-ddiv"
    MDTYPE"PREMIS" MIMETYPE"text/xml"
    LOCTYPE"URL" xmlnsxlink"http//www.w3.org/1999
    /xlink" xlinkhref"file/premis.xml" /gt
  • lt/metsdigiprovMDgt
  • lt/metsamdSecgt

28
Additional Information On Using premis.xml
  • Mets will use a struct map to organize the
    components of the premis object.
  • Example code from aip.xml
  • ltmetsstructMapgt
  • ltmetsdiv ID"R0b002ee180b0044c-div"
    LABEL"xml-submitted"gt
  • ltmetsfptr FILEID"D09002ee180b00449" /gt
  • ltmetsdiv ID"R0b002ee180b00452-div"
    LABEL"Graphic Support Documents"gt
  • ltmetsfptr FILEID"D09002ee180affca3" /gt
  • ltmetsfptr FILEID"D09002ee180b0045b" /gt
  • ltmetsfptr FILEID"D09002ee180b00464" /gt
  • ltmetsfptr FILEID"D09002ee180b0046e" /gt
  • ltmetsfptr FILEID"D09002ee180b00477" /gt
  • ltmetsfptr FILEID"D09002ee180b0047c" /gt
  • ltmetsfptr FILEID"D09002ee180b00483" /gt
  • ltmetsfptr FILEID"D09002ee180b00493" /gt
  • ltmetsfptr FILEID"D09002ee180b00499" /gt
  • ltmetsfptr FILEID"D09002ee180b004a4" /gt
  • ltmetsfptr FILEID"D09002ee180b004af" /gt
  • ltmetsfptr FILEID"D09002ee180b004bc" /gt
  • ltmetsfptr FILEID"D09002ee180b004c6" /gt

29
References
  • Understanding PREMIS
  • http//www.loc.gov/standards/premis/understanding-
    premis.pdf
  • Data Dictionary for Preservation Metadata
  • http//www.oclc.org/research/activities/past/orpro
    jects/pmwg/premis-final.pdf
  • W3C Schools
  • http//www.w3schools.com/xml/default.asp
Write a Comment
User Comments (0)
About PowerShow.com