Skip to main content Link Search Menu Expand Document (external link)

Appendix 4: Notes on Adherence to Digital Preservation Standards

On this page

  1. Appendix 4: Notes on Adherence to Digital Preservation Standards
    1. National Digital Stewardship Alliance (NDSA) Levels of Digital Preservation
      1. Level 1 – Know your content
      2. Level 2 – Protector your content
      3. Level 3 – Monitor your content
      4. Level 4 – Sustain your content
    2. Digital Preservation Coalition Rapid Assessment Model
    3. Reference Model for an Open Archival Information System (OAIS)
    4. Audit and Certification of Trustworthy Digital Repositories (future area of work)

The following models and ancillary documents were used to frame the digital preservation program and specific actions within that program:

National Digital Stewardship Alliance (NDSA) Levels of Digital Preservation

The NDSA Levels of Digital Preservation is broken down into five functional areas and four levels, with each level indicating specific actions for storage, integrity, control, metadata, and content considerations. Adherence to the levels is indicated by green, yellow, and red highlighted notes. Green indicates adherence, yellow indicates in-progress work toward adherence, and red indicates work toward that item has not yet begun.

Level 1 – Know your content

Functional AreaActionNotes
StorageHave two complete copies in separate locationsAs documented in the Storage & Backups section, backups of the Z: drive are also stored at HSL.
StorageDocument all storage media where content is storedThis document outlines the processes for documenting incoming storage media and transmitting content to the Z: drive. Documentation and content is stored on the Z: drive.
StoragePut content into a stable storageContent is extracted from all unstable media where possible in-house; media that cannot be transferred is documented and used to advocate for additional equipment.
IntegrityVerify integrity information if it has been provided with the contentIntegrity information will be verified whenever it is provided using Teracopy.
IntegrityGenerate integrity information if not provided with the contentIntegrity information using md5 checksums is generated for content using siegfried upon accession.
IntegrityVirus check all content; isolate content for quarantine as neededAll content is scanned using clamAV upon accession, which includes content ingest from storage media. Content flagged is quarantined and monitored by the Digital Archivist before being uploaded to secure storage.
ControlDetermine the human and software agents that should be authorized to read, write, move, and delete contentA permissions document consolidates who has read and write (including move and delete) permissions. The Digital Archivist authorizes the use permissions given to new users.
MetadataCreate inventory of content, also documenting current storage locationsContent inventories are stored with content in a metadata folder and generated using siegfried. Storage locations are isolated to a single location: the Z: drive.
MetadataBackup inventory and store at least one copy separately from contentAn inventory of storage media, not necessarily a detailed listing of the files contained on that media (pending processing level), is in ArchivesSpace. A detailed inventory of file-level information is currently not stored separately from content, but options are being examined as of 10/2022.
ContentDocument file formats and other essential content characteristics including how and when these were identifiedFile formats and characteristics are generated using siegfried and are stored in a metadata folder with the content. The date is stored as part of the brunnhilde report generated at the same time.

Level 2 – Protector your content

Functional AreaActionNotes
StorageHave three complete copies with at least one copy in a separate geographic locationTwo complete copies are stored at the Downtown Library (see the Storage & Backups section). A third copy is planned to be deployed to Amazon Glacier but is in the planning stages.
StorageDocument storage and storage media indicating the resources and dependencies they require to functionResource and dependency information is maintained by Systems Infrastructure. Changes or issues are conveyed to the Digital Archivist and addressed in tandem.
IntegrityVerify integrity information when moving or copying contentIntegrity information is verified when copying or moving content using Teracopy. A record of the verified checksums is stored in the logs folder for a collection.
IntegrityUse write-blockers when working with original mediaWrite blockers are used with media upon content extraction.
IntegrityBack up integrity information and store copy in a separate location from the contentA detailed inventory of file-level information, including integrity information, is currently not stored separately from content. Options for separate storage are being examined as of 10/2022.
ControlDocument the human and software agents authorized to read, write, move, and delete content and apply theseA permissions document consolidates who has read and write (including move and delete) permissions. The Digital Archivist authorizes the use permissions given to new users.
MetadataStore enough metadata to know what the content is (this might include some combination of administrative, technical, descriptive, preservation, and structural)Metadata stored on Z: includes information not only about the files, but also about the media carriers, including technical, preservation, descriptive, and structural metadata. Additional administrative and descriptive metadata is stored in ArchivesSpace.
ContentVerify file formats and other essential content characteristicsFile formats are verified and any issues are currently flagged by siegfried. Issues are currently addressed at the time of scheduled access by a user, though options for addressing issues with file formats are currently being explored as part of processing.
ContentBuild relationships with content creators to encourage sustainable file choicesThis document includes a section on preferred file formats for donors. Where possible, discussions with content creators occur prior to transfer and accession using these preferred file formats.

Level 3 – Monitor your content

Functional AreaActionNotes
StorageHave at least one copy in a geographic location with a different disaster threat than the other copiesImplementation of Level 2 Storage actions as outlined in the Level 2 table will also meet this Level 3 Storage action.
StorageHave at least one copy on a different storage media typeImplementation of Level 2 Storage actions as outlined in the Level 2 table will also meet this Level 3 Storage action.
StorageTrack the obsolescence of storage and mediaServer hardware migrations and management is maintained by Systems Infrastructure. Changes, such as migrating to new servers and refreshing server hardware, are conveyed to the Digital Archivist and addressed in tandem according to a predetermined schedule. Collections content is extracted from all unstable storage media where possible in-house; media that cannot be transferred is documented and used to advocate for additional equipment.
IntegrityVerify integrity information of content at fixed intervalsAction will be implemented in the future—currently, collection processing and generation of checksums must occur before verifying integrity information.
IntegrityDocument integrity information verification processes and outcomesAction will be implemented in the future—currently, collection processing and generation of checksums must occur before verifying integrity information.
IntegrityPerform audit of integrity information on demandIntegrity information is verified on demand using Teracopy. A record of the verified checksums is stored in the logs folder for a collection.
ControlMaintain logs and identify the human and software agents that performed actions on contentAction will be implemented in the future—this information is currently held by Systems Infrastructure.
MetadataDetermine what metadata standards to applyMetadata standards and implementations are outlined in the Description section for born digital materials in this document.
MetadataFind and fill gaps in your metadata to meet those standardsCurrently in progress—legacy processed collections are being reprocessed and new collections processed according to the metadata standards and implementations outlined in the Description section for born digital materials in this document.
ContentMonitor for obsolescence, and changes in technologies on which content is dependentAction will be implemented in the future.

Level 4 – Sustain your content

Functional AreaActionNotes
StorageHave at least three copies in geographic locations, each with a different disaster threatImplementation toward Level 2 and Level 3 Storage actions will occur before Level 4.
StorageMaximize storage diversification to avoid single points of failureImplementation toward Level 2 and Level 3 Storage actions will occur before Level 4.
StorageHave a plan and execute actions to address obsolescence of storage hardware, software, and mediaServer hardware migrations and management is maintained by Systems Infrastructure. Changes, such as migrating to new servers and refreshing server hardware, are conveyed to the Digital Archivist and addressed in tandem according to a predetermined schedule. Collections content is extracted from all unstable storage media where possible in-house; media that cannot be transferred is documented and used to advocate for additional equipment.
IntegrityVerify integrity information in response to specific events or activitiesIntegrity information is verified on demand using Teracopy. A record of the verified checksums is stored in the logs folder for a collection.
IntegrityReplace or repair corrupted content as necessaryAction will be implemented in the future.
ControlPerform periodic review of actions/access logsAction will be implemented in the future.
MetadataRecord preservation actions associated with content and when those actions occurPreservation actions are documented in the PREMIS Spreadsheet for a collection, stored in the administration folder for that collection.
MetadataImplement metadata standards chosenMetadata standards and implementations are outlined in the Description section for born digital materials in this document.
ContentPerform migrations, normalizations, emulation, and similar activities that ensure content can be accessedMigrations, normalizations, emulations, and similar activities are primarily addressed at the time of scheduled access by a user. Typically, original files are retained to minimize irreversible interventions and support changing standards in providing access to file formats.

Digital Preservation Coalition Rapid Assessment Model

While the NDSA Levels of Digital Preservation outlines specific actions, it does not include organizational and service characteristics of an archive. In DPC RAM, a digital preservation program has organizational and service capabilities that are assessed at the following levels: minimal awareness, awareness, basic, managed, and optimized.

The below table outlines steps to get to the subsequent level as progress toward the desired level. These steps will be incorporated into the WVRHC Digital Preservation Strategic Priorities document. The first instance of this document will be created in 2023.

Organizational Capabilities
Current LevelDesired LevelSteps to Get to Desired Level
A. Organizational viability: Governance, organizational structure, staffing and resourcing of digital preservation activities.AwarenessOptimized
  1. Awareness->Basic
    • To do: Demonstrate some engagement by administration.
    • To do: Ensure staff have assigned responsibilities and the time to undertake them.
    • To do: Allocate a budget for digital preservation.
    • To do: Identify staff development requirements.
  2. Basic->Managed
    • Will be outlined once a Basic level is achieved.
  3. Managed->Optimized
    • Will be outlined once a Managed level is achieved.
B. Policy and strategy: Policies, strategies, and procedures which govern the operation and management of the digital archive. BasicOptimized
  1. Basic->Managed
    • Completed: A digital preservation policy is aligned to organizational policies and is reviewed on an agreed upon schedule.
    • Completed: A suite of documented processes and procedures for managing, and providing access to, content within the digital archive exists.
    • To do (in progress): Policy and procedure takes into account any relevant ethical issues.
    • To do (in progress): All relevant staff are aware of digital preservation policies, strategies, and procedures.
    • To do (in progress): Knowledge of current and future use cases for content informs policy and procedure (for example on collecting, preservation approaches, metadata and access).
  2. Managed->Optimized
    • Will be outlined once a Managed level is achieved.
C. Legal basis: Management of legal rights and responsibilities, compliance with relevant regulation and adherence to ethical codes related to acquiring, preserving and providing access to digital content. AwarenessOptimized
  1. Awareness->Basic
    • To do: Key legal rights and responsibilities, together with their owners, have been identified and documented.
    • To do: Templates exist for necessary legal agreements and licenses.
    • To do: Relevant codes of conduct related to professional ethics are adhered to.
  2. Basic->Managed
    • Will be outlined once a Basic level is achieved.
  3. Managed->Optimized
    • Will be outlined once a Managed level is achieved.
D. IT capability: Information Technology capabilities for supporting digital preservation activities. BasicManaged
  1. Basic->Managed
    • Completed (by Systems Infrastructure, Systems Development, and the Digital Archivist where applicable): IT systems are regularly patched and updated.
    • Completed (current practice, processes can be improved): New tools and systems are deployed when required.
    • To do: Adequate IT support is available to the digital archive.
    • To do: IT roles and responsibilities are documented and regularly reviewed.
    • To do (done via this manual for software and some systems, needs improvement): IT systems are comprehensively documented.
    • To do: contracts and services with third party service providers are well managed and documented.
  2. Managed->Optimized
    • Will be outlined once a Managed level is achieved.
E. Continuous Improvement: Processes for the assessment of current digital preservation capabilities, the definition of goals and the monitoring of progressAwarenessOptimized
  1. Awareness->Basic
    • To do: An initial benchmarking exercise has been carried out.
    • To do: Gaps in digital preservation capability have been identified.
    • To do: There is an understanding of where the organization is relative to peers.
  2. Basic->Managed
    • Will be outlined once a Basic level is achieved.
  3. Managed->Optimized
    • Will be outlined once a Managed level is achieved.
F. Community: Engagement with and contribution to the wider digital preservation community. AwarenessManaged
  1. Awareness->Basic
    • To do: Networks of relevant contacts have been established.
    • To do: Relevant community events can be accessed.
    • To do: There is commitment to learn from the experience of others.
  2. Basic->Managed
    • Will be outlined once a Basic level is achieved.
  3. Managed->Optimized
    • Will be outlined once a Managed level is achieved.
Service Capabilities
Current LevelDesired LevelSteps to Get to Desired Level
G. Acquisition, Transfer and Ingest: Processes to acquire or transfer content and ingest it into a digital archive. AwarenessOptimized
  1. Awareness->Basic
    • Completed (this document): A documented ingest process exists.
    • Completed (this and ingest document): Basic guidance for donors, depositors, and record creators is available where appropriate.
    • Completed (this and ingest document): Documentation and metadata is sometimes received or captured as part of the acquisition or transfer process.
    • Completed (accession portion of this document): Some content is appraised as part of a manual process in line with relevant policies.
    • To do: A documented process exists for selecting and capturing digital content where appropriate (for example, web archives, email archives, digitized content, etc.)
    • To do (in progress, 10/2022): A working area (physical or virtual) is available for pre-ingest and ingest activities.
  2. Basic->Managed
    • Will be outlined once a Basic level is achieved.
  3. Managed->Optimized
    • Will be outlined once a Managed level is achieved.
H. Bitstream Preservation: Processes to ensure the storage and integrity of digital content to be preserved.AwarenessOptimized
  1. Awareness->Basic
    • Completed (see this manual): Dedicated storage is available to meet current preservation needs.
    • Completed (see this manual): Staff know where content is stored.
    • Completed (see this manual): Replication is based on simple backup regimes.
    • Completed (see permissions document): There is an understanding of which staff members should be authorized to access the content.
    • To do (in progress, procedures completed): Checksums are generated for all content.
  2. Basic->Managed
    • Will be outlined once a Basic level is achieved.
  3. Managed->Optimized
    • Will be outlined once a Managed level is achieved.
I. Content Preservation: Processes to preserve the meaning or functionality of the digital content and ensure its continued accessibility and usability over time.AwarenessOptimized
  1. Awareness->Basic
    • To do (in progress, procedures completed): File formats are identified.
    • To do (in progress, procedures completed): Content is characterized and assessed for preservation and quality issues such as encrypted, broken, or incomplete content and invalid files.
    • To do: There is a basic understanding of current and future users and use cases for the content.
  2. Basic->Managed
    • Will be outlined once a Basic level is achieved.
  3. Managed->Optimized
    • Will be outlined once a Managed level is achieved.
J. Metadata Management: Processes to create and maintain sufficient metadata to support preservation, discovery and use of preserved digital content.AwarenessOptimized
  1. Awareness->Basic
    • Completed (this document): An appropriate minimum descriptive metadata requirement exists.
    • Completed (this document): Metadata and documentation acquired with content is retained and preserved.
    • To do (in progress, procedures created): Content is described at collection level in a digital asset register.
    • To do (in progress, procedures created): Basic preservation metadata is captured at item level.
  2. Basic->Managed
    • Will be outlined once a Basic level is achieved.
  3. Managed->Optimized
    • Will be outlined once a Managed level is achieved.
K. Discovery and Access: Processes to enable discovery of digital content and provide access for users.AwarenessOptimized
  1. Awareness->Basic
    • Completed: Basic resource discovery exists for some digital content.
    • Completed: Users can view or access digital content and metadata either remotely or on-site.
    • To do: Users’ access to digital content is recorded.
    • To do (in progress, procedures created): Information on the accessibility of digital content is provided to users.
  2. Basic->Managed
    • Will be outlined once a Basic level is achieved.
  3. Managed->Optimized
    • Will be outlined once a Managed level is achieved.


Reference Model for an Open Archival Information System (OAIS)

To demonstrate OAIS compliance, it is critical to directly map our workflows to the functional entities and archival information package (AIP) as outlined in OAIS. Figures in this section are taken directly from the Reference Model for an Open Archival Information System (OAIS). David Giaretta has also written and created visualizations that link the discrete functional entities outlined below together in a way that is helpful for envisioning OAIS as a whole system. Images of the functional entity or package overview will come before a description of WVRHC adherence. Italicized areas are areas of improvement to meet OAIS standards.

Overview of OAIS functional entities

Figure 4-1 has been included to demonstrate a simplified visual representation of how figures 4-2, 4-3, 4-4, 4-5, 4-6, and 4-7 link together.

OAIS ingest functional entity

The Ingest functional entity concerns actions related to formally accepting Submission Information Packages (SIPs) and generating Archival Information Packages (AIPs). Broadly, these areas map to the accession, appraisal, processing, and description portions of the born digital archival processing cycle outlined in this manual. Below is a detailed mapping of actions taken to adhere to these OAIS areas.

OAIS archival storage functional entity

The Archival Storage functional entity concerns actions related to storage, maintenance of content and storage infrastructure, and retrieval of AIPs. Broadly, these areas map to the access portion of the born digital archival processing cycle and the Digital Preservation Administration section in this manual. Below is a detailed mapping of actions taken to adhere to these OAIS areas.

OAIS data management functional entity

The Data Management functional entity concerns actions related to populating, maintaining, and accessing Descriptive Information which identifies and documents Archive holdings and administrative data used to manage the archive. Broadly, these areas map to the accession, appraisal, processing, and description portions of the born digital archival processing cycle and the Digital Preservation Administration section in this manual. Below is a detailed mapping of actions taken to adhere to these OAIS areas.

  • Administering the Archive database functions (maintaining schema and view definitions, and referential integrity)
    • Completed by the Digital Archivist as part of the annual review process for this document.
  • Performing database updates (loading new descriptive information or Archive administrative data)
  • Performing queries on the data management data to generate query responses
    • Completed by the Digital Archivist in response to requests/needs using ArchivesSpace and other structured data generated as part of the full born digital workflow.
  • Producing reports from these query responses
    • Completed by the Digital Archivist in response to requests/needs using ArchivesSpace and other structured data generated as part of the full born digital workflow.

OAIS administration functional entity

The Administration functional entity concerns actions related to providing the services and functions for the overall operation of the archive system. Broadly, these areas map to the accession and access portions of the born digital archival processing cycle and the Digital Preservation Administration section in this manual. Below is a detailed mapping of actions taken to adhere to these OAIS areas.

  • Soliciting and negotiating submission agreements with producers/donors
    • Conducted as part of the pre-accessioning process.
  • Auditing submissions to ensure that they meet Archive standards
    • Still establishing documentation for well-formed SIPs; currently SIPs are accepted as a simple file transfer of zipped materials to maintain as much file metadata as possible with minimal donor expertise required.
    • Submissions are audited at the point of Accession and Appraised for whether they merit inclusion in the archive.
  • Maintaining configuration management of system hardware and software
    • Completed in coordination with Systems Infrastructure by the Digital Archivist.
  • System engineering functions to monitor and improve Archive operations
    • Completed by the Digital Archivist and, in terms of hardware, by the Digital Archivist in coordination with Systems Infrastructure.
  • To inventory, report on, and migrate/update the contents of the Archive
    • Completed by the Digital Archivist in coordination with relevant WVRHC employees.
  • Establishing and maintaining Archive standards and policies
    • Completed by the Digital Archivist in coordination with other WVRHC employees as needed.
  • Providing customer support
    • Completed by the Digital Archivist or authorized person.
  • Activating stored requests
    • Completed by the Digital Archivist or authorized person.

OAIS preservation planning functional entity

The Preservation Planning functional entity concerns actions related to monitoring the environment of the OAIS, providing recommendations and preservation plans to ensure that the information stored in the OAIS remains accessible to, and understandable by, the Designated Community over time. Broadly, these areas map to the processing portions of the born digital archival processing cycle and the Digital Preservation Administration section in this manual. Below is a detailed mapping of actions taken to adhere to these OAIS areas.

  • Evaluating the contents of the archive and periodically recommending archival information updates
    • Accomplished by Digital Archivist as part of daily work.
  • Recommending the migration of current archive holdings
    • For file formats: accomplished by the Digital Archivist in response to changing needs. For hardware: accomplished by Systems Infrastructure in coordination with the Digital Archivist.
  • Developing recommendations for Archive standards and policies
    • Accomplished by the Digital Archivist in coordination with relevant WVRHC employees.
  • Providing periodic risk analysis reports
  • Monitoring changes in the technology environment and in the Designated Community’s service requirements and knowledge
    • Technology environment is actively monitored by the Digital Archivist, additional work needs to be done on articulating the Designated Community’s needs and knowledge.
  • Designs Information Package templates and provides design assistance and review to specialize these templates into SIPs and AIPs for specific submissions
    • Accomplished by the Digital Archivist and documented in this manual and ancillary documentation.
  • Develops detailed Migration plans, software prototypes and test plans to enable implementation of Administration migration goals
    • Instigated by the Digital Archivist in coordination with Systems Development, Systems Infrastructure, and relevant WVRHC employees.

OAIS access functional entity

The Access functional entity concerns actions related to providing the services and functions that support users/consumers in determining the existence, description, location and availability of information stored in the OAIS, and allowing users/consumers to request and receive information products. Broadly, these areas map to the processing, description, and access portions of the born digital archival processing cycle. Below is a detailed mapping of actions taken to adhere to these OAIS areas.

  • Communicating with users/consumers to receive requests
    • Accomplished by Reference Staff or the Digital Archivist as part of standard processes outlined in Access procedures within each section of this document.
  • Applying controls to limit access to specially protected information
  • Coordinating the execution of requests to successful completion
    • Accomplished by Reference Staff or the Digital Archivist as part of standard processes outlined in Access procedures within each section of this document.
  • Generating responses (Dissemination Information Packages, query responses, reports) and delivering the responses to users/consumers
    • Accomplished by the Digital Archivist as part of standard processes outlined in Access procedures within each section of this document. Access copies are generated based upon the type of AIP.

In addition to the above functional entities, the information objects and packages must include the following aspects and information

OAIS archival information package detailed view

The above diagram maps to our information package structure, outlined in the Accessioning Media Workflow, used internally as follows:

  • Package Description: The information intended for use by Access Aids.
    • This is the information uploaded to ArchivesSpace; information may be organized prior to being uploaded to ArchivesSpace using the Born Digital Processing Checklist.
  • Packaging Information: The information that is used to bind and identify the components of an Information Package. For example, it may be the ISO 9660 volume and directory information used on a CD-ROM to provide the content of several files containing Content Information and Preservation Description Information.
    • This information is initially documented in the Digital Media Inventory Template and expanded in the Brunnhilde report stored in the Metadata folder in the collection folder.
  • Content Information: A set of information that is the original target of preservation or that includes part or all of that information. It is an Information Object composed of its Content Data Object and its Representation Information.
    • Data Object: Either a Physical Object or a Digital Object.
      • This information is stored in the Content folder in the folder containing digital collections content and metadata on the Z: drive.
    • Representation Information: The information that maps a Data Object into more meaningful concepts. One example is JPEG software which is used to render a JPEG file.
      • Structure Information: The Representation Information that imparts meaning about how other information is organized.
        • Documented through description processes or self-documented through strategic file naming where applicable.
      • Semantic Information: The Representation Information that further describes the meaning beyond that provided by the Structure Information.
        • Documented using Siegfried as part of Brunnhilde for file characterization and stored in the Metadata folder for the collection.
  • Preservation Description Information: The information which is necessary for adequate preservation of the Content Information.
    • Reference Information: The information that is used as an identifier for the Content Information.
      • This information is initially documented at the media item or transfer level in the Digital Media Inventory Template and included in ArchivesSpace; single files are yet given individual identifiers beyond checksums.
    • Provenance Information: The information that documents the history of the Content Information.
      • Documented in accession records in ArchivesSpace at the time of transfer of ownership.
    • Context Information: The information that documents the relationships of the Content Information to its environment. This includes why the Content Information was created and how it relates to other Content Information objects.
      • Documented through processing and description processes and stored in ArchivesSpace.
    • Fixity Information: The information which documents the mechanisms that ensure that the Content Information object has not been altered in an undocumented manner.
      • Documented through Siegfried as part of Brunnhilde and stored in the Metadata folder for the collection.
    • Access Rights Information: The information that identifies the access restrictions pertaining to the Content Information, including the legal framework, licensing terms, and access control.
      • Documented in ArchivesSpace as part of processing and description processes.

As the digital preservation program is fairly new, only a high level overview of OAIS compliance is available. The portion of the OAIS standard related to repository responsibilities will be outlined in the Audit and Certification of Trustworthy Digital Repositories section, currently a work in progress.

Audit and Certification of Trustworthy Digital Repositories (future area of work)

In the future, this section will include an overview of WVRHC adherence to the Audit and Certification of Trustworthy Digital Repositories (CCSDS 652.0-M-1) set of recommended practices to implement the Open Archival Information System (OAIS) Reference Model (ISO 14721). As a complement to the former document, the Trustworthy Repositories Audit & Certification: Criteria and Checklist document by OCLC and The Center for Research Libraries was also used to determine compliance with CCSDS 652.0-M-1. Adherence to these standards will be used as a tool for institutional accountability.