Versions Compared


  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migrated to Confluence 5.3

Scholars Portal Definition of AIP

1. Archival Information Package (AIP)

  • The information package consisting of the Content Information (CI), Preservation Description Information (PDI), Packaging Information (PI), and Descriptive Information (DI) that is archived at Scholars Portal.
  • The level of content in a Scholars Portal AIP can vary, depending on the amount of content provided by the publisher.
  • This description will use the OAIS Information Model to illustrate completeness of our conceptual model


  • , and will describe, in general terms, what a Scholars Portal AIP looks like.


Wiki Markup
Things in \[\] are currently being implemented, but are intended to be a final part of the documentation. \--sbm

Content Information (CI)

  • The Content Data Object is generally stored


Wiki Markup
\[Representation Information is provided by DROID identification of the CDO's format and a link to the entry for the format in PRONOM. Format characterization is provided by JHOVE. This information is stored in the METS file used to describe the object. (see PI)\]

  • separately from the primary preservation metadata file, which is held in the Marklogic database.
  • The CDO will be stored on a filesystem with a reference to its location contained in the preservation metadata.
  • Representation Information is maintained, and contains information on the CDO's file format, version, and a reference to a format registry in order to provide information on how to interpret the file. See: registry of file formats 

1.2. Preservation Description Information (PDI)

  • Reference Information - Identifiers are stored for each article identifying it globally (e.g. DOI) and locally (


  • e.g. URI).


  • Provenance


  • Information


  • - Provenance metadata is maintained for each object that provides a history of preservation events in the object's lifetime, beginning at ingest into the SP repository and referencing any preservation activities taken on the object (e.g.,


  • replacement


  • due


  • to


  • corruption,


  • format


  • migration


  • , etc.).
  • Context Information -


  • As appropriate, information on how a CDO relates to other CDOs or to other conceptual entities. Examples of these relationships can include: a newer version of a document that supersedes an older one, or a journal article that is a part of a journal issue.
  • Fixity Information -


Wiki Markup
\[checksum information is generated for the object at the time of ingest, just after format validation. Comprehensive fixity checks are done twice yearly.\]
  • Fixity information is generated at the time of ingest in order to later determine whether or not the item remains in the same state as when it was ingested. This information can be used to determine integrity of an object being copied within the system (as in the case of a change in storage location), or for periodic integrity checks.

1.3. Packaging Information (PI)

  • The Scholars Portal preservation metadata packages both the descriptive and preservation metadata together. 

1.4. Descriptive Information (DI)

  • Depending on the type of CDO, the format of this descriptive metadata can vary, but is selected to maximize findability. In all cases, the descriptive metadata will be recreated within the preservation metadata.

2. References

2.1. OAIS (2002) CCSDS 650.0-B-1: Reference Model for an Open Archival Information System (OAIS). Blue Book. Issue 1. January 2002 (ISO 14721:2003) accessed 2011.08.24

3. Document History







Draft created

Aurianne Steinman



Draft formatted

Aurianne Steinman



Suggested edits

Aurianne Steinman



Minor edits

Steve Marks










Packaging Information (PI)

Packaging information is present in Mark Logic, where a reference to the CDO's location on the filesystem is stored.

Wiki Markup
\[PI is provided by a METS file that describes linkages between the CI (stored in the filesystem), the PDI (as contained in a PREMIS formatted document that will hopefully reside in both the filesystem and in Mark Logic) and the DI (as stored in Mark Logic)\]

Descriptive Information (DI)