The level of content in a Scholars Portal AIP can vary, depending on the amount of content provided by the publisher. This description will use the OAIS Information Model to illustrate completeness of our conceptual model.

Things in red are currently being implemented, but are intended to be a final part of the documentation. --sbm

Content Information (CI)

The Content Data Object is stored in the format that it was provided by the publisher, in its own directory on the filesystem.

Representation Information is provided by DROID identification of the CDO's format and a link to the entry for the format in PRONOM. Format characterization is provided by JHOVE. This information is stored in the METS file used to describe the object. (see PI)

Preservation Description Information (PDI)

Reference Information - Identifiers are stored for each article identifying it globally (DOI) and locally (URL).

Provenance Information - PREMIS formatted metadata provides a history of preservation events in the object's lifetime, including ingest and any preservation activities taken on the object (e.g., replacement due to corruption, format migration)

Context Information - Information is stored regarding the journal and issue from which the article originated. This information is used to construct a table of contents of each journal issue.

Fixity Information - checksum information is generated for the object at the time of ingest, just after format validation. Comprehensive fixity checks are done twice yearly.

Packaging Information (PI)

Packaging information is present in Mark Logic, where a reference to the CDO's location on the filesystem is stored.

PI is provided by a METS file located in the CDO's directory that describes linkages between the CI (stored in the filesystem), the PDI (as contained in a PREMIS formatted document that will hopefully reside in both the filesystem and in Mark Logic) and the DI (as stored in Mark Logic)

Descriptive Information (DI)

Descriptive metadata about the object is stored in the database driving Scholars Portal's search application, Mark Logic. This information takes the form of an XML document conforming to the NLM Archiving and Interchange Tag Set ( This information is also present in an XML file located in the filesystem directory with the object and referenced in the METS file.

