Multiple Batches In Collection

From UA Libraries Digital Services Planning and Documentation
Revision as of 14:36, 25 June 2010 by Scpstu32 (talk | contribs)

When processing a very large or ongoing collection, it is not feasible to wait for completion of digitization prior to online delivery or archival storage.

Currently we are exporting Excel spreadsheets as tab-delimited for long-term storage of collection metadata, and for processing into MODS.

Assuming the collection is u0003_0000653. The first batch of content to be processed for delivery and storage is to be accompanied by an exported spreadsheet named u0003_0000653.1.txt.*

The second batch of content will be accompanied by an exported spreadsheet named u0003_0000653.2.txt, and so forth.

Each spreadsheet so exported will NOT include content from other spreadsheets.

The understanding is that by combining all the spreadsheets for this collection in the order designated by the number preceding the extension, the entire collection's metadata can be constructed.

* Note: Sometimes when beginning a collection we do not know that more will be forthcoming. Therefore, if there is a .2.txt exported spreadsheet in the archive, it is understood that the first version of the spreadsheet without a sequence number prior to the extension, is indeed the .1.txt for reconstruction purposes.