Collection Information

From UA Libraries Digital Services Planning and Documentation
Revision as of 18:03, 6 December 2013 by Kgmatheny (Talk | contribs)

Jump to: navigation, search

If you are the first person to work on a manuscript collection, you should create an xml file (using the 'Notepad' program on the PC; or 'TextEdit' on the Mac, saving as ANSI/ASCII text on Windows and as UTF-8 on the Mac) that contains the following information, in this order and using the following template, with no carriage returns (newlines) or formatting within any element.

Please make sure that all encoding saved in these files (like the tab-delimited files from the Excel exports) is either ANSI ASCII, or UTF-8. That means, if any Word or PDF content was cut and pasted and transferred into these files – overwrite all the quotes, hyphens, apostrophes, and non-keyboard encodings with plain text or UTF-8 unicode. Otherwise they’ll be garbage. In addition, if you MUST use “&”, encode it as & so it won’t break the xml.

Save the file in the Admin folder for the collection, naming it for the collection itself. Thus, u0003_0000252.xml for the Cabaniss papers.

If more than one digital collection is created for a given analog collection, distinguish following collection file names sequentially as if they are pages. That is, if we return to the Cabaniss Papers and digitize some descriptions of tigers in Africa, the collection xml file for this second set of content would be saved as u0002_0000252_002.xml, and the original xml file should have "_001" appended onto the name prior to the xml prefix, as at that point, u0002_0000252.xml becomes the first in a sequence (u0003_0000252_001.xml), rather than a representation of the complete analog collection.


<?xml version="1.0" encoding="UTF-8"?>

Step by step instructions:

 This section describes how to make Collection Information XML files from "scratch" using the file: 
 S:\Digital Projects\Organization\Digital Program\Selection.xlsx.
 In general, however, most of the the values such as Digital Collection Name, etc. are already in the Descriptive Metadata spreadsheets handed to us by the archivists
 as well as the TrackingFilenames spreadsheet.
 That is to say that most of the work in determining such values is already done by the Archivists.
 See this page for information on matching values across documents.

All of this information should be available from the S:\Digital Projects\Organization\Digital Program\Selection.xlsx which was filled in by the archivists.

If this information is not available in the given spreadsheet, please contact the archivists to ask for it.


<Digital_Collection_Name>Enter value from "Project Name" column</Digital_Collection_Name>

  • NOTE! This value -- exactly! -- must be entered in the digital collection column of the metadata spreadsheet for each object, so that we can retrieve all the contents of a collection with a search.

<Alphabetized_By>Enter value from "Alphabetize" column</Alphabetized_By> IF THIS VALUE IS NOT AVAILABLE, enter what makes sense. This is NOT an optional field! It determines how things appear on our collections page.

DO NOT precede the value in the "Alphabetized By" element with "a", "an", or "the". We don't want to sort things by definite or indefinite articles.

<Type_Of_Content>Enter value from "Genre/Type" column</Type_Of_Content>

  • Note that the acceptable types are these: book, image, text, audio, video, mixed media, finding aid, score, other. This value determines what icon will be displayed, if we do NOT have one for this collection -- so choose wisely. Sheet music should be "score" -- most manuscript content should be "text". USE ONE TYPE ONLY (for example, DO NOT enter 'image; text' -- choose ONE).

<Analog_Collection_Name>Enter value from "Name of Analog Collection" column</Analog_Collection_Name>

More information on the Analog Collection Title is here: Analog Collection Title

<Manuscript_Number>Enter value from "Manuscript Number" column (always give in format: MS ####)</Manuscript_Number>

<Finding_Aid_Link> </Finding_Aid_Link>

Search for the Analog Collection name in these 2 places:

  1. [| The new interface] ... here, enter the collection number, for example, for MS 252, enter
  1. [| The old interface]

If you find a live link to a finding aid, enter it in the above field. If you do not, leave the field empty. NOTE: If inputting the collection number to Acumen gets you something, but it's not a finding aid, it's probably a default link to the very xml you're trying to create. Which means it's already in Acumen, and you don't need to make it. Do not put this link in the Finding Aid Link field. If no finding aid can be found, leave this field blank.

If you did not find a Manuscript Number in the spreadsheet, but you do find a finding aid online, use the finding aid to fill in the Manuscript Number field. If possible, use 4 digits for the MS number, left-padding with zeros so that MS 14 would be written MS 0014. Always give the Manuscript Number in this format: MS 1632

Alternative forms the Manuscript number may currently have, in the filename or elsewhere:

  1. ms_1952.pdf would be MS 1952
  2. Reference Code USALM1952 would be MS 1952
  3. MSS.0630 would be MS 0630 or MS 630
  4. u0003_0000252 would be MS 0252 or MS 252

<Digital_Collection_Description> Enter value from "Blurb" column </Digital_Collection_Description>


<?xml version="1.0" encoding="UTF-8"?>


<Digital_Collection_Name>Septimus D. Cabaniss Papers</Digital_Collection_Name>

<Alphabetized_By>Cabaniss, S. D.</Alphabetized_By>


<Analog_Collection_Name>The S.D. Cabaniss Papers, 1820-1937</Analog_Collection_Name>

<Manuscript_Number>MS 0252</Manuscript_Number>


<Digital_Collection_Description>Materials from the papers of this nineteenth-century Madison County, Alabama, attorney who drafted a controversial will for wealthy planter Samuel Townsend which manumitted certain slaves and designated them as Townsend's primary heirs. Selected items include Townsend's will, a deposition given by S.D. Cabaniss concerning his role in the estate, and a report by Rev. William D. Chadick discussing the prospect of settling the newly-manumitted Townsend heirs in Ohio.</Digital_Collection_Description>


Personal tools