Procedural Anomalies

From UA Libraries Digital Services Planning and Documentation
(Difference between revisions)
Jump to: navigation, search
(New page: Procedural Anomalies ==Scanning Object== ===Metal paperclips, pins, and other removable fasteners=== *If the fastener can be safely removed, do so…carefully [*When returning to box/fol...)
 
Line 4: Line 4:
 
===Metal paperclips, pins, and other removable fasteners===
 
===Metal paperclips, pins, and other removable fasteners===
 
*If the fastener can be safely removed, do so…carefully
 
*If the fastener can be safely removed, do so…carefully
[*When returning to box/folder, replace old metal fasteners with plastic clips]
+
*When returning to box/folder, throw away old metal fasteners and replace with plastic clips
  
 
===Staples, brads, and other hard-to-remove fasteners===
 
===Staples, brads, and other hard-to-remove fasteners===
 
*DO NOT REMOVE ANYTHING THAT MIGHT DAMAGE THE ITEM: when in doubt, consult an archivist
 
*DO NOT REMOVE ANYTHING THAT MIGHT DAMAGE THE ITEM: when in doubt, consult an archivist
*Scan the item as-is
+
*Scan the item carefully, with fasteners in place
 
*It should be possible to work around a corner or side fastener; laying open or folding back the paper is okay, just DON'T CREASE it
 
*It should be possible to work around a corner or side fastener; laying open or folding back the paper is okay, just DON'T CREASE it
  
Line 18: Line 18:
  
 
===Interleaved material found in bound item===
 
===Interleaved material found in bound item===
 +
*If material is found between pages or clipped to pages, scan it by itself against the background, and number it in sequence.
 +
**For example, if a check stub is found after page 0057, it should be numbered 0058, and the facing page will be 0059.
 +
*If this material is itself a bound item (such as a pamphlet or booklet), it should be given its own item number and added to the metadata.
 +
**For example, if an opera program is found inserted between pages 0042 and 0043 of item 0000034, it should be scanned separately and given the number 0000035, paginated starting at 0001, like any other item.
  
 
+
===Very large item===
===Item is very large===
+
 
*Take test images to evaluate lighting and focus at edges of items; if okay, capture
 
*Take test images to evaluate lighting and focus at edges of items; if okay, capture
 
*If too large for capture bed, consult with Jeremiah about how best to divide item over images
 
*If too large for capture bed, consult with Jeremiah about how best to divide item over images
  
===Paper or binding is fragile===
+
===Very fragile paper or binding===
 
*If item can be scanned with extreme care, do it
 
*If item can be scanned with extreme care, do it
 
**Handle with gloves
 
**Handle with gloves
Line 37: Line 40:
  
  
===Item not in metadata: Photos/pre-numbered by archivists===
+
===Item not in metadata: Photos/pre-numbered===
*Scan the item unless you've been told to leave out skipped items
+
*This item has been culled from the spreadsheet by April or someone else before it gets to us. Check to see if we're digitizing skipped items or not.
*Insert the missing number into the metadata spreadsheet and do your best to extrapolate metadata from nearby lines.
+
*If you do capture the item, insert the missing number into the metadata spreadsheet and do your best to extrapolate metadata from nearby lines.
  
 
===Item has fewer/more pages than metadata indicates===
 
===Item has fewer/more pages than metadata indicates===
 +
*Fewer pages than metadata
 +
**Check to see if anything is loose in the folder.
 +
**If you can't determine where the missing page might be, scan what you do have, amend the page count in the Format column, and add a note to the Description column explaining that a page is missing.
 +
*More pages than metadata
 +
**Check object against metadata to make sure the page actually belongs to that item.
 +
**If it belongs, correct the total in the spreadsheet's Format column.
  
 
+
===Multiple items seem to constitute one object===
===Multiple items actually constitute one object===
+
*With manuscripts, trust the instinct of the archivist and assume there's a reason these items are not combined in the metadata and clipped together.
 
+
*With photos, this happens because they are always numbered individually during processing, and our filenames are based on those image numbers. If this happens
 +
***assign the item number of the first page to the whole object, then amend the metadata line for the first page to reflect that this is a whole, not part of one -- in the Title, Description, and Format columns; and 
 +
***in TrackingFiles, explain why the remaining pre-assigned filenames have not been captured.
 +
**Example: Photos 0000123, 0000124, and 0000125 are pictures of a single document, The Bradley Contract, and would not make sense unless seen in context together. Assign 0000123 as the item number and treat these images like pages in a multi-page document: place a folder for that item in the Scans folder, with pages 0000123_0001, 0000123_0002, and 0000123_0003 inside it. In the metadata, relabel metadata line one (Bradley Contract page 1) to The Bradley Contract. In the TrackingFiles beside 0000124 and 0000125, note that the images were part of a single document and scanned as 0000123.
  
 
===Letter out of order: logical flow of letter does not match physical sequence of pages===
 
===Letter out of order: logical flow of letter does not match physical sequence of pages===

Revision as of 16:51, 20 September 2012

Procedural Anomalies

Contents

Scanning Object

Metal paperclips, pins, and other removable fasteners

  • If the fastener can be safely removed, do so…carefully
  • When returning to box/folder, throw away old metal fasteners and replace with plastic clips

Staples, brads, and other hard-to-remove fasteners

  • DO NOT REMOVE ANYTHING THAT MIGHT DAMAGE THE ITEM: when in doubt, consult an archivist
  • Scan the item carefully, with fasteners in place
  • It should be possible to work around a corner or side fastener; laying open or folding back the paper is okay, just DON'T CREASE it

Things taped or glued to an item

  • DO NOT REMOVE ANYTHING THAT MIGHT DAMAGE THE ITEM
  • Scan the item as-is
  • If the taped/glued object is hanging (not completely stuck down), it might be possible to push it back so that it blocks less of or less important parts of the item
  • If glue/tape works as a binding (side, corner), treat it as a hard-to-remove fastener (see above)

Interleaved material found in bound item

  • If material is found between pages or clipped to pages, scan it by itself against the background, and number it in sequence.
    • For example, if a check stub is found after page 0057, it should be numbered 0058, and the facing page will be 0059.
  • If this material is itself a bound item (such as a pamphlet or booklet), it should be given its own item number and added to the metadata.
    • For example, if an opera program is found inserted between pages 0042 and 0043 of item 0000034, it should be scanned separately and given the number 0000035, paginated starting at 0001, like any other item.

Very large item

  • Take test images to evaluate lighting and focus at edges of items; if okay, capture
  • If too large for capture bed, consult with Jeremiah about how best to divide item over images

Very fragile paper or binding

  • If item can be scanned with extreme care, do it
    • Handle with gloves
    • Don't put under glass or otherwise compress
    • Don't use flatbed scanner

Intervening with Tracking or Metadata

Item not in box/folder

Make a note in the TrackingFiles. Obviously, you can't capture what's not there.

Item not in metadata: Manuscripts/auto-numbered by us

Item not in metadata: Photos/pre-numbered

  • This item has been culled from the spreadsheet by April or someone else before it gets to us. Check to see if we're digitizing skipped items or not.
  • If you do capture the item, insert the missing number into the metadata spreadsheet and do your best to extrapolate metadata from nearby lines.

Item has fewer/more pages than metadata indicates

  • Fewer pages than metadata
    • Check to see if anything is loose in the folder.
    • If you can't determine where the missing page might be, scan what you do have, amend the page count in the Format column, and add a note to the Description column explaining that a page is missing.
  • More pages than metadata
    • Check object against metadata to make sure the page actually belongs to that item.
    • If it belongs, correct the total in the spreadsheet's Format column.

Multiple items seem to constitute one object

  • With manuscripts, trust the instinct of the archivist and assume there's a reason these items are not combined in the metadata and clipped together.
  • With photos, this happens because they are always numbered individually during processing, and our filenames are based on those image numbers. If this happens
      • assign the item number of the first page to the whole object, then amend the metadata line for the first page to reflect that this is a whole, not part of one -- in the Title, Description, and Format columns; and
      • in TrackingFiles, explain why the remaining pre-assigned filenames have not been captured.
    • Example: Photos 0000123, 0000124, and 0000125 are pictures of a single document, The Bradley Contract, and would not make sense unless seen in context together. Assign 0000123 as the item number and treat these images like pages in a multi-page document: place a folder for that item in the Scans folder, with pages 0000123_0001, 0000123_0002, and 0000123_0003 inside it. In the metadata, relabel metadata line one (Bradley Contract page 1) to The Bradley Contract. In the TrackingFiles beside 0000124 and 0000125, note that the images were part of a single document and scanned as 0000123.

Letter out of order: logical flow of letter does not match physical sequence of pages

  • Normally, when a letter is written on folded paper, we're easily able to reproduce the text in order by following it around the physical object. For example, some old letters that were written on folded paper (picture something like a greeting card) when unfolded begin on the right side of the page, move to the back, then wrap around to the left side of the front page: 4 1 | 2 3. The digital item pagination will reflect this order: 0004 0001 | 0002 0003.
  • If the flow of a letter's text does not match up in any straightforward way to its physical layout on the page, this is the rule of thumb: representing the object is more important than interpreting its content. If possible, begin with the first page, then scan the other pages in order. The image online, then, will approximate the researcher's experience of looking at the pages and trying to figure out how to follow the text.
  • Example
    • A letter might start on front right, jump to bottom of back right, continue on front left then back left, and end on top of back right: 3 1 | 4 5/2.
    • This would actually be scanned just like the simple example above: 0004 0001 | 0002 0003. Note that this results in 4 scans for 5 text parts. It's confusing and space-wasting to render the back right page (5/2 or 0003) twice. The researcher should be able to sort this out.
Personal tools