The following policies have been adopted to deal with procedural anomalies: irregularities encountered during the digitization process, either with the physical state of the materials or their intellectual organization. For metadata issues, see Metadata Anomalies.
In general, we are operating under these assumptions:
- Digitized items are surrogates for analog items, so the experience of looking at an item online should as much as possible approximate the experience of viewing an object in person.
- Digitization can be a means of conserving archival items, but only if the digitization process is not itself destructive.
Very large item
- Take test images to evaluate lighting and focus at edges of items; if okay, capture
- If too large for capture bed, consult with Jeremiah about how best to divide item over multiple images\
- If multiple images are required, you will need to combine them into one file using Photoshop, the basic procedures for which are described at Using Adobe Photoshop.
Very fragile paper or binding
- If item can be scanned with extreme care, do it
- Handle with gloves
- Don't put under glass or otherwise compress
- Don't use flatbed scanner
- When in doubt, get a second opinion from Jeremiah or consult an archivist
- Keep these guidelines for handling material in mind
- Opening a folded section of paper is okay
- Opening a creased section of paper is not okay
- What makes something creased? The severity of the folded edge or presence of multiple compounded folds, coupled with the fragility of old paper. If pulling back fold seems like it will damage the paper, it is creased, in our vernacular.
- DO NOT UN-CREASE ANYTHING. Capture the page anyway, unless you feel the area obscured by the crease is so great or important that the scan would be worthless. In that case, consult Jeremiah or Jody about whether to proceed.
- For example, if it's as bad as this...
...don't bother with capture.
- Place a sheet of tan card or folder stock paper underneath the translucent material being scanned.
- White paper can also be used if the material being digitized is a neutral gray
- In order to read text on a sheet of vellum that is bound up in a book, you must mask the text on any subsequent pages
- Black tends to be a bad choice because it lowers the contrast between the text and the paper of the page
- In general white or some other color paper is distracting because the hue of the vellum does not blend well with it.
- Attempt to capture as normal.
- If the image shows a lot of reflection (or is super dark), try turning the item 90 degrees. This should make it catch light in a different way.
- If you cannot get a good capture, skip the item.
Pages stuck together
- DO NOT SEPARATE THEM
Staples, brads, and other hard-to-remove fasteners
- DO NOT REMOVE ANYTHING THAT MIGHT DAMAGE THE ITEM: when in doubt, consult an archivist
- Capture the item carefully, with fasteners in place
- It should be possible to work around a corner or side fastener; laying open or folding back the paper is okay, just DON'T CREASE it
Metal paperclips, pins, and other removable fasteners
- If the fastener can be safely removed, do so…carefully, then capture
- If it appears as though something was clipped/pinned into place to obscure the original text, capture both the clipped/pinned piece and the text underneath -- think about it: if you were holding the object in your hands, you'd be able to remove the clip/pin and get to that obscured text if you wanted to
- When returning to box/folder, throw away old metal fasteners and replace with plastic clips
Things taped or glued to an item
- DO NOT REMOVE ANYTHING THAT MIGHT DAMAGE THE ITEM
- Scan the item as-is
- If the taped/glued object is hanging (not completely stuck down), it might be possible to push it back so that it blocks less of or less important parts of the item
- If glue/tape works as a binding (side, corner), treat it as a hard-to-remove fastener (see above)
In a plastic sleeve
- If a good image can be captured through the sleeve, leave it on and shoot through it
- If a good image can't be captured through the sleeve...
- and the sleeve can't be removed or can't be removed without damaging the item, consult Jeremiah or a coworker about whether a capture is worth taking/keeping
- and the sleeve can be safelyremoved, it's okay to do so -- but ONLY if you can't get a good capture through the sleeve: the archivists put that sleeve on for a reason, after all
- This is the standard way to keep photos safe, not usually an indication of fragile quality -- except in the case of Photo Negatives, which are usually fragile
- It is usually perfectly okay to take the sleeve off; you will have to do so if it is translucent instead of transparent, anyway
- If the item appears to be fragile, follow the instructions above for manuscript items in plastic sleeves
Strange page ordering
Interleaved material found in bound item
- If material is found between pages or clipped to pages
- scan it by itself against the capture bed if possible; if not, use black paper around the edges to mask the object you're shooting it against
- number it in sequence.
- In the metadata for the bound item, include this in the Description field: "Interleaved materials were found in this item and scanned in place."
- For example, if a check stub is found after page 0057, it should be numbered 0058, and the facing page will be 0059.
- If this material is itself a bound item (such as a pamphlet or booklet)
- it should be given its own item number and
- added as a new line in the metadata; in the Description field, write: "This item was found within another item, Parent Item #, Title of Parent Item."
- For example, if an opera program is found inserted between pages 0042 and 0043 of item 0000034, it should be scanned separately and given the first available item number (look at the bottom of the spreadsheet), paginated starting at 0001, like any other item.
- it should be given its own item number and
More than one right side up
- Use the most common orientation found in an item for all captures taken from a single item.
- If a book has plates that are rotated sideways, do not rotate those few pages to make them view right side up. leave them sideways so they match the format orientation of the rest of the pages in the book.
- If a letter has an address written on the back page that is turned sideways leave this capture sideways and do not reorient.
- If a letter has a separate envelope, please orient the envelope so its text is right side up
- We want to maintain the consistency of an items overall presentation when all of its pages are viewed side by side.
- Typed pages that will be OCRed should be oriented correctly, regardless of the orientation of the rest of the pages
Logical flow of pages does not match physical sequence of pages
- Normally, when a letter is written on folded paper, we're easily able to reproduce the text in order by following it around the physical object. For example, some old letters that were written on folded paper (picture something like a greeting card) when unfolded begin on the right side of the page, move to the back, then wrap around to the left side of the front page: 4 1 | 2 3. The digital item pagination will reflect this order: 0004 0001 | 0002 0003.
- If the flow of a letter's text does not match up in any straightforward way to its physical layout on the page, this is the rule of thumb: representing the object is more important than interpreting its content. If possible, begin with the first page, then scan the other pages in order. The image online, then, will approximate the researcher's experience of looking at the pages and trying to figure out how to follow the text.
- A letter might start on front right, jump to bottom of back right, continue on front left then back left, and end on top of back right: 3 1 | 4 5/2.
- This would actually be scanned just like the simple example above: 0004 0001 | 0002 0003. Note that this results in 4 scans for 5 text parts. It's confusing and space-wasting to render the back right page (5/2 or 0003) twice. The researcher should be able to sort this out.
- Sometimes, a book will be partly filled in, then the person turned it over and started again from the back cover. In this case
- Shoot the pages in the correct intellectual order (for example, 1-50 then backward 100-51).
- Shoot the front and back covers as the first and last pages, as with a normal book; DO NOT SHOOT THE BACK COVER IN THE MIDDLE OF THE RUN, before the second part of the book
- Make a note in the Description field of the metadata indicating how the book presented and how the pages have been ordered; something like this: In the middle of this book, the writer turned the book over and continued writing from the back cover toward the middle. The pages are presented here in that order.
Missing things or mismatches with metadata
Item not in box/folder
Make a note in the log data. Obviously, you can't capture what's not there.
The item has blank pages you didn't scan
Include a note in the metadata spreadsheet's Description field: "Item had blank pages that were not scanned."
Multiple items seem to constitute one object
- With manuscripts, trust the instinct of the archivist and assume there's a reason these items are not combined in the metadata and clipped together.
- With photos, this happens because they are always numbered individually during processing, and our filenames are based on those image numbers. For example, if a letter is found in a photo collection, each page is likely numbered as a separate item, which from our perspective is a mistake. If this happens
- assign the item number of the first page to the whole object, then amend the metadata line for the first page to reflect that this is a whole, not part of one -- in the Title, Description, and Format columns; and
- in TrackingFiles, explain why the remaining pre-assigned filenames have not been captured.
- Example: Photos 0000123, 0000124, and 0000125 are pictures of a single document, The Bradley Contract, and would not make sense unless seen in context together. Assign 0000123 as the item number and treat these images like pages in a multi-page document: place a folder for that item in the Scans folder, with pages 0000123_0001, 0000123_0002, and 0000123_0003 inside it. In the metadata, relabel metadata line one (Bradley Contract page 1) to The Bradley Contract. In the TrackingFiles beside 0000124 and 0000125, note that the images were part of a single document and scanned as 0000123.
Multiple physical copies of the exact same printed material
- Look over the materials and choose the copy that is in the best condition
- Scan only the one copy
- If there is more than one line of metadata, for instance there are five lines of metadata for five copies. add the skipped notation to the master spreadsheet for the remaining undigitized copies