Documenting Invisible Digitization
There are various kinds of digitization that we perform, for which we have nothing to show.
- corrective digitization (this includes deleting captures as well as recapturing them or correcting numbering errors or duplication)
- OCR Transcript Tiffs (where we capture typed transcripts, then discard them after collecting OCR)
- weeded or deleted collections (think rights issues, and duplication)
- digitization for others (donors, publications, web exhibits, etc.)
For each of these types of work, employees must document the work completed in their daily log.
The automated reporter for your daily log (developed by Jeremiah Colonna-Romano, our Digitization Manager) keeps each day's report in an XML file in a particular directory on the share drive.
Our Repository Manager, Jessica Anderson, is developing a script to collect these counts to incorporate them in the appropriate spreadsheet located in S:\Digital Projects\Organization\Digital_Program_Logs\TrackingOddities under the current fiscal year.
Each month, any of these spreadsheets which have been updated since the previous month, need to be exported as tab-delimited text, overwriting the existing text file by the same name in that directory. The text files will be picked up and counted in our Monthly_Count.
Each fiscal year, a new directory needs to be created, and a new set of Excel files created using the templates in the Templates directory, incorporating the previous year's totals into the "Existing" columns for items and subitems, and then exported to create the first set of text files for count collection.