In October 2012, we modified how we collect the count, to capture a snapshot across multiple locations and note duplicates which are in the process of being repaired. We also developed a method of tracking digitization that never sees the light of day. This can include:
- digitization for donors, such as providing digital versions of content donated to Special Collections
- digitization for publication, at the dean's request, often for cost recovery
- weeded collections (such as those containing duplicates), and collections deleted due to rights issues
- digitization of typed transcripts in order to obtain OCR to enable searchability of hand-written documents (we delete the transcript tiffs)
- recapture of badly digitized content
- correction of errors, such as duplicate captures of a single page
- Starting in October 2012, we are capturing monthly snapshots (an example can be viewed here, with a second synopsis specifying GB counts, total digitization thus far, and what was accomplished this past month.
For October, 2012, the synopsis was as follows (not yet automated):
- Digitized 125 new items (8719 captures)
- Total items digitized: 89972
- Total captures: 323204
- Total items in Acumen: 78704
- Total captures in Acumen: 294350
- Total GB in archive: 13852
- Total GB on share: 474
- Total GB: 14326
The following information is deprecated as of 10/2012 (15:39, 2 November 2012 (CDT)Jlderidder)
To collect the count on the storage drive:
- Log onto libcontent1 via SSH and change into the
scriptsdirectory. (see Command-line_Work_on_Linux_Server)
- Run GBcount: type in `GBcount` and hit enter. It will output on the command line the total count in gigabytes for the archival storage directories on the server. It will also output a detailed report in the scripts/output directory.
- Run acumenCount: type in `acumenCount` and hit enter. It will output on the command line the total number of collections represented in Acumen, including the number which do not have EADs; the detailed report will be output in the scripts/output directory.
- Download the report created by acumenCount, and open it in Excel, with tabs as delimiters. This report lists collection identifiers, titles, item count, file count, and whether the collection has an EAD. At the end is the total counts for both items and files.
For the size of content on Share, use the monthly GB counter described here Monthly_gb_counter.
Please refer to this document for information on how we compile our monthly count File:Collecting numbers for use in the monthly count.docx