OCR List
From UA Libraries Digital Services Planning and Documentation
(Difference between revisions)
(New page: '''How to create an OCR List''' # Open the collection's TrackingFiles log file # For the files you are uploading, copy those rows to a new spreadsheet # Delete all columns EXCEPT: ''Filen...) |
m |
||
| Line 3: | Line 3: | ||
# Open the collection's TrackingFiles log file | # Open the collection's TrackingFiles log file | ||
# For the files you are uploading, copy those rows to a new spreadsheet | # For the files you are uploading, copy those rows to a new spreadsheet | ||
| - | # | + | # In the new spreadsheet, delete all columns EXCEPT: ''Filename'' and ''OCR?'' |
| + | #* This will results in a two-column list, with only one filename per line -- which is what the script is looking for | ||
#* ''OCR?'' should be filled in with 1 or 0 for each item (1=yes to OCR, more than half of the item is typewritten, 0=no to OCR) | #* ''OCR?'' should be filled in with 1 or 0 for each item (1=yes to OCR, more than half of the item is typewritten, 0=no to OCR) | ||
| - | #* | + | #* If it isn't, take the time to look over the items in Bridge and fill in that column |
# Save as tab delimited file called [collection number].ocrList.txt | # Save as tab delimited file called [collection number].ocrList.txt | ||
#* Example: u0003_0001577.ocrList.txt | #* Example: u0003_0001577.ocrList.txt | ||
# Put in collection's Admin folder (in Digital_Coll_Completed directory, of course!) | # Put in collection's Admin folder (in Digital_Coll_Completed directory, of course!) | ||
Revision as of 15:14, 23 October 2012
How to create an OCR List
- Open the collection's TrackingFiles log file
- For the files you are uploading, copy those rows to a new spreadsheet
- In the new spreadsheet, delete all columns EXCEPT: Filename and OCR?
- This will results in a two-column list, with only one filename per line -- which is what the script is looking for
- OCR? should be filled in with 1 or 0 for each item (1=yes to OCR, more than half of the item is typewritten, 0=no to OCR)
- If it isn't, take the time to look over the items in Bridge and fill in that column
- Save as tab delimited file called [collection number].ocrList.txt
- Example: u0003_0001577.ocrList.txt
- Put in collection's Admin folder (in Digital_Coll_Completed directory, of course!)
