<?xml version="1.0" encoding="utf-8"?>
<?xml-stylesheet type="text/css" href="http://www.lib.ua.edu/wiki/digcoll/skins/common/feed.css?97"?>
<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/">
	<channel>
		<title>Transcription stats - Revision history</title>
		<link>http://www.lib.ua.edu/wiki/digcoll/index.php?title=Transcription_stats&amp;action=history</link>
		<description>Revision history for this page on the wiki</description>
		<language>en</language>
		<generator>MediaWiki 1.11.0</generator>
		<lastBuildDate>Sat, 25 May 2013 15:43:20 GMT</lastBuildDate>
		<item>
			<title>Jlderidder at 21:26, 18 December 2012</title>
			<link>http://www.lib.ua.edu/wiki/digcoll/index.php?title=Transcription_stats&amp;diff=4352&amp;oldid=prev</link>
			<description>&lt;p&gt;&lt;/p&gt;

			&lt;table style=&quot;background-color: white; color:black;&quot;&gt;
			&lt;col class='diff-marker' /&gt;
			&lt;col class='diff-content' /&gt;
			&lt;col class='diff-marker' /&gt;
			&lt;col class='diff-content' /&gt;
			&lt;tr&gt;
				&lt;td colspan='2' style=&quot;background-color: white; color:black;&quot;&gt;←Older revision&lt;/td&gt;
				&lt;td colspan='2' style=&quot;background-color: white; color:black;&quot;&gt;Revision as of 21:26, 18 December 2012&lt;/td&gt;
			&lt;/tr&gt;
		&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 1:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 1:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;-&lt;/td&gt;&lt;td style=&quot;background: #ffa; color:black; font-size: smaller;&quot;&gt;&lt;div&gt;''All of the below is on &lt;del style=&quot;color: red; font-weight: bold; text-decoration: none;&quot;&gt;libcontent1&lt;/del&gt;.lib.ua.edu.''&lt;/div&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;+&lt;/td&gt;&lt;td style=&quot;background: #cfc; color:black; font-size: smaller;&quot;&gt;&lt;div&gt;''All of the below is on &lt;ins style=&quot;color: red; font-weight: bold; text-decoration: none;&quot;&gt;libcontent&lt;/ins&gt;.lib.ua.edu.''&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt; &lt;/td&gt;&lt;td style=&quot;background: #eee; color:black; font-size: smaller;&quot;&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt; &lt;/td&gt;&lt;td style=&quot;background: #eee; color:black; font-size: smaller;&quot;&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt; &lt;/td&gt;&lt;td style=&quot;background: #eee; color:black; font-size: smaller;&quot;&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt; &lt;/td&gt;&lt;td style=&quot;background: #eee; color:black; font-size: smaller;&quot;&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;/table&gt;</description>
			<pubDate>Tue, 18 Dec 2012 21:26:14 GMT</pubDate>			<dc:creator>Jlderidder</dc:creator>			<comments>http://www.lib.ua.edu/wiki/digcoll/index.php/Talk:Transcription_stats</comments>		</item>
		<item>
			<title>Jlderidder: New page: ''All of the below is on libcontent1.lib.ua.edu.''   == Update on success of this round of transcriptions ==  All the TransLive scripts create an output file in the ./output directory in /...</title>
			<link>http://www.lib.ua.edu/wiki/digcoll/index.php?title=Transcription_stats&amp;diff=3728&amp;oldid=prev</link>
			<description>&lt;p&gt;New page: ''All of the below is on libcontent1.lib.ua.edu.''   == Update on success of this round of transcriptions ==  All the TransLive scripts create an output file in the ./output directory in /...&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;''All of the below is on libcontent1.lib.ua.edu.''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Update on success of this round of transcriptions ==&lt;br /&gt;
&lt;br /&gt;
All the TransLive scripts create an output file in the ./output directory in /srv/scripts/transcripts/mediawiki/.  These output files are dated tab-delimited text files intended to be opened in Excel or another spreadsheet software. &lt;br /&gt;
&lt;br /&gt;
The columns are: &lt;br /&gt;
&lt;br /&gt;
#&amp;quot;CollID&amp;quot; for the collection identifier, &lt;br /&gt;
#&amp;quot;NumItemsTranscribed&amp;quot; for the number of items transcribed for that collection thus far in this round, and &lt;br /&gt;
#&amp;quot;NumTranscriptions&amp;quot; for the total number of transcriptions for the collection.&lt;br /&gt;
&lt;br /&gt;
A collection with multi-page items may have many more transcriptions than items (up to one per page).  Only the latest transcription is counted, as that's what's retrieved when you run these scripts.  Note that if you are selectively running TransLive on one collection after another, these output files will all be named with today's date and will hence overwrite one another (at present) -- so you may want to retrieve the output files between processing each collection for deletion.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Total Transcripts in u0003 and u0002 thus far ==&lt;br /&gt;
&lt;br /&gt;
In /srv/scripts/transcripts/ there are two scripts of interest:  '''numPagesAndTrans''' and '''getTransCount'''.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Run numPagesAndTrans first;'''  it will traverse the directories of u0003 (Hoole Manuscripts) and u0002 (Hoole Rare Books) and count the number of pages per item, and the number of transcripts (and OCR files) per item and per collection.  This information is stored in the InfoTrack database numItemPages table.   &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''getTransCount utilizes this information, so it should be run second.'''  What it does is report the information gathered from the previous script and will also yank the title of the collection out of the allColls table, formatting the results in the ./output directory as a dated tab-delimited text file intended to be opened in Excel or another spreadsheet software.  &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
The columns of information are:  &lt;br /&gt;
&lt;br /&gt;
#&amp;quot;Collnum&amp;quot; for the collection identifier; &lt;br /&gt;
#&amp;quot;Title&amp;quot; for the collection title; &lt;br /&gt;
#&amp;quot;NumItems&amp;quot; for the number of items in the collection; &lt;br /&gt;
#&amp;quot;NumItemsTranscribed and/Or OCRd&amp;quot; for the number of items transcribed or OCRd in that collection; &lt;br /&gt;
#&amp;quot;NumPages&amp;quot; for the total page count for that collection; &lt;br /&gt;
#&amp;quot;NumTranscriptions&amp;quot; and &lt;br /&gt;
#&amp;quot;NumOCRd&amp;quot; for a breakdown of the &amp;quot;NumItemsTranscribed&amp;quot; column.  Some things may have been OCR'd and then transcribed (perhaps the OCR has not been removed), but for the most part, this will give you a sense of where the gaps are in terms of what should still be transcribed.  Generally, if we can't OCR it, it should be transcribed.&lt;/div&gt;</description>
			<pubDate>Wed, 11 Apr 2012 17:51:38 GMT</pubDate>			<dc:creator>Jlderidder</dc:creator>			<comments>http://www.lib.ua.edu/wiki/digcoll/index.php/Talk:Transcription_stats</comments>		</item>
	</channel>
</rss>