<?xml version="1.0" encoding="utf-8"?>
<?xml-stylesheet type="text/css" href="http://www.lib.ua.edu/wiki/digcoll/skins/common/feed.css?97"?>
<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/">
	<channel>
		<title>Transcription - Revision history</title>
		<link>http://www.lib.ua.edu/wiki/digcoll/index.php?title=Transcription&amp;action=history</link>
		<description>Revision history for this page on the wiki</description>
		<language>en</language>
		<generator>MediaWiki 1.11.0</generator>
		<lastBuildDate>Sat, 18 May 2013 19:14:53 GMT</lastBuildDate>
		<item>
			<title>Jlderidder: New page: For translations or transcriptions to be in a form that we can use:  # We need a separate digital text (txt extension, not doc, docx, or rtf ) file for each page, named the same as what th...</title>
			<link>http://www.lib.ua.edu/wiki/digcoll/index.php?title=Transcription&amp;diff=4150&amp;oldid=prev</link>
			<description>&lt;p&gt;New page: For translations or transcriptions to be in a form that we can use:  # We need a separate digital text (txt extension, not doc, docx, or rtf ) file for each page, named the same as what th...&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;For translations or transcriptions to be in a form that we can use:&lt;br /&gt;
&lt;br /&gt;
# We need a separate digital text (txt extension, not doc, docx, or rtf ) file for each page, named the same as what the digital page will be named.  &lt;br /&gt;
# It needs to be in UTF-8 (not Windows Unicode, which is UTF-16) and use either Windows character map or Babelmap UTF-8 encodings for the diacritics.  We recommend the use of Notepad++ for the creation of each text file.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
If the transcripts are in analog form, the process is this:&lt;br /&gt;
&lt;br /&gt;
# Digitize the transcripts&lt;br /&gt;
# OCR the images.  OCR only captures about 80% of plain text, and will destroy words with diacritic encodings.  &lt;br /&gt;
# Review the OCR text, compare with the original files that had been transcribed, and divide the content accordingly.  You will need to create a separate text file for each original handwritten page, and name it the same as the original handwritten page (except end with the extension &amp;quot;.ocr.txt&amp;quot;) -- see above.&lt;br /&gt;
# If you correct the OCR errors in these text files, then remove the &amp;quot;.ocr&amp;quot; from the file extension, so they simply end in &amp;quot;.txt&amp;quot;&lt;br /&gt;
# Unless the transcript images are of research value themselves, delete them (as well as the intermediate OCR content).&lt;br /&gt;
# Place the results in the Transcripts directory for upload.&lt;/div&gt;</description>
			<pubDate>Wed, 03 Oct 2012 15:54:03 GMT</pubDate>			<dc:creator>Jlderidder</dc:creator>			<comments>http://www.lib.ua.edu/wiki/digcoll/index.php/Talk:Transcription</comments>		</item>
	</channel>
</rss>