Transcripts

From UA Libraries Digital Services Planning and Documentation
(Difference between revisions)
Jump to: navigation, search
Line 49: Line 49:
 
==FAQ==
 
==FAQ==
  
# But what if you have 2 .wav files and 3 pages of transcripts? That is, what happens when a transcript page contains the transcription for part of each of the 2 .wav files?
+
* What if you have 2 .wav files and 3 pages of transcripts? That is, what happens when a transcript page contains the transcription for part of each of the 2 .wav files?
 +
 
  
 
In that case you might have a scenario like this: _0001.wav goes with _0001_001.txt and _0001_002.txt while _0002.wav goes with _0002_001.txt and _0001_002.txt.  
 
In that case you might have a scenario like this: _0001.wav goes with _0001_001.txt and _0001_002.txt while _0002.wav goes with _0002_001.txt and _0001_002.txt.  
 
In this case, _0001_002.txt and _0002_001.txt are the SAME document, simply existing twice with different file names. This still allows people to know what transcripts correspond  
 
In this case, _0001_002.txt and _0002_001.txt are the SAME document, simply existing twice with different file names. This still allows people to know what transcripts correspond  
to what media items based simply on the file name. We understand that this calls for more storage space to be used (given that an analog item exists as two distinct files),  
+
to what media items based simply on the file name.  
 +
 
 +
We understand that this calls for more storage space to be used (given that an analog item exists as two distinct files),  
 
but the greater concern is the removal of confusion regarding relation of items to one another.
 
but the greater concern is the removal of confusion regarding relation of items to one another.
 +
 +
 +
* What about a scenario in which there are 2 tiffs (of original analog materials) and 1 transcript file (in tiff format) which contains the transcription for both original tiffs? How will someone know what part of the transcription tiff matches with the respective portion of the scans of the analog materials?
 +
 +
In this case, you would again, of course, have a duplicate of the transcription file. But as to the question, while a person won't know exactly what portion of the transcription file to look at, they will still have to seek for the matching information only across one transcript file.
 +
 +
In other words, there would be 4 files:
 +
 +
* 2 in the Scans folder (one for each analog item):
 +
_0001.tif
 +
_0002.tif
 +
 +
* 2 in the Transcripts folder (both files are of the same document):
 +
_0001.tif
 +
_0002.tif

Revision as of 09:11, 25 June 2009

Contents

Transcript Folders and File Names

2

File Names

Often tiff/wave files do note have a one to one match with transcript files or vice-versa.

Below are three scenarios and the naming rules devised on 6/19/09 to allow the file namees themselves to denote a correlation between different file types that point to the same information (i.e. an audio interview and a transcript of that interview).


Situation 1: One to Many (One media file to many text files)

ex: 1 .wav file and 3 .txt transcript files

_0001.wav

_0001_001.txt

_0001_002.txt

_0001_003.txt


Situation 2: Many to One (Many media files to one text file)

ex: 3 .wav files and 1 .txt transcript file

_0001_001.wav

_0001_002.wav

_0001_003.wav

_0001.txt


Situation 3: One to One (One media file to one text file)

ex: 1 .wav file and 1 .txt transcript file

_0001.wav

_0001.txt


FAQ

  • What if you have 2 .wav files and 3 pages of transcripts? That is, what happens when a transcript page contains the transcription for part of each of the 2 .wav files?


In that case you might have a scenario like this: _0001.wav goes with _0001_001.txt and _0001_002.txt while _0002.wav goes with _0002_001.txt and _0001_002.txt. In this case, _0001_002.txt and _0002_001.txt are the SAME document, simply existing twice with different file names. This still allows people to know what transcripts correspond to what media items based simply on the file name.

We understand that this calls for more storage space to be used (given that an analog item exists as two distinct files), but the greater concern is the removal of confusion regarding relation of items to one another.


  • What about a scenario in which there are 2 tiffs (of original analog materials) and 1 transcript file (in tiff format) which contains the transcription for both original tiffs? How will someone know what part of the transcription tiff matches with the respective portion of the scans of the analog materials?

In this case, you would again, of course, have a duplicate of the transcription file. But as to the question, while a person won't know exactly what portion of the transcription file to look at, they will still have to seek for the matching information only across one transcript file.

In other words, there would be 4 files:

  • 2 in the Scans folder (one for each analog item):

_0001.tif _0002.tif

  • 2 in the Transcripts folder (both files are of the same document):

_0001.tif _0002.tif

Personal tools