AllColls

From UA Libraries Digital Services Planning and Documentation
Revision as of 10:38, 28 July 2010 by Jlderidder (Talk | contribs)

Jump to: navigation, search

Here's the current database structure:

mysql> show columns from allColls;

 +-----------------+--------------+------+-----+---------+----------------+
 | Field           | Type         | Null | Key | Default | Extra          |
 +-----------------+--------------+------+-----+---------+----------------+
 | dnum            | int(30)      | NO   | PRI | NULL    | auto_increment | 
 | id_2009         | varchar(60)  | NO   |     | NULL    |                | 
 | title           | varchar(255) | YES  |     | NULL    |                | 
 | listLevel       | int(4)       | YES  |     | NULL    |                | 
 | parentID        | varchar(60)  | YES  |     | NULL    |                | 
 | sourceCollName  | varchar(255) | YES  |     | NULL    |                | 
 | mssNum          | varchar(10)  | YES  |     | NULL    |                | 
 | mssUrl          | varchar(80)  | YES  |     | NULL    |                | 
 | blurb           | blob         | YES  | MUL | NULL    |                | 
 | type            | varchar(30)  | YES  |     | NULL    |                | 
 | alphaBy         | varchar(100) | YES  |     | NULL    |                | 
 | AnalogOrDigital | char(1)      | YES  |     | NULL    |                | 
 | online          | char(1)      | YES  |     | NULL    |                | 
 | inAcumen        | char(1)      | YES  |     | NULL    |                | 
 | inCDM           | char(1)      | YES  |     | NULL    |                | 
 | iconLocation    | varchar(255) | YES  |     | NULL    |                | 
 | dateLive        | date         | YES  |     | NULL    |                | 
 | cannedLink      | varchar(255) | YES  |     | NULL    |                | 
 | cannedLink2     | varchar(255) | YES  |     | NULL    |                | 
 | hasPDFs         | char(1)      | YES  |     | NULL    |                | 
 | hasTranscripts  | char(1)      | YES  |     | NULL    |                | 
 | hasOCR          | char(1)      | YES  |     | NULL    |                | 
 | digFiles        | int(11)      | YES  |     | NULL    |                | 
 | objects         | int(11)      | YES  |     | NULL    |                | 
 | allBound        | char(1)      | YES  |     | NULL    |                | 
 | boundObjects    | int(11)      | YES  |     | NULL    |                | 
 | unboundDigFiles | int(11)      | YES  |     | NULL    |                | 
 | notes           | text         | YES  |     | NULL    |                | 
 | inLOCKSS        | date         | YES  |     | NULL    |                | 
 +-----------------+--------------+------+-----+---------+----------------+
 29 rows in set (0.23 sec)

The current query that pulls information for our collection browse page is:

 "SELECT title, iconLocation, blurb, online, cannedLink, mssUrl, id_2009, listLevel FROM InfoTrack.allColls WHERE alphaBy LIKE '".$sortVal."%' 
 AND online=1 AND listLevel=1 ORDER BY alphaBy LIMIT ".($page ? ($page*$_SESSION['max']).', ' : ).$_SESSION['max'].";"

-- where the LIMIT statement is PHP checking Session variables for the pagination feature.

This is our primary table for information about our collections, and as such, it is frequently altered and updated. The first entry of information here generally comes from the Collection_Information XML file created by Digital Services during digitization.

  1. dnum is simply an auto-incrementing number to serve as primary key.
  2. id_2009 is the identifier for this collection assigned according to our 2009 File_naming_schemes
  3. title is the title of the collection
  4. listLevel is currently either 1 (one) indicating it should appear in the online collection list, or NULL (it should not)
  5. parentID is the id_2009 - type identifier for the analog collection from which this digital collection was derived. This only applies to digital collections, and the analog collection may or may not be listed in this database.
  6. sourceCollName is the title of the analog collection from which this collection was derived.
  7. mssNum is the manuscript number of this collection, if applicable, in the form: MS xxxx where xxxx is left-padded with zeros to 4 places. For the u0003 (manuscript) collections, this number corresponds to the second set of numbers in the id_2009 identifier. That is to say, u0003_0000252 will have an mssNum of MS 0252.
  8. mssUrl is the link to the online finding aid from which this digital collection was derived. This link should NOT be entered in analog collections, as their URL will go into the cannedLink field instead.
  9. blurb is the description of this collection, most frequently derived from the abstract in the finding aid.
  10. type tells us what kind of icon to assign to this content, and how to count it in ARL preservation statistics. Currently supported options are: book, image, text, audio, video, mixed media, finding aid, score, other.
  11. alphaBy provides the information as to how to alphabetize this title. If, for example this is "John Smith's Papers", this value will likely be something like "Smith, John." From the EAD finding aids, this information is pulled from the "filing title" entry.
  12. AnalogOrDigital contains either "D" for digital, or "A" for analog. The latter is used for finding aids, or simply for reference to the manuscript collection.
  13. online equals 1 (one) if true, and NULL otherwise.
  14. inAcumen equals 1 (one) if this content is in Acumen, NULL otherwise.
  15. inCDM equals 1 (one) if this content is in CONTENTdm, NULL otherwise.
  16. iconLocation contains the URL to the icon assigned for this collection. If no icon was supplied, the icon assigned is based on the value of the type field above.
  17. dateLive is a yyyy-mm-dd form of date to indicate when this collection went live on the web. Early collections are estimated dates only.
  18. cannedLink is a URL to the collection online
  19. cannedLink2 is only used if the collection is online in more than one location. For example, the collections in CONTENTdm will have links here, for they are also in Acumen, and the Acumen display is primary.
  20. hasPDFs equals one(1) if this collection has PDFs associated with the content.
  21. hasTranscripts equals one(1) if this collection has Transcripts associated with the content.
  22. hasOCR equals one(1) if this collection has OCR associated with the content.
  23. digFiles is supposed to hold the final count of all archival digital files currently available for this collection, but this number is not frequently updated or used.
  24. objects is supposed to hold the final count of all intellectual items currently available in this collection, but this number is not frequently updated or used.
  25. allBound is for ARL preservation stats, to indicate that all the contents of this collection are bound items, and should be counted as such. One (1) means yes.
  26. boundObjects is for ARL preservation stats. If a collection is only partially composed of bound objects, then we must have a count of how many items are bound. That goes here.
  27. unboundDigFiles is for ARL preservation stats. If a collection is only partially composed of bound objects, then we must have a count of how many files in it come from the unbound items. That goes here.
  28. notes is a free-text field for additional information of any sort.
  29. inLOCKSS contains a date (yyyy-mm-dd) of when this collection was released to LOCKSS partners for harvesting (if indeed it has yet).


Additional fields will be added for the donor information at some point, and some of these fields will likely go away, but this is our current database setup as of July 2010.

--Jlderidder 10:38, 28 July 2010 (CDT)

Personal tools