Tagging Stats

From UA Libraries Digital Services Planning and Documentation
Jump to: navigation, search


This script, located in /srv/scripts/tagging/ reads through the extracted XML files in the ./XML directory (created by the extraction scripts), counts the number of items in each collection that have tags, the number of tags per item -- (disregarding the quantity of times a tag was used for an item) -- checks the database to find out the quantity of items for this collection, then outputs a tab-delimited text report in the ./output directory, datestamped. The columns are as follows:

  1. Collnum: the collection number
  2. Total Items: total items in collection
  3. ItemsTagged: how many obtained tags
  4. NumTags: number of distinct tags obtained
  5. TimesTagged: number of times something in this collection was tagged (includes duplicate tags)