digioverload: In(di)gestion

Greetings from the borderlands of DSpace, where Rachel and I have been floating for the past week. Wish you were here!

After successfully digitizing dozens of documents, we turned eagerly to the "ingest" process--a.k.a. uploading digitized materials to their new forever home on the iSchool's digital repository. We gathered expert advice from Dr. Galloway about our organizational schema and consulted with students in her Problems in the Permanent Retention of Digital Records class about essential DSpace resources. We read through the DSpace manual and supporting documentation. We crossed our fingers and took deep breaths and practiced eating space food.

But we were still confused.

We decided we needed to see the ingest process in action before attempting it ourselves, so we talked with Sam, who suggested we talk with Sarah, who blessed us with several magical batch processing powers: the New Zealand Metadata Extraction tool and three marvelous perl scripts (which she originally received during her Problems project at the HRC), passed down through the DSpace generations like some elusive passcode.

Can we give Sarah a gold star for staying late with us in the IT lab to walk us through our first ingest pre-processing encounter? Despite the fact that she has this little dissertation thing to work on...

Once Sarah left, though, our luck ran out: we couldn't seem to launch the metadata extractor on the lab computers (something about missing Javascript), so we did what any NAS-space-hogging SOD student would do: we uploaded EVERYTHING to the network and prayed we'd be able to open the Extractor at home.

Fortunately, our dreams came true. The extractor launched beautifully on my laptop, and a few hurdles, several questions, one ActivePerl download--and about 8 hours of troubleshooting--later, all of our files are processed and waiting patiently on the NAS for the next chapter in their journey: ingest. We're hoping Sam (bless him!) will be available to walk us through these steps in the coming few days.

In the meantime, we're going to build the DSpace hierarchy according to our schema and get started on our reflection paper.

Updates to come, but for now, I'm a bit exhausted, so I'm going to pop a TUM for my in(di)gestion and try to get some DSleep.

p.s. We originally thought our hierarchy would look like this:

Sub-community: YEAR

Sub-sub community: Archives Week

Collection: YEAR Archives Week

Sub-sub community: Meeting Minutes

Collection: YEAR Meeting Minutes

Sub-sub community: Events

Collection: YEAR Events

Sub-sub community: Finances

Collection: YEAR Finances

Sub-sub community: Marketing

Collection: YEAR Marketing

Sub-sub community: Correspondence

Collection: YEAR Correspondence

Sub-sub community: Administrative Records

Collection: YEAR Administrative Records (includes Annual Reports?)

Then we decided it would look like this:

Sub-sub community: Events

Collection: YEAR

Collection: YEAR...etc.

Sub-sub community: Finances

Collection: YEAR

Collection: YEAR...etc.

And now we've decided it will look like this:

Sub community: SAA-UT Records

Collection: Administrative Records (including annual reports)

Collection: Meeting Minutes

Collection: Archives Week

Collection: Events

Collection: Finances

Collection: Marketing

Collection: Website

(with the years conveyed through the metadata)

Sub-sub community: Finances

Collection: YEAR

Collection: YEAR...etc.

Questions to be resolved:

-which materials should have access restrictions, based on the Briscoe's and iSchool's policies, and the FERPA regulations?

digioverload

Saturday, April 23, 2011

In(di)gestion

No comments:

Post a Comment

About Me

Followers