SLIDE 1 Metadata Reuse Workflows and Methods for DSpace Repositories Maureen P. Walsh Open Repositories 2013 Metadata Reuse Workflows and Methods for DSpace Repositories
Charlottetown, PEI July 12, 2013
SLIDE 2 Metadata Repurposing Workflows
- Background / Context
- Metadata Services and Archiving Options
- Metadata Repurposing Overview
- MARC Catalog Metadata
- Embedded Image Metadata
- EAD Finding Aid Metadata
- Printed Text Metadata
SLIDE 3 The Ohio State University’s Institutional Repository Knowledge Bank Mission: …to collect, preserve, and distribute the digitally formatted intellectual output
SLIDE 4
76 Communities
50,733 Items 108,440 Content Files
SLIDE 5
Knowledge Bank Archived Items
SLIDE 6 KB Metadata Application Profile
- Core set of metadata elements and
Dublin Core Metadata Element Set mappings to
- improve retrieval accuracy and resource
discovery
- facilitate multi-institutional
interoperability and quality control
- comply with the Open Archives Initiative
Protocol for Metadata Harvesting
- enable collection migration, import &
export between the Knowledge Bank and
- ther systems as necessary
SLIDE 7
KB Collection Core Element Set
SLIDE 8
Individual Item Submission
SLIDE 9
Customized Input Forms (Display)
SLIDE 10
Customized Input Forms (XML)
SLIDE 11
Customized Item Templates (Display)
SLIDE 12
Customized Item Templates (Dublin Core)
SLIDE 13
Importing Items in Batch
SLIDE 14
Batch Loading – DSpace Simple Archive Format
SLIDE 15
Batch Loading – Metadata CSV
SLIDE 16 Batch Loading – Dublin Core
DSpace Dublin Core XML DSpace Item Record
SLIDE 17 Building the Simple Archive Format
Custom Perl Scripts
http://hdl.handle.net/1811/46845
Stand-alone Java Tool
- Simple Archive Format Packager / SAFBuilder
https://wiki.duraspace.org/display/DSPACE/Simple+Archive+Format+Packager
SLIDE 18
Repurposing MARC Metadata
SLIDE 19 Repurposing MARC Metadata
MARC - Catalog Dublin Core - IR
SLIDE 20 Repurposing MARC Metadata
- XSLT Workflow
- Export Tab Delimited Records Workflow
SLIDE 21
XSLT Workflow
SLIDE 22 XSLT Workflow
[Truncated] XSLT
MarcEdit by Terry Reese Full example available at: http://hdl.handle.net/1811/47564
SLIDE 23 Export Tab Delimited Records Workflow
MarcEdit by Terry Reese
SLIDE 24
MarcEdit CSV Export
SLIDE 25
Batch Load CSV
SLIDE 26
SLIDE 27
Repurposing Embedded Image Metadata
SLIDE 28 Embedded Metadata Workflow
Adobe Photoshop
SLIDE 29 Extracting Metadata with ExifTool
ExifTool by Phil Harvey http://owl.phy.queensu.ca/~phil/exiftool/
Adobe Photoshop
SLIDE 30
ExifTool CSV Export
SLIDE 31
ExifTool CSV Export Mapping to DC
SLIDE 32
ExifTool ‘Targeted’ CSV Export
SLIDE 33
Batch Load CSV
SLIDE 34
Simple Archive Format Packager
SLIDE 35
SLIDE 36
Repurposing EAD Finding Aid Metadata
SLIDE 37 Repurposing EAD Metadata
- XSLT Workflow
- xml2csv Workflow
SLIDE 38 Repurposing EAD Metadata
EAD Online Finding Aid
SLIDE 39 XSLT Workflow
<oXygen/> XML Editor
<oXygen/> XML Editor
SLIDE 40 xml2csv Workflow
A7Soft xml2csv http://www.a7soft.com/xml2csv.html
SLIDE 41
SLIDE 42
Repurposing Printed Text Metadata
SLIDE 43
Delimited Text Workflow
SLIDE 44 Delimited Text Workflow
PSPad
SLIDE 45
Delimited Result
SLIDE 46 Thank You
Maureen P. Walsh
Associate Professor Institutional Repository Services Librarian The Ohio State University Libraries walsh.260@osu.edu