Maureen P. Walsh Open Repositories 2013 Charlottetown, PEI - - PowerPoint PPT Presentation

maureen p walsh open repositories 2013 charlottetown pei
SMART_READER_LITE
LIVE PREVIEW

Maureen P. Walsh Open Repositories 2013 Charlottetown, PEI - - PowerPoint PPT Presentation

Metadata Reuse Workflows and Methods for Metadata Reuse Workflows and Methods for DSpace Repositories DSpace Repositories Maureen P. Walsh Open Repositories 2013 Charlottetown, PEI


slide-1
SLIDE 1

Metadata Reuse Workflows and Methods for DSpace Repositories Maureen P. Walsh Open Repositories 2013 Metadata Reuse Workflows and Methods for DSpace Repositories

Charlottetown, PEI July 12, 2013

slide-2
SLIDE 2

Metadata Repurposing Workflows

  • Background / Context
  • Metadata Services and Archiving Options
  • Metadata Repurposing Overview
  • MARC Catalog Metadata
  • Embedded Image Metadata
  • EAD Finding Aid Metadata
  • Printed Text Metadata
slide-3
SLIDE 3

The Ohio State University’s Institutional Repository Knowledge Bank Mission: …to collect, preserve, and distribute the digitally formatted intellectual output

  • f the University…
slide-4
SLIDE 4

76 Communities

50,733 Items 108,440 Content Files

slide-5
SLIDE 5

Knowledge Bank Archived Items

slide-6
SLIDE 6

KB Metadata Application Profile

  • Core set of metadata elements and

Dublin Core Metadata Element Set mappings to

  • improve retrieval accuracy and resource

discovery

  • facilitate multi-institutional

interoperability and quality control

  • comply with the Open Archives Initiative

Protocol for Metadata Harvesting

  • enable collection migration, import &

export between the Knowledge Bank and

  • ther systems as necessary
slide-7
SLIDE 7

KB Collection Core Element Set

slide-8
SLIDE 8

Individual Item Submission

slide-9
SLIDE 9

Customized Input Forms (Display)

slide-10
SLIDE 10

Customized Input Forms (XML)

slide-11
SLIDE 11

Customized Item Templates (Display)

slide-12
SLIDE 12

Customized Item Templates (Dublin Core)

slide-13
SLIDE 13

Importing Items in Batch

slide-14
SLIDE 14

Batch Loading – DSpace Simple Archive Format

slide-15
SLIDE 15

Batch Loading – Metadata CSV

slide-16
SLIDE 16

Batch Loading – Dublin Core

DSpace Dublin Core XML DSpace Item Record

slide-17
SLIDE 17

Building the Simple Archive Format

Custom Perl Scripts

  • Examples

http://hdl.handle.net/1811/46845

Stand-alone Java Tool

  • Simple Archive Format Packager / SAFBuilder

https://wiki.duraspace.org/display/DSPACE/Simple+Archive+Format+Packager

slide-18
SLIDE 18

Repurposing MARC Metadata

slide-19
SLIDE 19

Repurposing MARC Metadata

MARC - Catalog Dublin Core - IR

slide-20
SLIDE 20

Repurposing MARC Metadata

  • XSLT Workflow
  • Export Tab Delimited Records Workflow
slide-21
SLIDE 21

XSLT Workflow

slide-22
SLIDE 22

XSLT Workflow

[Truncated] XSLT

MarcEdit by Terry Reese Full example available at: http://hdl.handle.net/1811/47564

slide-23
SLIDE 23

Export Tab Delimited Records Workflow

MarcEdit by Terry Reese

slide-24
SLIDE 24

MarcEdit CSV Export

slide-25
SLIDE 25

Batch Load CSV

slide-26
SLIDE 26
slide-27
SLIDE 27

Repurposing Embedded Image Metadata

slide-28
SLIDE 28

Embedded Metadata Workflow

Adobe Photoshop

slide-29
SLIDE 29

Extracting Metadata with ExifTool

ExifTool by Phil Harvey http://owl.phy.queensu.ca/~phil/exiftool/

Adobe Photoshop

slide-30
SLIDE 30

ExifTool CSV Export

slide-31
SLIDE 31

ExifTool CSV Export Mapping to DC

slide-32
SLIDE 32

ExifTool ‘Targeted’ CSV Export

slide-33
SLIDE 33

Batch Load CSV

slide-34
SLIDE 34

Simple Archive Format Packager

slide-35
SLIDE 35
slide-36
SLIDE 36

Repurposing EAD Finding Aid Metadata

slide-37
SLIDE 37

Repurposing EAD Metadata

  • XSLT Workflow
  • xml2csv Workflow
slide-38
SLIDE 38

Repurposing EAD Metadata

EAD Online Finding Aid

slide-39
SLIDE 39

XSLT Workflow

<oXygen/> XML Editor

<oXygen/> XML Editor

slide-40
SLIDE 40

xml2csv Workflow

A7Soft xml2csv http://www.a7soft.com/xml2csv.html

slide-41
SLIDE 41
slide-42
SLIDE 42

Repurposing Printed Text Metadata

slide-43
SLIDE 43

Delimited Text Workflow

slide-44
SLIDE 44

Delimited Text Workflow

PSPad

slide-45
SLIDE 45

Delimited Result

slide-46
SLIDE 46

Thank You

Maureen P. Walsh

Associate Professor Institutional Repository Services Librarian The Ohio State University Libraries walsh.260@osu.edu