The Fast & the FRDR Tuesday June 11th, 2019 Alex Garnett, Simon - - PowerPoint PPT Presentation

the fast amp the frdr
SMART_READER_LITE
LIVE PREVIEW

The Fast & the FRDR Tuesday June 11th, 2019 Alex Garnett, Simon - - PowerPoint PPT Presentation

The Fast & the FRDR Tuesday June 11th, 2019 Alex Garnett, Simon Fraser University, garnett@sfu.ca; Lee Wilson, Portage, ACENET, lee.wilson@ace- net.ca; Clara Turp, McGill University, clara.turp@mcgill.ca; Julienne Pascoe, Library &


slide-1
SLIDE 1

The Fast & the FRDR

Alex Garnett, Simon Fraser University, garnett@sfu.ca; Lee Wilson, Portage, ACENET, lee.wilson@ace- net.ca; Clara Turp, McGill University, clara.turp@mcgill.ca; Julienne Pascoe, Library & Archives Canada, julienne.pascoe@canadiana.ca

Tuesday June 11th, 2019

slide-2
SLIDE 2

Agenda

  • Background information
  • Discovery and Metadata
  • The Fast & the FRDR
slide-3
SLIDE 3

Background

slide-4
SLIDE 4

The Portage Network

  • The Portage Network:
  • Canadian Association of Research Libraries (CARL)
  • Shared stewardship of research data in Canada
  • Networks of experts and working groups
slide-5
SLIDE 5

Portage Expert Groups

  • Data Management Planning
  • Curation
  • Data Discovery
  • Preservation
  • Training
  • Research Intelligence
  • Data Repositories
slide-6
SLIDE 6

Data Discovery Expert Group

“The main purpose of the Data Discovery Expert Group (DDEG) is to support data creators and curators in planning, producing, and managing descriptive metadata to enable the effective discovery and reuse of research data across a wide range of disciplines.”

slide-7
SLIDE 7

FRDR Discovery Service Working Group

  • Federated Research Data Repository
  • Partnership with Compute Canada
  • 100 000 metadata records from over 40 repositories
  • https://www.frdr.ca/repo/
  • FRDR Discovery Service Working Group
  • Appropriate description and discoverability
  • Discoverability for the widest audience
slide-8
SLIDE 8

Discovery & Metadata

slide-9
SLIDE 9

Discovery

  • Discovery: research data to support new research
  • The role of metadata
  • FRDR as aggregation service
slide-10
SLIDE 10

Harvested metadata

  • Challenges in normalizing metadata
  • Crosswalks built for standard fields
  • Subject/Keywords
slide-11
SLIDE 11
slide-12
SLIDE 12
slide-13
SLIDE 13

The Fast & the FRDR

slide-14
SLIDE 14

FAST

  • "Faceted Application of Subject Terminology"
  • Derived from Library of Congress Subject Headings
  • Maintained by OCLC
  • Why did we choose FAST?
  • Fairly widely implemented with a mature basis
  • Sufficiently generic, wide-reach
  • Good open APIs, potential for linked data
slide-15
SLIDE 15

Open Refine

  • Open source Google project
  • Powerful for this use case:

Messy dataset

Reconciliation service using open APIs

Using a semi-mediated process https://github.com/cmharlow/fast-reconcile https://github.com/OpenRefine/OpenRefine/wiki/Reconciliation

slide-16
SLIDE 16
slide-17
SLIDE 17
slide-18
SLIDE 18

Challenges & Next Steps

slide-19
SLIDE 19

Challenges

  • Large-Scale and ongoing work
  • What about unmapped keywords?
  • Integrating this work with the FRDR harvesting process?
slide-20
SLIDE 20

Fast & FRDR W orking Group

  • Define guidelines for mapping subject keywords that do not

correspond directly with the FAST vocabulary

  • Work collaboratively and asynchronously using OpenRefine to map

subject keywords to FAST vocabulary

  • Peer review mappings to ensure consistency and quality
  • Develop a work plan for unmapped subject keywords
  • Develop a work plan to ingest those mappings back into the

repository and workflows to map new metadata harvests.

slide-21
SLIDE 21

Discussion Points

  • The Bilingual Canadian context

Mapping French keywords (RVM)

  • Choosing policies and practices:

Granularity: Royal canadian mounted police superannuaction account

Standardization

Abbreviations, names, numbers, multivalued data

slide-22
SLIDE 22

Questions?

Thank you!