validation tool Alecs Geuder SCAPE Information Day British Library, - - PowerPoint PPT Presentation

validation tool
SMART_READER_LITE
LIVE PREVIEW

validation tool Alecs Geuder SCAPE Information Day British Library, - - PowerPoint PPT Presentation

Flint a format and file validation tool Alecs Geuder SCAPE Information Day British Library, UK, 14 th July 2014 Introducing Flint: Presentation Structure Introduction What does Flint do? Flint-the-API Policy-focused


slide-1
SLIDE 1

Flint – a format and file validation tool

Alecs Geuder SCAPE Information Day British Library, UK, 14th July 2014

slide-2
SLIDE 2

Introducing Flint: Presentation Structure

  • Introduction
  • What does Flint do?
  • Flint-the-API
  • Policy-focused Validation
  • Flint-the-toolbox
  • Format-specific Implementations
  • How we are using it
  • Mini-demo
slide-3
SLIDE 3

Introduction

  • Flint facilitates [file/format validation against a policy]
  • the code centres on individual file format modules (pdf,

epub, ..)

  • Comes with a command line interface, GUIs and a

hadoop mapreduce program

slide-4
SLIDE 4

FLint – core features

Schematron Policy

  • categoryA – three tests
  • categoryB – two tests

Input file

  • f specific format

PolicyAware

(Uses schematron-utils)

categoryC – two tests

Format specific Implementation

  • canCheck
  • validationResult
  • ..

<checkresult file=“input file“ result=“passed”>

<categoryA result=“passed”/> <categoryB result=“failed”/> <testB.1 result=“failed”/> <testB.2 result=“failed”/> <categoryC result=“passed”/>

</checkresult>

configuration code Set of internal & third party tools

slide-5
SLIDE 5

The FLint ecosystem

config code

CLI GUIs hadoop PDF EPUB Geospatial data … Entry points Format/Feature specific Implementations

CORE

DRM-detection PDF/EPUB

Input file <checkResult>

slide-6
SLIDE 6

How we are using it

  • To deal with non print legal deposit

What’s next

  • Add additional format/feature modules (geospatial, etc..)
slide-7
SLIDE 7

Mini-demo