Adventures in Provenance Integration Elaine Angelino, Uri Braun, - - PowerPoint PPT Presentation

adventures in provenance integration
SMART_READER_LITE
LIVE PREVIEW

Adventures in Provenance Integration Elaine Angelino, Uri Braun, - - PowerPoint PPT Presentation

Adventures in Provenance Integration Elaine Angelino, Uri Braun, David A. Holland, Peter Macko, Daniel Margo, Margo Seltzer TaPP 2011 Integrating two provenance systems StarFlow, a workflow environment for data analysis in Python


slide-1
SLIDE 1

Adventures in Provenance Integration

Elaine Angelino, Uri Braun, David A. Holland, Peter Macko, Daniel Margo, Margo Seltzer TaPP 2011

slide-2
SLIDE 2

Integrating two provenance systems

  • StarFlow, a workflow environment for

data analysis in Python

  • PASS, which manages provenance at

the operating system level

  • Run StarFlow on top of PASS
slide-3
SLIDE 3

The dream

We thought that PASS and StarFlow would get along like two peas in a pod ...

slide-4
SLIDE 4

The reality

... But sometimes they were more like apples and oranges

slide-5
SLIDE 5

Reconciliation

  • StarFlow and PASS each has its own

world view

– Different scopes, objects, granularity, goals

  • Each produces its own account of any

particular execution

  • These accounts need to be reconciled

semantically as well as physically

slide-6
SLIDE 6
  • Problem: StarFlow records information about
  • bjects that do not yet exist
  • The OPM cannot express this
  • Solution: Placeholder objects
  • The new stands-for edge connects

placeholders to reality

Non-existent objects

slide-7
SLIDE 7
  • Problem: When StarFlow regenerates a file,

it deletes the old file and writes a new copy

  • PASS sees no connection between the old

and new objects

  • Solution: Explicit version edges
  • StarFlow must tell PASS about these

relationships

Version disconnection

slide-8
SLIDE 8
  • Problem: We let StarFlow add provenance

records to the PASS database

  • PASS needs to keep track of where individual

records came from

  • Solution: ???
  • What about the provenance of provenance of

provenance of provenance of provenance …

Provenance of provenance

slide-9
SLIDE 9

Is this what provenance integration looks like?