workflow virtualiza on for data intensive computa on wvdic
play

WorkflowVirtualiza/onforData IntensiveComputa/on (WVDIC) - PowerPoint PPT Presentation

WorkflowVirtualiza/onforData IntensiveComputa/on (WVDIC) SreekanthPothanis Scien/ficWorkflowsandDataGrids Scien/ficworkflows Managecomplexscien/fic applica/ons


  1. Workflow
Virtualiza/on
for
Data
 Intensive
Computa/on
 (WVDIC)
 Sreekanth
Pothanis


  2. Scien/fic
Workflows
and
Data
Grids
 • Scien/fic
workflows
 – Manage
complex
scien/fic
 applica/ons 

 – Integrate
compute
and
data
 sources
 – Generate
large
amounts
of
data
 • Cactus
simula/ons
 • Data
grids
 – Provide
long
term
storage
 – Enable
collabora/on
and
sharing
 – Provide
context
for
recovery


  3. Integra/on
with
Data
Grids
 • Automates
execu/on
of
workflows
 • Allows
staging
and
post
processing
 • Enables
automa/on
of
archival
of
produced
 data
sets
 • Simplifies
environment
set‐up



  4. Workflow
Virtualiza/on
 • Management
of
the
proper/es
 • Manage
interac/ons
with
each
workflow
system
 for
input
and
output
of
files.
 • Provides
higher
control
 • Enables
execu/on
of
complex
workflows
 spanning
mul/ple
different
workflow
systems
 • External
to
the
environment
that
actually
runs
 the
workflow
 – Increases
generality



  5. Workflow
Virtualiza/on
Server
(WVS)
 • Stand
alone
and
modular

 • External
to
any
workflow


  6. WVS:
Authen/ca/on
and
Context
 Handling
 • Handled
at
two
levels
 – Grid
level
to
perform
grid
transac/ons
 – OS
level
to
execute
workflows
 • Data
grid
context
 – Provides
informa/on
about
data
grid

 • User
privileges,
quotas
 • Workflow
context
 – Generated
during
the
execu/on
 • List
of
output
files,
des/na/on,
metadata


  7. WVS:
Staging,
Execu/on
and
post
 Processing
 • Sets
up
the
working
environment
before
 ini/a/ng
the
interfacing
module
 • Decreases
execu/on
/me
by
pipelining
where
 possible
 • Executed
by
invoking
appropriate
modules
 – Modularity
allows
high
level
of
customiza/on
 – Provides
higher
control
 • Handles
custom
post
processing
scenarios


  8. Integra/on
with
iRODS
 • Implemented
through
micro‐ services
and
rules
 – Client
interface
 • Client
design
and
configura/on
 – Configura/on
file
and
rules
 WORKFLOW=MAKEFLOW

 CONFIG=/tempZone/home/wfuser/test.makeflow

 INPUT=/tempZone/home/wfuser/capitol.jpg

 INPUT=/tempZone/home/wfuser/local.jpg

 INPUT=/tempZone/home/wfuser/meta.jpg

 DEST=/tempZone/home/wfuser/test_dest/

 METADATA=NAME1=VAL1

 METADATA=NAME2=VAL2


  9. Integra/on
with
iRODS
 • Server
Configura/on
 [MAKEFLOW]
path=/usr/local/cctools/redhat5/ – Authen/ca/on
 bin/makeflow

 args=
‐T
condor

 – Data
Transfer
 [MAKEFLOW]

 [MAKEFLOW1]

 path=/usr/local/Makeflow/bin/makeflow
 – Metadata

 args=
‐p
9876
 [MAKEFLOW1]

 – Module
execu/on
 #[KEPLER]

 #path=path
to
kepler

 • Interacts
with
iRODS
 #args=‐t
–P

 #[KEPLER]

 server
as
an
admin
 [PEGASUS]
path=/usr/local/Pegasus/Pegasus‐ plan

 path_to_sites.xml
=
/usr/local/Pegasus/sites.xml

 path_to_rc.data
/usr/local/Pegasus/rc.data
 path_to_tc.data
=
/usr/local/Pegasus/tc.data

 [PEGASUS]


  10. Conclusion
–
WVDIC
 • Automates
execu/on
of
workflow
 • Orchestrates
at
sub‐workflow
levels
across
 mul/ple
workflow
systems
 • Provides
a
generic
solu/on
 – Implemented
with
iRODS,
Makeflow,
Pegasus


  11. Thank
you


Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend