Separating Execution and Data Management: A key to - - PowerPoint PPT Presentation

separating execution and data management a key to
SMART_READER_LITE
LIVE PREVIEW

Separating Execution and Data Management: A key to - - PowerPoint PPT Presentation

Separating Execution and Data Management: A key to Business-Process-as-a-Service (BPaaS) Yutian Sun Jianwen Su Jian Yang UC Santa Barbara Macquarie U Outline of the Talk The technical problem for supporting BPaaS


slide-1
SLIDE 1

Separating Execution and Data Management: A key to Business-Process-as-a-Service (BPaaS)

Yutian Sun Jianwen Su Jian Yang

UC Santa Barbara Macquarie U

slide-2
SLIDE 2

Outline of the Talk

The technical problem for supporting BPaaS Self-guided artifacts The SeGA Framework (support for BPaaS) Conclusions

2014/9/9 BPM '14 2

slide-3
SLIDE 3

A BP is an assembly of tasks to accomplish an objective Eg: Obtaining a Permit Each workflow model matches a BP model Each workflow activity is a software program ( ) that

interfaces one task in BP

WfM system manages executions, resources,

documents, etc.

Business Processes & Workflow Management

2014/9/9 BPM '14 3

Application Init rev Review Approval Fee Certificate Delivery

BP

Workflow Management (WfM) System

workflow

slide-4
SLIDE 4

Application Example: Property Management

A Housing Management Bureau (HMB

) manages

titles, licenses, permits, … for a region

Each HMB runs its own workflow system for its business 30 provinces in China, each province has 10-50 HMBs Those hundreds HMB workflow systems are individually

maintained—very costly!

Can one workflow system

support multiple HMBs?

Note:

(1) BPs are similar but not identical (provincal/local policies); data sets are mostly disjoint (e.g., data for buildings) (2) BPs change often (e.g., low income housing added in 2013)

2014/9/9 BPM '14 4

HMB 1

. . . Workflow System

Service Provider HMB 2

slide-5
SLIDE 5

A workflow system typically maintains execution states,

control flow related data, correlations, etc.

in a local data store

Workflow Systems and Local Data

2014/9/9 BPM '14 5

Execution engine

Local data store

Enterprise database Task wrapper

. . .

Task wrapper Task wrapper

Workflow System

[van der Aalst-van Hee 2004] Includes all data required for control flow decisions, correlations, …

slide-6
SLIDE 6

Workflow Mgmt as Services? Not Effective

With straight-forward approach, the service provider has

to run one WfM system for each client HMB due to disparate local data for each HMB’s workflow instances

Arguments for keeping data at local sites: Analytics with local enterprise data + WF control data Heterogeneity of data across different local sites HMB’s want to hold their own data

2014/9/9 BPM '14 6

Hangzhou HMB

Enterprise Data Store 1

Yiwu HMB

. . .

Local 1 WfM system 1 WfM system 2 Local 2 Enterprise Data Store 2

. . .

Service Provider

slide-7
SLIDE 7

Our Goal: Separate processing from data

Only one WfM system that maintains no data Keep each HMB’s data separated Independent decision where/how Hangzhou, Yiwu

maintain their data

A key enabling idea: For each WF step, ship relevant data to WfM System How to know which data is relevant? Use BP model with holistic notion of data + process

2014/9/9 BPM '14 7

Hangzhou HMB

Ent DS 1 & Local 1

. . .

Virtual WfM System

  • Ent. DS 2

& Local 2

Service Provider Yiwu HMB

slide-8
SLIDE 8

2014/9/9 BPM '14 8

Self-Guided Artifacts (sga)

A traditional business artifact (model):

(Entity information model, Entity lifecycle model)

A self-guided artifact:

(BP specification, Entity, States, Dependencies, L)

A document with the specification of the entity lifecycle model actual business data correlations resources current states ID of modeling language

A self-guided artifact contains everything an engine needs

slide-9
SLIDE 9

The SeGA Framework

Key idea: a process wrapper to supply

all data (i.e., self-guided artifact) when the engine needs to run

Both Barcelona and EZ-Flow are integrated with SeGA

2014/9/9 BPM '14 9

SeGA Dispatcher Barcelona Engine SeGA Repository

  • 1. incoming

event

  • 3. send the sga to

the mediator

SeGA Mediator

  • 2. fetch the

self-guided artifact (sga)

  • 4. decompose sga and

put them into the right places and send the incoming event

  • 5. engine performs

a step and possibly sending

  • utgoing event
  • 6. fetch all data and

assemble into an sga

  • 7. store sga back into

the repository

slide-10
SLIDE 10

Supporting BPaaS

WfM system maintains no local

data*

SeGA repository can be merged

into enterprise db

2014/9/9 BPM '14 10

SeGA Dispatcher Barcelona Engine SeGA Repository

  • 1. incoming

event

  • 3. send the sga to

the mediator

SeGA Mediator

  • 2. fetch the sga
  • 4. decompose sga and

put them into the right places and send the incoming event

  • 5. engine performs

a step and possibly sending

  • utgoing event
  • 6. fetch all data and

assemble into an sga

  • 7. store sga back

into the repository

SeGA Dispatcher SeGA Repository

  • 2. fetch the sga

Hangzhou HMB YiWu HMB WfM System (Service Provider)

[Sun-Su-Wu-Yang ICDE 2014]

* in current prototype some data stored in Server due to ID mgmt issue

slide-11
SLIDE 11

SeGA Prototype and Current State

A prototype developed that works

with Barcelona and EZ-Flow

Two operational test examples:

Hangzhou HMB’s (cooperative) BPs:

Early-sell permit approval (Barcelona via SeGA) Maintenance space check (EZ-Flow via SeGA) A new project funded by Ministry of Sci. & Tech (China),

Zhejiang Province, and Hangzhou City (> 1.2m RMB):

To implement an operational prototype with

a small number of HMBs

2014/9/9 BPM '14 11

slide-12
SLIDE 12

Summary

Main difficulty of providing BPaaS lies in data

management

Workflow systems need to have a holistic approach to

managing all of its data

Self-guided artifact is one such approach SeGA elevates existing systems to support:

Independence of Data and Execution Management

With the principle, BPaaS is actually easy to support Basis for moving workflow services to cloud while

enabling local sites to maintain their data

2014/9/9 BPM '14 12

slide-13
SLIDE 13

Some research challenges

Conceptual data modeling for BPs beneficial, but

SeGA raises many technical questions to be addressed:

Inefficient to ship large sega instance due to large data sets

Identify subset of artifact data needed for a given WF step?

Parallelism/txn consistency – what if 2 WfM steps require

  • verlapping sets of artifact instances?

Use locking, and also subsets of artifact data

Auto-ID issue: current BP systems typically use DBMS to

perform ID mgmt

Current SeGA prototype: retains some data in WfM server For general solution the virtual WfMS server needs to

explicitly manage artifact IDs Re-thinking WfMS arch. to include the above will enable

robust support for the data independence principle

2014/9/9 BPM '14 13