CS 644: Introduction to Big Data Chapter 8. Enabling Big-data Scientific Workflows in High-performance Networks
1
CS 644: Introduction to Big Data Chapter 8. Enabling Big-data - - PowerPoint PPT Presentation
CS 644: Introduction to Big Data Chapter 8. Enabling Big-data Scientific Workflows in High-performance Networks Chase Wu New Jersey Institute of Technology 1 Outline Introduction Challenges & Objectives A Three-layer
1
2
3
4
Visualization channel Visualization control channel Computation steering channel
small portion of parameters
visualization
analysis
Supercomputer or Cluster Client
Ø One day: 1 terabyte datasets
Ø About 8 hours
Ø TCP-based transfer tools: up to one week
5
Visualization: several hours or days
Energy Camera), etc.
6
(tiled display, powerwall, etc.), network infrastructure, etc.
7
8
9
10
11
12
An Overview of TCP/IP Stack
13
14
15
16
17
18
19
Bandwidth
Demand Dynamic Optical Bypass Unified Recovery
Unified Control Plane
Switch Abstraction Networking Applications
VIRTUALIZATION (SLICING) PLANE Underlying Data Plane Switching
Traffic Engineering Application
QoS
Packet Switch Packet Switch Wavelength Switch Time-slot Switch Multi-layer Switch
Packet & Circuit Switch Packet & Circuit Switch
20
Onix / Network OS
Network Hypervisor
Real States Logical States Abstractions Mapping Control Commands Distributes, Configures Network Info Base
Abstraction
Provides
Provides
OpenFlow
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39 Residual Bandwidth
[0] t [2] t [1] t [3] t
bandwidth
across multiple time slots
the same (fixed) bandwidth
varying bandwidths across multiple time slots
40
41
42
43
44
1 1
m
j
j
w
1 j
1 j
Computational complexity
j
1 j
Module
Recv data : Send data :
( , ) G V E = | | V n =
1 1
n
k
Link bandwidth:
, h k
h
Min link delay:
, h k
Node processing power
k
Network
, , ,
i i
w w i j i j i j
45
Link failure rate:
, h k
Node Failure rate
Workflow
k
c
wv w v V
Î
,
,
j i j
s f w e j w i j w
,
,
i j i
s f e w i w i j w
wv
wv
46
comp
w w w c v
f w s w
t v t
:( )( ) 0
f s w w w
v wv w V t t t t
Î
, , , tran , , , , , ,
i j i j h k i j h k h k i j w h k c h k
, , ,
, ,
f ei j s h k ei j
t i j h k l t
, , , ,
:( )( ) 0
h k i h j k f s i j w e e i j i j
l w v w v e E t t t t
Î
47
1
ED 1 2 comp tran ( , ) 1 [ ] , 1 [ ] , 1 1 [ ], [ 1] [ ], [ 1] [ ], [ 1]
i i i j j i
q q g e g g i i q w w j P i i j g j P i i i i i P i P i P i P i i P i P i
+
=
Î ³ + + + + = +
2 q-
48
, , ' ', ' ' ', '
BN ' comp ' ' , , , , ', ' tran , ', ' , , ', ' ', '
i i i w j k w i w j k w i c j k i c j k
w c w w i i i i i w V e E w V e E j k j k j k j k j k v V l Ec v V l Ec j k j k
Î Î Î Î Î Î Î Î
, , , ,
, mappedon mappedon , ,
i j h k i h i w h c i j w h k c
h k h e l w v w V v V e E l E
Î Î Î Î
49
ED all possible mappings
BN all possible mappings
50
Ø No Node Reuse (MED-NNR) Ø Contiguous Node Reuse (MED-CNR) Ø Arbitrary Node Reuse (MED-ANR)
Ø No Node Reuse or Share (MFR-NNR) Ø Contiguous Node Reuse and Share (MFR-CNR) Ø Arbitrary Node Reuse and Share (MFR-ANR)
51
A rbi t rary node reuse
Source Destination
N
C
M0 M1 Mn-5 Mn-4 Mn-2 Mn-1 Mn-3 Vn-2 M2 Vp+1 Vq+1 Vp Vq V1 V5 Vs Vd Source Destination M0 M1 Mn-5 Mn-4 Mn-2 Mn-1 Mn-3 Vn-2 M2 Vp+1 Vq+1 Vp Vq V1 V5 Vs Vd Source Destination M0 M1 Mn-5 Mn-4 Mn-2 Mn-1 Mn-3 Vn-2 M2 Vp+1 Vq+1 Vp Vq V1 V5 Vs Vd
52
No Node Reuse Contiguous Node Reuse Arbitrary Node Reuse Objective Function Constraints Minimum End-to-end Delay NP-complete NP-complete Polynomial (Dyn. Prog.) Maximum Frame Rate NP-complete NP-complete NP-complete
53
54
( ( ))
, , , , , , , , ( ), 1to 1, 1 ( ( )) ,
( , ) max 1 (1 ) (1 ) (1 ) if ( ) ( , ) max ( ) ( , ) max
j j u j i v V pre w mapping j j j
h u u j u j h i h i j u h i i h i w w j i i j w pre w j m h n v suc v i h u w w j h h
T z B e l d f f h i b z A w v T p T z A w v p l l
" Î
Î =
Î
æ ö ç ÷ ç ÷ ç ÷ × ç ÷ + Ù = -
¹ ç ÷ ç ÷ × = ç ÷ ç ÷ è ø × ! F F F F F 1 (1 ) (1 ) if
j u i
f h i æ ö ç ÷ ç ÷ ç ÷ ç ÷ ç ÷ ç ÷ ç ÷ ç ÷ ç ÷ ç ÷ æ ö ç ÷ ç ÷ Ù = -
= ç ÷ ç ÷ ç ÷ ç ÷ ç ÷ è ø è ø F F F F F
55
The DP table with separated layers in . Layered workflow in a topological sorting.
vs=v0 vd=vn-1 v1 v2 v3 v4 ... w0 w1 w2 w3 w4 w5 …... ws wm-1 …... …... …... …... …... ... …... layer 0 vs=v0 vd=vn-1 v1 v2 v3 v4 ... w0 w1 w2 w3 w4 w5 …... ws wm-1 …... …... …... …... ... …... layer 1 vs=v0 vd=vn-1 v1 v2 v3 v4 ... w0 w1 w2 w3 w4 w5 …... ws wm-1 …... …... …... …... …... ... …... layer 2 vs=v0 vd=vn-1 v1 vy vp vq ... w0 w1 w2 w3 wl wt ws wm-1 …... …... …... …... …... ... …... layer l-2 vs=v0 vd=vn-1 w0 w1 w2 w3 wswm-1 …... …... …... …... ... …... layer l-1
…... T0,0 T1,1 T2,2 T2,4 T3,3 T4,5
v1 vy vp vq ... …...
Tq,s
Tn-1,m-1
…... …... …... wl wt
Ty,lTp,t
LDP-F
vs v3 v1 vd vq vx vp
…... …...
w0
wm-1
w2 wl wt
…... …... …...
Workflow Computer network
w1 ws vy v2 w3
…...
w5 w4
layer 0 layer 1 layer 2
…...
layer l-1 layer l-2
v4 v5
Jet air flow dynamics (pressure, raycasting) TSI explosion (density, raycasting)
56
57
58
59
60
61
62