SpeedLight: Synchronized Network Snapshots
Nofel Yaseen, John Sonchack, Vincent Liu
1
SpeedLight: Synchronized Network Snapshots Nofel Yaseen , John - - PowerPoint PPT Presentation
SpeedLight: Synchronized Network Snapshots Nofel Yaseen , John Sonchack, Vincent Liu 1 Network Measurements 2 Network Measurements Measurements are how we understand networks Operators: configuration, management and provisioning
Nofel Yaseen, John Sonchack, Vincent Liu
1
2
3
A B X Y
4
A B X Y
5
What is the reason for this packet drop?
A B X Y
6
7
8
9
Congestion
10
Congestion Poor Load Balancing
11
Congestion Poor Load Balancing
12
Congestion Poor Load Balancing
13
Congestion Poor Load Balancing
14
Congestion Poor Load Balancing
15
possible to an actual state (<RTT)
A set of data-plane measurements that capture the state of the network at ~(single point in time)
16
A set of data-plane measurements that capture the state of the network at ~(single point in time)
17
plane
A set of data-plane measurements that capture the state of the network at ~(single point in time)
18
19
Figure adapted from Linh T. X. Phan
Event B0 Event B1 Event B2 Event B3 Event A0 Event A1 Event A2 Event A3
A B
20
Figure adapted from Linh T. X. Phan
Event B0 Event B1 Event B2 Event B3 Event A0 Event A1 Event A2 Event A3
A B
Inconsistent cut
21
Figure adapted from Linh T. X. Phan
Event B0 Event B1 Event B2 Event B3 Event A0 Event A1 Event A2 Event A3
A B
Inconsistent cut Consistent cut
22
A B C
Figure adapted from Linh T. X. Phan
SS# 1 SS# 1 SS# 1
23
A B C
Figure adapted from Linh T. X. Phan
SS# 1 SS# 1 SS# 1
24
A B C
Figure adapted from Linh T. X. Phan
SS# 1 SS# 1 SS# 1 SS# 2
25
A B C
Figure adapted from Linh T. X. Phan
SS# 1 SS# 1 SS# 1 SS# 2
26
A B C
Figure adapted from Linh T. X. Phan
SS# 1 SS# 1 SS# 1 SS# 2 SS# 2
27
A B C
Figure adapted from Linh T. X. Phan
SS# 1 SS# 1 SS# 1 SS# 2 SS# 2 SS# 2
28
29
30
Observer
Challenge 1: Chandy- Lamport provides no guarantee of synchrony
31
Observer
Challenge 1: Chandy- Lamport provides no guarantee of synchrony
32
Observer
Take SS# n at time t
Challenge 1: Chandy- Lamport provides no guarantee of synchrony
33
Observer
Take SS# n at time t
CPU ASIC
Challenge 1: Chandy- Lamport provides no guarantee of synchrony
34
Observer
Take SS# n at time t
CPU ASIC
Challenge 1: Chandy- Lamport provides no guarantee of synchrony
35
Challenge 2: CL assumes single-threaded nodes, FIFO channels
ASIC
Observer CPU
Figure from P4 language Specification
36
Challenge 2: CL assumes single-threaded nodes, FIFO channels
ASIC
Observer CPU
37
units and priority channels
Challenge 2: CL assumes single-threaded nodes, FIFO channels
ASIC
Observer CPU
units and priority channels
38
Challenge 2: CL assumes single-threaded nodes, FIFO channels
ASIC
Observer CPU
units and priority channels
Ethernet IP Snapshot TCP/UDP Data
39
Challenge 3: CL assumes general purpose CPUs
40
Challenge 3: CL assumes general purpose CPUs
41
42
43
44
0.2 0.4 0.6 0.8 1 1 10 100 1000 10000
CDF Synchronization (us)
Speedlight Polling
45
0.2 0.4 0.6 0.8 1 1 10 100 1000 10000
CDF Synchronization (us)
Speedlight Polling
Median: 6.4μs
46
0.2 0.4 0.6 0.8 1 1 10 100 1000 10000
CDF Synchronization (us)
Speedlight Polling
Median: 6.4μs Median: 3500 μs
47
48
20 40 60 80 100 10 100 1000 10000 Synchronization (us) Number of Routers
49
20 40 60 80 100 10 100 1000 10000 Synchronization (us) Number of Routers
length of the tail
50
51
52
Computational Resources Stateless ALUs 24 Stateful ALUs 11
53
Computational Resources Stateless ALUs 24 Stateful ALUs 11 Memory Resources SRAM 770 kB TCAM 244 kB
54
SpeedLight Snapshots Polling
55
SpeedLight Snapshots Polling
56
SpeedLight Snapshots Polling
57
SpeedLight Snapshots Polling
ECMP
58
59
0.2 0.4 0.6 0.8 1 50 100 150 200 250 CDF Standard Deviation (ms) ECMP Polling ECMP Snapshots Flowlet Polling Flowlet Snapshots
Hadoop
60
0.2 0.4 0.6 0.8 1 50 100 150 200 250 CDF Standard Deviation (ms) ECMP Polling ECMP Snapshots Flowlet Polling Flowlet Snapshots
Hadoop
between ECMP and flowlets.
stddev
61
0.2 0.4 0.6 0.8 1 50 100 150 200 250 CDF Standard Deviation (ms) ECMP Polling ECMP Snapshots Flowlet Polling Flowlet Snapshots 0.2 0.4 0.6 0.8 1 20 40 60 80 100 CDF Standard Deviation (us) ECMP Polling ECMP Snapshots Flowlet Polling Flowlet Snapshots
Hadoop Memcache
between ECMP and flowlets.
stddev
62
0.2 0.4 0.6 0.8 1 50 100 150 200 250 CDF Standard Deviation (ms) ECMP Polling ECMP Snapshots Flowlet Polling Flowlet Snapshots 0.2 0.4 0.6 0.8 1 20 40 60 80 100 CDF Standard Deviation (us) ECMP Polling ECMP Snapshots Flowlet Polling Flowlet Snapshots
Hadoop Memcache
between ECMP and flowlets.
stddev
imbalance
63
0.2 0.4 0.6 0.8 1 50 100 150 200 250 CDF Standard Deviation (ms) ECMP Polling ECMP Snapshots Flowlet Polling Flowlet Snapshots 0.2 0.4 0.6 0.8 1 20 40 60 80 100 CDF Standard Deviation (us) ECMP Polling ECMP Snapshots Flowlet Polling Flowlet Snapshots
Hadoop Memcache
Averaging shows perfect balance in both cases
between ECMP and flowlets.
stddev
imbalance
64
65
66
10 100 1000 10000 4 8 16 32 64 Maximum Rate (Hz) # of Ports/Router
67