KOLLAPS Decentralized and Dynamic Topology Emulation Paulo - - PowerPoint PPT Presentation

kollaps
SMART_READER_LITE
LIVE PREVIEW

KOLLAPS Decentralized and Dynamic Topology Emulation Paulo - - PowerPoint PPT Presentation

KOLLAPS Decentralized and Dynamic Topology Emulation Paulo Gouveia*, Joo Neves*, Carlos Segarra, Luca Liechti Shady Issa*, Valerio Schiavoni, and Miguel Matos* *: INESC-ID & IST, University of Lisbon, Portugal : University of


slide-1
SLIDE 1

Eurosys 2020 KOLLAPS 29.04.2020

KOLLAPS

Decentralized and Dynamic Topology Emulation

Paulo Gouveia*, João Neves*, Carlos Segarra†, Luca Liechti† Shady Issa*, Valerio Schiavoni†, and Miguel Matos* *: INESC-ID & IST, University of Lisbon, Portugal †: University of Neuchâtel, Switzerland

slide-2
SLIDE 2

EuroSys 2020 KOLLAPS 29.04.2020

2

slide-3
SLIDE 3

EuroSys 2020 KOLLAPS 29.04.2020

MOTIVATION

  • Performance depends heavily on underlying

network

  • Variability and Failures are the norm

3

slide-4
SLIDE 4

EuroSys 2020 KOLLAPS 29.04.2020

MOTIVATION

  • Performance depends heavily on underlying

network

  • Variability and Failures are the norm

3

  • Need for tools for systematic evaluation of distributed

applications

  • Ability to answer key questions:
  • What is the impact of halving the network latency in application

throughput?

  • What is the effect of packet loss?
  • What if …
slide-5
SLIDE 5

EuroSys 2020 KOLLAPS 29.04.2020

RELATED WORK

4

slide-6
SLIDE 6

EuroSys 2020 KOLLAPS 29.04.2020

RELATED WORK

4

Main limitations:

  • scalability/centralization
  • accuracy
  • dynamics
slide-7
SLIDE 7

EuroSys 2020 KOLLAPS 29.04.2020

  • Applications are concerned about

end-to-end network properties

  • bandwidth, latency, jitter, packet loss
  • Rather than the network state

leading to these properties

KOLLAPS IN A NUTSHELL

5

slide-8
SLIDE 8

EuroSys 2020 KOLLAPS 29.04.2020

  • Applications are concerned about

end-to-end network properties

  • bandwidth, latency, jitter, packet loss
  • Rather than the network state

leading to these properties

KOLLAPS IN A NUTSHELL

5

  • Emulate the emerging properties rather

than the internal state of the network

  • Allows decentralized highly scalable

emulation

slide-9
SLIDE 9

EuroSys 2020 KOLLAPS 29.04.2020

NETWORK COLLAPSING

6

10Mb/s 10ms 100Mb/s 20ms 50Mb/s 5ms 50Mb/s 5ms

sv1 sv2 s1 s2 c1 target topology

slide-10
SLIDE 10

EuroSys 2020 KOLLAPS 29.04.2020

c1 sv1 sv2

50Mb/s 10ms 10Mb/s 35ms 10Mb/s 35ms Node Router Throughput Latency

collapsed topology

10Mb/s

collapsed topology

NETWORK COLLAPSING

6

10Mb/s 10ms 100Mb/s 20ms 50Mb/s 5ms 50Mb/s 5ms

sv1 sv2 s1 s2 c1 target topology

slide-11
SLIDE 11

EuroSys 2020 KOLLAPS 29.04.2020

c1 sv1 sv2

50Mb/s 10ms 10Mb/s 35ms 10Mb/s 35ms Node Router Throughput Latency

collapsed topology

10Mb/s

collapsed topology

NETWORK COLLAPSING

6

10Mb/s 10ms 100Mb/s 20ms 50Mb/s 5ms 50Mb/s 5ms

sv1 sv2 s1 s2 c1 target topology

slide-12
SLIDE 12

EuroSys 2020 KOLLAPS 29.04.2020

NETWORK COLLAPSING

7

c1 sv1 sv2

50Mb/s 10ms 10Mb/s 35ms 10Mb/s 35ms Node Router Throughput Latency

collapsed topology

10Mb/s

collapsed topology

10Mb/s 10ms 100Mb/s 20ms 50Mb/s 5ms 50Mb/s 5ms

sv1 sv2 s1 s2 c1 target topology

slide-13
SLIDE 13

EuroSys 2020 KOLLAPS 29.04.2020

NETWORK COLLAPSING

7

c1 sv1 sv2

50Mb/s 10ms 10Mb/s 35ms 10Mb/s 35ms Node Router Throughput Latency

collapsed topology

10Mb/s

collapsed topology

10Mb/s 10ms 100Mb/s 20ms 50Mb/s 5ms 50Mb/s 5ms

sv1 sv2 s1 s2 c1 target topology

Minimum bandwidth on all links Minimum bandwidth on all links

slide-14
SLIDE 14

EuroSys 2020 KOLLAPS 29.04.2020

NETWORK COLLAPSING

7

c1 sv1 sv2

50Mb/s 10ms 10Mb/s 35ms 10Mb/s 35ms Node Router Throughput Latency

collapsed topology

10Mb/s

collapsed topology

10Mb/s 10ms 100Mb/s 20ms 50Mb/s 5ms 50Mb/s 5ms

sv1 sv2 s1 s2 c1 target topology

Sum of latencies

  • n all links

Sum of latencies

  • n all links

Sum of latencies

  • n all links

Minimum bandwidth on all links Minimum bandwidth on all links

slide-15
SLIDE 15

EuroSys 2020 KOLLAPS 29.04.2020

NETWORK COLLAPSING

7

c1 sv1 sv2

50Mb/s 10ms 10Mb/s 35ms 10Mb/s 35ms Node Router Throughput Latency

collapsed topology

10Mb/s

collapsed topology

10Mb/s 10ms 100Mb/s 20ms 50Mb/s 5ms 50Mb/s 5ms

sv1 sv2 s1 s2 c1 target topology

Sum of latencies

  • n all links

Sum of latencies

  • n all links

Sum of latencies

  • n all links

Minimum bandwidth on all links Minimum bandwidth on all links Pre-computation of static properties

slide-16
SLIDE 16

EuroSys 2020 KOLLAPS 29.04.2020

ARCHITECTURE

8

host-C

  • em. core

!

!

!

tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc Aeron Media Driver Log shared memory

  • em. core!

!

!

  • em. core!

!

!

host-B

  • em. core

!

!

!

tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc Aeron Media Driver Log shared memory

  • em. core!

!

!

  • em. core!

!

!

physical network

host-A emulation manager

container container container

host-B host-C

dashboard input deployment generator monitor design

slide-17
SLIDE 17

EuroSys 2020 KOLLAPS 29.04.2020

ARCHITECTURE

8

host-C

  • em. core

!

!

!

tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc Aeron Media Driver Log shared memory

  • em. core!

!

!

  • em. core!

!

!

host-B

  • em. core

!

!

!

tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc Aeron Media Driver Log shared memory

  • em. core!

!

!

  • em. core!

!

!

physical network

host-A emulation manager

container container container

host-B host-C

dashboard input deployment generator monitor design

slide-18
SLIDE 18

EuroSys 2020 KOLLAPS 29.04.2020

ARCHITECTURE

8

host-C

  • em. core

!

!

!

tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc Aeron Media Driver Log shared memory

  • em. core!

!

!

  • em. core!

!

!

host-B

  • em. core

!

!

!

tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc Aeron Media Driver Log shared memory

  • em. core!

!

!

  • em. core!

!

!

physical network

host-A emulation manager

container container container

host-B host-C

dashboard input deployment generator monitor design

slide-19
SLIDE 19

EuroSys 2020 KOLLAPS 29.04.2020

ARCHITECTURE

8

host-C

  • em. core

!

!

!

tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc Aeron Media Driver Log shared memory

  • em. core!

!

!

  • em. core!

!

!

host-B

  • em. core

!

!

!

tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc Aeron Media Driver Log shared memory

  • em. core!

!

!

  • em. core!

!

!

physical network

host-A emulation manager

container container container

host-B host-C

dashboard input deployment generator monitor design

slide-20
SLIDE 20

EuroSys 2020 KOLLAPS 29.04.2020

ARCHITECTURE

8

host-C

  • em. core

!

!

!

tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc Aeron Media Driver Log shared memory

  • em. core!

!

!

  • em. core!

!

!

host-B

  • em. core

!

!

!

tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc Aeron Media Driver Log shared memory

  • em. core!

!

!

  • em. core!

!

!

physical network

host-A emulation manager

container container container

host-B host-C

dashboard input deployment generator monitor design

slide-21
SLIDE 21

EuroSys 2020 KOLLAPS 29.04.2020

ARCHITECTURE

8

host-C

  • em. core

!

!

!

tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc Aeron Media Driver Log shared memory

  • em. core!

!

!

  • em. core!

!

!

host-B

  • em. core

!

!

!

tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc Aeron Media Driver Log shared memory

  • em. core!

!

!

  • em. core!

!

!

physical network

host-A emulation manager

container container container

host-B host-C

dashboard input deployment generator monitor design

slide-22
SLIDE 22

EuroSys 2020 KOLLAPS 29.04.2020

ARCHITECTURE

8

host-C

  • em. core

!

!

!

tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc Aeron Media Driver Log shared memory

  • em. core!

!

!

  • em. core!

!

!

host-B

  • em. core

!

!

!

tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc Aeron Media Driver Log shared memory

  • em. core!

!

!

  • em. core!

!

!

physical network

host-A emulation manager

container container container

host-B host-C

dashboard input deployment generator monitor design

slide-23
SLIDE 23

EuroSys 2020 KOLLAPS 29.04.2020

ARCHITECTURE

8

host-C

  • em. core

!

!

!

tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc Aeron Media Driver Log shared memory

  • em. core!

!

!

  • em. core!

!

!

host-B

  • em. core

!

!

!

tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc Aeron Media Driver Log shared memory

  • em. core!

!

!

  • em. core!

!

!

physical network

host-A emulation manager

container container container

host-B host-C

dashboard input deployment generator monitor design

slide-24
SLIDE 24

EuroSys 2020 KOLLAPS 29.04.2020

EMULATION MANAGER (EM)

  • One instance per physical machine
  • Enforces topology properties
  • static properties
  • dynamic properties

9

slide-25
SLIDE 25

EuroSys 2020 KOLLAPS 29.04.2020

EM: DYNAMIC PROPERTIES

10

10Mb/s 10ms 100Mb/s 20ms 50Mb/s 5ms 50Mb/s 5ms

sv1 sv2 s1 s2 c1 target topology c1 sv1 sv2

50Mb/s 10ms 10Mb/s 35ms 10Mb/s 35ms Node Router Throughput Latency

collapsed topology

10Mb/s

collapsed topology

slide-26
SLIDE 26

EuroSys 2020 KOLLAPS 29.04.2020

EM: DYNAMIC PROPERTIES

10

10Mb/s 10ms 100Mb/s 20ms 50Mb/s 5ms 50Mb/s 5ms

sv1 sv2 s1 s2 c1 target topology c1 sv1 sv2

50Mb/s 10ms 10Mb/s 35ms 10Mb/s 35ms Node Router Throughput Latency

collapsed topology

10Mb/s

collapsed topology

slide-27
SLIDE 27

EuroSys 2020 KOLLAPS 29.04.2020

EM: DYNAMIC PROPERTIES

10

10Mb/s 10ms 100Mb/s 20ms 50Mb/s 5ms 50Mb/s 5ms

sv1 sv2 s1 s2 c1 target topology

sv1

1 M b / s 10 Mb/s

c1 sv1 sv2

50Mb/s 10ms 10Mb/s 35ms 10Mb/s 35ms Node Router Throughput Latency

collapsed topology

10Mb/s

collapsed topology

sv2 C1

slide-28
SLIDE 28

EuroSys 2020 KOLLAPS 29.04.2020

EM: DYNAMIC PROPERTIES

10

10Mb/s 10ms 100Mb/s 20ms 50Mb/s 5ms 50Mb/s 5ms

sv1 sv2 s1 s2 c1 target topology

sv1

1 M b / s 10 Mb/s

c1 sv1 sv2

50Mb/s 10ms 10Mb/s 35ms 10Mb/s 35ms Node Router Throughput Latency

collapsed topology

10Mb/s

collapsed topology

sv2 C1

How to enforce properties under congestion?

slide-29
SLIDE 29

EuroSys 2020 KOLLAPS 29.04.2020

EM: DYNAMIC PROPERTIES

11

  • RTT-Aware Min-Max model:
  • Intuition
  • Available bandwidth is inversely

proportionally to the RTT

slide-30
SLIDE 30

EuroSys 2020 KOLLAPS 29.04.2020

ARCHITECTURE

12

host-C

  • em. core

!

!

!

tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc Aeron Media Driver Log shared memory

  • em. core!

!

!

  • em. core!

!

!

host-B

  • em. core

!

!

!

tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc Aeron Media Driver Log shared memory

  • em. core!

!

!

  • em. core!

!

!

physical network

host-A emulation manager

container container container

host-B host-C

dashboard input deployment generator monitor design emulation core shared memory Log emulation core emulation core

slide-31
SLIDE 31

EuroSys 2020 KOLLAPS 29.04.2020

ARCHITECTURE

12

host-C

  • em. core

!

!

!

tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc Aeron Media Driver Log shared memory

  • em. core!

!

!

  • em. core!

!

!

host-B

  • em. core

!

!

!

tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc Aeron Media Driver Log shared memory

  • em. core!

!

!

  • em. core!

!

!

physical network

host-A emulation manager

container container container

host-B host-C

dashboard input deployment generator monitor design emulation core shared memory Log emulation core emulation core Gather usage statistics

slide-32
SLIDE 32

EuroSys 2020 KOLLAPS 29.04.2020

ARCHITECTURE

12

host-C

  • em. core

!

!

!

tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc Aeron Media Driver Log shared memory

  • em. core!

!

!

  • em. core!

!

!

host-B

  • em. core

!

!

!

tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc tcal

  • verlay,..

qdisc Aeron Media Driver Log shared memory

  • em. core!

!

!

  • em. core!

!

!

physical network

host-A emulation manager

container container container

host-B host-C

dashboard input deployment generator monitor design emulation core shared memory Log emulation core emulation core Gather usage statistics Enforce emulation constraints

slide-33
SLIDE 33

EuroSys 2020 KOLLAPS 29.04.2020

EVALUATION

  • Link-level emulation
  • Scalability and metadata overhead
  • Short- and long-lived connections
  • Cubic and Reno congestion control algorithms
  • Dynamic behavior
  • Large-scale topologies
  • Reproducing published results
  • Geo-replicated Systems
  • What-if use cases

13

slide-34
SLIDE 34

EuroSys 2020 KOLLAPS 29.04.2020

EVALUATION

  • Link-level emulation
  • Scalability and metadata overhead
  • Short- and long-lived connections
  • Cubic and Reno congestion control algorithms
  • Dynamic behavior
  • Large-scale topologies
  • Reproducing published results
  • Geo-replicated Systems
  • What-if use cases

14 check full paper

slide-35
SLIDE 35

EuroSys 2020 KOLLAPS 29.04.2020

LARGE-SCALE TOPOLOGIES

  • Scale-free networks with random ping requests
  • Mean-square error w.r.t. theoretical RTT:

15

Size (# nodes + # switches) KOLLAPS Mininet Maxinet 1000 0.0261 0.0079 28.0779 2000 0.0384 N/A 347.5303 4000 0.0721 N/A N/A

slide-36
SLIDE 36

EuroSys 2020 KOLLAPS 29.04.2020

GEO-REPLICATED SYSTEM

16

  • Cassandra on EC2 (replication factor: 2)
  • 4 replicas in Frankfurt, 4 replicas in Sydney
  • 4 YCSB clients in Frankfurt
  • Repeat experiment with Kollaps on a local cluster
slide-37
SLIDE 37

EuroSys 2020 KOLLAPS 29.04.2020

GEO-REPLICATED SYSTEM

16

  • Cassandra on EC2 (replication factor: 2)
  • 4 replicas in Frankfurt, 4 replicas in Sydney
  • 4 YCSB clients in Frankfurt
  • Repeat experiment with Kollaps on a local cluster
slide-38
SLIDE 38

EuroSys 2020 KOLLAPS 29.04.2020

GEO-REPLICATED SYSTEM

17

  • What is the impact of halving the network latency in

application throughput?

  • E.g. move replicas from Sidney to Seoul
slide-39
SLIDE 39

EuroSys 2020 KOLLAPS 29.04.2020

GEO-REPLICATED SYSTEM

17

  • What is the impact of halving the network latency in

application throughput?

  • E.g. move replicas from Sidney to Seoul
slide-40
SLIDE 40

EuroSys 2020 KOLLAPS 29.04.2020

CONCLUSION AND FUTURE WORK

  • KOLLAPS: a decentralized topology emulator
  • Emulation of emerging end-to-end properties
  • Allows decentralized highly scalable emulation
  • Future work:
  • Adding interactive control of experiments
  • Time-dilation to mitigate physical limitations
  • Event-based meta-data dissemination

18

https://github.com/miguelammatos/Kollaps