Ground Control to Major Faults: Towards Fault Tolerant and Adaptive - - PowerPoint PPT Presentation

ground control to major faults towards fault tolerant and
SMART_READER_LITE
LIVE PREVIEW

Ground Control to Major Faults: Towards Fault Tolerant and Adaptive - - PowerPoint PPT Presentation

Ground Control to Major Faults: Towards Fault Tolerant and Adaptive SDN Control Network Liron Schiff (Tel Aviv University) Stefan Schmid (TU Berlin, Germany & Aalborg University, Denmark) Marco Canini (Universit catholique de Louvain)


slide-1
SLIDE 1

Liron Schiff (Tel Aviv University) Stefan Schmid (TU Berlin, Germany & Aalborg University, Denmark) Marco Canini (Université catholique de Louvain)

Ground Control to Major Faults: Towards Fault Tolerant and Adaptive SDN Control Network

slide-2
SLIDE 2

Logically centralized control Control plane network Fast data plane

Software Defined Network (SDN)

slide-3
SLIDE 3
  • Main function:

– Connect the controller with each switch

Controller

SDN control-plane

slide-4
SLIDE 4
  • Main function:

– Connect the controller with each switch

  • Can be distributed

– Handle failures – Load balancing – Need synchronization

SDN control-plane

slide-5
SLIDE 5
  • Main functions:

– Connect the controller with each switch – Inter-connect the controllers

  • Can be distributed

– Handle failures – Load balancing – Need synchronization

SDN control-plane

slide-6
SLIDE 6
  • Main function:

– Connect the controller with each switch – Inter-connect the controllers

  • Can be distributed

– Handle failures – Load balancing – Need synchronization

  • Can be in-band

– Cheaper – More provisioned (redundancy) – More flexible (TE, unicast, etc.)

SDN control-plane

slide-7
SLIDE 7

Control Module In-band processing

  • Control traffic is sent in-

band.

  • The switch identifies and

forward it to the control module.

  • Supported by OpenFlow.

Switch Structure (Model)

slide-8
SLIDE 8

Challenge: Boot Up

  • Switches start as unmanaged.
  • Switches should be configured to forward control in-

band.

slide-9
SLIDE 9

Challenge: Boot Up

  • Switches start as unmanaged.
  • Switches should be configured to forward control in-

band.

slide-10
SLIDE 10

Challenge: Plug&Play

  • Support new links / switches / controllers
  • Switches can’t be configured with all possible controllers.
slide-11
SLIDE 11

Challenge: Plug&Play

  • Support new links / switches / controllers
  • Switches can’t be configured with all possible controllers.
slide-12
SLIDE 12

Challenge: Handle Failures

slide-13
SLIDE 13
  • Goal: Network should return to a good state.

Challenge: Handle Failures

slide-14
SLIDE 14

“Good network state” :=

  • Every switch is connected to a controller.
  • Controllers can communicate and make joint

decisions.

Model

slide-15
SLIDE 15

Our Contributions

A Plug & Play Distributed SDN Control Plane

  • Flexible controller membership (additions, removals,

failures)

  • Automatic switch discovery & topology awareness
  • Supports ONIX, ElastiCon, Beehive, STN, and more.

Self Adjusting

  • Converges to “good state” from unmanaged states.
  • Tolerates failures and delays: low re-convergence times
slide-16
SLIDE 16

The Medieval Scheme

  • Controllers aim to continuously grow their

management regions...

  • … and “conquer” unmanaged switches.
  • Management with two spanning tree types:

(1) Per-region spanning tree (bidirectional, owned by controller) (2) Network-wide spanning tree (to connect controllers)

slide-17
SLIDE 17

Switch States

x

1 2 2 1

  • 1. Controller traffic

is passed through

  • 2. Other controllers

are blocked Session established No keep-alive timeout

Unmanaged Managed

  • 1. Broadcast
  • 2. Any controller

can respond

slide-18
SLIDE 18

Switch State Configurations

Rules Properties Managed Priority 2, with timeout Unmanaged Priority 1, no timeout

A priori configured Maintained by controller

slide-19
SLIDE 19

Controller uses a managed switch, R, to detect and establish connection to a new switch S.

The Protocol

slide-20
SLIDE 20

The Medieval Scheme

  • Controllers aim to continuously grow their

management regions...

  • … and “conquer” unmanaged switches.
  • Management with two spanning tree types:

(1) Per-region spanning tree (bidirectional, owned by controller) (2) Network-wide spanning tree (to connect controllers)

slide-21
SLIDE 21

The Medieval Scheme

  • Controllers aim to continuously grow their

management regions...

  • … and “conquer” unmanaged switches.
  • Management with two spanning tree types:

(1) Per-region spanning tree (bidirectional, owned by controller)

slide-22
SLIDE 22

Controller to Switch Connectivity

Controllers “conquer” switches adjacent to their regions of control and build a spanning tree for controller-to-switch connectivity.

S1 S2 S7 S8 S3 S4 S5 S6 B A Switch Controller Other Link Spanning Tree Link

Unmanned switch Anchor switch pkt-in(ARP) ARP ARP

slide-23
SLIDE 23

The Medieval Scheme

  • Controllers aim to continuously grow their

management regions...

  • … and “conquer” unmanaged switches.
  • Management with two spanning tree types:

(1) Per-region spanning tree (bidirectional, owned by controller) (2) Network-wide spanning tree (to connect controllers)

slide-24
SLIDE 24

Controller to Controller Connectivity

S1 S2 S7 S8 S3 S4 S5 S6 B A

Per-controller global spanning trees provide controller-to-controller connectivity.

slide-25
SLIDE 25
  • Emulator in Java
  • OpenFlow switches and controllers: light-

weight threads

  • Links modelled by message queues
  • Fat-tree topology (k=4), 1-8 controllers
  • Measured time to manage switches

# ctrls 1 2 3 4 5 6 7 8 Time(ms) 9382 6983 6150 4224 6035 5104 3704 3680

Prototype Implementation

slide-26
SLIDE 26

Prototype Implementation

slide-27
SLIDE 27
  • Medieval: a robust distributed SDN control

plane.

  • Fully supported by OpenFlow.
  • Convergence can be proved and easily tested.
  • Extended analysis and simulation are coming

soon.

Conclusions

slide-28
SLIDE 28