XPANDER: TOWARDS OPTIMAL-PERFORMANCE DATACENTERS Asaf Valadarsky - - PowerPoint PPT Presentation

xpander towards optimal performance datacenters
SMART_READER_LITE
LIVE PREVIEW

XPANDER: TOWARDS OPTIMAL-PERFORMANCE DATACENTERS Asaf Valadarsky - - PowerPoint PPT Presentation

XPANDER: TOWARDS OPTIMAL-PERFORMANCE DATACENTERS Asaf Valadarsky (Hebrew University) Gal Shahaf (Hebrew University) Michael Dinitz (Johns Hopkins University) Michael Schapira (Hebrew University) DESIGNING A DATACENTER ARCHITECTURE Network


slide-1
SLIDE 1

XPANDER: TOWARDS OPTIMAL-PERFORMANCE DATACENTERS

Asaf Valadarsky (Hebrew University) Gal Shahaf (Hebrew University) Michael Dinitz (Johns Hopkins University) Michael Schapira (Hebrew University)

slide-2
SLIDE 2

DESIGNING A DATACENTER ARCHITECTURE

Network Topology? Routing? Congestion Control?

slide-3
SLIDE 3

DESIGNING A DATACENTER ARCHITECTURE

Performance

➡Throughput ➡Resiliency to failures ➡Path diversity ➡…

Deployability

➡Cabling complexity ➡Operations cost ➡Equipment costs ➡…

slide-4
SLIDE 4

WHAT IS THE “RIGHT” DATACENTER ARCHITECTURE?

DEPLOYABILITY

PERFORMANCE

Jellyfish Slim-Fly

????

Fat Tree SWDC, DCell, BCube, c-Through, Helios, …

slide-5
SLIDE 5

AGENDA

  • Reaching that upper-right corner entails

designing “expander datacenters”

  • Xpander: a tangible and near-optimal

datacenter design

slide-6
SLIDE 6

EXPANDER DATACENTERS

  • An expander datacenter architecture:

Utilizes an expander graph as its network topology (see next slide)

Employs (multi-path) routing and congestion control to exploit path diversity

slide-7
SLIDE 7

EXPANDER GRAPHS: INTUITION

S V\S

  • A graph is called an “expander graph” if it has

“good” edge expansion

  • Intuition: In an expander graph, the capacity

traversing each cut is “large”

Traffic is never bottlenecked at small set of links

High path diversity

slide-8
SLIDE 8

CONSTRUCTING EXPANDERS

  • Constructing expanders is a prominent research

area in mathematics and computer science

  • Applications in networking, computational

complexity, coding, and beyond

slide-9
SLIDE 9

Support higher traffic loads

More resilient to failures

Support more servers with less network devices

Multiple short-paths between hosts

Incrementally expandable

EXPANDER DATACENTERS ACHIEVE NEAR-OPTIMAL PERFORMANCE

slide-10
SLIDE 10

OUR EVALUATION

➡ Theoretical analyses ➡ Flow- and packet-level simulations ➡ Experiments on network emulator ➡ Experiments on an SDN-capable network

slide-11
SLIDE 11

EXPANDER DATACENTERS ARE THE STATE-OF-THE-ART

Low-Diameter Graph Random Graph

DEPLOYABILITY

PERFORMANCE

Jellyfish Slim-Fly

????

Fat Tree SWDC, DCell, BCube, c-Through, Helios, …

slide-12
SLIDE 12

CAN WE HAVE IT ALL?

A well structured design Near optimal performance

YES! :)

slide-13
SLIDE 13

XPANDER DATACENTER ARCHITECTURE

Near-Optimal Performance

➡Throughput ➡Resiliency to failures ➡Path diversity ➡…

Deployable

➡Cabling complexity ➡Operations cost ➡Equipment costs ➡…

Expander Datacenter Deployment- Oriented Construction

slide-14
SLIDE 14

ToR ToR ToR ToR

XPANDER DATACENTER ARCHITECTURE

Meta Node Meta Node

Same number

  • f ToRs within

any meta-node Same number of links between every two meta- nodes

Leverages a deterministic graph-theoretic construction of expanders [BL ’06]

ToR ToR ToR ToR ToR ToR ToR ToR ToR ToR ToR ToR

No links within the same meta- node

slide-15
SLIDE 15

WHERE ARE MY PODS?

An Xpander can be divided into smaller “Xpander pods”

ToR ToR ToR ToR

slide-16
SLIDE 16

XPANDER DATACENTER ARCHITECTURE

Topology Routing

Multipath Routing (K-Shortest Paths)

Congestion Control

Multipath Congestion Control (Multipath-TCP)

ToR ToR ToR ToR
slide-17
SLIDE 17

Support higher traffic loads

More resilient to failures

Support more servers with less network devices

Multiple short-paths between hosts

Incrementally expandable

EXPANDER DATACENTERS ACHIEVE NEAR-OPTIMAL PERFORMANCE

slide-18
SLIDE 18

NEAR OPTIMAL ALL-TO-ALL THROUGHPUT

Theorem: In the all-to-all setting, the throughout of any d-regular expander G on n vertices is within a factor of O(logd) of that of the throughput-optimal d-regular graph

  • n n vertices

* 18-port

switches

0.5 0.55 0.6 0.65 0.7 0.75 0.8 0.85 0.9 0.95 1 500 1000 1500 2000

Normelized Throughput Number Of Servers

All-to-All Throughput

Xpander Jellyfish LPS_54 LPS_62

*

slide-19
SLIDE 19

Theorem: In any d-regular expander, any two vertices are connected by exactly d edge-disjoint paths.

RESILIENCE TO FAILURES

slide-20
SLIDE 20
  • Expander datacenters empirically attain near-
  • ptimal throughput under skewed TMs (mice and

elephants)

  • We prove that expander datacenters are
  • ptimal with respect to adversarial traffic

conditions

NEAR-OPTIMAL THROUGHPUT UNDER SKEWED TRAFFIC MATRICES

slide-21
SLIDE 21

COST EFFICIENCY: XPANDER VS. FAT-TREE

Switch Degree #Switches All-to-All Throughput 8* 80% 121% 10 100% 157% 24 80% 111%

*Validated using Mininet experiments

slide-22
SLIDE 22

SEE PAPER FOR

  • Analysis of shortest-paths and diameter
  • Physical layout and costs
  • Incremental expansion of expander datacenters
  • Results for skewed traffic matrices
  • Results for Xpander vs. Jellyfish
  • Results for Xpander vs. Slim-Fly
  • Additional results for Xpander vs. Fat Tree
  • Experiments with the Mininet network emulator
  • Experiments on the OCEAN SDN-capable network testbed
slide-23
SLIDE 23

DEPLOYING XPANDER

Place ToRs of each meta-node in close proximity

Bundle cables between two meta-nodes

Use color-coding to distinguish between different meta-nodes and bundles of cables

No links within the same meta- node Same number of links between every two meta- nodes

ToR ToR ToR ToR

slide-24
SLIDE 24

DEPLOYING XPANDER

  • Analysed physical layout, cabling complexity,

#cables and cable length for both large-scale and “container” datacenters

Switch Ports

#Switches #Servers #Cables Cable Length Throughput

32

42 vs. 48 (87.5%) 504 vs. 512 (98.44%) 420 vs. 512 (82%) 4.2 km vs 5.12km (82%) 109%

48

66 vs. 72 (92%) 1056 vs. 1152 (92%) 1056 vs. 1152 (92%) 10.5 km vs 11.5km (92%) 142%

slide-25
SLIDE 25

CONCLUSION

  • We show that expander datacenters outperform traditional

datacenters

Sheds light on past results about random and low- diameter graphs based datacenters

  • We present Xpander, a novel datacenter architecture

Suggests a tangible alternative to today’s datacenter architectures

Achieves near-optimal performance

slide-26
SLIDE 26

QUESTIONS?

THANK YOU!

See project webpage at: https://husant.github.io/Xpander/