Performance Datacenters HotNets15 Xpander: Unveiling the Secrets of - - PowerPoint PPT Presentation

performance datacenters
SMART_READER_LITE
LIVE PREVIEW

Performance Datacenters HotNets15 Xpander: Unveiling the Secrets of - - PowerPoint PPT Presentation

Towards Optimal- Performance Datacenters HotNets15 Xpander: Unveiling the Secrets of High-Performance Datacenters Asaf Valadarsky 3 , Michael Dinitz 1 , Michael Schapira 3 CoNext16 Xpander: Towards Optimal-Performance Datacenters


slide-1
SLIDE 1

Towards Optimal- Performance Datacenters

HotNets’15 – Xpander: Unveiling the Secrets of High-Performance Datacenters Asaf Valadarsky 3, Michael Dinitz1, Michael Schapira3 CoNext’16 – Xpander: Towards Optimal-Performance Datacenters Asaf Valadarsky 3 , Michael Dinitz1, Gal Shahaf3, Michael Schapira3 SIGCOMM’17 – Beyond Fat-Trees Without Antennae, Mirrors, and Disco-Balls Simon Kassing2, Asaf Valadarsky 3, Gal Shahaf3, Michael Schapira3, Ankit Singla2 3 1 2

slide-2
SLIDE 2

Designing A Datacenter Architecture

Network topology? Routing? Congestion Control?

Reconfigurab

† † † † †

Abstract— figure µ fic fle flo fline flo 30–95% 25–40%.

  • Reconfigurablility
  • rks—electrical

topology—has

profit first specific

’16, Array of Micromirrors Diffracted beam Towards destination Received beam Input beam Lasers DMDs Photodetectors Mirror assembly Reflected beam Reconfig.

µ Firefly µ

econfigurable reconfigurable fic reconfigurable reconfiguration reconfigurable 1–3)

Reconfigurab

† † † † † † † †

fs—simple “f tree”-lik reconfigurable reconfigurability reconfigurable; significant benefits fle Reconfigurablility fic fic

profit first specific S IGCOMM’14, 17–22, Rack% 1% Rack% N% Rack% r% Steerable% % FSOs% Ceiling% mirror% ToR% switch% FireFly% Controller% Traffic% Pa= erns% Rule% change% FSO% reconf%

reconfigurable fle fle benefits fle fit fle first significant reconfigurable reconfigures fic fle econfigur

slide-3
SLIDE 3

Deployability

➡Cabling complexity ➡Operations cost ➡Equipment costs ➡”Easy to reason about” ➡…

Designing A Datacenter Architecture

Performance

➡Throughput ➡Resiliency to failures ➡Path diversity ➡Flow completion time ➡…

slide-4
SLIDE 4

What Is The “RIGHT” Datacenter Architecture?

PERFORMANCE DEPLOYABILITY

FatTree Small-World Datacenters, Dcell, Bcube, Legup, Hedera, c-Through, etc…

????

Jellyfish Slim-Fly

slide-5
SLIDE 5

In This (and the next) Talk

  • Reaching that upper-right corner entails

designing “expander datacenters”

  • Xpander: a tangible and near-optimal

datacenter design

  • Next talk: Theoretical advances in the field of

expander datacenters

slide-6
SLIDE 6

Expander Datacenters

  • An expander datacenter architecture:

➡Utilizes an expander graph as its network

topology (see next slide + Michael’s talk)

➡Employs multi-path routing to exploit path

diversity

slide-7
SLIDE 7

S V\S

  • A graph is called an “expander graph” if it has

“good” edge expansion

  • Intuition: In a d-regular graph, with constant edge

expansion c, there are at least |S|c links crossing any cut (S,V\S) ➡ We want high values of c (ideally ~d/2) ➡ Traffic is never bottlenecked at small set of links ➡ Many paths between any source/destination pairs

Expander Graphs: Intuition

slide-8
SLIDE 8

Support higher traffic loads

More resilient to failures

Support more servers with less network devices

Multiple short-paths between hosts

Incrementally expandable

Expander Datacenters Achieve Near-Optimal Performance

slide-9
SLIDE 9

Our Evaluation

Theoretical analyses

Flow- and packet-level simulations

Experiments on a network emulator

Experiments on an SDN-capable network

slide-10
SLIDE 10

Expander Datacenters ARE The State-Of-The-Art Datacenters

PERFORMANCE DEPLOYABILITY

FatTree Small-World Datacenters, Dcell, Bcube, Legup, Hedera, c-Through, etc…

????

Jellyfish Slim-Fly Random Graph Low-Diameter Graph

Breaking news! Large low- diameter graphs are good expanders

slide-11
SLIDE 11

CAN WE HAVE IT ALL?

A well structured design Near optimal performance YES! :)

slide-12
SLIDE 12

Deployability

➡Cabling complexity ➡Operations cost ➡Equipment costs ➡”Could reason about” ➡…

Designing A Datacenter Architecture

Performance

➡Throughput ➡Resiliency to failures ➡Path diversity ➡Flow completion time ➡…

Expander Datacenter Deployment- Oriented Construction

slide-13
SLIDE 13

ToR ToR ToR ToR Meta Node

Same number of ToRs within any meta- node Same number

  • f links

between every two meta- nodes

Leverages a deterministic graph-theoretic construction of expanders [BL ’06]

ToR ToR ToR ToR ToR ToR ToR ToR ToR ToR ToR ToR

No links within the same meta- node

Xpander Datacenter Architecture

Meta Node

slide-14
SLIDE 14

Xpander Datacenter Architecture

Topology Routing K-Shortest Paths Congestion Control DCTCP [SIGCOMM’10]

ToR ToR ToR ToR
slide-15
SLIDE 15

Support higher traffic loads

More resilient to failures

Support more servers with less network devices

Multiple short-paths between hosts

Incrementally expandable

Expander datacenters Achieve Near- Optimal performance

slide-16
SLIDE 16

Datacenter Throughput

  • How much traffic can a datacenter network

support?

  • The network is modelled as a capacitated graph

G=(V,E,c) coupled with a demand matrix D

  • The maximum-concurrent-flow aD is the maximum a

such that each commodity in D sends exactly an a

  • f its demand
  • Common selections of D: All-to-All, Permutation,

Many-to-One, and One-to-Many

slide-17
SLIDE 17

Near Optimal All-To-All Throughput

Theorem: In the all-to-all setting, the throughout of any d- regular expander G on n vertices is within a factor of O(logd) of that of the throughput-optimal d-regular graph

  • n n vertices

* 18-port

switches

0.5 0.55 0.6 0.65 0.7 0.75 0.8 0.85 0.9 0.95 1 500 1000 1500 2000

Normelized Throughput Number Of Servers

All-to-All Throughput

Xpander Jellyfish LPS_54 LPS_62 *

slide-18
SLIDE 18

Observation: In any d-regular expander (with edge expansion >=1), any two vertices are connected by exactly d edge-disjoint paths.

Resilience To Failures

slide-19
SLIDE 19

Datacenter Traffic

  • Datacenter traffic is unpredictable
  • Different tenants want different things
  • Varying degree of mixture between long and short

flows

  • With different types of skewness (i.e.,

percentage of chatty servers)

  • Could range between a uniform to highly skewed

distributions

slide-20
SLIDE 20

Near-Optimal Throughput Even Against Adversarial Traffic!

Theorem 1: Throughput of any expander on n vertices is a logarithmic (in n) factor away from the optimum with respect to any traffic pattern Theorem 2: For any d-regular graph G on n vertices there is some traffic matrix under which the throughput of G is a logarithmic (in n) factor away from the optimum Distance from Optimum Xpander throughput<80% <1% 80% ≤ throughput <85% 2.3% 85% ≤ throughput <90% 16.14% 90% ≤ throughput <95% 44.48% 95% ≤ throughput 36.61%

slide-21
SLIDE 21

Dynamic Networks: Set Up Network Connections On The Fly

slide-22
SLIDE 22

Are Static Networks Irrelevant?

  • Are fewer but flexible ports better than many

cheaper static ones?

  • Do static networks need sophisticated

routing/congestion control schemes to match the performance of dynamic networks?

We show that Xpander attains performance comparable to state-of-the-art dynamic networks at a comparable cost! This and more in our new SIGCOMM paper 

slide-23
SLIDE 23

Deploying A New Datacenter Architecture

  • Need to address the concerns of IT managing the

datacenter, mainly:

  • Keeping changes to the protocol stack to a minimum:

DCTCP as the congestion control mechanism and K- Shortest paths routing

  • Minimize cabling complexity (see next slide)
  • Have the ability to increase the datacenter size

More on this in Michael’s talk (coming up next)

slide-24
SLIDE 24

Cabling Xpander

Place ToRs of each meta-node in close proximity

Bundle cables between two meta-nodes

Use color-coding to distinguish between different meta-nodes and bundles of cables

No links within the same meta- node Same number

  • f links

between every two meta- nodes

ToR ToR ToR ToR

slide-25
SLIDE 25

Conclusion

  • We show that expander datacenters outperform

traditional datacenters

Sheds light on past results about random and low- diameter datacenter networks

  • We present Xpander, a novel datacenter architecture

Suggests a tangible alternative to today’s datacenter architectures

Achieves near-optimal performance

slide-26
SLIDE 26

Thank you!

Questions?