Sincronia:
Near-Optimal Network Design for Coflows
Saksham Agarwal Akshay Narayan Rachit Agarwal David Shmoys Amin Vahdat
Shijin Rajakrishnan
Joint work with
Sincronia: Near-Optimal Network Design for Coflows Shijin - - PowerPoint PPT Presentation
Sincronia: Near-Optimal Network Design for Coflows Shijin Rajakrishnan Joint work with Saksham Agarwal Akshay Narayan Rachit Agarwal David Shmoys Amin Vahdat The Flow Abstraction FTP Email HTTP Traditional Applications : Care about
Saksham Agarwal Akshay Narayan Rachit Agarwal David Shmoys Amin Vahdat
Shijin Rajakrishnan
Joint work with
Optimized for Flow-level performance FTP Email HTTP … Traditional Applications: Care about performance
Good Match
Distributed Applications: Care about performance for a group of flows … FTP Email HTTP … Traditional Applications: Care about performance
Optimized for Flow-level performance
… … … … Coflow 1 Coflow 2 Coflow 3
▪
➢ Source-destination for each flow ➢ Size of each flow ➢ Coflow weight
Ingress ports Egress ports
1 2’ 2 1’
DC Fabric
Systems/ Theory State-of-the-art Performance Guarantees Runs on Existing Transport Work Conserving Starvation Avoiding Systems Varys [SIGCOMM ‘14] Theory On Scheduling Coflows [IPCO ‘17]
(4-apx)
Systems/ Theory Name Performance Guarantees Runs on Existing Transport Work Conserving Starvation Avoiding Systems Varys Theory On Scheduling Coflows Systems Sincronia
(4-apx) (4-apx)
▪ Bottleneck, Select, Scale, Iterate ▪ SRPT-first style algorithm
▪ Ordered last
1 2 1’ 2’
Ordering not important
▪ Ordered Last
1 2’ 2 1’ 1 2 1’ 2’
Order: Weights: Find port handling largest number of packets Select coflow with largest size-to-weight ratio Scale weight of each coflow (at bottleneck port) Iterate on unscheduled coflows Weight ← Weight(1 –
ൗ
Size Weight
ൗ
Size Weight)
Weight ← Weight(1 –
ൗ
Size Weight
ൗ
Size Weight)
ൗ
Size Weight = 3
ൗ
Size Weight = 4
ൗ
Size Weight = 1
ൗ
Size Weight = 8
ൗ
Size Weight = 4
1 3 4 Weight ← × (1 – )
#packets = 4 #packets = 8 #packets = 5 #packets = 7
1 2’ 2 1’ Order:
Host 1 Host 2 Transport Transport
▪ E.g. pFabric, pHost, TCP
▪ Network Topology ▪ Location of Congestion ▪ Paths of Coflows
▪ No reallocations upon coflow arrivals/departures
1 2 4 8 1 2 4 8
▪ Choose subset of unscheduled jobs ▪ Schedule in next epoch using offline alg.
▪ Evaluate impact of in-network congestion, and hardware constraints
▪ Coflows arrive at time 0 ▪ Coflows arrive at arbitrary times
▪ Sensitivity analysis
➢Coflow sizes, structure, # of coflows ➢Network topologies, Oversubscription ratios, Network load ➢…
1 2 3 4 5 6 7 8 9
Average 90th percentile 99th percentile Facebook trace 1000 coflow trace 2000 coflow trace
OCT: Completion time of a coflow in an unloaded network
Sincronia not only provides near-optimal guarantees, but also improves upon state-of-the-art design in practice
526 coflow trace [Varys]
0.5 1 1.5 2 2.5 3 3.5 4
1000 coflow trace 2000 coflow trace
Average 90th percentile 99th percentile
Slowdown
Network Load = 0.9 Even at such high network loads, Sincronia achieves CCT close to that of an unloaded network
▪ Full bisection bandwidth ▪ 20 PICA8 switches
➢ Supports 8 priority levels
20 40 60 80 100 120 140 160
+ Compare against existing designs
at mean and at tails
Average 90th percentile 99th percentile
Sincronia achieves significant improvements over existing network designs even with a small number of priority levels
Name Performance Guarantees Run on existing Transport Work Conserving Starvation Avoiding Varys On Scheduling Coflows Sincronia
(4-apx) (4-apx)