Raj Joshi1, Ti Ting Qu2, Mun Choon Chan1, Ben Leong1, Boon Thau Loo3
BurstRadar
Practical Real-time Microburst Monitoring for Datacenter Networks
1 2 3
BurstRadar Practical Real-time Microburst Monitoring for Datacenter - - PowerPoint PPT Presentation
BurstRadar Practical Real-time Microburst Monitoring for Datacenter Networks Raj Joshi 1 , Ti Ting Qu 2 , Mun Choon Chan 1 , Ben Leong 1 , Boon Thau Loo 3 1 2 3 Microbursts (bursts) Events of intermittent congestion lasting 10s or
Raj Joshi1, Ti Ting Qu2, Mun Choon Chan1, Ben Leong1, Boon Thau Loo3
Practical Real-time Microburst Monitoring for Datacenter Networks
1 2 3
Events of intermittent congestion lasting 10’s or 100’s of µs
Bursty UDP traffic TCP segment offloading
BurstRadar (APSys ‘18) 2
Senders …… Receiver
Small amounts of queueing (microbursts):
BurstRadar (APSys ‘18) 3
> 10 Gbps 10’s of µs
Latency
Performance
BurstRadar (APSys ‘18) 4
Detect the occurrence of µbursts & identify the contributing flows!
Detecting & characterizing µbursts is hard
Measurement study from FB’s datacenter
Traditional sampling-based techniques
Commercial Solutions
BurstRadar (APSys ‘18) 5
In-band Telemetry (INT)
to monitoring servers from the last-hop switches
BurstRadar (APSys ‘18) 6
Programmable dataplanes and dataplane telemetry
Challenges: Effective & real-time monitoring
BurstRadar (APSys ‘18) 7
Using INT to detect µbursts is wasteful
all packets
Since µbursts are unpredictable
Correlate monitoring data from different points in the network
BurstRadar (APSys ‘18) 8
Key Insight: Key Idea:
switch where it happens
Egress Port Queues Switch’s Queuing Engine
µbursts are localized to a switch’s egress port queue
BurstRadar (APSys ‘18) 9
does not provide any support to peek into the contents of any queue
Egress Processing Pipeline
BurstRadar (APSys ‘18) 10
Egress Port Queues
Ring Buffer Courier Pkt Generator Snapshot Algorithm
Egress Ports Egress Deparser Queuing Telemetry (metadata) Markbit (metadata)
Egress Processing Pipeline
BurstRadar (APSys ‘18) 11
Egress Port Queues
Ring Buffer Courier Pkt Generator Snapshot Algorithm
Egress Ports Egress Deparser Mirror Port Queue Courier Packet
Egress Processing Pipeline
BurstRadar (APSys ‘18) 12
Egress Port Queues
Ring Buffer Courier Pkt Generator Snapshot Algorithm
Egress Ports Egress Deparser Mirror Port Queue Courier Packet
Egress Processing Pipeline
BurstRadar (APSys ‘18) 13
Egress Port Queues
Ring Buffer Courier Pkt Generator Snapshot Algorithm
Egress Ports Egress Deparser Telemetry Info:
Mirror Port Courier Packet Mirror Port Queue
Egress Processing Pipeline
BurstRadar (APSys ‘18) 14
Egress Port Queues
Ring Buffer Courier Pkt Generator Snapshot Algorithm
Egress Ports Egress Deparser Mirror Port
BurstRadar (APSys ‘18) 15
BurstRadar (APSys ‘18) 16
“Snapshot” the telemetry info of only the
packets involved in µbursts
Telemetry info: 5-tuple (packet header) ingress/egress timestamps enqueue/dequeue queue depths
(metadata)
BurstRadar (APSys ‘18) 17
Latency Increase Threshold
Queue Snapshots Snapshot Algorithm
BurstRadar (APSys ‘18) 18
BurstRadar (APSys ‘18) 19
“Courier” Packets transport the telemetry
info via the switch’s mirror port (out-of-band)
All the data stays together Avoids the expensive correlation on the monitoring servers
BurstRadar (APSys ‘18) 20
Each marked packet generate new courier packet clone egress to egress, clone_e2e
BurstRadar (APSys ‘18) 21
BurstRadar (APSys ‘18) 22
“Ring Buffer” temporarily stores the
telemetry info of marked packets until they can be copied into the courier packets.
BurstRadar (APSys ‘18) 23
Hardware Testbed
Generated µburst Traffic Traces
Compare BurstRadar against
BurstRadar (APSys ‘18) 24
BurstRadar Prototype Send/Receive µburst Traffic
BurstRadar (APSys ‘18) 25
5
5% RTT 10 times less packets compared to INT
No Note: e: 5% RTT ≈ 1.25µs of queuing @10Gbps in our testbed
BurstRadar (APSys ‘18) 26
300 entries (8.7KB SRAM) 10 concurrent µbursts (< 0.5%)
No Note: e: 1000 entries (29KB SRAM) fully handle 10 concurrent µbursts
0.5
BurstRadar (APSys ‘18) 27
Reso source ce switch ch.p4* p4* Match Crossbar 50.13% Hash Bits 32.35% SRAM 29.79% TCAM 28.47% VLIW Actions 34.64% Stateful ALUs 15.63%
Tofino Resource Utilization (Ring Buffer = 1000 entries)
* resource utilization of a fully-featured datacenter ToR switch
Very little resources combined with switch.p4
BurstRadar 3.39% 4.83% 4.06% 0.69% 4.69% 12.5% Combined 53.52% 37.18% 33.85% 29.16% 39.33% 28.13%
BurstRadar (APSys ‘18) 28
effectively and continuously
packets involved in microbursts
ASICs have made it practical to detect and characterize microbursts at multi-gigabit line rates in high-speed datacenter networks.
BurstRadar (APSys ‘18) 29