STAR: Self-Tuning Aggregation for Scalable Monitoring
Navendu Jain, Dmitry Kit, Prince Mahajan, Praveen Yalagandula†, Mike Dahlin, and Yin Zhang University of Texas at Austin
†HP Labs
STAR: Self-Tuning Aggregation for Scalable Monitoring [On job - - PowerPoint PPT Presentation
STAR: Self-Tuning Aggregation for Scalable Monitoring [On job market next year] Navendu Jain, Dmitry Kit, Prince Mahajan, Praveen Yalagandula , Mike Dahlin, and Yin Zhang University of Texas at Austin HP Labs Motivating Application
†HP Labs
\
2
Frequencies
Flows Node 1 Node N Aggregate Sum 0.1% threshold
3
4
Adaptive filters [Olston SIGMOD ’03], Astrolabe [VanRenesse TOCS ’03], TAG [Madden OSDI ’02], TACT [Yu TOCS ’02]
5
Coordinator
6
PIER [Huebsch VLDB ‘03], SDIMS [Yalagandula SIGCOMM ’04], Astrolabe [VanRenesse TOCS ’03], TAG [Madden OSDI ’02]
18 19 37 Physical Nodes (Leaf sensors) L1 L2 L3 L0 7 11 7 12 3 4 2 9 6 1 9 3 SUM
8
9
L1 L2 L3 L0
10
11
13
Message Load
Error Budget
14
15
17 5
Coordinator
18
19
21
22
1.11A 1.1A 1.A A 1.A A A1 Uea U f id Ac c f cN m a f e ASN g-U a c m Til STAR
10x load reduction
23
Message Cost per second Error Budget to Noise ratio
1e-04 0.001 0.01 0.1 1 0.1 1 10 100 Error Budget to Noise ratio Uniform Allocation Adap-filters (freq = 5) Adap-filters (freq = 10) Adap-filters (freq = 50) STAR
24
Message Cost per second Error Budget to Noise ratio
1 10 100 1 100 10000 1e+06 CDF (% of flows) Flow value (KB) Flow value distribution 1 10 100 1 100 10000 CDF (% of flows) Number of updates Flow updates distribution
60% flows send < 1KB 40% flows send 1 IP pkt 99% flows send < 330KB 99% flows send < 2k pkt
25
0.01 0.1 1 10 100 5 10 15 20 Message Cost per second AI Error Budget (% max flow value) BW(Root_share=0%) BW(Root_share=50%) BW(Root_share=90%) BW(Root_share=100%)
3x load reduction
10x load reduction
26
27
28