Intro Software streams Case studies Conclusions
Software Streams Big Data Challenges in Dynamic Program Analysis
Irene Finocchi
- Dept. Computer Science – Sapienza U. Rome
1 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Software Streams Big Data Challenges in Dynamic Program Analysis - - PowerPoint PPT Presentation
Intro Software streams Case studies Conclusions Software Streams Big Data Challenges in Dynamic Program Analysis Irene Finocchi Dept. Computer Science Sapienza U. Rome 1 / 41 Irene Finocchi CiE 2013 special session on data streams and
Intro Software streams Case studies Conclusions
1 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Program analysis Static vs. dynamic Dynamic issues
2 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Program analysis Static vs. dynamic Dynamic issues
2 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Program analysis Static vs. dynamic Dynamic issues
2 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Program analysis Static vs. dynamic Dynamic issues
3 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Program analysis Static vs. dynamic Dynamic issues
3 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Program analysis Static vs. dynamic Dynamic issues
4 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Program analysis Static vs. dynamic Dynamic issues
4 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Program analysis Static vs. dynamic Dynamic issues
5 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Program analysis Static vs. dynamic Dynamic issues
5 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Program analysis Static vs. dynamic Dynamic issues
6 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Program analysis Static vs. dynamic Dynamic issues
7 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Program analysis Static vs. dynamic Dynamic issues
8 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Program analysis Static vs. dynamic Dynamic issues
9 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Program analysis Static vs. dynamic Dynamic issues
10 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Program analysis Static vs. dynamic Dynamic issues
10 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Program analysis Static vs. dynamic Dynamic issues
10 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Program analysis Static vs. dynamic Dynamic issues
10 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Program analysis Static vs. dynamic Dynamic issues
11 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Execution traces Some properties
12 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Execution traces Some properties
13 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Execution traces Some properties
14 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Execution traces Some properties
14 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Execution traces Some properties
14 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Execution traces Some properties
15 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Execution traces Some properties
j {routines}j
15 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Execution traces Some properties
16 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Execution traces Some properties
16 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Execution traces Some properties
17 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Execution traces Some properties
18 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Execution traces Some properties
10 20 30 40 50 60 70 80 90 100 10 20 30 40 50 60 70 80 90 100 cumulative frequency relative to total number of calls % of hot contexts sorted by rank cumulative frequency distributions audacity audacity (startup only) bzip2 gimp gnome-dictionary inkscape
19 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Execution traces Some properties
20 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
21 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
22 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
23 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
24 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
25 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
26 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
26 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
26 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
26 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
26 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
27 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
1 Sketch-based
27 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
1 Sketch-based
2 Counter-based
27 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
1 Sticky sampling Gibbons & Matias, SIGMOD 1998 – Manku &
ε · log 1 ϕδ)
2 Lossy counting [Manku & Motwani, VLDB 2002]
ε · log(εn))
3 Space saving [Metwally, Agrawal & El Abbadi, ACM TODS 2006]
ε) (provably optimal)
28 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
CCT (all contexts) false positives monitored M true hot H (ε,ϕ)-heavy hitters A
29 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
CCT (all contexts) false positives HCCT monitored M true hot H (ε,ϕ)-heavy hitters A
30 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
CCT (all contexts) (ε,ϕ)-HCCT false positives HCCT monitored M true hot H (ε,ϕ)-heavy hitters A
31 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
32 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
100 100 100 100 50 50 40 10 10 10 10 1 false positive false positive (a) (b) (c)
33 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
20 40 60 80 100 a m a r
a r k a u d a c i t y b l u e f i s h d
p h i n f i r e f
g e d i t g h e x 2 g i m p s u d
u g w e n v i e w i n k s c a p e
a l c
m p r e s s
r i t e r p i d g i n q u a n t a v l c Cold nodes / hot nodes / false positives (%) Classification of (φ,ε)-HCCT nodes
34 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
35 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
35 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
35 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
35 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
36 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 a m a r
a r k a u d a c i t y b l u e f i s h d
p h i n f i r e f
g e d i t g h e x 2 g i m p s u d
u g w e n v i e w i n k s c a p e
a l c
m p r e s s
r i t e r p i d g i n q u a n t a v l c a m a r
a r k a u d a c i t y b l u e f i s h d
p h i n f i r e f
g e d i t g h e x 2 g i m p s u d
u g w e n v i e w i n k s c a p e
a l c
m p r e s s
r i t e r p i d g i n q u a n t a v l c Avg/max error (%) Benchmarks Avg/max counter error among hot elements (% of the true frequency) LSS avg error LC avg error LSS max error LC max error Max Avg 37 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
38 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
39 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
39 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
39 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
40 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
40 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
40 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions Mining calling contexts Other applications
40 / 41 Irene Finocchi CiE 2013 special session on data streams and compression
Intro Software streams Case studies Conclusions
41 / 41 Irene Finocchi CiE 2013 special session on data streams and compression