A True Positives Theorem for a Static Race Detector Nikos - PowerPoint PPT Presentation

A True Positives Theorem   for a Static Race Detector Nikos Gorogiannis Peter O’Hearn Ilya Sergey

Key Messages Unsound (and incomplete) static analyses can be principled , satisfying meaningful theorems   that help to understand their behaviour and guide their design One can have an unsound but effective static analysis,   which has significant industrial impact,   and which is supported by a meaningful theorem . 2

Context 1.We had a demonstrably-effective industrial analysis:   RacerD (OOPSLA'18); >3k fixes in Facebook Java codebase 2.No soundness theorem 3

Static Analyses for Bug Detection

Context 1. We had a demonstrably-effective industrial analysis:   RacerD (OOPSLA'18); >3k fixes in Facebook Java 2. No soundness theorem 3. Architecture: compositional abstract interpreter 4. No heuristic alarm filtering Just ad hoc? Our reaction:   Semantics/theory should understand/explain, not lecture. 5

Conjecture True Positives Theorem:   Under certain assumptions, the static bug detector reports no false positives . 6

Static Analyses   for Program Validation 7

The Essence of Static Analysis “abstraction” α C e p program   property   execution of interest 8

α e 1 p α e 2 9

Static Analysis p 1 e 1 p 2 e 2 concreteSem ( c ) = e 3 e 6 e 4 p 3 e 5 p 4 10

Static Analysis p 1 } “has bugs” e 1 p 2 e 2 } concreteSem ( c ) = e 3 e 6 e 4 p 3 e 5 “correct” p 4 11

Verifier   or a   Bug Detector? 12

Program Verifier true positive p 1 e 1 false positive p 2 e 2 e 3 e 6 e 4 true negative p 3 e 5 true negative p 4 13

Sound Program Verifier true positive p 1 e 1 false positive p 2 e 2 e 3 e 6 e 4 true negative p 3 e 5 true negative p 4 14

Sound Program Verifier true positive p 1 e 1 false positive p 2 e 2 e 3 e 6 e 4 true negative p 3 e 5 true negative p 4 abstract over-approximation < 15

Sound Program Verifier true positive p 1 e 1 false positive p 2 e 2 e 3 e 6 e 4 true negative p 3 e 5 true negative p 4 abstract over-approximation < 16

Sound Program Verifier true positive p 1 Developer : e 1 Go away, that never happens! false positive p 2 e 2 e 3 e 6 e 4 true negative p 3 e 5 true positive p 4 if (n == VERY_UNLIKELY_VALUE) { bug.explode(); } else { // do nothing } 17

Unsound Program “Verifier” true positive p 1 e 1 false positive p 2 e 2 e 3 e 6 e 4 true negative p 3 e 5 false negative p 4 if (n == VERY_UNLIKELY_VALUE) { bug.explode(); } else { // do nothing } 18

“Sound” Program Verifier true positive p 1 e 1 false positive p 2 e 2 e 3 e 6 e 6 e 4 true negative p 3 e 5 false negative p 4 19

“Sound” Program Verifier true positive p 1 e 1 false positive p 2 e 2 e 3 e 6 e 4 true negative p 3 e 5 concrete under-approximation abstract over-approximation < 20

Sound Static Verifiers • False negatives (bugs missed) are bad • False positives (non-bugs reported) are okay • Constructed as over-approximation ( of under-approximation ) • Soundness Theorem :   Under certain assumptions about the programs, the analyser has no false negatives . 21

p 1 } “has bugs” e 1 p 2 e 2 } e 3 e 6 e 4 p 3 e 5 “correct” p 4 22

Static Bug Finder true positive p 1 e 1 false positive p 2 e 2 e 3 e 6 e 4 true negative p 3 e 5 false negative p 4 23

Unsound Static Bug Finder true positive p 1 e 1 false positive p 2 e 2 e 3 e 6 e 4 true negative p 3 e 5 false negative p 4 24

Sound (but imprecise) Static Bug Finder true positive p 1 e 1 false negative p 2 e 2 e 3 e 6 e 4 true negative p 3 e 5 false negative p 4 abstract under-approximation < 25

Loss of Precision in Static Bug Finders if (n != VERY_UNLIKELY_VALUE) { e 2 // bug happens here } else { e 3 // normal execution } Idea: over-approximate in concrete semantics! 26

Sound (but Imprecise) Static Bug Finder true positive p 1 e 1 false negative p 2 e 2 Let’s merge these executions into Let’s consider these two equivalent! one that subsumes both! e 3 e 6 e 4 true negative p 3 e 5 false negative p 4 27

if (*) { // bug happens here } else { true positive p 1 // normal execution } e 1 false negative true positive p 2 p 2 e 2 e 23 overApproxConcreteSem (c) = 1. e 3 e 6 e 4 true negative p 3 e 5 false negative p 4 28

Sound Static Bug Finder if (*) { // bug happens here } else { true positive p 1 // normal execution } e 1 true positive p 2 e 23 overApproxConcreteSem (c) = 1. e 6 e 4 true negative p 3 e 5 false negative p 4 concrete over-approximation abstract under-approximation < 29

Towards Sound Static Bug Finders (this work) • False negatives (bugs missed) are okay • False positives (non-bugs reported) are bad • Constructed as under-approximation of over-approximation • Soundness (True Positives) Theorem :   Under certain assumptions about the programs, the analyser has no false positives . 30

A Recipe for True Positives Theorem Over-approximate semantic elements to make up for “difficult” dynamic execution aspects   1. Example: replace conditions and loops with their non-deterministic versions Pick abstraction α for over-approximated executions that provably identifies “buggy” behaviours:   2. ∀ e: execution, hasBug ( α ( e )) ⇒ execution e has a bug Design an abstract semantics asem , so it is complete wrt. α and over-approximated concrete semantics:   3. ∀ c : program, asem (c) = α ( overApproxConcreteSem (c)) Together, asem and hasBug provide a TP-sound static bug finder. 4. 31

Case Study: RacerDX • A provably TP-Sound version of Facebook’s RacerD concurrency analyser   (Blackshear et al., OOPSLA’18) • Buggy executions: data races in lock-based concurrent programs • Syntactic assumptions :   Java programs with well-scoped locking ( synchronised ), no recursion, reflection, dynamic class loading; global variables are ignored. • Concrete over-approximation :   Loops and conditionals are non-deterministic. 32

A True Race class Burble { class Bloop { public int f = 1; public void meps(Bloop b) { } synchronized ( this ) { System.out.println(b.f); } } public void reps(Bloop b) { b.f = 42; } public void beps(Bloop b) { b = new Bloop(); b.f = 239; } } 33

A False Race class Burble { class Bloop { public int f = 1; public void meps(Bloop b) { } synchronized ( this ) { System.out.println(b.f); } } public void reps(Bloop b) { b.f = 42; Path prefix b is “ unstable ” ( “ wobbly ” ),   } as it’s reassigned, hence race is evaded. public void beps(Bloop b) { b = new Bloop(); b.f = 239; } } 34

Complete Abstraction for Race Detection (W, L, A) class Burble { public void meps(Bloop b) { synchronized ( this ) { System.out.println(b.f); } “Wobbly” paths, Accesses/locks } touched during execution with formals/fields public void reps(Bloop b) { b.f = 42; Locking level } public void beps(Bloop b) { b = new Bloop(); • asem ( meps(b) ) = ({b.f}, 0, {R(b.f, 1)}) b.f = 239; } } • asem ( reps(b) ) = ({b.f}, 0, {W(b.f, 0)}) • asem ( beps(b) ) = ({b, b.f}, 0, {W(b, 0), W(b.f, 0)}) 35

Analysing Summaries for Races class Burble { • asem ( meps(b) ) = ({b.f}, 0, {R(b.f, 1)}) public void meps(Bloop b) { synchronized ( this ) { System.out.println(b.f); • asem ( reps(b) ) = ({b.f}, 0, {W(b.f, 0)}) } } • asem ( beps(b) ) = ({b, b.f}, 0, {W(b, 0), W(b.f, 0)}) public void reps(Bloop b) { b.f = 42; } public void beps(Bloop b) { b = new Bloop(); meps(b) || reps(b) ⇒ Can race, b.f = 239; } } report a bug! 36

Analysing Summaries for Races class Burble { • asem ( meps(b) ) = ({b.f}, 0, {R(b.f, 1)}) public void meps(Bloop b) { synchronized ( this ) { System.out.println(b.f); • asem ( reps(b) ) = ({b.f}, 0, {W(b.f, 0)}) } } • asem ( beps(b) ) = ({b, b.f}, 0, {W(b, 0), W(b.f, 0)}) public void reps(Bloop b) { b.f = 42; } public void beps(Bloop b) { b = new Bloop(); meps(b) || beps(b) ⇒ Maybe don’t race, b.f = 239; } } don’t report a bug 37

Formal Result RacerDX enjoys the True Positives Theorem wrt . Data Race Detection (Details in the paper) 38

Evaluation What is the price to pay for having the TP Theorem? (Reporting no bugs whatsoever is TP-Sound) 39

RacerD vs RacerDX Target LOC D CPU DX CPU CPU ± % D Reps DX Reps Reps ± % D avrora 76k 103 102 0.4% 143 92 36% Chronicle-Map 45k 196 196 0.1% 2 2 0% jvm-tools 33k 106 109 -3.6% 30 26 13% RxJava 273k 76 69 9.2% 166 134 19% sun f ow 25k 44 44 -1.4% 97 42 57% xalan-j 175k 144 137 5.0% 326 295 10% (b) Evaluation results. CPU columns are in seconds; Reps are distinct reports; 40

RacerD vs RacerDX Target LOC D CPU DX CPU CPU ± % D Reps DX Reps Reps ± % D avrora 76k 103 102 0.4% 143 92 36% Chronicle-Map 45k 196 196 0.1% 2 2 0% jvm-tools 33k 106 109 -3.6% 30 26 13% RxJava 273k 76 69 9.2% 166 134 19% sun f ow 25k 44 44 -1.4% 97 42 57% xalan-j 175k 144 137 5.0% 326 295 10% (b) Evaluation results. CPU columns are in seconds; Reps are distinct reports; 41

A True Positives Theorem for a Static Race Detector Nikos - PowerPoint PPT Presentation

A True Positives Theorem for a Static Race Detector Nikos Gorogiannis Peter OHearn Ilya Sergey Key Messages Unsound (and incomplete) static analyses can be principled , satisfying meaningful theorems that help to understand their

# of true positives true positive rate = # of known positives (Proportion of actual positives

# of true positives true positive rate = # of known positives (Proportion of actual positives

A True Positives Theorem for a Static Race Detector Presentation by Julia Belyakova and Artem

So You Want to Race to Bermuda Marion Bermuda Race Starts June 19, 2015 So You Want to Race to

Marion to Bermuda Race 2021 Race Starts: June 18, 2021 So You Want to Race to Bermuda Why

Compositional Static Race Detection at Scale, without False Positives Ilya Sergey Joint work

Race Race In D&D, race refers to any intelligent humanoid species Dwarf Elf

Marion to Bermuda Race 2017 Front Row Seat to the Americas Cup Race Starts: June 9, 2017 So

31. Stokes Theorem Stokes theorem is to Greens theorem, for the work done, as the

Post hoc bounds on false positives using Post hoc bounds on false positives using reference

Bedford Basin Yacht Club 2017 RACE PROGRAM Your 2017 Race Management Team Race Officers Emma

Static and Method Overloading static One per class, not per object static variables

Looking Inside a Race Detector kavya @kavya719 data race detection data races when two+

Strategic Plan for Detector R&D at Fermilab Petra Merkel Fermilab Detector R&D

INTRODUCTION THE WORLD CUP OF AIR RACING P 1 AIR RACE 1 WORLD CUP AIR RACE 1 WORLD CUP THE

Race 1 Peer Teaching What role do you think race plays in international relations? 2 Race

PUBLIC POLICY TOWARD ABUSE OF FIRM DOMINANCE Outline Public policy: false positives and

M onkey: O ptimal N avigable Key -Value Store Niv Dayan, Manos Athanassoulis, Stratos Idreos

in the Range-Based Constraint Manager dm Balogh adam.balogh@ericsson.com Euro LLVM 2019

Stemming VSM, session 10 CS6200: Information Retrieval Slides by: Jesse Anderton Stemming A

The European Commissions science and knowledge service Joint Research Centre Why machine

CSI5180. MachineLearningfor BioinformaticsApplications Fundamentals of Machine Learning tasks

Intrus ntrusion ion Det Detection, ection, Fi Fire rewalls, alls, an and d Intr ntrusion

Optimizing unit test execution in large software programs using dependency analysis Taesoo Kim,

A True Positives Theorem for a Static Race Detector Nikos - PowerPoint PPT Presentation

A True Positives Theorem for a Static Race Detector Nikos Gorogiannis Peter OHearn Ilya Sergey Key Messages Unsound (and incomplete) static analyses can be principled , satisfying meaningful theorems that help to understand their

# of true positives true positive rate = # of known positives (Proportion of actual positives

# of true positives true positive rate = # of known positives (Proportion of actual positives

A True Positives Theorem for a Static Race Detector Presentation by Julia Belyakova and Artem

So You Want to Race to Bermuda Marion Bermuda Race Starts June 19, 2015 So You Want to Race to

Marion to Bermuda Race 2021 Race Starts: June 18, 2021 So You Want to Race to Bermuda Why

Compositional Static Race Detection at Scale, without False Positives Ilya Sergey Joint work

Race Race In D&amp;D, race refers to any intelligent humanoid species Dwarf Elf

Marion to Bermuda Race 2017 Front Row Seat to the Americas Cup Race Starts: June 9, 2017 So

31. Stokes Theorem Stokes theorem is to Greens theorem, for the work done, as the

Post hoc bounds on false positives using Post hoc bounds on false positives using reference

Bedford Basin Yacht Club 2017 RACE PROGRAM Your 2017 Race Management Team Race Officers Emma

Static and Method Overloading static One per class, not per object static variables

Looking Inside a Race Detector kavya @kavya719 data race detection data races when two+

Strategic Plan for Detector R&amp;D at Fermilab Petra Merkel Fermilab Detector R&amp;D

INTRODUCTION THE WORLD CUP OF AIR RACING P 1 AIR RACE 1 WORLD CUP AIR RACE 1 WORLD CUP THE

Race 1 Peer Teaching What role do you think race plays in international relations? 2 Race

PUBLIC POLICY TOWARD ABUSE OF FIRM DOMINANCE Outline Public policy: false positives and

M onkey: O ptimal N avigable Key -Value Store Niv Dayan, Manos Athanassoulis, Stratos Idreos

in the Range-Based Constraint Manager dm Balogh adam.balogh@ericsson.com Euro LLVM 2019

Stemming VSM, session 10 CS6200: Information Retrieval Slides by: Jesse Anderton Stemming A

The European Commissions science and knowledge service Joint Research Centre Why machine

CSI5180. MachineLearningfor BioinformaticsApplications Fundamentals of Machine Learning tasks

Intrus ntrusion ion Det Detection, ection, Fi Fire rewalls, alls, an and d Intr ntrusion

Optimizing unit test execution in large software programs using dependency analysis Taesoo Kim,

Race Race In D&D, race refers to any intelligent humanoid species Dwarf Elf

Strategic Plan for Detector R&D at Fermilab Petra Merkel Fermilab Detector R&D