Static Analysis with Demand-Driven Value Refinement Benno Stein, - PowerPoint PPT Presentation

Static Analysis with   Demand-Driven Value Refinement Benno Stein, Benjamin Barslev Nielsen, Bor-Yuh Evan Chang & Anders Møller

Sound static analysis for JavaScript • Static analysis for JavaScript is very challenging o[m]() 2 /17

Sound static analysis for JavaScript • Static analysis for JavaScript is very challenging o[m]() Dynamic object structure 2 /17

Sound static analysis for JavaScript • Static analysis for JavaScript is very challenging o[m]() Dynamic object structure Dynamically computed property name 2 /17

Sound static analysis for JavaScript • Static analysis for JavaScript is very challenging o[m]() Dynamic dispatch Dynamic object structure Dynamically computed property name 2 /17

Sound static analysis for JavaScript • Static analysis for JavaScript is very challenging o[m]() Dynamic dispatch Dynamic object structure Dynamically computed property name • Critical precision losses renders analysis useless • Too much spurious data-flow 2 /17

State-of-the-art data-flow analyzers • Fail to analyze load of some very popular libraries • Critical precision losses occur • Common characteristics • Forwards whole program analysis • Tracks data-flow, e.g., strings, functions and other objects • Non-relational • Aims to mitigate critical precision losses by: Context sensitivity • Syntactic patterns and special-case techniques • 3 /17

Critical code example Example program Analysis state func = o1[name] . . . o2[name] = func . . . o2.foo(…) 4 /17

Critical code example Example program Analysis state func = o1[name] o1 = {foo: f1, bar: f2} . name = ⊤ str . . o2 = {} o2[name] = func . . . o2.foo(…) 4 /17

Critical code example Example program Analysis state func = o1[name] o1 = {foo: f1, bar: f2} . name = ⊤ str . . o2 = {} func = f1|f2 o2[name] = func . . . o2.foo(…) 4 /17

Critical code example Example program Analysis state func = o1[name] o1 = {foo: f1, bar: f2} . name = ⊤ str . . o2 = { ⊤ str : f1|f2} func = f1|f2 o2[name] = func . . . o2.foo(…) 4 /17

Critical code example Example program Analysis state func = o1[name] o1 = {foo: f1, bar: f2} . name = ⊤ str . . o2 = { ⊤ str : f1|f2} func = f1|f2 o2[name] = func . Resolves both f1 and f2 . . o2.foo(…) 4 /17

The Lodash library 5 /17

Critical code example Example program Analysis state func = o1[name] o1 = {foo: f1, bar: f2} . . name = ⊤ str . o2 = {} func = f1|f2 o2[name] = func . . . o2.foo(…) 6 /17

Demand-driven value refinement Regain relational information through refinement queries Without modifying base analysis domain Refinement query: What is x, when y ↦ ̂ v ? What value can variable x have, given that y has value ̂ v ? 7 /17

Critical code example Analysis state Example program o1 = {foo: f1, bar: f2} func = o1[name] name = ⊤ str . o2 = {} . func = f1|f2 . o2[name] = func . . . o2.foo(…) 8 /17

Critical code example Analysis state Example program o1 = {foo: f1, bar: f2} func = o1[name] name = ⊤ str . o2 = {} . func = f1|f2 . What is name, when func ↦ f1? o2[name] = func . What is name, when func ↦ f2? . . o2.foo(…) 8 /17

̂ ̂ ̂ ̂ ̂ ̂ ̂ ̂ ̂ Backwards abstract interpreter for value refinement • Backwards goal-directed from the query location • Separation logic based abstract domain Intuitionistic - constraints hold for all extensions • Special symbolic variable RES represents value being refined • symbolic variables z , RES ∈ x , ̂ Var y , ̂ ::= ̂ h ∧ π | φ 1 ∨ φ 2 symbolic stores φ ∈ Store x 3 | ̂ h 1 * ̂ ::= true | unalloc ( ̂ heap constraints x ) | x ↦ ̂ x 1 [ ̂ x 2 ] ↦ ̂ x | h 2 h pure constraints ::= true | e | π 1 ∧ π 2 π symbolic expressions ::= ̂ x | ̂ e 1 ⊕ ̂ v | e 2 e ∈ Expr 9 /17

Backwards abstract interpreter for value refinement • Based on refutation sound Hoare triples ⟨ φ ⟩ s ⟨ φ ′ � ⟩ • Refutation soundness: φ ′ � s For all concrete runs where holds after , the state s before must satisfy . φ • Encoding refinement queries: v ? ⇝ ⟨ x ↦ RES * y ↦ ̂ v ⟩ What is x, when y ↦ ̂ y ∧ ̂ y = ̂ 10 /17

Critical code example func = o1[name] o2[name] = func 11 /17

Critical code example Refinement query: What is name, when func ↦ f1? func = o1[name] o2[name] = func 11 /17

̂ ̂ Critical code example Refinement query: What is name, when func ↦ f1? func = o1[name] ⟨ name ↦ RES * func ↦ func = f1 ⟩ func ∧ o2[name] = func 11 /17

̂ ̂ ̂ ̂ ̂ ̂ Critical code example Refinement query: What is name, when func ↦ f1? ⟨ name ↦ RES * o1 ↦ func = f1 ⟩ o1 * o1 [ RES ] ↦ func ∧ func = o1[name] ⟨ name ↦ RES * func ↦ func = f1 ⟩ func ∧ o2[name] = func 11 /17

̂ ̂ ̂ ̂ Leveraging forwards analysis state Analysis state o1 = {foo: f1, bar: f2} ⟨ name ↦ RES * o1 ↦ func = f1 ⟩ o1 * o1 [ RES ] ↦ func ∧ Refinement result is the values of RES satisfying: o1 [ RES ] = f1 Refinement result: “foo” 12 /17

Critical code example Analysis state Example program o1 = {foo: f1, bar: f2} func = o1[name] name = ⊤ str . o2 = {} . func = f1|f2 . o2[name] = func . . . o2.foo(…) 13 /17

Critical code example Analysis state Example program o1 = {foo: f1, bar: f2} func = o1[name] name = ⊤ str . o2 = {} . func = f1|f2 . What is name, when func ↦ f1? o2[name] = func . . What is name, when func ↦ f2? . o2.foo(…) 13 /17

Critical code example Analysis state Example program o1 = {foo: f1, bar: f2} func = o1[name] name = ⊤ str . o2 = {} . func = f1|f2 . What is name, when func ↦ f1? o2[name] = func “foo” . . What is name, when func ↦ f2? . “bar” o2.foo(…) 13 /17

Critical code example Analysis state Example program o1 = {foo: f1, bar: f2} func = o1[name] name = ⊤ str . o2 = {foo: f1, bar: f2} . func = f1|f2 . What is name, when func ↦ f1? o2[name] = func “foo” . . What is name, when func ↦ f2? . “bar” o2.foo(…) 13 /17

Critical code example Analysis state Example program o1 = {foo: f1, bar: f2} func = o1[name] name = ⊤ str . o2 = {foo: f1, bar: f2} . func = f1|f2 . What is name, when func ↦ f1? o2[name] = func “foo” . Resolves only f1 . What is name, when func ↦ f2? . “bar” o2.foo(…) 13 /17

Implementation for JavaScript • TAJS VR : TAJS extended with demand-driven value refinement • TAJS is a state-of-the-art analyzer for JavaScript • Implemented in Java • Active research since 2009 • VR JS : Backwards abstract interpreter for JavaScript for answering refinement queries • Implemented in Scala from scratch 14 /17

Compared to state-of-the-art #tests TAJS CompAbs TAJS VR Underscore 1 182 0 % 0 % 95% (2.9s) Lodash3 1 176 0 % 0 % 98% (5.5s) Lodash4 1 306 0 % 0 % 87% (24.7s) Prototype 2 6 0 % 33% (23.1s) 83% (97.7s) Scriptaculous 2 1 0 % 100% (62.0s) 100% (236.9s) JQuery 3 71 7% (14.4s) 0 % 7% (17.2s) JSAI tests 4 29 86% (12.3s) 34% (32.4s) 86% (14.3s) “x% (y)” means succeeded x% of test cases with average time y 1 : Most popular functional utility libraries 2 : Wei et al. [2016] 3 : Andreasen and Møller [2014] 4 : Kashyap et al. [2014] & Dewey et al. [2015] 15 /17

Compared to state-of-the-art #tests TAJS CompAbs TAJS VR Underscore 1 182 0 % 0 % 95% (2.9s) Lodash3 1 176 0 % 0 % 98% (5.5s) Lodash4 1 306 0 % 0 % 87% (24.7s) succeeds analyzing 92% of Underscore and Lodash Prototype 2 6 0 % 33% (23.1s) 83% (97.7s) tests, which all are unanalyzable by existing analyzers Scriptaculous 2 1 0 % 100% (62.0s) 100% (236.9s) JQuery 3 71 7% (14.4s) 0 % 7% (17.2s) TAJS R V JSAI tests 4 29 86% (12.3s) 34% (32.4s) 86% (14.3s) “x% (y)” means succeeded x% of test cases with average time y 1 : Most popular functional utility libraries 2 : Wei et al. [2016] 3 : Andreasen and Møller [2014] 4 : Kashyap et al. [2014] & Dewey et al. [2015] 15 /17

Value refinement insights • Value refinement is triggered in few locations • In Lodash4, it is triggered in 7 locations in >17000 LoC • Almost all queries are solved successfully (>99%) • Queries are answered efficiently (Avg. ~10ms) • Answering a query requires visiting few locations • Typically below 40 • Many queries requires interprocedural reasoning 16 /17

Conclusion • New technique: Demand-Driven Value Refinement Relational reasoning on top of non-relational analysis • Eliminates critical precision loss on-the-fly • Uses backwards analysis for gaining relational precision • Exploiting forwards analysis state allows efficient refinements • • Experimental evaluation First analysis capable of analyzing most popular JavaScript library • No significant overhead for incorporating backwards analyzer • Open-source: https://www.brics.dk/TAJS/VR/ • 17 /17

Static Analysis with Demand-Driven Value Refinement Benno Stein, - PowerPoint PPT Presentation

Static Analysis with Demand-Driven Value Refinement Benno Stein, Benjamin Barslev Nielsen, Bor-Yuh Evan Chang & Anders Mller Sound static analysis for JavaScript Static analysis for JavaScript is very challenging o[m]() 2 /17

Static and Method Overloading static One per class, not per object static variables

Adaptive Mesh Refinement CS 101 - Meshing Winter 2007 1 Mesh Refinement Applications

Static and dynamic verification Static and dynamic V&V Software inspections Concerned

Effective Approaches to Abstraction Refinement for an Explicit Value Analysis Stefan Lwe

Nash demand game Julio D avila 2009 Julio D avila Nash demand game Nash demand game

Priority-Driven Scheduling of Periodic Tasks Priority-driven vs. clock-driven scheduling:

Static analysis of OpenAFS code base Cheyenne Wills OpenAFS 2019 Workshop Overview What is

A Brief Introduction to Static Analysis Sam Blackshear March 13, 2012 Outline A theoretical

Static Analysis of Haskell Neil Mitchell http://ndmitchell.com Static Analysis is getting

Quadratic Interval Refinement Nikolaos Arvanitopoulos Seminar on Computational Geometry and

WMSBF Event Business Solutions 1 Employer-Focused, Demand-Driven Demand Driven model was

Static Code Analysis of Complex PHP Application Vulnerabilities Johannes Dahse Static Code

False fasting is driven by pride False fasting is driven by pride False fasting is

Static Analysis Static Analysis they are read? Will the current value of a variable ever be

7 Refinement Options November 3, 2016 Overview Recap the HS Boundary Refinement Process

Crystallographic refinement Roberto A. Steiner roberto.steiner@kcl.ac.uk with many slides

Compiler aided optimization of the pimpl-idiom Alexander Richardson (alr48@cam.ac.uk) University

CSE 116: Fall 2019 Introduction to Functional Programming Higher-Order Functions Owen

1. More readable 2. (Usually) More overhead 3. We dont know which one we aspire to more

WRITING FASTER CODE 1 . 1 WRITING FASTER CODE AND NOT HATING YOUR JOB AS A SOFTWARE DEVELOPER

C++11/14/1z in CMake Ben Morgan Overview How do we ensure the compiler and standard library

So you think you got it? 59 Wait a minute May everything we did in our Bounded Types

The APGAS Library Resilient Parallel and Distributed Programming in Java 8 Olivier Tardieu IBM

Cloud-Based File Synchronization Services Exploding in popularity Numerous providers:

Sambuz

Useful Links

Newsletter

Mail Us

Static Analysis with Demand-Driven Value Refinement Benno Stein, - PowerPoint PPT Presentation

Static Analysis with Demand-Driven Value Refinement Benno Stein, Benjamin Barslev Nielsen, Bor-Yuh Evan Chang & Anders Mller Sound static analysis for JavaScript Static analysis for JavaScript is very challenging o[m]() 2 /17

Static and Method Overloading static One per class, not per object static variables

Adaptive Mesh Refinement CS 101 - Meshing Winter 2007 1 Mesh Refinement Applications

Static and dynamic verification Static and dynamic V&amp;V Software inspections Concerned

Effective Approaches to Abstraction Refinement for an Explicit Value Analysis Stefan Lwe

Nash demand game Julio D avila 2009 Julio D avila Nash demand game Nash demand game

Priority-Driven Scheduling of Periodic Tasks Priority-driven vs. clock-driven scheduling:

Static analysis of OpenAFS code base Cheyenne Wills OpenAFS 2019 Workshop Overview What is

A Brief Introduction to Static Analysis Sam Blackshear March 13, 2012 Outline A theoretical

Static Analysis of Haskell Neil Mitchell http://ndmitchell.com Static Analysis is getting

Quadratic Interval Refinement Nikolaos Arvanitopoulos Seminar on Computational Geometry and

WMSBF Event Business Solutions 1 Employer-Focused, Demand-Driven Demand Driven model was

Static Code Analysis of Complex PHP Application Vulnerabilities Johannes Dahse Static Code

False fasting is driven by pride False fasting is driven by pride False fasting is

Static Analysis Static Analysis they are read? Will the current value of a variable ever be

7 Refinement Options November 3, 2016 Overview Recap the HS Boundary Refinement Process

Crystallographic refinement Roberto A. Steiner roberto.steiner@kcl.ac.uk with many slides

Compiler aided optimization of the pimpl-idiom Alexander Richardson (alr48@cam.ac.uk) University

CSE 116: Fall 2019 Introduction to Functional Programming Higher-Order Functions Owen

1. More readable 2. (Usually) More overhead 3. We dont know which one we aspire to more

WRITING FASTER CODE 1 . 1 WRITING FASTER CODE AND NOT HATING YOUR JOB AS A SOFTWARE DEVELOPER

C++11/14/1z in CMake Ben Morgan Overview How do we ensure the compiler and standard library

So you think you got it? 59 Wait a minute May everything we did in our Bounded Types

The APGAS Library Resilient Parallel and Distributed Programming in Java 8 Olivier Tardieu IBM

Cloud-Based File Synchronization Services Exploding in popularity Numerous providers:

Sambuz

Useful Links

Newsletter

Mail Us

Static and dynamic verification Static and dynamic V&V Software inspections Concerned