Towards an Optimizing Compiler for Numerical Kernels joint work - PowerPoint PPT Presentation

Photo by Sebastian Bruggisser Towards an Optimizing Compiler for Numerical Kernels joint work with: Heiko Becker, Anastasiia Izycheva, Debasmita Lohar, Viktor Kuncak, Magnus Myreen, Sylvie Putot, Eric Goubault, Helmut Seidl Eva Darulova eva@mpi-sws.org

Resources are Limited Suppose you want to implement a heartbeat monitor: Design Implementation infinite resources: limited resources: ‣ perfect inputs ‣ noisy inputs ‣ continuous arithmetic ‣ finite-precision arithmetic inspired from: A Methodology for Embedded Classification of Heartbeats Using Random Projections, DATE’13

Approximations accuracy efficiency

Approximations accuracy Navigating the Tradeoff is Hard! efficiency

Programming with Approximations state-of-the-art Embedded systems and scientific computing ‣ manual ‣ costly ‣ error-prone Programming languages ‣ automated ‣ sound ‣ limited point solutions

Vision: 'Approximating Compiler' ideal real-valued program with accuracy & resource spec automatically approximate finite-precision program with correctness certificate

Today real-valued specification with transcendental functions y s i a D fixed-point/floating-point implementation with polynomial approximations

Overview real-valued specification Accuracy verification with transcendental functions ‣ arithmetic y s i a D ‣ conditionals Optimization fixed-point/floating-point implementation with polynomial approximations ‣ finite-precision ‣ elementary functions

Daisy real-valued specification def sine(x: Real ): Real = { require (-1.5 <= x && x <= 1.5 && x +/- 1e-11) x - (x*x*x)/6.0 + (x*x*x*x*x)/120.0 } ensuring (res => res +/- 1.001e-11) s y i a D finite-precision implementation fixed-point arithmetic floating-point arithmetic def sine(x: Double ): Double = { ap_fixed <64,3> sine( ap_fixed <64,2> x) { require (-1.5 <= x && x <= 1.5) ap_fixed <64,4> _const0 = 6.0; ap_fixed <64,3> _tmp = (x * x); x - (x*x*x)/6.0 + (x*x*x*x*x)/120.0 ap_fixed <64,3> _tmp1 = (_tmp * x); ap_fixed <64,1> _tmp2 = (_tmp1 / _const0); } ap_fixed <64,3> _tmp3 = (x - _tmp2); ...

Worst-case Accuracy for arithmetic expressions def sine(x: Real ): Real = { require (-1.5 <= x && x <= 1.5 && x +/- 1e-11) x - (x*x*x)/6.0 + (x*x*x*x*x)/120.0 } ensuring (res => res +/- 1.001e-11) absolute errors [TOPLAS'17] ‣ static data-flow analysis with x ∈ I | f ( x ) − ˜ max f (˜ x ) | interval & affine arithmetic ‣ interval subdivision relative errors [FMCAD'17] | f ( x ) − ˜ f (˜ x ) | max ‣ global optimization | f ( x ) | x ∈ I ‣ for floating-points only Challenge: tight bounds for nonlinear arithmetic

Certificates [FMCAD'18, FM'19] real-valued specification def sine(x: Real ): Real = { require (-1.5 <= x && x <= 1.5 && x +/- 1e-11) x - (x*x*x)/6.0 + (x*x*x*x*x)/120.0 } ensuring (res => res +/- 1.001e-11) s y i a D formally verified finite-precision implementation fixed-point arithmetic floating-point arithmetic def sine(x: Double ): Double = { ap_fixed <64,3> sine( ap_fixed <64,2> x) { require (-1.5 <= x && x <= 1.5) ap_fixed <64,4> _const0 = 6.0; ap_fixed <64,3> _tmp = (x * x); x - (x*x*x)/6.0 + (x*x*x*x*x)/120.0 ap_fixed <64,3> _tmp1 = (_tmp * x); ap_fixed <64,1> _tmp2 = (_tmp1 / _const0); } ap_fixed <64,3> _tmp3 = (x - _tmp2); ...

Overview real-valued specification Accuracy verification with transcendental functions ‣ arithmetic y s i a D ‣ conditionals Optimization fixed-point/floating-point implementation with polynomial approximations ‣ finite-precision ‣ elementary functions

Conditionals: Continuous Case def sine(x: Real ): Real = { require (-2.0 < x && x < 2.0 && x +/- 1e-11) Control-flow may diverge: if (x < 1.0) { f 1 reals 0.95493 * x - 0.12901*(x*x*x) } else { f 2 x - (x*x*x)/6.0 + (x*x*x*x*x)/120.0 } finite-precision } ensuring (res => res +/- 1.001e-11) | f 1 ( x ) � ˜ x ) � ˜ x ∈ I | f ( x ) − ˜ f 2 (˜ x ) |  | f 1 ( x ) � f 1 (˜ x ) | + | f 1 (˜ x ) � f 2 (˜ x ) | + | f 2 (˜ f 2 (˜ x ) | max f (˜ x ) | Challenge: complexity of constraint

Conditionals: Continuous Case def sine(x: Real ): Real = { require (-2.0 < x && x < 2.0 && x +/- 1e-11) Control-flow may diverge: if (x < 1.0) { f 1 reals 0.95493 * x - 0.12901*(x*x*x) } else { f 2 x - (x*x*x)/6.0 + (x*x*x*x*x)/120.0 } finite-precision } ensuring (res => res +/- 1.001e-11) break up total error into different manageable pieces [TOPLAS'17] ‣ | f 1 ( x ) � ˜ x ) � ˜ x ∈ I | f ( x ) − ˜ f 2 (˜ x ) |  | f 1 ( x ) � f 1 (˜ x ) | + | f 1 (˜ x ) � f 2 (˜ x ) | + | f 2 (˜ f 2 (˜ x ) | max f (˜ x ) | Lipschitz const. real difference roundoff error Challenge: complexity of constraint

Conditionals: Discrete Case def rigidBody(x1: Real , x2: Real , x3: Real ): Real = { def rigidBody(x1: Real , x2: Real , x3: Real ): Real = { require (-15.0 ≤ x1 ≤ 15 && -15.0 ≤ x2 ≤ 15.0 && -15.0 ≤ x3 ≤ 15) require (-15.0 ≤ x1 ≤ 15 && -15.0 ≤ x2 ≤ 15.0 && -15.0 ≤ x3 ≤ 15) val res = -x1*x2 - 2*x2*x3 - x1 - x3 val res = -x1*x2 - 2*x2*x3 - x1 - x3 if (res <= 0.0) if (res <= 0.0) reals raise_alarm() raise_alarm() else else continue() continue() finite-precision } }

Conditionals: Discrete Case def rigidBody(x1: Real , x2: Real , x3: Real ): Real = { require (-15.0 ≤ x1 ≤ 15 && -15.0 ≤ x2 ≤ 15.0 && -15.0 ≤ x3 ≤ 15) val res = -x1*x2 - 2*x2*x3 - x1 - x3 if (res <= 0.0) reals return 0 else return 1 finite-precision } worst-case analysis: maximum error = 1

Conditionals: Discrete Case def rigidBody(x1: Real , x2: Real , x3: Real ): Real = { How often will the program return the wrong answer? require (-15.0 ≤ x1 ≤ 15 && -15.0 ≤ x2 ≤ 15.0 && -15.0 ≤ x3 ≤ 15) val res = -x1*x2 - 2*x2*x3 - x1 - x3 if (res <= 0.0) reals return 0 else return 1 finite-precision } worst-case analysis: maximum error = 1

Probabilistic Analysis def rigidBody(x1: Real , x2: Real , x3: Real ): Real = { require (-15.0 ≤ x1 ≤ 15 && -15.0 ≤ x2 ≤ 15.0 && -15.0 ≤ x3 ≤ 15) val res = -x1*x2 - 2*x2*x3 - x1 - x3 if (res <= 0.0) return 0 else return 1 } Goal: compute 'wrong path probability' (WPP) ‣ probability to compute the wrong answer

  Exact Symbolic Inference encode WPP as probabilistic program: x1 := gauss(-15.0, 15.0); x2 := gauss(-15.0, 15.0); x3 := gauss(-15.0, 15.0); res := -x1*x2 - 2*x2*x3 - x1 - x3; error := 0.2042266; // worst-case error computed with Daisy assert (0.0 - error <= res && res <= 0.0 + error); 1. compute exact expression for WPP with PSI [1] 2. solve numerically with Mathematica [1] PSI: Exact Symbolic Inference for Probabilistic Programs, CAV, 2016

  Exact Symbolic Inference encode WPP as probabilistic program: x1 := gauss(-15.0, 15.0); x2 := gauss(-15.0, 15.0); x3 := gauss(-15.0, 15.0); res := -x1*x2 - 2*x2*x3 - x1 - x3; 20min error := 0.2042266; // worst-case error computed with Daisy assert (0.0 - error <= res && res <= 0.0 + error); 1. compute exact expression for WPP with PSI [1] 2. solve numerically with Mathematica [1] PSI: Exact Symbolic Inference for Probabilistic Programs, CAV, 2016

Probabilistic Range Analysis 1 � 1 � 0 . 5 0 . 250 . 5 1 2 discretize input distribution • {<[a 1 , b 1 ], w 1 >, <[a 2 , b 2 ], w 2 >, ... ,<[a n , b n ], w n >} ‣ number of subdivisions determines accuracy propagation for independent variables: interval arithmetic • propagation for dependent variables • ‣ LP problem keep track of linear dependencies with affine arithmetic [2] ‣ [2] A Generalization of P-boxes to Affine Arithmetic, Computing, vol. 94, no. 2-4, pp. 189–201, 2012.

Computing WPP [EMSOFT'18] roundoff analysis input program f (¯ x ) f (¯ x ) probabilistic analysis 1 error := 0.2042266 � 1 � 0 . 5 0 . 250 . 5 1 2 compute intersection (WPP)

Computing Intersection 1 w 5 w 4 w 3 w 2 w 1 � 1 � 0 . 5 0 . 250 . 5 1 2 T := 0.0 WPP = w 1 + w 2 error := 0.2042266

Probabilistic Range Analysis def rigidBody(x1: Real , x2: Real , x3: Real ): Real = { require (-15.0 ≤ x1 ≤ 15 && -15.0 ≤ x2 ≤ 15.0 && -15.0 ≤ x3 ≤ 15) val res = -x1*x2 - 2*x2*x3 - x1 - x3 WPP = 1.0 if (res <= 0.0) return 0 else return 1 }

Computing WPP II roundoff analysis input program interval subdivision probabilistic analysis 1 error := 0.2042266 � 1 � 0 . 5 0 . 250 . 5 1 2 compute intersection (WPP)

Computing WPP III roundoff analysis input program interval subdivision with reachability check probabilistic analysis 1 error := 0.2042266 � 1 � 0 . 5 0 . 250 . 5 1 2 compute intersection (WPP)

Towards an Optimizing Compiler for Numerical Kernels joint work - PowerPoint PPT Presentation

Photo by Sebastian Bruggisser Towards an Optimizing Compiler for Numerical Kernels joint work with: Heiko Becker, Anastasiia Izycheva, Debasmita Lohar, Viktor Kuncak, Magnus Myreen, Sylvie Putot, Eric Goubault, Helmut Seidl Eva Darulova

Compiler Construction Chapter 11 1 Compiler Construction Compiler Construction A New Compiler

The Gray Code Kernels The Gray Code Kernels The Gray Code Kernels Gil Ben-Artzi Hagit Hel-Or

Overview: Kernels for Sequences and Graphs String Kernels 8 Example Sequence Classification

Beta kernels and transformed kernels applications to copulas and quantiles Arthur Charpentier

Kernels on structures Andrea Passerini passerini@disi.unitn.it Machine Learning Kernels on

Optimizing monitoring networks for Optimizing monitoring networks for Optimizing monitoring

Scalable Machine Learning 6. Kernels Alex Smola Yahoo! Research and ANU

SVM Kernels COMPSCI 371D Machine Learning COMPSCI 371D Machine Learning SVM Kernels 1 /

GRAPH MINING AND GRAPH KERNELS Part II: Graph Kernels Karsten Borgwardt^ and Xifeng Yan*

How an Optimizing Compiler Works Rewriting code with simple data structures and algorithms Li

Towards a formally verified obfuscating compiler Sandrine Blazy joint work with Roberto Giacobazzi

11/8/2012 The Structure of a Compiler (2) The Structure of a Compiler (1) Any compiler must

Compiler Development (CMPSC 401) Janyl Jumadinova January 17, 2018 Janyl Jumadinova Compiler

Principles of Compiler Design - The Brainf*ck Compiler - Clifford Wolf - www.clifford.at

Towards numerical integration in Coq Faster real computation Numerical integration Picard

On enumerating the kernels in a bipolar valued digraph Raymond Bisdorff University of Luxembourg

Number representation A number can be represented in binary in many ways. The most common number

Variable precision computing: Applications and challenges David H. Bailey Lawrence Berkeley

Fixed and Floating-point Numbers Eric McCreath Fractional binary numbers Remember how the

Floating Point Basics Normally write: mantissa * 2 exponent Alternately, sign * mantissa *

Automating the Verification of Floating-point Algorithms Guillaume Melquiond Inria Saclay

Optimizing MPC for robust and scalable integer and floating-point arithmetic Liisi Kerik * Peeter

Interest Rate Hedging Best Practices & Pitfalls to Avoid March 14, 2017 @ SVAFP &

DNN Model and Hardware Co-Design ISCA Tutorial (2017) Website:

Towards an Optimizing Compiler for Numerical Kernels joint work - PowerPoint PPT Presentation

Photo by Sebastian Bruggisser Towards an Optimizing Compiler for Numerical Kernels joint work with: Heiko Becker, Anastasiia Izycheva, Debasmita Lohar, Viktor Kuncak, Magnus Myreen, Sylvie Putot, Eric Goubault, Helmut Seidl Eva Darulova

Compiler Construction Chapter 11 1 Compiler Construction Compiler Construction A New Compiler

The Gray Code Kernels The Gray Code Kernels The Gray Code Kernels Gil Ben-Artzi Hagit Hel-Or

Overview: Kernels for Sequences and Graphs String Kernels 8 Example Sequence Classification

Beta kernels and transformed kernels applications to copulas and quantiles Arthur Charpentier

Kernels on structures Andrea Passerini passerini@disi.unitn.it Machine Learning Kernels on

Optimizing monitoring networks for Optimizing monitoring networks for Optimizing monitoring

Scalable Machine Learning 6. Kernels Alex Smola Yahoo! Research and ANU

SVM Kernels COMPSCI 371D Machine Learning COMPSCI 371D Machine Learning SVM Kernels 1 /

GRAPH MINING AND GRAPH KERNELS Part II: Graph Kernels Karsten Borgwardt^ and Xifeng Yan*

How an Optimizing Compiler Works Rewriting code with simple data structures and algorithms Li

Towards a formally verified obfuscating compiler Sandrine Blazy joint work with Roberto Giacobazzi

11/8/2012 The Structure of a Compiler (2) The Structure of a Compiler (1) Any compiler must

Compiler Development (CMPSC 401) Janyl Jumadinova January 17, 2018 Janyl Jumadinova Compiler

Principles of Compiler Design - The Brainf*ck Compiler - Clifford Wolf - www.clifford.at

Towards numerical integration in Coq Faster real computation Numerical integration Picard

On enumerating the kernels in a bipolar valued digraph Raymond Bisdorff University of Luxembourg

Number representation A number can be represented in binary in many ways. The most common number

Variable precision computing: Applications and challenges David H. Bailey Lawrence Berkeley

Fixed and Floating-point Numbers Eric McCreath Fractional binary numbers Remember how the

Floating Point Basics Normally write: mantissa * 2 exponent Alternately, sign * mantissa *

Automating the Verification of Floating-point Algorithms Guillaume Melquiond Inria Saclay

Optimizing MPC for robust and scalable integer and floating-point arithmetic Liisi Kerik * Peeter

Interest Rate Hedging Best Practices &amp; Pitfalls to Avoid March 14, 2017 @ SVAFP &amp;

DNN Model and Hardware Co-Design ISCA Tutorial (2017) Website:

Interest Rate Hedging Best Practices & Pitfalls to Avoid March 14, 2017 @ SVAFP &