Toward a Standard Benchmark Format and Suite for Floating-Point Analysis
Nasrine Damouche, Matthieu Martel, Pavel Panchekha, Chen Qiu, Alexander Sanchez-Stern, Zachary Tatlock.
Toward a Standard Benchmark Format and Suite for Floating-Point - - PowerPoint PPT Presentation
Toward a Standard Benchmark Format and Suite for Floating-Point Analysis Nasrine Damouche, Matthieu Martel, Pavel Panchekha , Chen Qiu, Alexander Sanchez-Stern, Zachary Tatlock. Incredible progress Optimization Automatic Verification STOKE
Nasrine Damouche, Matthieu Martel, Pavel Panchekha, Chen Qiu, Alexander Sanchez-Stern, Zachary Tatlock.
Automatic Verification Fluctuat Rosa FPTaylor Optimization STOKE Improvement Salsa Herbie Manual Verification Wave equation Rounding error Next ???
Automatic Verification Fluctuat Rosa FPTaylor Optimization STOKE Improvement Salsa Herbie Manual Verification Wave equation Rounding error Next ???
Herbie: ulp(NaN, Inf) = UINT_MAX STOKE: ulp(NaN, Inf) < UINT_MAX Fluctuat: Poly, Inv, F1a, F1b, idem, … FPTaylor: sine, sqrt, verhulst, … Rosa: def example(x: Double): Double = … Salsa: double example(double x) { … }
compilers, HPC, SAT, SMT, … communities Rosa Salsa Composition FPTaylor Fluctuat Evaluation STOKE Herbie Standardization
Benchmark suite Common format Named measures
Benchmark suite Common format Named measures
Arguments S-expression syntax
Preconditions
Loops Common functions
S-expression syntax Purely functional No control flow analysis All C, Fortran functions Loops, conditionals Tools support parts Metadata properties Tool-specific metadata Input or output format
Generate from higher-level, imperative FPImp lang.
Benchmark suite Common format
Simple to implement Covers all existing uses Simple to extend, specialize
Named measures
Benchmark suite Common format
Simple to implement Covers all existing uses Simple to extend, specialize
Named measures
Rich features Diverse domains Existing programs
FPTaylor 29 Herbie 28 Rosa 6 Salsa 9 Arith 72 Expt 16 Trig 11 Loop 12 Branch 3 Textbook 59 Math Alg 6 Emb Sys 4 Sci Comp 3
Benchmark suite
From existing projects Cover many domains Grows over time
Common format
Simple to implement Covers all existing uses Simple to extend, specialize
Named measures
Benchmark suite
From existing projects Cover many domains Grows over time
Common format
Simple to implement Covers all existing uses Simple to extend, specialize
Named measures
Scaling vs. non-scaling Forward vs. backward Maximum vs. average Sound vs. statistical Improvement
Absolute, relative, ULPs, bits, … Fixed input error vs fixed output error vs Formal guarantees vs mathematical accuracy vs
Benchmark suite
From existing projects Cover many domains Grows over time
Common format
Simple to implement Covers all existing uses Simple to extend, specialize
Named measures
Terms for measuring error Standard across tools Flexible but rigorous
Benchmark suite
From existing projects Cover many domains Grows over time
Common format
Simple to implement Covers all existing uses Simple to extend, specialize
Named measures
Terms for measuring error Standard across tools Flexible but rigorous
Benchmark suite Common format Named measures