11th International Satisfiability Modulo Theories Competition - PowerPoint PPT Presentation

11th International Satisfiability Modulo Theories Competition SMT-COMP 2016 Sylvain Conchon David D´ eharbe Matthias Heizmann Tjark Weber

The Numbers ◮ 17 teams participated ◮ Solvers: Main track 25 2 non-competitive Application track 8 3 non-competitive Unsat-core track 1 4 non-competitive ◮ Logics: Main track 40 Application track 14 Unsat-core track 40 Unknown track 26 ◮ Benchmarks: Main track 154424 Application track 9856 Unsat-core track 93241 Unknown track 29724 Record numbers of solvers and benchmarks!

Job Pairs ◮ 1,562,544 job pairs executed (+ some repeats) 1,500,000 1,200,000 900,000 600,000 300,000 0 SMT-COMP 2014 SMT-COMP 2015 SMT-COMP 2016

Job Pairs by Track Main track 64.2 % 2.1 % Application track 11.7 % Unsat-core track 22.0 % Unknown track

StarExec ◮ All job pairs executed on StarExec ◮ Timeout: 40 minutes (unknown track: 10 minutes) ◮ ∼ 12 days × 100 nodes × 2 processors/node of compute time StarExec worked even better than last year ◮ Thanks to Aaron Stump for prompt help when problems or questions arose ◮ Only very few (and minor) bug reports submitted to the StarExec developers

Machine Specifications Hardware: ◮ Intel Xeon CPU E5-2609 @ 2.4 GHz, 10 MB cache ◮ 2 processors per node, 4 cores per processor ◮ Main memory capped at 60 GB per job pair Software (upgraded in 2016): ◮ Red Hat Enterprise Linux Server release 7.2 ◮ Kernel 3.10.0-327, gcc 4.8.5, glibc 2.17 ◮ Virtual machine image available before the competition

Benchmarks and Logics ◮ Number of benchmarks in SMT-LIB almost unchanged since 2015 ◮ Very few new benchmarks ◮ Some non-conforming benchmarks were removed ◮ No new logics ◮ Thanks to Clark Barrett for curation and uploading

Eligible Benchmarks 200,000 status unknown partial operations status unknown 100,000 10,000 eligible eligible 5,000 0 0 Main track Application track All eligible benchmarks were used for the competition. There was no further selection.

Important Rule Changes ◮ SMT-LIB 2.5 instead of 2.0 ◮ SMT-LIB not fully migrated yet ◮ Fortunately, largely backwards-compatible ◮ Size-based weighting of benchmark families within divisions: 1 + log e | F | Small benchmark families are more important than before. ◮ Unsat-core track reinstated

Competition Tools Improved ◮ New unsat-core track tools (scrambler and post-processor) ◮ New scrambling algorithm that makes it harder to identify the original benchmark (cf. yesterday’s talk)

Solvers

... primarily a (non-)termination and complexity bounds prover, but also ... SMT-LIB 2 front-end for QF NIA use bit-blasting for binary arithmetic, back-end: MiniSat fixed bit-length for unknowns bit-length for constants, sums, products etc. as needed details on SAT encoding: [Fuhs, Giesl, Middeldorp, Schneider-Kamp, Thiemann, Zankl, SAT ’07 ] back-end for proof techniques for termination and complexity bounds, search space & time-out fixed in “tactics” approach for SMT-COMP start with small search space if MiniSat says satisfiable : return with model else : retry with larger search space until satisfiable (or out of resources) Semi-Deciding QF NIA with AProVE via Bit-Blasting Giesl, Aschermann, Brockschmidt, Emmes, Frohn, Fuhs , Hensel, Otto, Pl¨ ucker, Schneider-Kamp, Str¨ oder, Swiderski, Thiemann

OpenSMT2 OpenSMT2 is an MIT-licensed SMT solver wri6en in C++, ì Developed at Università della Svizzera Italiana, Switzerland By AnB, Leo & Ma6eo ì Check it out from h6p://verify.inf.usi.ch/opensmt ì Version 2 has been under development since 2012 ì Currently supports QF_UF and QF_LRA ì Labeled interpolaUon on Boolean, QF_UF and QF_LRA with proof compression ì MulUcore and cluster/cloud based parallelizaUon ì Provides C and Python API through a library ì Support for incrementality ì Compact size (55 000 LoC) ì Compact representaUon and efficient memory management for the data types ì An object-oriented design which (hopefully) makes the development of theory support easier ì

raSAT – an SMT Solver for Polynomial Constraints Vu Xuan Tung, Mizuhito Ogawa @ JAIST, To Van Khanh @ VNU-UET  raSAT: ICP + Testing + Intermediate Value Theorem (IVT). Inequality Equality  ICP: Interval Constraint Propagation = Interval Arithmetic + Constraint Propagation + Box Decomposition.  Testing to boost SAT detection of inequality.  Generalized IVT for (non-constructive) SAT detection of equality.  Sound, but incomplete.  Outward rounding (ICP), confirmation by iRRAM (testing) Download : http://www.jaist.ac.jp/~s1310007/raSAT/ , or google “raSAT SMT”

http://www.veriT-solver.org Haniel Barbosa , David Déharbe and Pascal Fontaine Loria, INRIA, Université de Lorraine (France), ClearSy and UFRN (Brazil) What is new: ◮ cleaning, efficiency improvements, e.g. UF (space for improvement) ◮ (much) improved quantifier handling ◮ Other w.i.p.: (N|L)RA (Redlog), quantifier handling, proofs Goals: ◮ clean, small SMT for UF(N|L)IRA with quantifiers and proofs ◮ for verification platforms B, TLA+ 1/1

Selected Results

Results: QF BV (Main Track) Solver Error Score Solved Score (Parallel) Unsolved Boolector (pre) 0.000 24473.995 149 Boolector 0.000 24468.395 150 Minkeyrink 0.000 24434.194 193 smt-cms-mt 0.000 24244.599 216 smt-cms-st 0.000 24165.007 214 CVC4 0.000 23820.707 231 Z3 0.000 23732.215 304 smt-cms-exp 0.000 23640.669 270 ABC glucose 0.000 23078.931 477 Yices2 0.000 22687.777 638 MathSat5 0.000 22496.779 544 MapleSTP-mt 0.000 22487.264 395 MapleSTP 0.000 21764.885 450 smt-minisat-st 0.000 20582.614 1058 ABC default 0.000 18528.788 1354 Q3B 719.723 10397.757 4430

Results: Competition-Wide Scoring (Main Track) Rank Solver Score (sequential) Score (parallel) Z3 185.09 185.09 1 CVC4 180.95 181.19 2 Yices 119.29 119.29 3 veriT 75.11 75.11 Best newcomer: 5 Vampire parallel 65.36 65.62

Results: Application Track (Summary) Logic Order ANIA Z3; CVC4 QF ANIA Z3; CVC4 QF ALIA Z3; SMTInterpol; Yices2; MathSat5; CVC4 QF UFNIA Z3; CVC4 LIA Z3; CVC4 ALIA Z3; CVC4 QF UFLRA Z3; Yices2; SMTInterpol; CVC4; MathSat5 UFLRA Z3; CVC4 QF UFLIA Z3; CVC4; Yices2; SMTInterpol; MathSat5 QF NIA CVC4; Z3 QF BV MathSat5; Yices2; smt-cms-st; smt-cms-mt; smt-cms-exp; CVC4; MapleSTP; MapleSTP-mt; smt-minisat-st; Z3 QF LRA MathSat5; SMTInterpol; Z3; Yices2; CVC4 QF LIA Yices2; Z3; SMTInterpol; MathSat5; CVC4 QF AUFLIA Yices2; Z3; SMTInterpol; MathSat5; CVC4

Selected Results: Unsat-Core Track Solver Errors Reductions SMTInterpol 0 1166535 toysmt 0 35886 veriT 26 68811 MathSat5 190 1527159 Z3 17079 4597883 ◮ 182,367 job pairs ◮ In total, 83,450 (45.8%) unsat cores generated ◮ . . . but also 17,097 (9.4%) wrong sat answers ◮ Each unsat core was checked with three solvers (CVC4, MathSat5 and Z3). 198 cores (2.4%) were found satisfiable by at least one solver.

Selected Results: Unknown Track Most benchmarks solved: Solver Benchmarks solved Benchmarks attempted Yices2 18593 20473 Minkeyring 16724 17504 CVC4 16646 29509 In total, 21,542 benchmarks (72.5%) were solved. However, disagreements on 79 benchmarks!

Further Thoughts Benchmarks: ◮ Still more benchmarks needed, especially for small divisions ◮ Resolve semantics of partial operations, e.g., bvdiv , fp.min ◮ Benchmark curation deserves better tool support Competition: ◮ Benchmark weights—good or bad? ◮ Integration of benchmarks with unknown status? ◮ Trophies? (T-shirts? Dinner? Funding?!) Teams: ◮ Congratulations on your accomplishments! ◮ Thanks for your participation!

11th International Satisfiability Modulo Theories Competition - PowerPoint PPT Presentation

11th International Satisfiability Modulo Theories Competition SMT-COMP 2016 Sylvain Conchon David D eharbe Matthias Heizmann Tjark Weber The Numbers 17 teams participated Solvers: Main track 25 2 non-competitive Application

Complete Instantiation of Quantified Formulas in Satisfiability Modulo Theories Yeting Ge 1

Introduction to Satisfiability Modulo Theories Combinatorial Problem Solving (CPS) Albert

SMT Unsat Core Minimization OFER GUTHMANN, OFER STRICHMAN, ANNA TRO STANETSKI FMCAD2016 1 SMT

Model-Constructing Satisfiability Calculus Dejan Jovanovi c Clark Barrett Leonardo de Moura

A Survey of Satisfiability Modulo Theory (for mathematicians) David Monniaux VERIMAG GNCS,

13th International Satisfiability Modulo Theories Competition SMT-COMP 2018 Matthias Heizmann

G22.2390-001 Logic in Computer Science Fall 2009 Lecture 10 1 Review Satisfiability Modulo

Satisfiability Modulo Theories Applications to Real-time Fault-Tolerant Systems SAT/SMT Summer

13th International Satisfiability Modulo Theories Competition SMT-COMP 2018 Matthias Heizmann

SMT-COMP 2019 14th International Satisfiability Modulo Theories Competition Liana Hadarean Antti

12th International Satisfiability Modulo Theories Competition SMT-COMP 2017 Matthias Heizmann

Enriched Lawvere Theories theories for Operational Semantics Lawvere theories enriched theories

Satisfiability Modulo Theories and Assignments Maria Paola Bonacina, Stphane Graham-Lengrand,

Proofs in Satisfiability Modulo Theories Clark Barrett (NYU) Leonardo de Moura (Microsoft

CDSAT: Conflict-Driven SATisfiability modulo theories and assignments 1 Maria Paola Bonacina

SMT: Satisfiability Modulo Theories Ranjit Jhala, UC San Diego April 9, 2013 Decision Procedures

ECE 550D Fundamentals of Computer Systems and Engineering Fall 2016 Introduction Tyler Bletsch

A Partial Deducer Assisted by Predefined Assertions and a Backwards Analyzer an Puebla and

Cliquewidth and Knowledge Compilation Igor Razgon 1 & Justyna Petke 2 1 Birkbeck, University of

Partial Differential Equations Approaches to Optimization and Regularization of Deep Neural

CPSC 213 Introduction to Computer Systems Unit 1b Scalars and Arrays 1 Reading Companion

BULGES NGC 4710 NGC 4594 ESO 498-G5 NGC 4565 NGC 7457 ESO 1129 FORMATION AND EVOLUTION OF

through coronavirus Neill Boddington Mental Health Advisor, Disability Support Team

La cuisine des donnes du Web Serge Abiteboul INRIA Saclay & ENS Cachan 11/23/2012 1

Sambuz

Useful Links

Newsletter

Mail Us

11th International Satisfiability Modulo Theories Competition - PowerPoint PPT Presentation

11th International Satisfiability Modulo Theories Competition SMT-COMP 2016 Sylvain Conchon David D eharbe Matthias Heizmann Tjark Weber The Numbers 17 teams participated Solvers: Main track 25 2 non-competitive Application

Complete Instantiation of Quantified Formulas in Satisfiability Modulo Theories Yeting Ge 1

Introduction to Satisfiability Modulo Theories Combinatorial Problem Solving (CPS) Albert

SMT Unsat Core Minimization OFER GUTHMANN, OFER STRICHMAN, ANNA TRO STANETSKI FMCAD2016 1 SMT

Model-Constructing Satisfiability Calculus Dejan Jovanovi c Clark Barrett Leonardo de Moura

A Survey of Satisfiability Modulo Theory (for mathematicians) David Monniaux VERIMAG GNCS,

13th International Satisfiability Modulo Theories Competition SMT-COMP 2018 Matthias Heizmann

G22.2390-001 Logic in Computer Science Fall 2009 Lecture 10 1 Review Satisfiability Modulo

Satisfiability Modulo Theories Applications to Real-time Fault-Tolerant Systems SAT/SMT Summer

13th International Satisfiability Modulo Theories Competition SMT-COMP 2018 Matthias Heizmann

SMT-COMP 2019 14th International Satisfiability Modulo Theories Competition Liana Hadarean Antti

12th International Satisfiability Modulo Theories Competition SMT-COMP 2017 Matthias Heizmann

Enriched Lawvere Theories theories for Operational Semantics Lawvere theories enriched theories

Satisfiability Modulo Theories and Assignments Maria Paola Bonacina, Stphane Graham-Lengrand,

Proofs in Satisfiability Modulo Theories Clark Barrett (NYU) Leonardo de Moura (Microsoft

CDSAT: Conflict-Driven SATisfiability modulo theories and assignments 1 Maria Paola Bonacina

SMT: Satisfiability Modulo Theories Ranjit Jhala, UC San Diego April 9, 2013 Decision Procedures

ECE 550D Fundamentals of Computer Systems and Engineering Fall 2016 Introduction Tyler Bletsch

A Partial Deducer Assisted by Predefined Assertions and a Backwards Analyzer an Puebla and

Cliquewidth and Knowledge Compilation Igor Razgon 1 &amp; Justyna Petke 2 1 Birkbeck, University of

Partial Differential Equations Approaches to Optimization and Regularization of Deep Neural

CPSC 213 Introduction to Computer Systems Unit 1b Scalars and Arrays 1 Reading Companion

BULGES NGC 4710 NGC 4594 ESO 498-G5 NGC 4565 NGC 7457 ESO 1129 FORMATION AND EVOLUTION OF

through coronavirus Neill Boddington Mental Health Advisor, Disability Support Team

La cuisine des donnes du Web Serge Abiteboul INRIA Saclay &amp; ENS Cachan 11/23/2012 1

Sambuz

Useful Links

Newsletter

Mail Us

Cliquewidth and Knowledge Compilation Igor Razgon 1 & Justyna Petke 2 1 Birkbeck, University of

La cuisine des donnes du Web Serge Abiteboul INRIA Saclay & ENS Cachan 11/23/2012 1