Hammering towards Qed Cezary Kaliszyk Josef Urban University of - PowerPoint PPT Presentation

Hammering towards Qed Cezary Kaliszyk Josef Urban University of Innsbruck Radboud University July 18, 2014 1 / 21

Outline Automation for Interactive Proof Translations Evaluation Machine Learning Reconstruction Towards Qed Strength Logics Knowledge 2 / 21

■ ■ ■ ■ ■ ■ ■ ■ Interactive proofs ■ Formal proof skeleton ✰ filling in the gaps ■ Searching for needed theorems ■ Tedious properties ■ Proof structure is lost ■ Uninteresting parts overshadow interesting ones 3 / 21

Interactive proofs ■ Formal proof skeleton ✰ filling in the gaps ■ Searching for needed theorems ■ Tedious properties ■ Proof structure is lost ■ Uninteresting parts overshadow interesting ones ■ Automation for Interactive Proof ■ Tableaux: Itaut, Tauto, Blast ■ Rewriting: Simp, Subst, HORewrite ■ Decision Procedures: Congruence Closure, Ring, Omega, Cooper ■ Large-theory ATP and translation techniques ■ Mizar: MaLARea ■ Isabelle/HOL: Sledgehammer ■ HOL(y)Hammer 3 / 21

MizAR demo https://www.youtube.com/watch?v=4es4iJKtM3I 4 / 21

■ ■ ■ ✙ AI-ATP systems ( ❄ -Hammers) Current Goal First Order Problem ITP Proof ATP Proof Proof Assistant ❄ Hammer ATP . 5 / 21

■ ■ ■ ✙ AI-ATP systems ( ❄ -Hammers) Current Goal First Order Problem ITP Proof ATP Proof Proof Assistant ❄ Hammer ATP . How much can it do? 5 / 21

✙ AI-ATP systems ( ❄ -Hammers) Current Goal First Order Problem ITP Proof ATP Proof Proof Assistant ❄ Hammer ATP . How much can it do? ■ Flyspeck (including core HOL Light and Multivariate) ■ Mizar / MML ■ Isabelle (Auth, Jinja) 5 / 21

AI-ATP systems ( ❄ -Hammers) Current Goal First Order Problem ITP Proof ATP Proof Proof Assistant ❄ Hammer ATP . How much can it do? ■ Flyspeck (including core HOL Light and Multivariate) ■ Mizar / MML ■ Isabelle (Auth, Jinja) ✙ 45% 5 / 21

Translation Overview ■ Various exports to FOF ■ MESON-style monomorphisation ■ TFF-style type tagging ■ Isabelle-style type guards ■ Export to TFF1 ■ Additional provers (Alt-ergo) ■ Tools that do Monomorphisation of TPTP (Why3, tptp2X) ■ Export to THF0 ■ Satallax, Leo-II, ... ■ Monomorphisation makes the problems big and slow ■ SMT solvers ■ Reconstruction ■ Export to other ITPs ■ Rarely better 6 / 21

Translation overview (HOL) 1 Heuristic type instantiation ■ Similar for induction 2 Eliminate ✎ 3 Remove ✕ -abstractions ■ lifting, combinators, ... 4 Optimizations ■ if..then..else , ✾ ! 5 Separate predicates and terms ■ Consider cases, introduce bool variables 6 NNF, Skolemize 7 Use apply functor to make all applications first-order 8 Encode remaining types ■ monomorphisation, tags, guards 9 Various optimizations (incomplete) 7 / 21

8 / 21

Re-proving (Flyspeck, 30sec) Prover Theorem% CounterSat% Sotac– ✝ E-par 38.4 0.0 69.12 Z3-4 36.1 0.0 61.51 E 32.6 0.0 45.44 Leo II 31.0 0.0 45.77 Vampire 30.5 0.0 45.75 CVC3 28.9 0.0 43.36 Satallax 26.9 0.0 48.75 Yices1 25.3 0.0 33.32 IProver 24.5 0.6 29.50 Prover9 24.3 0.0 29.98 Spass 22.9 0.0 26.22 LeanCop 21.4 0.0 26.98 AltErgo 19.8 0.0 26.82 Paradox 4 0.0 18.2 0.06 any 50.2 - - 9 / 21

Machine learning techniques Algorithms ■ Syntactic methods ■ Neighbours using various metrics, Recursive (MePo) ■ Sparse Naive Bayes ■ Variable prior, Confidence ■ k-Nearest Neighbours ■ TF-IDF, Dependency weighting ■ Neural Networks ■ Winnow, Perceptron ■ Linear Regression ■ Needs feature and theorem space reduction Combining original and ATP dependencies ■ Added value depends on the precision of human deps 10 / 21

Features for Machine Learning ■ A function that given a goal or premise returns a sparse vector ■ Optionally weights for kinds of features ■ Internal TF-IDF ■ Types and type variables ■ Constants ■ Subterms / Patterns ■ No variable normalization ■ De-Bruijn indices ■ Types of variables ■ Normalization of type variables ■ Meta information: Theory name, kind of rule, contains ✾ , ... 11 / 21

Naive Bayes ■ Each predictor ■ Given a vector of features of a goal g and a set of facts ■ Returns the p redicted relevance for each fact f ■ Assume independence between the features P ✭ f is relevant for proving g ✮ ❂ P ✭ f is relevant ❥ g ’s features ✮ ❂ P ✭ f is relevant ❥ f 1 ❀ ✿ ✿ ✿ ❀ f n ✮ P ✭ f is relevant ✮✆ n i ❂ 1 P ✭ f i ❥ f is relevant ✮ ❴ ★ f is a proof dependency ✁ ✆ n ★ f i appears when f is a proof dependency ❴ i ❂ 1 ★ f is a proof dependency ■ Efficient ■ Fast predictions ■ Fast updates ■ Small models 12 / 21

Success Rates Success rate (%) 40 Syntactic (MePo) 30 20 16 32 64 128 256 512 1024 Number of facts 13 / 21

Success Rates Success rate (%) 40 Naive Bayes Syntactic (MePo) 30 20 16 32 64 128 256 512 1024 Number of facts 13 / 21

Proof Reconstruction ■ Existing reconstruction mechanisms ■ Metis, SMT ■ Mizar by ■ MESON, Prover9 ■ Parse TSTP/SMT proofs ■ Create subgoals that match ATP intermediate steps ■ Automatically solve all simple ones ■ High reconstruction rates give confidence in our techniques ■ Naive reconstruction: 90% (of Flyspeck solved) ■ MESON, SIMP, ?_ARITH_TAC ■ With TSTP parsing: 96% 14 / 21

Outline Automation for Interactive Proof Translations Evaluation Machine Learning Reconstruction Towards Qed Strength Logics Knowledge 15 / 21

■ ■ ■ ■ Improve Percentage ■ Is 100% possible? ■ Granularity of steps also increases ■ Premise selection ■ Encodings ■ ATP-systems ■ Reconstruction 16 / 21

Improve Percentage ■ Is 100% possible? ■ Granularity of steps also increases ■ Premise selection ■ Good machine learning algorithms are still slow ■ Encodings ■ Efficient but more complete ■ ATP-systems ■ Strategies and combinations ■ Reconstruction ■ Formalized decision procedures 16 / 21

ITP logics ■ MizAR ■ Set theory, dependent types, (almost) first order ■ Sledgehammer, HOL(y)Hammer, ... ■ HOL, shallow polymorphism ■ ACL2 ■ Structure Irrelevance, Logic as lists ■ Isabelle/ZF, ... ■ All features of meta-logic necessary ■ Coq ■ Good machine-learning, but encodings hard 17 / 21

Sharing parts among systems ■ Machine Learning Predictors ■ Already many shared ■ Feature extraction ■ Given common data format ■ Certain Transformations ■ ✕ -lifting, combinators, apply functor ■ Monomorphisation, Heuristic instantiation ■ Type encodings (tags, guards, soft-types, ...) ■ Knowledge management ■ Namespaces, Browsing, Search, Refactoring, Change management ■ Readable proof reconstruction 18 / 21

Common Functionality ■ TPTP hierarchy: FOF, TFF1, THF0, ? ■ THF1 already used ■ Sledgehammer ✩ HOL(y)Hammer ■ HOL4 ■ Type-classes ■ Property of a universally quantified type ■ Already in some Isabelle/HOL version of THF1 com_ring : $tType > $o ■ Dependent types and intersection types ■ Already in MPTP ![X : int, K : matrix(X)]: ... ![X : t1 & t2]: ... ■ Universes ![X : int]: $type(X) : $tType ■ General ✆ - and Sigma -types ![W : ![X]: X = X]: ... ■ ... 19 / 21

Matching concepts across libraries ■ Same concepts in different proof assistants ■ Problem for proof translation ■ Manually found 7-70 pairs ■ Same properties ■ Patterns, like associativity, distributivity ... ■ Same algebraic structures do differ. ■ Automatically finds 400 pairs of same concepts ■ In HOL Light, HOL4, Isabelle/HOL ■ Coq: so far only lists analyzed ■ Proof advice can be universal? 20 / 21

Conclusion and Future work ■ Hammer-systems ■ Until recently unappreciated by developers ■ A large number of top-level proofs found automatically ■ Try it! ■ Interoperation between HOL Light, HOL4 and Isabelle/HOL ■ Cross-Prover Advice Service ■ More logics, ITPs, ATPs, and more effective 21 / 21

HOL(y) Hammer Machine learning based premise selection for HOL Light http://cl-informatik.uibk.ac.at/software/hh/ 21 / 21

References C. Kaliszyk and J. Urban. MizAR 40 for Mizar 40. CoRR , abs/1310.2805, 2013. C. Kaliszyk and J. Urban. PRocH: Proof reconstruction for HOL Light. In M. P. Bonacina, editor, CADE , volume 7898 of Lecture Notes in Computer Science , pages 267–274. Springer, 2013. C. Kaliszyk and J. Urban. HOL(y)Hammer: Online ATP service for HOL Light. Mathematics in Computer Science , 2014. http://dx.doi.org/10.1007/s11786-014-0182-0 . C. Kaliszyk and J. Urban. Learning-assisted automated reasoning with Flyspeck. Journal of Automated Reasoning , 2014. http://dx.doi.org/10.1007/s10817-014-9303-3 . D. Kühlwein, J. C. Blanchette, C. Kaliszyk, and J. Urban. MaSh: Machine learning for Sledgehammer. In S. Blazy, C. Paulin-Mohring, and D. Pichardie, editors, Proc. of the 4th International Conference on Interactive Theorem Proving (ITP’13) , volume 7998 of LNCS , pages 35–50. Springer, 2013. C. Tankink, C. Kaliszyk, J. Urban, and H. Geuvers. Formal mathematics on display: A wiki for Flyspeck. In J. Carette, D. Aspinall, C. Lange, P. Sojka, and W. Windsteiger, editors, MKM/Calculemus/DML , volume 7961 of Lecture Notes in Computer Science , pages 152–167. Springer, 2013. 21 / 21

Hammering towards Qed Cezary Kaliszyk Josef Urban University of - PowerPoint PPT Presentation

Hammering towards Qed Cezary Kaliszyk Josef Urban University of Innsbruck Radboud University July 18, 2014 1 / 21 Outline Automation for Interactive Proof Translations Evaluation Machine Learning Reconstruction Towards Qed Strength

Journey to the Center of the Matrix Nigel Morris Managing Partner, QED Investors QED Investors

Computing isospin breaking corrections in massive QED on the lattice Michele Della Morte August

Study the QED Background with Belle C. Kiesling, E. Nedelkovska, A. Moll, K. Prothmann, B.

Leonardo DiCarlo Leonardo DiCarlo Superconducting quantum circuits: Superconducting quantum

Flip Feng Shui: Hammering a Needle in the Software Stack Ben Gras Kaveh Razavi Erik Bosman Bart

Curing the infrared problem in nonrelativistic QED Daniela Cadamuro Leipzig joint work with

QED: A grand unified theory? John Harrison Intel Corporation 18th July 2014 (09:0009:35)

Mixing Computations and Proofs Michael Beeson July 18, 2014 Whats holding up the QED

The Polarization Function, the QED Beta Function and the Muon Anomalous Magnetic Moment Johann

Spectral analysis of a completely positive map and thermal relaxation of a QED cavity Joint work

EFFECTIVE DYNAMICS OF AN ELECTRON COUPLED TO AN EXTERNAL POTENTIAL IN NON-RELATIVISTIC QED VOLKER

The problem of equivalence of different gauges in external current QED Benedikt Wegener

Strong field QED effects in the quantum vacuum generated by laser-electron interactions A.

Outline Motivation s + in the SM B 0 QED corrections in QCD bound states

SCET approach to power-enhanced QED correction in B s + (M. Beneke, C. Bobeth, R.

HELLER REPORTS QED THE FUTURE OF DIRECT MARKETING LEE WILSON - HEADWAY STRATEGIES I have been

The Big Problem with Meta-Learning and How Bayesians Can Fix It Chelsea Finn Stanford

Upping the ante: the full Lambek Calculus Robert Levine Ohio State University levine.1@osu.edu

an Open Source RISC-V Microarchitecture CARRV 2019 June 22 nd , 2019 - Phoenix, Arizona Abraham

Semantic Spaces for Zero-Shot Behaviour Analysis Xun Xu Computer Vision and Interactive Media

Decomposition of effect algebras and the Hammer-Sobczyk theorem A report of the joint paper by

Lecture 4 (Part 3): Hierarchical 3D Models Prof Emmanuel Agu Computer Science Dept. Worcester

Fast Discriminative Component Analysis for Comparing Examples Jaakko Peltonen 1 , Jacob Goldberger

Components of a Hammer for Type Theory Goal Translation and Proof Reconstruction ukasz Czajka