Designing Games of Theorems Who am I? What do I like? automating - PowerPoint PPT Presentation

Designing Games of Theorems Who am I? What do I like? automating proof search in expressive logic (HOL) using heuristics / ML What did I develop? PSL/PaMpeR for Isabelle/HOL

Proof Strategy Language (PSL) for Isabelle/HOL meta-tool tactics approach quickcheck sledgehammer programming language runtime tactic extensible generation (Eisbach) PSL efficient proof generation extensive parallel native Isabelle proof search search proof script low memory t no code clutter!! usage easy installation s o m a l

try_hard: the default strategy strategy Try_Hard = Ors [Thens [Subgoal, Basic], Thens [DInductTac, Auto_Solve], strategy Basic = Thens [DCaseTac, Auto_Solve], Ors [ Thens [Subgoal, Advanced], Auto_Solve, Thens [DCaseTac, Solve_Many], Blast_Solve, Thens [DInductTac, Solve_Many] ] FF_Solve, Thens [IntroClasses, Auto_Solve], Thens [Transfer, Auto_Solve], Thens [Normalization, IsSolved], Thens [DInduct, Auto_Solve], Thens [Hammer, IsSolved], Thens [DCases, Auto_Solve], Thens [DCoinduction, Auto_Solve], Thens [Auto, RepeatN(Hammer), IsSolved], Thens [DAuto, IsSolved]]

try_hard vs sledgehammer The percentage of automatically proved obligations out of 1526 proof obligations (timeout = 300s) 100% 73% 75% 16% 57% 20% 50% 25% DEMO! 0% try_hard sledgehammer

try_hard: the default strategy strategy Try_Hard = Ors [Thens [Subgoal, Basic], Thens [DInductTac, Auto_Solve], strategy Basic = Thens [DCaseTac, Auto_Solve], Ors [ Thens [Subgoal, Advanced], Auto_Solve, Thens [DCaseTac, Solve_Many], Blast_Solve, Thens [DInductTac, Solve_Many] ] FF_Solve, Thens [IntroClasses, Auto_Solve], Thens [Transfer, Auto_Solve], Thens [Normalization, IsSolved], Thens [DInduct, Auto_Solve], Thens [Hammer, IsSolved], Thens [DCases, Auto_Solve], Thens [DCoinduction, Auto_Solve], Thens [Auto, RepeatN(Hammer), IsSolved], Thens [DAuto, IsSolved]]

database full feature extractor preparation phase large proof corpora :: ( tactic_name, [ bool ] ) preprocess PaMpeR decision tree construction fast feature extractor recommendation phase feature vector DEMO! proof ? state lookup proof engineer proof method recommendation

from try_hard to try_smart try_smart ? PSL & try_hard: more computation PaMpeR: get smart using heuristics

abstract big data ML algorithm notion https://googleblog.blogspot.jp/2012/06/using-large-scale-brain-simulations-for.html https://duckduckgo.com/?q=cat&t= ff ab&iar=images&iax=images&ia=images

many abstract ML algorithm proofs proof Higher-Order functions concise formula that can cover type class lots of concrete cases polymorphism dependent types small data set for each problem universal quantifier lambda abstraction different proof for general case

Large Proof Corpora? Transfer learning? (?) ? The seL4 proofs in Isabelle http://www.gilith.com/opentheory/ Goldbach’s conjecture https://en.wikipedia.org/wiki/File:Goldbach_partitions_of_the_even_integers_from_4_to_50_rev4b.svg “Every even integer greater than 2 can be expressed as the sum of two primes.” The Kepler “conjecture” in HOL Light http://annals.math.princeton.edu/wp-content/uploads/annals-v162-n3-p01.pdf Four color theorem in Coq https://en.wikipedia.org/wiki/File:Four_Colour_Map_Example.svg different logics in different proof corpora different provers about different problems

poor proof automation artificial intelligence for expressive logics for theorem proving! only small dataset available really? We need big data! because of expressiveness

Really? Self-play? http://cl-informatik.uibk.ac.at/teaching/ss18/mltp/02.pdf I want to train my prover using self-play so that it can prove Goldbach’s conjecture. But how? Proof search is not a 2-player game. The one that finds a proof of Goldbach’s conjecture first is the winner. If one prover finds a proof, that’s it. It is only 1 iteration. But how do you train provers, so that one prover can eventually find a proof. For each iteration, I create a set of not-so-difficult conjectures. The one that proves more conjectures is the winner. But how do you create not-so-difficult conjectures? random? But randomly created conjectures are not always good training data. Conjectures with difficult proofs are important ones. Not really. You need a mechanism to create many conjectures that are relevant to Goldbach’s conjecture. I can produce conjectures by mutating Goldbach’s conjecture. That might work for a small number of conjectures. Not for many conjectures. How? The more conjectures you create, the more valuable they should be.

Really? Self-play? http://cl-informatik.uibk.ac.at/teaching/ss18/mltp/02.pdf I want to train my prover using self-play so that it can prove Goldbach’s conjecture. Research hypothesis: But how? Proof search is not a 2-player game. subgoals proved during heuristic The one that finds a proof of Goldbach’s conjecture first is the winner. (incomplete) proof search are If one prover finds a proof, that’s it. It is only 1 iteration. useful to train provers. But how do you train provers, so that one prover can eventually find a proof. For each iteration, I create a set of not-so-difficult conjectures. The one that proves more conjectures is the winner. Game of Theorems! But how do you create not-so-difficult conjectures? random? But randomly created conjectures are not always good training data. Conjectures with difficult proofs are important ones. Not really. You need a mechanism to create many The more iterations it goes through, conjectures that are relevant to Goldbach’s conjecture. the higher the quality of problems should be! I can produce conjectures by mutating Goldbach’s conjecture. That might work for small number of conjectures. Not for many conjectures. How? The more conjectures you create, the more valuable they should be.

Game of Theorems 1 proved subgoals search tree big conjecture Coq https://en.wikipedia.org/wiki/File:Goldbach_partitions_of_the_even_integers_from_4_to_50_rev4b.svg Coq vs 70% di ff erent problems Parrot vs Crow for di ff erent prover? Parrot Crow https://en.wikipedia.org/wiki/File:Goldbach_partitions_of_the_even_integers_from_4_to_50_rev4b.svg https://en.wikipedia.org/wiki/File:Goldbach_partitions_of_the_even_integers_from_4_to_50_rev4b.svg 80% 60%

Game of Theorems 2 proved subgoals search tree big conjecture https://en.wikipedia.org/wiki/File:Goldbach_partitions_of_the_even_integers_from_4_to_50_rev4b.svg only one prover can survive? Owl vs Crow https://en.wikipedia.org/wiki/File:Goldbach_partitions_of_the_even_integers_from_4_to_50_rev4b.svg https://en.wikipedia.org/wiki/File:Goldbach_partitions_of_the_even_integers_from_4_to_50_rev4b.svg 70% 80%

Game of Theorems 3 https://en.wikipedia.org/wiki/File:Goldbach_partitions_of_the_even_integers_from_4_to_50_rev4b.svg https://en.wikipedia.org/wiki/File:Goldbach_partitions_of_the_even_integers_from_4_to_50_rev4b.svg https://en.wikipedia.org/wiki/File:Goldbach_partitions_of_the_even_integers_from_4_to_50_rev4b.svg https://en.wikipedia.org/wiki/File:Goldbach_partitions_of_the_even_integers_from_4_to_50_rev4b.svg 70% Coq vs Parrot vs Crow What if ? > >> 60% 80%

Game of Theorems 4 https://en.wikipedia.org/wiki/File:Goldbach_partitions_of_the_even_integers_from_4_to_50_rev4b.svg https://en.wikipedia.org/wiki/File:Goldbach_partitions_of_the_even_integers_from_4_to_50_rev4b.svg https://en.wikipedia.org/wiki/File:Goldbach_partitions_of_the_even_integers_from_4_to_50_rev4b.svg using VS > > > using VS > using VS No ordering, no casualty. <

Game of Theorems 4 future work? https://en.wikipedia.org/wiki/File:Goldbach_partitions_of_the_even_integers_from_4_to_50_rev4b.svg academic Research hypothesis: conference? subgoals proved during heuristic (incomplete) proof search are using VS > > useful to train provers. > using VS > using VS No ordering, no casualty. <

Thanks,

Designing Games of Theorems Who am I? What do I like? automating - PowerPoint PPT Presentation

Designing Games of Theorems Who am I? What do I like? automating proof search in expressive logic (HOL) using heuristics / ML What did I develop? PSL/PaMpeR for Isabelle/HOL Proof Strategy Language (PSL) for Isabelle/HOL meta-tool tactics

Designing Training and Assessment Games guided by the Evaluation Framework Ivan Boo Chairperson,

L. von Ahn and L. Dabbish . Designing games with a purpose. CACM, 2008 E. Law and L. von Ahn.

Uniform Folk Theorems in Repeated Anonymous Random Matching Games Joyee Deb 1 az 2 ome Renault 3

Video Games: Designing and Coding Oliver Bonham-Carter and Janyl Jumadinova Dept. of Computer

Immersive Game Design A look at three pilot programs focusing on designing digital video games

Principles of Engagement designing games for maximum player engagement Chuck Clanton 24 May

Designing Rhythm Games for Touchscreen Devices Project Progress Report Members: Philip H. Peng

ECE700.07: Game Theory with Engineering Applications Le Lecture 6: Re Repeated Games Seyed

Many Theorems and a Few Stories John Garnett Seoul 5/12/17 1 OUTLINE I. Extension Theorems.

a language for designing board games Lauren Pully (Project Manager) Jesse Bentert (Tools &

Hierarchy theorems Evgenij Thorstensen V18 Evgenij Thorstensen Hierarchy theorems V18 1 / 18

CSC2556 Lecture 11 Noncooperative Games 2: Zero-Sum Games, Stackelberg Games CSC2556 - Nisarg

A Machine-Assisted Proof of Gdel's Incompleteness Theorems Lawrence C. Paulson, Computer

Sylow Theorems Andrew Clarey Looking at the Structure of Arbitrary Groups Definitions/ Theorems

Digital Games An Introduction What are Digital Games? Commonly referred to as video games

Designing for differences Dan Smith 2 Designing for Differences - Goals Be familiar with

UPDATED TELE-WORK GUIDANCE FOR VR UNIT ADAPTED FROM PRESENTATION BY AMANDA GERSON COORDINATOR

A First Step Towards Leveraging Commodity Trusted Execu;on

Learning a SAT Solver from Single-Bit Supervision Daniel Selsam 1 Matthew Lamm 1 unz 1 Benedikt

Office of Research Services (ORS) 1 VPR ORS Post- Pre-Award IGS Award 2 Pr e - Awar d

Murrelet under the Oregon ESA Photo: Martin Raphael, USFS Kevin Blakely, Wildlife Division Deputy

**,n = + r) (i * i,t o,n .{-,he (-r)'"i; (r.J) ^lr,B (r,b j F) = I tl .n (p) n' (+:z) /,

SAVE HELVETIA Exceptions to Metro Ordinance No. 11-1255 and Washington County Ordinance No. 740

Message Security Mahmoud Yehia, Riham AlTawy and T. Aaron Gulliver Department of Electrical and

Designing Games of Theorems Who am I? What do I like? automating - PowerPoint PPT Presentation

Designing Games of Theorems Who am I? What do I like? automating proof search in expressive logic (HOL) using heuristics / ML What did I develop? PSL/PaMpeR for Isabelle/HOL Proof Strategy Language (PSL) for Isabelle/HOL meta-tool tactics

Designing Training and Assessment Games guided by the Evaluation Framework Ivan Boo Chairperson,

L. von Ahn and L. Dabbish . Designing games with a purpose. CACM, 2008 E. Law and L. von Ahn.

Uniform Folk Theorems in Repeated Anonymous Random Matching Games Joyee Deb 1 az 2 ome Renault 3

Video Games: Designing and Coding Oliver Bonham-Carter and Janyl Jumadinova Dept. of Computer

Immersive Game Design A look at three pilot programs focusing on designing digital video games

Principles of Engagement designing games for maximum player engagement Chuck Clanton 24 May

Designing Rhythm Games for Touchscreen Devices Project Progress Report Members: Philip H. Peng

ECE700.07: Game Theory with Engineering Applications Le Lecture 6: Re Repeated Games Seyed

Many Theorems and a Few Stories John Garnett Seoul 5/12/17 1 OUTLINE I. Extension Theorems.

a language for designing board games Lauren Pully (Project Manager) Jesse Bentert (Tools &amp;

Hierarchy theorems Evgenij Thorstensen V18 Evgenij Thorstensen Hierarchy theorems V18 1 / 18

CSC2556 Lecture 11 Noncooperative Games 2: Zero-Sum Games, Stackelberg Games CSC2556 - Nisarg

A Machine-Assisted Proof of Gdel's Incompleteness Theorems Lawrence C. Paulson, Computer

Sylow Theorems Andrew Clarey Looking at the Structure of Arbitrary Groups Definitions/ Theorems

Digital Games An Introduction What are Digital Games? Commonly referred to as video games

Designing for differences Dan Smith 2 Designing for Differences - Goals Be familiar with

UPDATED TELE-WORK GUIDANCE FOR VR UNIT ADAPTED FROM PRESENTATION BY AMANDA GERSON COORDINATOR

A First Step Towards Leveraging Commodity Trusted Execu;on

Learning a SAT Solver from Single-Bit Supervision Daniel Selsam 1 Matthew Lamm 1 unz 1 Benedikt

Office of Research Services (ORS) 1 VPR ORS Post- Pre-Award IGS Award 2 Pr e - Awar d

Murrelet under the Oregon ESA Photo: Martin Raphael, USFS Kevin Blakely, Wildlife Division Deputy

**,n = + r) (i * i,t o,n .{-,he (-r)'&quot;i; (r.J) ^lr,B (r,b j F) = I tl .n (p) n' (+:z) /,

SAVE HELVETIA Exceptions to Metro Ordinance No. 11-1255 and Washington County Ordinance No. 740

Message Security Mahmoud Yehia, Riham AlTawy and T. Aaron Gulliver Department of Electrical and

a language for designing board games Lauren Pully (Project Manager) Jesse Bentert (Tools &

**,n = + r) (i * i,t o,n .{-,he (-r)'"i; (r.J) ^lr,B (r,b j F) = I tl .n (p) n' (+:z) /,