Automated Theorem Proving 2/4: First-Order Theorem Proving A.L. - PowerPoint PPT Presentation

Automated Theorem Proving 2/4: First-Order Theorem Proving A.L. Lamprecht Course Program Semantics and Verfication 2020, Utrecht University September 23, 2020

Lecture Notes “Automated Reasoning” by Gerard A.W. Vreeswijk. Available for download on the course website. My slides are largely based on them.

In This Course • Propositional theorem proving (last Monday), Chapter 2 of the lecture notes • First-order theorem proving (today), Chapter 3 of the lecture notes • Clause sets and resolution (next Monday), Chapters 4 and 5 of the lecture notes • Satisfiability checkers, SAT/SMT (next Wednesday), Chapter 6 of the lecture notes, additional material

Recap: Propositional Theorem Proving • The nature of theorem proving. • Searching for counterexamples, refutation trees, semantic tableaux. • Turning refutation trees into proofs. • NP-completeness of propositional theorem proving.

Recap FOL This lecture assumes familiarity with the syntax and semantics of first-order logics. In particular: - well-formed formulas, - interpretation of constants, - scope of a quantifier, function symbols and predicate symbols, - free and bound variables, - interpretation of well-formed formulas, - closed well-formed formulas (sentences), - variable assignments, - fair substitutions, - first-order models, - first-order domains, - first-order countermodels (Recap was homework.)

Recap FOL • Predicate logic : first-order logic without further restrictions on the semantics of the of the formulas • Completeness of predicate logic has been proven. • Church’s thesis on computability, connecting algorithms and symbol-manipulating mechanisms. • Undecidability of predicate logic has been proven. • Semi-decidability of predicate logic : There exist algorithms that prove precisely all formulas that are valid in the predicate logic. (Basis of ATP!) • Incompleteness of arithmetic : Any first-order logic that is expressive as arithmetic cannot be axiomatized.

Recap FOL • “first order”: quantifiers range over variables • “second order”: quantifiers can also range over predicate variables • ... • Higher-order logics are more difficult to manage. • Most higher-order theories can be translated into first-order theories. • FOL suffices for the expression of most mathematical theories. • Dealing with FOL is difficult enough. • Most efforts of ATP research in this area.

Reduction Rules for FOL (Additional) refutation rules that describe how quantified formulas can be made true or false: Reduction rules for building a refutation tree If ( ∀ x ) φ , then φ [ t / x ] for all If ( ∀ x ) φ is false, then φ [ c / x ] terms t . is false for some constant c . If ( ∃ x ) φ , If ( ∃ x ) φ is false, then φ [ t / x ] then φ [ c / x ] for some constant c . is false for all terms t .

FOL Proofs: Idea The idea behind making a universal statement ( ∀ x )( P ) true is that we make all instances true, one at a time: ( ∀ x )( P ) ≡ ( ∀ x )( P ) ∧ P [ a / x ] (spawn formula with t = a ) ≡ ( ∀ x )( P ) ∧ P [ a / x ] ∧ P [ b / x ] (spawn with t = b ) ≡ ( ∀ x )( P ) ∧ P [ a / x ] ∧ P [ b / x ] ∧ P [ f ( a ) / x ] (spawn with t = f ( a ) ) ≡ ( ∀ x )( P ) ∧ P [ a / x ] ∧ P [ b / x ] ∧ P [ f ( a ) / x ] ∧ P [ f ( b ) / x ] (spawn with t = f ( b ) ) . . .

Herbrand Domain • ( ∀ x ) p on the LHS may generate a potentially infinite number of different terms. • A ground term is a term without variables. • The Herbrand domain of a formula is the set of all possible ground terms that can be made with constants and function symbols that occur in the formula. • If a term has no constants, then a fresh constant c 0 is used to prevent the Herbrand domain from being empty. • Generalizable to sets of formulas and terms.

Herbrand Domain (Examples) Set of formulas Constants and Herbrand domain and terms function symbols { ( ∀ x )( p x , a ) } { a } { a } { ( ∀ x )( pf ( x ) , a ) } { a , f } { a , f ( a ) , f ( f ( a )) , . . . } { ( ∀ x )( pf ( x )) } { f } { c 0 , f ( c 0 ) , f ( f ( c 0 )) , . . . } { ( ∀ x )( pg ( x , y )) } { g } { c 0 , g ( c 0 , c 0 ) , g ( g ( c 0 , c 0 ) , c 0 ) , . . . } { ( ∀ x )( pg ( x , y )) , qf ( c 3 ) } { f , g , c 3 } { c 3 , f ( c 3 ), g ( c 3 , c 3 ), g ( f ( c 3 ) , c 3 ), g ( c 3 , f ( c 3 )), f ( g ( c 3 , c 3 )), g ( g ( c 3 , c 3 ) , c 3 ), . . . }

Exercise Determine the Herbrand domain of the following formulas. 1 Px , a 2 ( ∀ x )( Px ⊃ Qf ( x ) , a ) 3 Rg ( f ( x ) , y ) , z

Solution 1 D ( Px , a ) = { a } 2 D (( ∀ x ) Px ⊃ Qf ( x ) , a ) = { f n ( a ) | n ≥ 0 } = { a , f ( a ) , f ( f ( a )) , . . . } 3 D ( Rg ( f ( x ) , y ) , z ) is the set H such that c 0 ∈ H , f ( t ) ∈ H if t ∈ H , and g ( t 1 , t 2 ) ∈ H if { t 1 , t 2 } ⊆ H . I.e., H = { c 0 , f ( c 0 ) , g ( c 0 , c 0 ) , f ( c 0 ) , f 2 ( c 0 ) , f ( g ( c 0 , c 0 )) , . . . .

Analytic Refutation Rules for FOL Previous rules, plus:

Gentzen System for FOL Previous rules, plus:

FOL Reduction and Complexity • Reduction rules of propositional logic reduce the complexity of the formula (sub-formula property). • “left- ∀ ” and “right- ∃ ” lack this property, they do not reduce the complexity of the formula they operate on. • In fact, reductions may go on forever and branches may grow indefinitely, which is inherent to the undecidability of predicate logic. • However, never-ending scenarios are a worst-case scenario. • In many cases, it is possible to guess with substitutions must be made to steer the refutation to an end.

Cases In the following we will look at FOL theorem proving with: • No functions and no equality • Functions and no equality • Functions and equality

No Functions and No Equality • If a sentence contains no function and no equality symbols, its Herbrand domain is a finite but non-empty set of constants. • Set may grow, but does not do so excessively. • It is no problem to substitute all variables and constants that have been encountered in the refutation so far.

Example: ( ∀ x )( p x ) ⊢ pa • Only applicable rule: left- ∀ • Herbrand domain: { a } , thus t = a

Example: p ⊢ ( ∀ x )( qx ) Only applicable rule: right- ∀ , with a fresh constant c 1 : LHS ∩ RHS = ∅ , so that we have found a counterexample model M with domain D = { 1 } , such that c 1 and all other constants are mapped to 1, and Predicate Extension p true q ∅

Example: ( ∀ x )( p x ) ⊢ ( ∃ x )( p x ) • Two reductions possible: “left- ∀ ,” and “right- ∃ ”. • For both, need to choose a term from the Herbrand domain. • No such term, since the Herbrand domain of is empty. • Use an arbitrary constant c 0 to kick off the refutation:

Example: ( ∃ x )( p x ) ⊢ ( ∀ x )( p x ) Refutation of the converse direction: Counterexample model M with: Predicate Extension { c 1 } p and domain D = { 1 , 2 } , such that c 1 and c 2 are interpreted as 1 and 2. Then M | = p ( c 1 ) but M � p ( c 2 ).

Many-on-One Variants of left- ∀ and right- ∃ Instead of using “left- ∀ ” and “right- ∃ ,” use: • “ left + - ∀ ”, meaning one or more applications of “left- ∀ ”, • “ right + - ∃ ”, meaning one or more applications of “right- ∃ ”. Do not enable reductions that would otherwise be impossible, but can reduce the size of the refutation trees.

Example

Sound- and Completeness • Soundness: if a sequent is falsifiable, then all refutation trees for that sequent have at least one branch that cannot be closed. • Completeness: if a sequent is valid (i.e., not falsifiable), then every refutation tree closes. • Sound- and completeness: a sequent is valid if and only if all refutations close. • Proof sketch in the lecture notes.

Functions (No Equality) • Function symbols complicate theorem proving, because it is possible to produce many terms with the help of only a few function symbols: HerbrandDomain ( pf ( a )) = { a , f ( a ) , f ( f ( a )) , f ( f ( f ( a ))) , . . . } • All terms thus generated could, in principle, be used by left- ∀ or right- ∃ as long as at least one branch remains open.

Example • Situation: two constants a and b , and a one-place function symbol f . • The formula ( ∀ x )( p x ) may be “unfolded” as follows: ( ∀ x )( p x ) ≡ ( ∀ x )( p x ) ≡ ( ∀ x )( p x ) ∧ pa ≡ ( ∀ x )( p x ) ∧ pb ∧ pa ≡ ( ∀ x )( p x ) ∧ pf ( a ) ∧ pb ∧ pa ≡ ( ∀ x )( p x ) ∧ pf ( b ) ∧ pf ( a ) ∧ pb ∧ pa ≡ ( ∀ x )( p x ) ∧ pf ( f ( a )) ∧ pf ( b ) ∧ pf ( a ) ∧ pb ∧ pa ≡ ( ∀ x )( p x ) ∧ . . . ∧ pf ( f ( a )) ∧ pf ( b ) ∧ pf ( a ) ∧ pb ∧ pa

Number of Generated Formulas • Problem: left + - ∀ or right + - ∃ do not reduce the formula they operate on. • Candidate terms in a general first-order language: c 1 , x 1 , f 1 1 ( c 1 ) , f 1 1 ( x 1 ) , c 2 , x 2 , f 1 1 ( c 2 ) , f 1 1 ( x 2 ) , f 1 2 ( c 1 ) , f 1 2 ( x 1 ) , . . . • Countably infinite, but it will take a while to encounter the right terms to close a refutation tree (if at all possible). • But: counterexamples need only be constructed from the Herbrand domain of the formula!

Example: ( ∀ x )( p x ⊃ pf ( x )); pa ⊢ pf ( f ( a )) • Herbrand domain (infinite): { a , f ( a ) , f ( f ( a )) , . . . } • Impossible to substitute all terms at once. • Take care when applying left + - ∀ or right + - ∃

Automated Theorem Proving 2/4: First-Order Theorem Proving A.L. - PowerPoint PPT Presentation

Automated Theorem Proving 2/4: First-Order Theorem Proving A.L. Lamprecht Course Program Semantics and Verfication 2020, Utrecht University September 23, 2020 Lecture Notes Automated Reasoning by Gerard A.W. Vreeswijk. Available for

Instantiation-Based Automated Theorem Proving for First-Order Logic Konstantin Korovin The

Automated Theorem Proving 1/4: Introduction and Propositional Theorem Proving A.L. Lamprecht

Visual theorem proving with the Incredible Proof Machine The idea Theorem Proving without

Artificial Intelligence in Theorem Proving Cezary Kaliszyk VTSA Overview Last Lecture theorem

Automated Theorem Proving in First-Order and Higher-Order Logic Christoph Benzm uller

First Order Logic: Resolution-based Automated Theorem Proving Valentin Goranko DTU Informatics

Herbrands Revenge SAT Solving for First-Order Theorem Proving Stephan Schulz

Combining Automated and Interactive Theorem Proving in Agda Anton Setzer (Joint work with Karim

Automated Theorem Proving in Real Applications John Harrison Intel Corporation The cost of

Automated Reasoning: Some Successes and New Challenges Predrag Jani ci c

Theorem-Proving Environments Nathan Ng CSC2547: Learning to Search Theorem Proving What is a

On Theorem Proving for Program Checking Historical perspective and recent developments Maria

Formal Verification Methods 4: Theorem Proving John Harrison Intel Corporation Need for

Symbolic Computation and Theorem Proving in Program Analysis Laura Kov acs Chalmers

Saturation-based Theorem Proving and ML Course Machine Learning and Reasoning 2020 MLR 2020 1 1

Large Scale Deep Learning for Theorem Proving in HOList: First Results and Future Directions

Mathematical Structure of Computation What is computation? Great Ideas in Computing

Primitive Recursive Bars Are Inductive Jon Sterling Carnegie Mellon University August 17, 2016

(In)Efficiency and Reasonable Cost Models Beniamino Accattoli INRIA & Ecole Polytechnique

Human effective computability and Absolute Horsten University of Undecidability Bristol

http://lcs.ios.ac.cn/znj/DM2017 Naijun Zhan April 5, 2017 1 Special thanks to Profs Hanpin Wang

INF4130 Algoritmer: Design og effektivitet 30th August 2018 Petter Kristiansen Praktisk

Lecture Slides for MAT-60556 Part IX: Recursion theory Henri Hansen November 7, 2013 1

Speedup phenomena in subrecursive settings Andrea Asperti DISI, University of Bologna Mura Anteo

Automated Theorem Proving 2/4: First-Order Theorem Proving A.L. - PowerPoint PPT Presentation

Automated Theorem Proving 2/4: First-Order Theorem Proving A.L. Lamprecht Course Program Semantics and Verfication 2020, Utrecht University September 23, 2020 Lecture Notes Automated Reasoning by Gerard A.W. Vreeswijk. Available for

Instantiation-Based Automated Theorem Proving for First-Order Logic Konstantin Korovin The

Automated Theorem Proving 1/4: Introduction and Propositional Theorem Proving A.L. Lamprecht

Visual theorem proving with the Incredible Proof Machine The idea Theorem Proving without

Artificial Intelligence in Theorem Proving Cezary Kaliszyk VTSA Overview Last Lecture theorem

Automated Theorem Proving in First-Order and Higher-Order Logic Christoph Benzm uller

First Order Logic: Resolution-based Automated Theorem Proving Valentin Goranko DTU Informatics

Herbrands Revenge SAT Solving for First-Order Theorem Proving Stephan Schulz

Combining Automated and Interactive Theorem Proving in Agda Anton Setzer (Joint work with Karim

Automated Theorem Proving in Real Applications John Harrison Intel Corporation The cost of

Automated Reasoning: Some Successes and New Challenges Predrag Jani ci c

Theorem-Proving Environments Nathan Ng CSC2547: Learning to Search Theorem Proving What is a

On Theorem Proving for Program Checking Historical perspective and recent developments Maria

Formal Verification Methods 4: Theorem Proving John Harrison Intel Corporation Need for

Symbolic Computation and Theorem Proving in Program Analysis Laura Kov acs Chalmers

Saturation-based Theorem Proving and ML Course Machine Learning and Reasoning 2020 MLR 2020 1 1

Large Scale Deep Learning for Theorem Proving in HOList: First Results and Future Directions

Mathematical Structure of Computation What is computation? Great Ideas in Computing

Primitive Recursive Bars Are Inductive Jon Sterling Carnegie Mellon University August 17, 2016

(In)Efficiency and Reasonable Cost Models Beniamino Accattoli INRIA &amp; Ecole Polytechnique

Human effective computability and Absolute Horsten University of Undecidability Bristol

http://lcs.ios.ac.cn/znj/DM2017 Naijun Zhan April 5, 2017 1 Special thanks to Profs Hanpin Wang

INF4130 Algoritmer: Design og effektivitet 30th August 2018 Petter Kristiansen Praktisk

Lecture Slides for MAT-60556 Part IX: Recursion theory Henri Hansen November 7, 2013 1

Speedup phenomena in subrecursive settings Andrea Asperti DISI, University of Bologna Mura Anteo

(In)Efficiency and Reasonable Cost Models Beniamino Accattoli INRIA & Ecole Polytechnique