HOList: An Environment for Machine Learning of Higher-Order Theorem - - PowerPoint PPT Presentation

holist an environment for machine learning of higher
SMART_READER_LITE
LIVE PREVIEW

HOList: An Environment for Machine Learning of Higher-Order Theorem - - PowerPoint PPT Presentation

HOList: An Environment for Machine Learning of Higher-Order Theorem Proving Kshitij Bansal, Christian Szegedy Can we create a human level AI to reason about mathematics? Can we create a human level AI to reason about mathematics? Without


slide-1
SLIDE 1

HOList: An Environment for Machine Learning of Higher-Order Theorem Proving

Kshitij Bansal, Christian Szegedy

slide-2
SLIDE 2

Can we create a human level AI to reason about mathematics?

slide-3
SLIDE 3

Without relying on informal human mathematics

  • No need for autoformalization (requires high

level of natural language understanding)

  • Need to formalize the notion of

“interestingness”.

  • User needs to learn an “alien” language just

to communicate a theorem to it

  • Can’t communicate its discoveries
  • May be hard to bootstrap (little training data)

Relying on informal human mathematics

  • Needs auto-formalization
  • Requires no formalization on user side
  • Could learn the human notion of

“interestingness”.

  • Lot of training data to bootstrap from

Can we create a human level AI to reason about mathematics?

slide-4
SLIDE 4

Vision of joint proving and auto-formalization

Proof Assistant Formal Reasoning Agent (Neural) Language Model

Formal Corpus Informal Corpus

slide-5
SLIDE 5

Which Proof Assistant?

  • Coq
  • Lean
  • Isabelle
  • HOL4
  • HOL Light
  • Mizar
slide-6
SLIDE 6

Which Proof Assistant?

  • Coq
  • Lean
  • Isabelle
  • HOL4
  • HOL Light
  • Mizar
slide-7
SLIDE 7

Proof Assistant Formal Reasoning Agent (Neural) Language Model

Formal Corpus Informal Corpus

Vision of joint proving and auto-formalization

slide-8
SLIDE 8

Formal Corpus

  • Theorems
  • Proofs: tree of (goal,

tactic) to (subgoals) Trained model predicting tactic applications.

AITP'18

slide-9
SLIDE 9

Formal Corpus

  • Theorems
  • Proofs: tree of (goal,

tactic) to (subgoals) Trained model predicting tactic applications.

Proof Assistant Formal Reasoning Agent

slide-10
SLIDE 10

Formal Corpus

  • Theorems

(Benchmark) Training data, model, trained checkpoints.

Proof Assistant Formal Reasoning Agent

APIs for ML researchers and theorem prover developers.

HOList

An Environment for Machine Learning of Higher-Order Theorem Proving Later: Initial experiments, results, discussion.

slide-11
SLIDE 11

(Proof) Assistant Proof Search APIs for Theorem Prover Developers and ML Researchers

Machine Learning

One goal/subgoal to prove One proof step: Tactic application, relevant premises Subgoals

  • r *proved*

One goal/subgoal to prove Ranking of tactics and premises

Formal Reasoning Agent

slide-12
SLIDE 12

ApplyTactic

Apply a tactic to a goal, potentially generating new subgoals.

  • Request:

○ Goal ○ Tactic

  • Response: one of

○ Subgoals ○ Error

RegisterTheorem

Register a new theorem for use as premise in later proofs.

  • Request:

○ Theorem

  • Response: one of

○ TheoremFingerprint ○ Error

Proof Assistant Service

slide-13
SLIDE 13
  • Apply a tactic to any goal at any time.
  • Controlled by any algorithm, e.g. neural algorithms.
  • Automated merging of identical goals.
  • On the fly tracking of:

○ Goals that are closed ○ Subgoals that can’t help closing the main goal

  • Collects statistics (e.g. running time, error codes).
  • Serialized as ProofLog.

Proof Search Tree API

slide-14
SLIDE 14

Proof Search Tree

slide-15
SLIDE 15

Proof Search Tree

slide-16
SLIDE 16
  • Our prover: simple BFS Prover built on this tree API, with limits on branching.

○ max_top_suggestions (default: 20) ○ max_successful_branches (default: 2) ○ max_explored_nodes (default: 100) ○ max_theorem_parameters (we used: 16)

  • Built on Tree API, easy to extend for more interesting proof search.

Proof Search

slide-17
SLIDE 17

(Proof) Assistant Proof Search APIs for Theorem Prover Developers and ML Researchers

Machine Learning

One goal/subgoal to prove One proof step: Tactic application, relevant premises Subgoals

  • r *proved*

One goal/subgoal to prove Ranking of tactics and premises

Formal Reasoning Agent

slide-18
SLIDE 18
  • Predictions API integrating with the proof search.

○ (Goal, Tactic ID) -> Score ○ (Goal, Premise) -> Score

  • Our models, experiments: more in the next talk.

Machine Learning

slide-19
SLIDE 19

Assistant Proof Search

Machine Learning

APIs for Theorem Prover Developers and ML Researchers

RegisterTheorem ApplyTactic Given:

  • Current goal

Score:

  • Tactic applied
  • Premises used

HOL-Light

  • Manages the state of the

proof search tree.

  • Allows arbitrary nodes to

be explored.

slide-20
SLIDE 20

Making available to researchers

Benchmark Theorem Database

Theorems Definitions Core required for creating in-built tactics 2,320 240 Complex separated into training, validation, testing 16,623 396 FlySpeck for evaluating generalization 10,519 1,563

slide-21
SLIDE 21

Making available to researchers

Data

  • Proof Logs:

Synthetic proofs

Human proofs

  • Proof Logs as TF Examples

Features:

Goal (string)

Labels:

Tactic applied (int)

Premises used (string)

Model

  • Checkpoints of two-tower

architecture from imitation learning and reinforcement learning.

  • Sample training code.
slide-22
SLIDE 22

Making available to researchers

Code HOL Light (with our modifications) http://github.com/ brain-research/hol-light DeepHOL prover http://github.com/ tensorflow/deepmath Docker images HOL Light (server) gcr.io/deepmath/hol-light DeepHOL prover gcr.io/deepmath/deephol

slide-23
SLIDE 23

http://deephol.org

Code is on GitHub. Training data, checkpoints, docker images also being made available.