[PPT] - Neural Program Synthesis Rishabh Singh, Google Brain Great PowerPoint Presentation

SLIDE 1

Neural Program Synthesis

Rishabh Singh, Google Brain

SLIDE 2

Great Collaborators!

SLIDE 3

Deep Learning and Evolutionary Progression

Vision Speech Language

SLIDE 4

Deep Learning and Evolutionary Progression

Vision Speech Language Programming

SLIDE 5

Deep Learning and Evolutionary Progression

Vision Speech Language Programming Perceptual Tasks Algorithmic Tasks

SLIDE 6

Neural Program Learning

SLIDE 7

Neural Program Learning More Complex Tasks

SLIDE 8

Neural Program Learning More Complex Tasks Generalizability

SLIDE 9

Neural Program Learning More Complex Tasks Generalizability Interpretability

SLIDE 10

Human Programmers

SLIDE 11

Human Programmers

Spec

I/O Examples Natural Language Partial programs

SLIDE 12

Human Programmers

Spec

Logic Basics Experience Samples

I/O Examples Natural Language Partial programs

SLIDE 13

Human Programmers

Spec

Logic Basics Experience Samples

I/O Examples Natural Language Partial programs

SLIDE 14

Neural Programmers

Spec

I/O Examples Natural Language Partial programs

Logic Basics Experience Samples

SLIDE 15

Some Properties of Neural Programmers

SLIDE 16

Some Properties of Neural Programmers

Limited Search

SLIDE 17

Some Properties of Neural Programmers

Limited Search Learn from few examples/test

SLIDE 18

Some Properties of Neural Programmers

Limited Search Learn from few examples/test Make Mistakes

SLIDE 19

Some Properties of Neural Programmers

Limited Search Learn from few examples/test Make Mistakes Improve over time

SLIDE 20

Long term Vision

Agent to win programming contests

[T

pCoder]

SLIDE 21

Long term Vision

Agent to win programming contests

Program Representations

[T

pCoder]

SLIDE 22

Long term Vision

Agent to win programming contests

Program Representations Program Repair[ICSE’18, ICLR’18]

[T

pCoder]

SLIDE 23

Long term Vision

Agent to win programming contests

Program Representations Program Repair[ICSE’18, ICLR’18] Fuzzing/Security Testing [ASE’17]

[T

pCoder]

SLIDE 24

Long term Vision

Agent to win programming contests

Program Representations Program Repair[ICSE’18, ICLR’18] Fuzzing/Security Testing [ASE’17] Program Optimization

[T

pCoder]

SLIDE 25

Neural Program Induction

SLIDE 26

Differentiable Neural Computer [Graves et al. Nature 2016]

SLIDE 27

Neural RAM [Kurach et al. ICLR 2016]

An LSTM Controller choosing modules and arguments

SLIDE 28

Neural RAM [Kurach et al. ICLR 2016]

An LSTM Controller choosing modules and arguments

14 modules

SLIDE 29

Neural RAM [Kurach et al. ICLR 2016]

An LSTM Controller choosing modules and arguments

14 modules

Differentiable Semantics

SLIDE 30

Neural RAM [Kurach et al. ICLR 2016]

An LSTM Controller choosing modules and arguments

14 modules

Differentiable Semantics

SLIDE 31

Neural Program Induction

Differentiable memory, stack Lots of Examples Single-task learning Non-Interpretable programs Examples: NTM, DNC, etc. Difficult to Generalize

SLIDE 32

Neural Program Induction

Differentiable memory, stack Lots of Examples Single-task learning Non-Interpretable programs Examples: NTM, DNC, etc. Difficult to Generalize

Neural Program Synthesis

Functional Abstractions Lots of Examples Single-task learning Interpretable programs Examples: QuickSort Generalizes Better

SLIDE 33

Neural Program Induction

Differentiable memory, stack Lots of Examples Single-task learning Non-Interpretable programs Examples: NTM, DNC, etc. Difficult to Generalize

Neural Program Synthesis

Functional Abstractions Lots of Examples Single-task learning Interpretable programs Examples: QuickSort Generalizes Better

Meta- Neural Program Synthesis

Functional Abstractions Few Examples Multi-task learning Interpretable programs Strong Generalization

SLIDE 34

Neuro-Symbolic Program Synthesis [ICLR 2017]

Emilio Parisotto, Abdelrahman Mohamed, Rishabh Singh, Lihong Li, Dengyong Zhou, Pushmeet Kohli

SLIDE 35

FlashFill in Excel 2013

Gulwani, Harris, Singh [CACM Research Highlight 2012]

SLIDE 36

FlashFill in Excel 2013

Gulwani, Harris, Singh [CACM Research Highlight 2012]

SLIDE 37

FlashFill DSL

SLIDE 38

Example FlashFill Task

Input (v) Output William Henry Charles Charles, W. Larry Page Page, L. Sergey Brin Brin, S. Martha D. Saunders Saunders, M.

SLIDE 39

Example FlashFill Task

Input (v) Output William Henry Charles Charles, W. Larry Page Page, L. Sergey Brin Brin, S. Martha D. Saunders Saunders, M. Concat(f1, ConstStr(“, ”), f2, ConstStr(“.”))

SLIDE 40

Example FlashFill Task

Input (v) Output William Henry Charles Charles, W. Larry Page Page, L. Sergey Brin Brin, S. Martha D. Saunders Saunders, M. Concat(f1, ConstStr(“, ”), f2, ConstStr(“.”)) f1 = SubStr(v, (Word,-1,Start), (Word,-1,End))

SLIDE 41

Example FlashFill Task

Input (v) Output William Henry Charles Charles, W. Larry Page Page, L. Sergey Brin Brin, S. Martha D. Saunders Saunders, M. Concat(f1, ConstStr(“, ”), f2, ConstStr(“.”)) f1 = SubStr(v, (Word,-1,Start), (Word,-1,End)) f2 = SubStr(v, CPos(0), CPos(1))

SLIDE 42

General Methodology

DSL

Rishabh Singh, Pushmeet Kohli. Artificial Programming. SNAPL 2017

SLIDE 43

General Methodology

DSL

Sampler – Training Data Neural Model

Rishabh Singh, Pushmeet Kohli. Artificial Programming. SNAPL 2017

SLIDE 44

General Methodology

DSL

Sampler – Training Data Neural Model

Synthesizer

Rishabh Singh, Pushmeet Kohli. Artificial Programming. SNAPL 2017

SLIDE 45

General Methodology

DSL

Sampler – Training Data Neural Model

Synthesizer

3 Key Properties

Syntax Semantics Executable

Rishabh Singh, Pushmeet Kohli. Artificial Programming. SNAPL 2017

SLIDE 46

Synthetic Training Data

SLIDE 47

Synthetic Training Data

SLIDE 48

Synthetic Training Data

SLIDE 49

Real-world Test Data

SLIDE 50

Real-world Test Data

SLIDE 51

Real-world Test Data

SLIDE 52

Real-world Test Data

SLIDE 53

Neural Architecture

I/O Encoder

Examples

SLIDE 54

Neural Architecture

I/O Encoder

Examples

Tree Decoder

SLIDE 55 S -> e + e

CFG/DSL:

S -> e + e e -> x e -> 1 e -> 0 Non-Terminals = {S, e} Terminals = {x, 1, 0, +}

S e e e

S -> e + e S -> e + e a1: S -> e + e a1: e -> x a2: e -> 1 a3: e -> 0 a4: e -> x a5: e -> 1 a6: e -> 0

a1 a5

e -> 1

1

e -> 1

1 x

e -> x

a1

a1: e -> x a2: e -> 1 a3: e -> 0

f(x) = x + 1 + + +

Key Idea: Guided Enumeration

SLIDE 56 S -> e + e

CFG/DSL:

S -> e + e e -> x e -> 1 e -> 0 Non-Terminals = {S, e} Terminals = {x, 1, 0, +}

S e e e

S -> e + e S -> e + e a1: S -> e + e a1: e -> x a2: e -> 1 a3: e -> 0 a4: e -> x a5: e -> 1 a6: e -> 0

a1 a5

e -> 1

1

e -> 1

1 x

e -> x

a1

a1: e -> x a2: e -> 1 a3: e -> 0

f(x) = x + 1 + + +

Key Idea: Guided Enumeration

SLIDE 57 S -> e + e

CFG/DSL:

S -> e + e e -> x e -> 1 e -> 0 Non-Terminals = {S, e} Terminals = {x, 1, 0, +}

S e e e

S -> e + e S -> e + e a1: S -> e + e a1: e -> x a2: e -> 1 a3: e -> 0 a4: e -> x a5: e -> 1 a6: e -> 0

a1 a5

e -> 1

1

e -> 1

1 x

e -> x

a1

a1: e -> x a2: e -> 1 a3: e -> 0

f(x) = x + 1 + + +

Key Idea: Guided Enumeration

SLIDE 58 S -> e + e

CFG/DSL:

S -> e + e e -> x e -> 1 e -> 0 Non-Terminals = {S, e} Terminals = {x, 1, 0, +}

S e e e

S -> e + e S -> e + e a1: S -> e + e a1: e -> x a2: e -> 1 a3: e -> 0 a4: e -> x a5: e -> 1 a6: e -> 0

a1 a5

e -> 1

1

e -> 1

1 x

e -> x

a1

a1: e -> x a2: e -> 1 a3: e -> 0

f(x) = x + 1 + + +

Key Idea: Guided Enumeration

SLIDE 59 S -> e + e

Problem

How to assign probabilities to each action ai such that the global tree state is taken into account?

S e e e

S -> e + e S -> e + e a1: S -> e + e a1: e -> x a2: e -> 1 a3: e -> 0 a4: e -> x a5: e -> 1 a6: e -> 0

a1 a5

e -> 1

1

e -> 1

1 x

e -> x

a1

a1: e -> x a2: e -> 1 a3: e -> 0

f(x) = x + 1 + + +

Key Idea: Guided Enumeration

SLIDE 60

Neural- Guided Enumeration

f ( ,I-O) =

SLIDE 61

Neural- Guided Enumeration

f ( ,I-O) =

SLIDE 62

Neural- Guided Enumeration

f ( ,I-O) =

SLIDE 63

Key Challenges

2

SLIDE 64

Key Challenges

2

Program Representation

SLIDE 65

Key Challenges

2

Program Representation Example Representation I-O

SLIDE 66

Recursive-Reverse-Recursive Neural Network (R3NN)

SLIDE 67

Recursive-Reverse-Recursive Neural

Network (R3NN)

SLIDE 68

Recursive

SLIDE 69

Input:

Distributed representations

f each leaf’s symbol.

Recursive

SLIDE 70

Input:

Distributed representations

f each leaf’s symbol.

Recursive

SLIDE 71

Input:

Distributed representations

f each leaf’s symbol.

Recursive

SLIDE 72

Input:

Distributed representations

f each leaf’s symbol.

Recursive

SLIDE 73

Input:

Distributed representations

f each leaf’s symbol.

Recursive

SLIDE 74

Input:

Distributed representations

f each leaf’s symbol.

Output:

Global root representation.

Recursive

SLIDE 75

Reverse-Recursive

SLIDE 76

Input: