Conditional Program Generation for Bimodal Program Synthesis Swarat - PowerPoint PPT Presentation

Conditional Program Generation for Bimodal Program Synthesis Swarat Chaudhuri Rice University www.cs.rice.edu/~swarat (Joint work with Chris Jermaine, Vijay Murali, and Letao Qi)

Program synthesis [Simon 1963, Summers 1977, Manna-Waldinger 1977, Pnueli-Rosner 1989] Specification Specification: Logical constraint that must be satisfied exactly Synthesizer Algorithm: Search for a program that satisfies the specification. Program + Correctness Certificate

“Bimodal” program synthesis An Prior distribution idealized program Learned Ambiguous “evidence” + from a real-world Logical requirements code corpus Synthesizer Candidate implementations Posterior distribution over programs Neural Sketch Learning for Conditional Program Generation. Murali, Qi, Chaudhuri, and Jermaine. Arxiv 2017. 3

“Bimodal” program synthesis An idealized Prior distribution program Learned Ambiguous “evidence” + from a real-world Logical requirements code corpus Synthesizer Candidate implementations Posterior distribution over programs • API calls or types that the program uses • “Soft” I/O examples or constraints • Natural language description of what the program does • ... 4

The Bayou synthesizer: A demo http://bit.ly/2zgP5fj 5

Conditional program generation Assume random variables 𝑌 and 𝑄𝑠𝑝𝑕 , over labels and programs respectively, following a joint distribution 𝑅(𝑌, 𝑄𝑠𝑝𝑕) . Offline: 𝑌 * , 𝑄𝑠𝑝𝑕 * of samples from 𝑅(𝑌, 𝑄𝑠𝑝𝑕) . From • You are given a set this, learn a function 𝑕 that maps evidence to programs. • Learning goal: maximize 𝐹 ,,-./0 ∼2 𝐽 , where ⇢ 1 if g ( X ) ≡ Prog I = 0 otherwise. Online: Given 𝑌 , produce 𝑕(𝑌) . 6

� In what we actually do The map g is probabilistic. Learning is maximum conditional likelihood estimation: ∑ log 𝑄 𝑄𝑠𝑝𝑕 * 𝑌 * , 𝜄) • Given {(𝑌 * , 𝑄𝑠𝑝𝑕 * )} , solve arg max . * ; 7

Labels Set of API calls • readline, write,… Set of API datatypes • BufferedReader, FileReader,… Set of keywords that may appear while describing program actions in English • read, file, write,… • Obtained from API calls and datatypes through a camel case split 9

Challenges Directly learning over source code simply doesn’t work • Source code is full of low-level, program-specific names and operations. • Programs need to satisfy structural and semantic constraints such as type safety. Learning to satisfy these constraints is hard. 10

Language abstractions to the rescue! Learn not over programs, but typed, syntactic models of programs. 11

� Sketches The sketch of a program is obtained by applying an abstraction function 𝛽 . From sketch 𝑍 to program 𝑄𝑠𝑝𝑕 : a fixed concretization distribution 𝑄(𝑄𝑠𝑝𝑕 | 𝑍). Learning goal changes to ∑ log 𝑄 𝑍 • Given {(𝑌 * , 𝑍 * )} , solve arg max * 𝑌 * , 𝜄) . * ; 12

Sketches ::= Call | skip | while Cond do Y 1 | Y 1 ; Y 2 | Y try Y 1 Catch | if Cond then Y 1 else Y 2 ::= catch ( τ 1 ) Y 1 . . . catch ( τ k ) Y k Catch ::= { Call 1 , . . . , Call k } Cond ::= a ( τ 1 , . . . , τ k ) Call Abstract API call 13

Program synthesis Learned from 𝒀 𝒋 , 𝒁 𝒋 pairs End-to-end Evidence 𝑌 𝑄 𝑍 𝑌) differentiable neural architecture Logical requirement 𝜒 Sample sketches Type-directed, Combinatorial Implementations compositional “concretization” satisfying 𝜒 synthesizer Sketch → Executable code 14

Program synthesis Learned from 𝒀 𝒋 , 𝒁 𝒋 pairs End-to-end Evidence 𝑌 Not all sketches may 𝑄 𝑍 𝑌) differentiable be realizable as neural executable programs architecture Logical requirement 𝜒 Sample ✘ sketches Type-directed, Combinatorial Implementations compositional “concretization” satisfying 𝜒 synthesizer Sketch → Executable code 15

Learning using a probabilistic 𝑌 : Evidence 𝑍 : Sketches encoder-decoder 𝑎 : Latent “intent” 𝑎 𝑔(𝑌) Decoder Encoder 𝑌 Y f g 𝑎 Representation of hidden intent 𝑌 𝑍 Prior for regularization 16

Learning using a probabilistic 𝑌 : Evidence 𝑍 : Sketches encoder-decoder 𝑎 : Latent “intent” 𝑎 𝑔(𝑌) Decoder Encoder 𝑌 Y g f During learning, use 𝑄 𝑎 = 𝑂𝑝𝑠𝑛𝑏𝑚 0, 𝐽 Jensen’s inequality 𝑄 𝑔(𝑌) 𝑎) = 𝑂𝑝𝑠𝑛𝑏𝑚 𝑎, 𝜏 S 𝐽 to get smooth loss function 17

Learning using a probabilistic 𝑌 : Evidence 𝑍 : Sketches encoder-decoder 𝑎 : Latent “intent” 𝑎 𝑔(𝑌) Decoder Encoder 𝑌 Y g f 𝑄 𝑎 = 𝑂𝑝𝑠𝑛𝑏𝑚 0, 𝐽 During inference, get P(Z | X) using 𝑄 𝑔(𝑌) 𝑎) = 𝑂𝑝𝑠𝑛𝑏𝑚 𝑎, 𝜏 S 𝐽 normal-normal conjugacy 18

Neural decoder Distribution on rules that can be fired at a point, given history so far. History encoded as a real vector. … 0.7 0.3 19

Concretization … Ruled out by type system ✗ 20

Results • Trained method on 100 million lines of Java/Android code. ~2500 API methods, ~1500 types. • Synthesis of method bodies from scratch, given 2-3 API calls and types. • Sketch learning critical to accuracy. • Good performance compared to GSNNs (state of the art conditional generative model). • Good results on label-sketch pairs not encountered in training set. 21

Thank you! Questions? swarat@rice.edu http://www.cs.rice.edu/~swarat (Research funded by the DARPA MUSE award #FA8750-14-2-0270) 22

Conditional Program Generation for Bimodal Program Synthesis Swarat - PowerPoint PPT Presentation

Conditional Program Generation for Bimodal Program Synthesis Swarat Chaudhuri Rice University www.cs.rice.edu/~swarat (Joint work with Chris Jermaine, Vijay Murali, and Letao Qi) Program synthesis [Simon 1963, Summers 1977, Manna-Waldinger

A Bimodal Analysis of Knowability Sergei Artemov & Tudor Protopopescu Logic Colloquium 2011

Bimodal Multicast And Cache Invalidation Who/What/Where Bruce Spang Software

Bimodal Algorithms Uni-modal distribution Input data block boundaries unimodal chunking 64 KB

Review: Conditional Probability Conditional Probability The conditional probability of event

11/15/16 Conditional distributions Let X and Y be discrete r.v.s. Conditional probability mass

Conditional Statements Python Conditional Statements Sometimes a statement (or a block of

Markov random fields 2. conditional specifications 3. conditional auto-regression Rasmus

Formal Modeling in Cognitive Science Independence Lecture 23: Conditional Probability; Bayes

Conditional Sentences as Conditional Speech Acts Workshop Questioning Speech Acts Universitt

Conditional Probability & Independence Conditional Probabilities Question : How should we

P( ) 1 conditional probability where P(F) > 0 Conditional probability of E given F:

Multiscale Conditional 1) Generalization of conditional random fields (CRF) to multiscale

15. The Conditional 15.1 The conditional: Formation and uses 15.2 Mise en pratique 15.1 The

Conditional Probability & Independence Conditional Probabilities Question : How should we

Conditional Random Fields [Hanna M. Wallach, Conditional Random Fields: An Introduction,

Blended Conditional Gradients: The unconditioning of conditional gradients Joint work with Gabor

ORIGINS AND IMPACT OF FORMAL SEMANTICS Troy Kaighin Astarte t.astarte @

Type- & Example-Driven Program Synthesis Steve Zdancewic WG 2.8, August 2014 Joint work

Yices 1.0: An Efficient SMT Solver AFM06 Tutorial Leonardo de Moura (joint work with Bruno

Ranking Templates for Linear Loops Jan Leike Matthias Heizmann The Australian University

Compositional Approach to Suspension and Other Improvements to LTL Translation s Babiak 1 Thomas

Approximating the Transitive Closure of a Boolean Affine Relation Paul Feautrier ENS de Lyon

Jewish Celebrations Learning Objective: To find out about the Jewish festival of Sukkot.

HEX-Programs with Existential Quantification Thomas Eiter, Michael Fink, Thomas Krennwallner,

Conditional Program Generation for Bimodal Program Synthesis Swarat - PowerPoint PPT Presentation

Conditional Program Generation for Bimodal Program Synthesis Swarat Chaudhuri Rice University www.cs.rice.edu/~swarat (Joint work with Chris Jermaine, Vijay Murali, and Letao Qi) Program synthesis [Simon 1963, Summers 1977, Manna-Waldinger

A Bimodal Analysis of Knowability Sergei Artemov &amp; Tudor Protopopescu Logic Colloquium 2011

Bimodal Multicast And Cache Invalidation Who/What/Where Bruce Spang Software

Bimodal Algorithms Uni-modal distribution Input data block boundaries unimodal chunking 64 KB

Review: Conditional Probability Conditional Probability The conditional probability of event

11/15/16 Conditional distributions Let X and Y be discrete r.v.s. Conditional probability mass

Conditional Statements Python Conditional Statements Sometimes a statement (or a block of

Markov random fields 2. conditional specifications 3. conditional auto-regression Rasmus

Formal Modeling in Cognitive Science Independence Lecture 23: Conditional Probability; Bayes

Conditional Sentences as Conditional Speech Acts Workshop Questioning Speech Acts Universitt

Conditional Probability &amp; Independence Conditional Probabilities Question : How should we

P( ) 1 conditional probability where P(F) &gt; 0 Conditional probability of E given F:

Multiscale Conditional 1) Generalization of conditional random fields (CRF) to multiscale

15. The Conditional 15.1 The conditional: Formation and uses 15.2 Mise en pratique 15.1 The

Conditional Probability &amp; Independence Conditional Probabilities Question : How should we

Conditional Random Fields [Hanna M. Wallach, Conditional Random Fields: An Introduction,

Blended Conditional Gradients: The unconditioning of conditional gradients Joint work with Gabor

ORIGINS AND IMPACT OF FORMAL SEMANTICS Troy Kaighin Astarte t.astarte @

Type- &amp; Example-Driven Program Synthesis Steve Zdancewic WG 2.8, August 2014 Joint work

Yices 1.0: An Efficient SMT Solver AFM06 Tutorial Leonardo de Moura (joint work with Bruno

Ranking Templates for Linear Loops Jan Leike Matthias Heizmann The Australian University

Compositional Approach to Suspension and Other Improvements to LTL Translation s Babiak 1 Thomas

Approximating the Transitive Closure of a Boolean Affine Relation Paul Feautrier ENS de Lyon

Jewish Celebrations Learning Objective: To find out about the Jewish festival of Sukkot.

HEX-Programs with Existential Quantification Thomas Eiter, Michael Fink, Thomas Krennwallner,

A Bimodal Analysis of Knowability Sergei Artemov & Tudor Protopopescu Logic Colloquium 2011

Conditional Probability & Independence Conditional Probabilities Question : How should we

P( ) 1 conditional probability where P(F) > 0 Conditional probability of E given F:

Conditional Probability & Independence Conditional Probabilities Question : How should we

Type- & Example-Driven Program Synthesis Steve Zdancewic WG 2.8, August 2014 Joint work