Learning Execution through Neural Code Fusion Zhan Shi, Kevin - PowerPoint PPT Presentation

Learning Execution through Neural Code Fusion Zhan Shi, Kevin Swersky, Danny Tarlow, Paruhasarathy Ranganathan , Milad Hashemi

Overview Proprietary + Confidential ● Motivation ● Background ● Neural Code Fusion ● Experimental Results ● Conclusion 2 Confidential + Proprietary

Proprietary + Confidential Motivation 3 Confidential + Proprietary

Proprietary + Confidential Proprietary + Confidential 2% Pergormance/Year is the New Normal Source: Paruhasarathy Ranganathan, More Moore: Thinking Outside the (Server) Box 4

Motivation Proprietary + Confidential Dynamic speculative execution ● Branch prediction, value prediction, cache replacement, ○ prefetching... 5 Confidential + Proprietary

Motivation Proprietary + Confidential Dynamic speculative execution ● Branch prediction, value prediction, cache replacement, ○ prefetching... Static source code ● Variable naming, fjnding bugs, algorithm classifjcation, program ○ synthesis… Pergormance-related tasks: device mapping, thread coarsening, ○ throughput prediction... 6 Confidential + Proprietary

Motivation Proprietary + Confidential Dynamic speculative execution ● Branch prediction, value prediction, cache replacement, ○ prefetching... Static source code ● Variable naming, fjnding bugs, algorithm classifjcation, program ○ synthesis… Pergormance-related tasks: device mapping, thread coarsening, ○ throughput prediction... Both views provide useful features ● 7 Confidential + Proprietary

Example: a “Simple” Case for Branch Prediction Proprietary + Confidential for (i = 0; i < k; i++) { } 8 Confidential + Proprietary

Example: a “Simple” Case for Branch Prediction Proprietary + Confidential Highly biased for (i = 0; i < k; i++) { } 9 Confidential + Proprietary

Example: a “Simple” Case for Branch Prediction Proprietary + Confidential Highly biased for (i = 0; i < k; i++) { Branch history doesn’t help } 10 Confidential + Proprietary

Example: a “Simple” Case for Branch Prediction Proprietary + Confidential Highly biased while(...){ generate k; for (i = 0; i < k; i++) { Branch history doesn’t help } } 11 Confidential + Proprietary

Example: a “Simple” Case for Branch Prediction Proprietary + Confidential Highly biased while(...){ generate k; for (i = 0; i < k; i++) { Branch history doesn’t help } } ● Jump out when “close enough” ● Predictable if we knew the relation [Static] i and k are compared [Dynamic] values of i and k 12 Confidential + Proprietary

Proprietary + Confidential Background: Graph Neural Networks 13 Confidential + Proprietary

Background: Graph Neural Networks Proprietary + Confidential Typical deep learning operates ● on IID data points. 14 Confidential + Proprietary

Background: Graph Neural Networks Proprietary + Confidential What if the data points had relational information? ● Battaglia et al., 2018 15 Confidential + Proprietary

Background: Graph Neural Networks Proprietary + Confidential Message passing ● Input graph 16 Confidential + Proprietary

Background: Graph Neural Networks Proprietary + Confidential Message passing ● Step 0 Input graph 17 Confidential + Proprietary

Background: Graph Neural Networks Proprietary + Confidential Message passing ● Step 0 Step 1 Input graph 18 Confidential + Proprietary

Background: Graph Neural Networks Proprietary + Confidential Message passing ● Step 0 Step 1 Step 2 Input graph 19 Confidential + Proprietary

Background: Graph Neural Networks Proprietary + Confidential Message passing ● Step 0 Step 1 Step 2 Input graph GRU GRU 20 Confidential + Proprietary

Proprietary + Confidential Programs as Graphs Allamanis et al., 2017 21 Confidential + Proprietary

Proprietary + Confidential Representing Static and Dynamic Information Graphs are an efgective representation for static code ● How do we generally represent dynamic information in a ● model? 22 Confidential + Proprietary

Proprietary + Confidential Neural Code Fusion 23 Confidential + Proprietary

Full System Proprietary + Confidential 24 Confidential + Proprietary

Assembly vs Source Code Proprietary + Confidential ● Highly structured 25 Confidential + Proprietary

Assembly vs Source Code Proprietary + Confidential ● Highly structured 26 Confidential + Proprietary

Assembly vs Source Code Proprietary + Confidential ● Highly structured ● Directly relate data to program semantics 27 Confidential + Proprietary

Assembly vs Source Code Proprietary + Confidential ● Highly structured ● Directly relate data to program semantics ● Easy to use for architecture tasks 28 Confidential + Proprietary

Code Fusion Graph Representation Proprietary + Confidential 29 Confidential + Proprietary

Dynamic Tasks: Control Flow and Data Flow Proprietary + Confidential Control fmow (branch prediction) ● predict whether a branch statement will be taken or not taken. ● Set branch instruction node to be the target node. ● Binary classifjcation ● 30 Confidential + Proprietary

Dynamic Tasks: Control Flow and Data Flow Proprietary + Confidential Control fmow (branch prediction) ● predict whether a branch statement will be taken or not taken. ● Set branch instruction node to be the target node. ● Binary classifjcation ● Data fmow (prefetching) ● predict which address will be accessed next. ● Set src node to be the target node. ● Predict 64-bit address ● 31 Confidential + Proprietary

Multi-Task Representation Proprietary + Confidential Many other static/dynamic tasks can be defjned on the ● graph simultaneously Value prediction, indirect branch prediction, memory ○ disambiguation, caching… 32 Confidential + Proprietary

Dynamic Snapshots Proprietary + Confidential Snapshots ● The values of the set of variable nodes ○ Captured during program execution ○ Used to initialize the graph neural network ● 33 Confidential + Proprietary

Representation Study Proprietary + Confidential Number “3” in difgerent representations ● Categorical: [1, 0, 0, 0] ○ Scalar: 3 ○ Binary: 11 ○ 34 Confidential + Proprietary

Representation Study Proprietary + Confidential ● Correctly predict when to jump out for(k=0; k < n; k+=3){ Sample k values as training data ● for (i = 0; i < k; i++) { } } 35 Confidential + Proprietary

Representation Study: Proprietary + Confidential Results ● Binary > scalar > categorical ○ 36 Confidential + Proprietary

Proprietary + Confidential Experimental Results 37 Confidential + Proprietary

Experimental Setup Proprietary + Confidential Benchmarks ● SPEC06 INT ○ Tasks ● Dynamic: control fmow (branch prediction) and data fmow (prefetching) ○ Static: algorithm classifjcation ○ Offmine evaluation for both NCF and baselines ● 70% training ○ 30% testing ○ 38 Confidential + Proprietary

Control-fmow (Branch Prediction) and Data-fmow Proprietary + Confidential (Prefetching) 39 Confidential + Proprietary

Algorithm Classifjcation Proprietary + Confidential Test the usefulness of the learned representation ● We pre-train our GNN on the control-fmow task ● A simple linear SVM model ● We get 96% vs 95.3% (50M lines of LLVM IR ) using 200k lines of ● assembly with no external data sources. 40 Confidential + Proprietary

Summary Proprietary + Confidential NCF combining static and dynamic information ● creates useful representations ○ Difgerent from the traditional dynamic models in architecture ● Data is usually purely dynamic ○ Model is history-based ○ Enhances static models with dynamic program behavior ● Learned representation can also transfer to a unseen static task ○ 41 Confidential + Proprietary

Thank you! Questions?

Learning Execution through Neural Code Fusion Zhan Shi, Kevin - PowerPoint PPT Presentation

Learning Execution through Neural Code Fusion Zhan Shi, Kevin Swersky, Danny Tarlow, Paruhasarathy Ranganathan , Milad Hashemi Overview Proprietary + Confidential Motivation Background Neural Code Fusion Experimental Results

Probabilistic and Model Fusion: . . . Model Fusion: . . . Interval Uncertainty Model Fusion:

High resolution image fusion via fusion frames Shidong Li San Francisco State University

October 2016 October 2016 WHAT IS FUSION? TWO FUSION TYPES NEUTRONIC ANEUTRONIC TWO

Update on the Fusion Update on the Fusion Energy Sciences Program Energy Sciences Program Ed

Modeling with MOSEK Fusion Ulf Worse INFORMS Minneapolis October 5 2013 http://www.mosek.com

MASTERING STRATEGY EXECUTION 18 BEST PRACTICES FOR STRATEGY EXECUTION STRATEGY EXECUTION AS

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Update of Magnetic Fusion Energy Research Brian A. Nelson for the UW Fusion Energy Research Group

Fusion Nothing But The Truth Fusion Orbotech s True Commitment To The PCB Industry Overall

Oncentra Prostate Image Fusion Josh Mason Oncentra Prostate Image Fusion Multiple image

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

2017 ACU Fusion 360 Capstone Award Hannah Crepeau, Associate PMM, Competitions 2017 ACU Fusion

' COLD FUSION ' Byron New Energy COLD FUSION MEETING NEW REVOLUTIONARY GREEN TECHNOLOGY

Next Steps for Realizing Fusion Power and Comparative Analysis of Roadmaps of World Major Fusion

Fusion Nuclear Science and Technology (FNST) Fusion Nuclear Science and Technology (FNST)

Fusion - Everything You Wanted to Know* * But Were Afraid to Ask Sam Eddinger February 7, 2013

Affect PROPRIETARY & CONFIDENTIAL March 4, 2010 Strategies MANAGING A HACK: Orchestrating

Effectively Evaluating Risk Through Factors Defense Acquisition University Exercise handout

Solicited Proposals REQUESTS FOR QUOTES (RFQ) Solicited by customers to get a competitive price

Disclosures: None Key Publications in OEM Exposure to disinfectants in health care workers

CSC321 Lecture 22: Adversarial Learning Roger Grosse Roger Grosse CSC321 Lecture 22: Adversarial

The Data Challenge is Growing 1 In 2014, 420 million wearable fitness devices were in use

Combined ILN and XOL Reinsurance Placement November 15, 2018 $455 Million Combined ILN/XOL

Allocation of Capacity An alternative option EOWG 8th March 2006 User commitment The models

Learning Execution through Neural Code Fusion Zhan Shi, Kevin - PowerPoint PPT Presentation

Learning Execution through Neural Code Fusion Zhan Shi, Kevin Swersky, Danny Tarlow, Paruhasarathy Ranganathan , Milad Hashemi Overview Proprietary + Confidential Motivation Background Neural Code Fusion Experimental Results

Probabilistic and Model Fusion: . . . Model Fusion: . . . Interval Uncertainty Model Fusion:

High resolution image fusion via fusion frames Shidong Li San Francisco State University

October 2016 October 2016 WHAT IS FUSION? TWO FUSION TYPES NEUTRONIC ANEUTRONIC TWO

Update on the Fusion Update on the Fusion Energy Sciences Program Energy Sciences Program Ed

Modeling with MOSEK Fusion Ulf Worse INFORMS Minneapolis October 5 2013 http://www.mosek.com

MASTERING STRATEGY EXECUTION 18 BEST PRACTICES FOR STRATEGY EXECUTION STRATEGY EXECUTION AS

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Update of Magnetic Fusion Energy Research Brian A. Nelson for the UW Fusion Energy Research Group

Fusion Nothing But The Truth Fusion Orbotech s True Commitment To The PCB Industry Overall

Oncentra Prostate Image Fusion Josh Mason Oncentra Prostate Image Fusion Multiple image

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

2017 ACU Fusion 360 Capstone Award Hannah Crepeau, Associate PMM, Competitions 2017 ACU Fusion

' COLD FUSION ' Byron New Energy COLD FUSION MEETING NEW REVOLUTIONARY GREEN TECHNOLOGY

Next Steps for Realizing Fusion Power and Comparative Analysis of Roadmaps of World Major Fusion

Fusion Nuclear Science and Technology (FNST) Fusion Nuclear Science and Technology (FNST)

Fusion - Everything You Wanted to Know* * But Were Afraid to Ask Sam Eddinger February 7, 2013

Affect PROPRIETARY &amp; CONFIDENTIAL March 4, 2010 Strategies MANAGING A HACK: Orchestrating

Effectively Evaluating Risk Through Factors Defense Acquisition University Exercise handout

Solicited Proposals REQUESTS FOR QUOTES (RFQ) Solicited by customers to get a competitive price

Disclosures: None Key Publications in OEM Exposure to disinfectants in health care workers

CSC321 Lecture 22: Adversarial Learning Roger Grosse Roger Grosse CSC321 Lecture 22: Adversarial

The Data Challenge is Growing 1 In 2014, 420 million wearable fitness devices were in use

Combined ILN and XOL Reinsurance Placement November 15, 2018 $455 Million Combined ILN/XOL

Allocation of Capacity An alternative option EOWG 8th March 2006 User commitment The models

Affect PROPRIETARY & CONFIDENTIAL March 4, 2010 Strategies MANAGING A HACK: Orchestrating