Machine Programming Justin Gottschlich, Intel Labs December 12 th , - PowerPoint PPT Presentation

Machine Programming Justin Gottschlich, Intel Labs December 12 th , 2018 TVM Conference, University of Washington

Motivation We have a software programmer resource problem � 2

Motivation We have a software programmer resource problem https://www.bloomberg.com/news/articles/2018-03-08/demand-for-programmers-hits-full-boil-as-u-s-job-market-simmers � 3

Motivation We have a software programmer resource problem 2019 human population 7,714M 2019 developers 26.4M % of programmers: > 0.34% < http://www.worldometers.info/world-population/world-population-projections/ � 4 https://www.future-processing.com/blog/worldwide-software-developers-number-264-million-2019/

Motivation We have a software programmer resource problem 2019 human population 7,714M 2019 human population 7,714M 2019 developers 26.4M 2019 developers 1,200M % of programmers: > 0.34% < % of drivers: > 15.56% < � 5

Motivation 2019 human population 7,714M 2019 developers 26.4M % of programmers: > 0.34% < 2019 human population 7,714M 2019 developers 1,200M % of drivers: > 15.56% < � 6

Motivation What if programming could be as simple as driving? How can we simplify programming (mostly with machine learning)? (1) Reduce intention-challenge, (2) delegate most work to machines. � 7

Human programming vs machine programming � 8 https://channels.theinnovationenterprise.com/articles/the-future-of-digital-marketing-ai-vs-human-copywriters

Human Programming The process of developing software, principally by one or more humans. ▪ Examples – Writing code in < your favorite language here > ▪ Pros – Near complete control over the software created, exact behaviors ▪ Cons – Expensive, slow, error-prone, human-resource limited � 9

Machine Programming The process of developing software where some or all of the steps are performed autonomously. ▪ Examples – Classical: compiler transformations – Emerging: Verified lifting[1], AutoTVM[2], Sketch[3], DeepCoder[4], SapFix/Sapienz[5] ▪ Pros – Resource constrained by computers, most humans can create software ▪ Cons – Immature, may lack full control, may be partially stochastic [1] http://www.cs.technion.ac.il/~shachari/dl/pldi2016.pdf [2] https://arxiv.org/pdf/1805.08166.pdf [3] https://people.csail.mit.edu/asolar/papers/thesis.pdf � 10 [4] https://arxiv.org/abs/1611.01989 [5] https://research.fb.com/finding-and-fixing-software-bugs-automatically-with-sapfix-and-sapienz/

The Three Pillars of Machine Programming (MP)   MAPL/PLDI’18 R econfigurable Justin Gottschlich, Intel HW/S W HW Algorithm Armando Solar-Lezama, MIT co-designs Design C reation Nesime Tatbul, Intel Michael Carbin, MIT Data Data Martin, Rinard, MIT Invention Regina Barzilay, MIT Holistic P rogram C ompiler Saman Amarasinghe, MIT S ynthesis Optimizations Joshua B Tenenbaum, MIT Intention Adaptation Tim Mattson, Intel Optimizing Inductive C ode Data P rogramming Generators 2 nd ACM SIGPLAN Workshop on Machine Learning and Programming Languages (MAPL), PLDI’18, arxiv.org/pdf/1803.07244.pdf

Examples of the Three Pillars of MP ▪ Intention R econfigurable – “Automating String Processing in Spreadsheets using HW/S W Input-Output Examples” (Sumit Gulwani) HW Algorithm co-designs – “Program Synthesis by Sketching” (Armando Solar- Design C reation Lezama, Adviser: R. Bodik) Data Data ▪ Invention Invention Holistic – “The Case for Learned Index Structures” (Tim Kraska, Alex P rogram C ompiler Beutel, Ed H. Chi, Jeffrey Dean, Neoklis Polyzotis) S ynthesis Optimizations Intention Adaptation ▪ Adaptation – “Precision and Recall for Time Series” (Nesime Tatbul, TJ Optimizing Inductive Lee, Stan Zdonik, Mejbah Alam, Justin Gottschlich) C ode Data P rogramming Generators ▪ Adaptation Anomaly Detection Interpretability (Xin Sheng, Mejbah Alam, Justin Gottschlich, Armando Solar-Lezama)

Flash Fill ▪ Intention – “Automating String Processing in Spreadsheets using Input-Output Examples” (Sumit Gulwani) – “Program Synthesis by Sketching” (Armando Solar- Lezama, Adviser: R. Bodik) ▪ Invention – “The Case for Learned Index Structures” (Tim Kraska, Alex Beutel, Ed H. Chi, Jeffrey Dean, Neoklis Polyzotis) ▪ Adaptation – “Precision and Recall for Time Series” (Nesime Tatbul, TJ Lee, Stan Zdonik, Mejbah Alam, Justin Gottschlich) Flash Fill (POPL 2011): https://www.microsoft.com/en-us/research/wp-content/uploads/2016/12/popl11-synthesis.pdf

Sketch ▪ Intention – “Automating String Processing in Spreadsheets using Input-Output Examples” (Sumit Gulwani) – “Program Synthesis by Sketching” (Armando Solar- Lezama, Adviser: R. Bodik) ▪ Invention – “The Case for Learned Index Structures” (Tim Kraska, Alex Beutel, Ed H. Chi, Jeffrey Dean, Neoklis Polyzotis) ▪ Adaptation – “Precision and Recall for Time Series” (Nesime Tatbul, TJ Lee, Stan Zdonik, Mejbah Alam, Justin Gottschlich) “Program Synthesis by Sketching”: https://people.csail.mit.edu/asolar/papers/thesis.pdf

Learned Index Structures ▪ Intention – “Automating String Processing in Spreadsheets using Input-Output Examples” (Sumit Gulwani) – “Program Synthesis by Sketching” (Armando Solar- Lezama, Adviser: R. Bodik) ▪ Invention – “The Case for Learned Index Structures” (Tim Kraska, Alex Beutel, Ed H. Chi, Jeffrey Dean, Neoklis Polyzotis) ▪ Adaptation – “Precision and Recall for Time Series” (Nesime Tatbul, TJ Lee, Stan Zdonik, Mejbah Alam, Justin Gottschlich) “The Case for Learned Index Structures”: https://arxiv.org/abs/1712.01208

Time Series Anomalies and Interpretability ▪ Intention Range-based Anomalies – “Automating String Processing in Spreadsheets using Input-Output Examples” (Sumit Gulwani) – “Program Synthesis by Sketching” (Armando Solar- Lezama, Adviser: R. Bodik) ▪ Invention – “The Case for Learned Index Structures” (Tim Kraska, Alex Beutel, Ed H. Chi, Jeffrey Dean, Neoklis Polyzotis) ▪ Adaptation – “Precision and Recall for Time Series” (Nesime Tatbul, TJ Lee, Stan Zdonik, Mejbah Alam, Justin Gottschlich) ▪ Adaptation Anomaly Detection Interpretability (Xin Sheng, Mejbah Alam, Justin Gottschlich, Armando Solar-Lezama) “Precision and Recall for Time Series” (NIPS ’18, Spotlight): https://arxiv.org/abs/1803.03639

R econfigurable HW/S W HW Algorithm co-designs Design C reation Data Data Invention Holistic P rogram C ompiler S ynthesis Optimizations Intention Adaptation Optimizing Inductive C ode Data P rogramming Adaptation Generators Software that automatically evolves (e.g., repairs, optimizes, secures) itself Adaptation is principally about range-based anomaly detection

Time Series Anomaly Detection Point-based Anomalies Range-based Anomalies T rue P ositives F alse F alse N egatives P ositives ▪ How do we define TPs, TNs, FPs, FNs? � 18

(Prior) State of the Art ▪ Classical recall/precision – Point-based anomalies β : relative importance of Recall to Precision β = 1 : evenly weighted (harmonic mean) – Recall penalizes FN, precision penalizes FP β = 2 : weights Recall higher (i.e., no FN!) β = 0.5 : weights Precision higher (i.e., no FP!) F β -measure to combine & weight them – ▪ Numenta Anomaly Benchmark (NAB)’s Scoring Model [1] – Point-based anomalies – Focuses specifically on early detection use cases – Difficult to use in practice (irregularities, ambiguities, magic numbers) [2] ▪ Activity recognition metrics – No support for flexible time bias [1] Lavin and Ahmad, “Evaluating Real-Time Anomaly Detection Algorithms – The Numenta Anomaly Benchmark”, IEEE ICMLA, 2015. � 19 [2] Singh and Olinsky, “Demystifying Numenta Anomaly Benchmark”, IEEE IJCNN, 2017.

(Prior) State of the Art ▪ Classical recall/precision – Point-based anomalies β : relative importance of Recall to Precision β = 1 : evenly weighted (harmonic mean) – Recall penalizes FN, precision penalizes FP β = 2 : weights Recall higher (i.e., no FN!) β = 0.5 : weights Precision higher (i.e., no FP!) F β -measure to combine & weight them – ▪ Numenta Anomaly Benchmark (NAB)’s Scoring Model [1] – Point-based anomalies – Focuses specifically on early detection use cases – Difficult to use in practice (irregularities, ambiguities, magic numbers) [2] ▪ Activity recognition metrics A new accuracy model is needed – No support for flexible time bias [1] Lavin and Ahmad, “Evaluating Real-Time Anomaly Detection Algorithms – The Numenta Anomaly Benchmark”, IEEE ICMLA, 2015. � 20 [2] Singh and Olinsky, “Demystifying Numenta Anomaly Benchmark”, IEEE IJCNN, 2017.

New Evaluation Model Expressive, Flexible, Extensible ▪ Superset of: – Classical model – Other state-of-the-art evaluators (NAB) ▪ NeurIPS ‘18 Spotlight ▪ Key: evaluate anomaly detectors with practical meaningfulness https://ai.intel.com/precision-and-recall-for-time-series/ � 21

Precision & Recall for Time Series Customizable weights & functions Range-based Recall Range-based Precision https://ai.intel.com/precision-and-recall-for-time-series/ � 22

Machine Programming Justin Gottschlich, Intel Labs December 12 th , - PowerPoint PPT Presentation

Machine Programming Justin Gottschlich, Intel Labs December 12 th , 2018 TVM Conference, University of Washington Motivation We have a software programmer resource problem 2 Motivation We have a software programmer resource problem

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Network Programming Network Programming as Programming across Machine Boundaries The

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

Machine to Machine Communications As a Service Machine-to-Machine (M2M) refers to technologies

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

SIDMAR Steel Production Plant Crane A Case study of Esprit-LTR Machine 2 Machine 3 Machine

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

Machine-Level Programming I: Basics CSE 238/2038/2138: Systems Programming Instructor: Fatma

voice Kate Howland End-user programming? End-user programming? End-user programming?

Hierarchy of Software Complexity Application Programs Sequential Programming Embedded

Programming Styles and Objects Fermilab - TARGET 2018 Week 3 Programming styles Imperative

+ f(x) = Python Functional Programming Python Functional Programming Functional Programming by

NLP Programming Tutorial 0 - Programming Basics Graham Neubig Nara Institute of Science and

CS2281: Programming in UNIX Semester 3, 2004/05 CS2281: Programming in UNIX p.1/13 Syllabus

Welcome your new email subscribers ! t o l i p o t u a n o New subscribers to your

Managing a Client Drupal Style Chris Laney | @Imchrislaney | @boostDFM | boostdfm.com

Age of Content: Ken Moire Founding Partner, Digital Strategy Building Meaningful Spry Digital

Copywriting Style Writing Copy One-to-one communication Speaking to one person

Requirements Analysis Roman Kontchakov Birkbeck, University of London Based on Chapter 7 of

THE WEBSITE RFP This is the second in a series of webinars that Gravitate will be hosting over

Designing for Success. Content-First Design. A presentation by Stelios Kourakis for DrupalCamp

Introduction Psyc 0032/2032: Engineering Psychology Instructor TA Paul Chang Eric Ma Who you