Runtime Error Analysis - A Machine Learning Perspective Praful - PowerPoint PPT Presentation

Runtime Error Analysis - A Machine Learning Perspective Praful Mangalath University of Colorado, Boulder Center for Computational Language and EducAtion Research (CLEAR) April 29th, 2009

Project Summary ◮ Runtime Error Analysis ◮ 5535 Deliverables ◮ developed developing* a bug finding toolkit for C ◮ Benchmarks on Siemens Test Suite ◮ Applied machine learning techniques to detect runtime errors

Outline of this talk ◮ Setup background and explain the problem ◮ Demo ◮ Details of Implementation ◮ Experimental data

Finding Errors in Code - Static Properties ◮ Check for syntactic and static semantic rules ◮ Errors to Warnings ratio low ◮ Cheap and easy to use ◮ Tools : FindBugs, Splint

Finding Errors in Code - Dynamic Properties ◮ Code verification with Abstract Interpretation. ◮ Without executing program investigate program behavior ◮ Derive dynamic properties from source code ◮ mature and sound mathematical basis ◮ Tools : BLAST, SLAM (Static Driver Verifier)

Finding Errors in Code - Dynamic Properties ◮ Test driven code verification ◮ Identifies only symptoms not cause of error ◮ Tracing anomaly to root cause manual time-consuming process ◮ Effectiveness limited to test case coverage

Verifying Dynamic Properties - Analogy ◮ Goal - predict the trajectory of a projectile mid-air ◮ Abstract Interpretation ◮ laws of physics (gravity, initial speed, air braking coeff) ◮ transform problem into set of equations ◮ solve by mathematical rules, formal or numeric ◮ Test driven ◮ launch many projectiles and record observations ◮ derive empirical laws of motion and error margins ◮ estimate trajectory and report a confidence parameter ◮ Mathworks White Paper: ’Verifying Code When Software Reliability is Critical.’ , Paul Barnard, Marc Lalo, & Jim Tung. 2008

Cooperative Bug Isolation (CBI) Project ◮ "Scalable Statistical Bug Isolation" Ben Liblit, Mayur Naik, Alice Zheng, Alex Aiken & Michael Jordan (PLDI 2005) ◮ bug-finding post-deployment ◮ application in the wild >> writing test cases ◮ "Interesting program behavior is expressible as a predicate on a state at a particular program point" ◮ Sample predicates from users running these applications ≈ Yields best test case coverage

Cooperative Bug Isolation (CBI) Project - Architecture Predicates Source Instrumented Sampler Code Application Compiler BUGS predicate Statistical log Debugging reports

Modeling Program Behavior with Predicates <CODE> upward_preferred = Inhibit_Biased_Climb() > Down_Separation; if (upward_preferred) { result = !(Own_Below_Threat()) || ((Own_Below_Threat()) && (!(Down_Separation >= ALIM()))); } else <CODE> tcas.c

Modeling Program Behavior with Predicates <CODE> upward_preferred = Inhibit_Biased_Climb() > Down_Separation; Branch Predicate if (upward_preferred) { result = !(Own_Below_Threat()) || ((Own_Below_Threat()) && (!(Down_Separation >= ALIM()))); } else <CODE> tcas.c Figure: For each conditional, count how many times the branch predicate is false or true. Each branch induces one instrumentation point with a pair of counters.

Modeling Program Behavior - Execution Profiles ◮ Instrumentation sites ◮ branches - pair of counters (branch false ,branch true ) ◮ bounds - at each assignment site we record max and min values ◮ function-calls - count function entries ◮ Collect predicate values with some sampling period ◮ Collect execution profile ◮ A set of execution profiles (failed & successful runs) is the input to the machine learning component

Machine Learning Components predicate log reports Component Classifier Mixture Model Nested Support Chinese Vector Restaurant Machine Process Heuristics BUGS

Classifier Design - Support Vector Machine ◮ Goal to use predicates as features to determine failed/successful execution profiles ◮ Linear algorithm in feature space is equivalent to non-linear algorithm in input space ◮ Ranks predicate features that were significant in making fail/pass decision

Hierarchical Mixture Model - Nested Chinese Restaurant Process ◮ Goal to enable predicates to share clusters ◮ Number of clusters varies for each report and needs to be inferred automatically ◮ For complex source code with library dependencies clusters could be hierarchical 1 2 3 4 � 3 2 1 6 + � 6 + � 6 + � 6 + �

Data ◮ Siemens Test Suite ◮ 132 known expert induced bugs ◮ supporting test cases

Conclusion ◮ Machine learning approach to runtime error analysis ◮ Tool requires no specialized annotation or expertise to tune/run ◮ More data = ⇒ better performance in ML ◮ Instrument real-world application

Runtime Error Analysis - A Machine Learning Perspective Praful - PowerPoint PPT Presentation

Runtime Error Analysis - A Machine Learning Perspective Praful Mangalath University of Colorado, Boulder Center for Computational Language and EducAtion Research (CLEAR) April 29th, 2009 Project Summary Runtime Error Analysis 5535

Chapter 11: The R.M.S. Error for Regression Errors: A has a large positive error B has a large

Introduction to Machine Learning Evaluation: Test Error Learning goals training error 0.06

ERROR DETECTON & CORRECTION Error Detection EDC= Error Detection and Correction bits

Introduction to Machine Learning Evaluation: Training Error compstat-lmu.github.io/lecture_i2ml

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

Testing Concurrency Runtime via a Testing Concurrency Runtime via a Stochastic Stress Framework

Task scheduling over Heterogeneous Multicore Machines: a Runtime Perspective Raymond Namyst

MACHINE LEARNING Kernel Canonical Correlation Analysis 1 ADVANCED MACHINE LEARNING ADVANCED

Human Error and Human Error Identification Techniques adapted from an IE 545 presentaton by

An Overview of Human Error Drawn f rom J . Reason, Human Error , Cambridge, 1990 Aaron Brown CS

Questions From Chapter 1 Figure 1.1: Testing life cycle Ch 12 Error vocabulary 1

Error Detection Codes Error Detection Two types Nave scheme Error Detection Codes

Tax Townhall 02.05.18 2 irishfunds.ie Agenda Opening Gareth Bryan, KPMG AEOI (FATCA

Learning Recursive Segments for Discourse Parsing Stergos D. Afantenos Pascal Denis

Proposed changes to the Spokesperson election procedures Report from the Reflection

182.694 Microcontroller VU Martin Perner SS 2017 Featuring Today: Assembler Programming Weekly

CMB: How you see it Mike Peel, 19 November 2009 Discovery - Penzias & Wilson Image from NASA

Congressional Budget Office March 7, 2017 The 2017 Budget and Economic Outlook National

The EDM measured at BNL Becky Chislett UCL Workshop on future muon EDM searches at Fermilab and

Pipeline to the Chief Business Officer Challenges, Opportunities and Calls to Action Tuesday,

Runtime Error Analysis - A Machine Learning Perspective Praful - PowerPoint PPT Presentation

Runtime Error Analysis - A Machine Learning Perspective Praful Mangalath University of Colorado, Boulder Center for Computational Language and EducAtion Research (CLEAR) April 29th, 2009 Project Summary Runtime Error Analysis 5535

Chapter 11: The R.M.S. Error for Regression Errors: A has a large positive error B has a large

Introduction to Machine Learning Evaluation: Test Error Learning goals training error 0.06

ERROR DETECTON &amp; CORRECTION Error Detection EDC= Error Detection and Correction bits

Introduction to Machine Learning Evaluation: Training Error compstat-lmu.github.io/lecture_i2ml

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

Testing Concurrency Runtime via a Testing Concurrency Runtime via a Stochastic Stress Framework

Task scheduling over Heterogeneous Multicore Machines: a Runtime Perspective Raymond Namyst

MACHINE LEARNING Kernel Canonical Correlation Analysis 1 ADVANCED MACHINE LEARNING ADVANCED

Human Error and Human Error Identification Techniques adapted from an IE 545 presentaton by

An Overview of Human Error Drawn f rom J . Reason, Human Error , Cambridge, 1990 Aaron Brown CS

Questions From Chapter 1 Figure 1.1: Testing life cycle Ch 12 Error vocabulary 1

Error Detection Codes Error Detection Two types Nave scheme Error Detection Codes

Tax Townhall 02.05.18 2 irishfunds.ie Agenda Opening Gareth Bryan, KPMG AEOI (FATCA

Learning Recursive Segments for Discourse Parsing Stergos D. Afantenos Pascal Denis

Proposed changes to the Spokesperson election procedures Report from the Reflection

182.694 Microcontroller VU Martin Perner SS 2017 Featuring Today: Assembler Programming Weekly

CMB: How you see it Mike Peel, 19 November 2009 Discovery - Penzias &amp; Wilson Image from NASA

Congressional Budget Office March 7, 2017 The 2017 Budget and Economic Outlook National

The EDM measured at BNL Becky Chislett UCL Workshop on future muon EDM searches at Fermilab and

Pipeline to the Chief Business Officer Challenges, Opportunities and Calls to Action Tuesday,

ERROR DETECTON & CORRECTION Error Detection EDC= Error Detection and Correction bits

CMB: How you see it Mike Peel, 19 November 2009 Discovery - Penzias & Wilson Image from NASA