Using Invariant Analysis for Improving Instrumentation-based - PowerPoint PPT Presentation

Using Invariant Analysis for Improving Instrumentation-based Performance Evaluation of SPECjvm2008 Benchmarks Michael Kuperberg, Martin Krogmann, Ralf Reussner Karlsruhe Institute of Technology SOFTWARE DESIGN AND QUALITY GROUP INSTITUTE FOR PROGRAM STRUCTURES AND DATA ORGANIZATION, FACULTY OF INFORMATICS KIT – University of the State of Baden-Wuerttemberg and www.kit.edu National Research Center of the Helmholtz Association

Motivation � Cross-platform performance prediction [KKR2008a] for systematic engineering of component-based software � Performance in our case: execution duration of component services � Performance prediction e.g. for following scenarios: Exec. platform 1 Exec. platform 2 � Relocation of an application to another execution platform A E A E ? � Sizing : choosing appropriate Exec. platform 4 Exec. platform 3 execution platform to fulfil F D changed perf. requirements ? Exec. platform 5 2 Oct 8th, 2010 Kuperberg et al. - Invariant Analysis for Performance Evaluation Software Design and Quality Group Institute for Program Structures and Data Organization

Bytecode-based Performance Prediction � Context of presented work: bytecode-based performance prediction [KKR2008a] for existing components: � Performance of a component on other execution platform � Bytecode instructions counts as a performance metric number of intructions 1. 2. execution duration NEWARRAY Count Benchmark NEWARRAY LMUL DUP bytecode bytecode IADD IADD LMUL DUP instructions instructions 3. Predict performance: combine counts and benchmark results � Counting must be performed at runtime, since static analysis or symbolic execution not sufficient � Must be applicable to sourceless and legacy components 3 Oct 8th, 2010 Kuperberg et al. - Invariant Analysis for Performance Evaluation Software Design and Quality Group Institute for Program Structures and Data Organization

ByCounter: Runtime Bytecode Instruction Counting using Application Instrumentation � ByCounter collects runtime counts of Java bytecode instructions and method invocations Bytecode classes Application Workload Aggregated of application instruction counts ... IINC ByCounter Method a(): meth1() ... IMUL 27865*LLOAD meth2() 976*meth1() ISTORE ... LLOAD LLOAD Method b(): ... Settings ... � Counts different instruction types individually � Configurable parameter recording for array-related instructions � Not constrained by timer accuracies and costs (cf. short methods) � Based on JVM-independent application instrumentation 4 Oct 8th, 2010 Kuperberg et al. - Invariant Analysis for Performance Evaluation Software Design and Quality Group Institute for Program Structures and Data Organization

Overview over the ByCounter Process Instrument bytecode before execution ... ... ... ... ... 101 11 ... ... 101 11 ... 101 11 ILOAD 101 1 ILOAD 101 11 1 3. Convert 101 1 111 110 11 IINC C1 1. Parse 2. Instrument 1 110 11 110 11 IADD 110 1 into IADD 110 11 1 110 1 program parsed program 111 ... IINC C8 1 ... executable ... ... ... ... ... bytecode representation ... bytecode Execute instrumented bytecode 5. Replace 6. Run 4. Create original instrumented ... testbed with instru- bytecode, 27865*ILOAD if needed 11108*IADD mented collect 8764*meth1 (parameters, bytecode counting () ... etc.) classes results 5 Oct 8th, 2010 Kuperberg et al. - Invariant Analysis for Performance Evaluation Software Design and Quality Group Institute for Program Structures and Data Organization

Idea and Advantages of ByCounter � Idea : instrument the application, not the virtual machine � Insert counters into existing bytecode, preserve method signatures � Advantages : � Instrumentation transparent to the application: no functional side-effects (but: runtime overhead) � Method invocations by the bytecode of the instrumented method: configurable and extendable treatment � No dependence on native interfaces, works on any JVM � Idea applicable to Dalvik, CLR etc. � Previous approaches: use modified JVMs or JVMTI etc. � Insufficient portability; not desirable in production environments 6 Oct 8th, 2010 Kuperberg et al. - Invariant Analysis for Performance Evaluation Software Design and Quality Group Institute for Program Structures and Data Organization

Example: SOR Part of the Scimark Benchmark in SPECjvm2008 � No jumps, loops, method invocations or other control flow è The number of executed bytecode instructions... � ... is independent of the input parameter values of num_flops � ... is independent of the state of the invocation target � ... can be determined statically 7 Oct 8th, 2010 Kuperberg et al. - Invariant Analysis for Performance Evaluation Software Design and Quality Group Institute for Program Structures and Data Organization

Switching to Bytecode Instruction Sequences � Since counting bytecode instructions individually... � ... is costly in terms of runtime overhead (CPU, memory) � ... limits scalability, offers room for improvement � Solution: identify and use performance-invariant bytecode instruction sequences (PIBISes) � Decreases amount of inserted instrumentation � Maintains existing precision of counting results � Similar to basic blocks (and dictionaries in data compression) � We extended ByCounter and studied the effects using workloads of the SPECjvm2008 benchmark 8 Oct 8th, 2010 Kuperberg et al. - Invariant Analysis for Performance Evaluation Software Design and Quality Group Institute for Program Structures and Data Organization

PIBISes: Treatment in ByCounter � PIBISes are not identical to basic blocks: � As with basic blocks: no jumps etc. allowed � Additionally: a PIBIS may not contain instructions with parameter-dependent performance (which can change between executions: cf. size parameter of newarray ) � Extended ByCounter: identifies PIBISes � Instead of 1 counter incrementation for every single executed instruction: 1 incrementation per PIBIS exec. � Note that some PIBISes still contain just one instruction 9 Oct 8th, 2010 Kuperberg et al. - Invariant Analysis for Performance Evaluation Software Design and Quality Group Institute for Program Structures and Data Organization

Implementation of ByCounter for Java � Analysable, easily modifiable representation 1. Parse program Obtained using ASM framework � bytecode Insert counting instrumentation into application � Counters are long -typed bytecode local variables � 2. Instrument parsed (invisible outside the instrumented method), program Counters initialised when method execution starts � representation and run Each execution of instruction/PIBIS: counter is � resulting also incremented bytecode Report counters at method exit points (write to a log � file or report to a central „collector “ daemon) Instrumented .class files: persistable, usable by any ClassLoader � Existing workloads, harnesses, scripts and configurations can be used � 10 Oct 8th, 2010 Kuperberg et al. - Invariant Analysis for Performance Evaluation Software Design and Quality Group Institute for Program Structures and Data Organization

Preliminary Results 160.00 Durations in seconds 139.10 � 140.00 Median values based on 21 120.00 � 100.00 measurements using java. 80.00 58.40 56.90 55.30 lang.System.nanoTime() 60.00 48.02 40.00 Durations include result � 6.09 20.00 6.10 5.79 4.26 aggregation and storage 0.00 Crypto.AES Derby MPEG Audio JITting takes place (proof: � Uninstrumented -XX:+PrintCompilation Instrumented(Original Method) JVM flag to enable logging) Instrumented (PIBIS Analysis) Evaluation platform (runs Mac OS X 10.6.4, 64 bit): 2.8 GHz Intel Core 2 Duo, 4 GB of 1067 MHz DDR3 main memory � JVM 1.6.0_20 provided by Apple (default mode, equals –server ) � - Xmx768M JVM flag to allocate 768 MB of heap memory � 11 Oct 8th, 2010 Kuperberg et al. - Invariant Analysis for Performance Evaluation Software Design and Quality Group Institute for Program Structures and Data Organization

Related Work Concerning SPECjvm98: � [Gregg et al., 2002] modified JVM to benchmarking methods and bytecode � instructions, no research on counting overhead [Lambert and Power, 2005] static/dynamic frequencies of basic blocks � [Li et al., 2000] complete system simulation: not addressing bytecode-level � basic blocks or precise bytecode counts SPECjvm2008 � [Oi, 2009], [Oi, 2010] compared other performance metrics, different JVMs � [Shiv et al., 2009] impact of hardware architecture details on � SPECjvm2008 performance in comparison to other SPEC benchmarks JVM-internal basic block analysis for Just-in-Time compilation etc. � Analysis results not available to platform-independent counting tools � Program optimisers, escape analysis and control flow graph analysis of � basic blocks have different objectives 12 Oct 8th, 2010 Kuperberg et al. - Invariant Analysis for Performance Evaluation Software Design and Quality Group Institute for Program Structures and Data Organization

Using Invariant Analysis for Improving Instrumentation-based - PowerPoint PPT Presentation

Using Invariant Analysis for Improving Instrumentation-based Performance Evaluation of SPECjvm2008 Benchmarks Michael Kuperberg, Martin Krogmann, Ralf Reussner Karlsruhe Institute of Technology SOFTWARE DESIGN AND QUALITY GROUP INSTITUTE FOR

Dynamic Binary Instrumentation: Introduction to Pin Instrumentation A technique that injects

Beam Instrumentation Hermann Schmickler (CERN Beam Instrumentation Group) Hermann Schmickler

MPIfR APEX Instrumentation MPIfR APEX Instrumentation Bernd Klein Bernd Klein bklein@mpifr.de

Loom Weaving Instrumentation for Program Analysis Brian Kidney (Presenter) Jonathan Anderson

Instrumentation best practices in Brewing Slide 1 Ola Wesstrom Instrumentation best practices in

Analog Electronics for Beam Instrumentation Jeroen Belleman CERN June 4-5, 2018 Jeroen Belleman

Pin Tutorial What is Instrumentation? A technique that inserts extra code into a program to

Outline Last time: local invariant features, scale invariant detection Lecture 14:

Invariant Variational Calculus Irina Kogan North Carolina State University & IMA December

T-duality Invariant Formalisms at the Quantum Level Daniel Thompson Queen Mary University of

Cross-ISA Machine Instrumentation Cross-ISA Machine Instrumentation using Fast and Scalable

State-based Testing Using Dynamic Instrumentation Vijay Upadya Microsoft Agenda What is

Loop-invariant code motion Example Two steps: analysis & transformation x := 3 Step 1: find

1 Algorithm for Identifying Loop Invariant Code Algorithm for Identifying Loop Invariant Code

Improving Improving Finances, Finances, Improving Improving Lives Lives www.jeanchatzky.com

protoDUNE single-phase Cryogenic and Argon Instrumentation - Overview Stephen Pordes,

CSE 331 Composite Layouts; Decorators slides created by Marty Stepp based on materials by M.

Perf rform rmance ance Inter terfer ference ence on Multico ticore Processor essors

I N C LOUD C OMPUTING Christina Delimitrou Stanford University Defense May 26 th

Architecture-based Dependability Prediction for Service-oriented Computing Vincenzo Grassi

Digging Into The Core of Boot Yuriy Bulygin @c7zero Oleksandr Bazhaniuk @ABazhaniuk Agenda

Porting D3JS to kotlin multiplaform Gatan Zoritchak Pierre Mariac @gz_k @imtam5

Cross-Platform Development CS 4720 Mobile Application Development CS 4720 Which Platform?

Building and Distributing SDK Add-ons Dave Smith NewCircle, Inc. @devunwired +DaveSmithDev

Using Invariant Analysis for Improving Instrumentation-based - PowerPoint PPT Presentation

Using Invariant Analysis for Improving Instrumentation-based Performance Evaluation of SPECjvm2008 Benchmarks Michael Kuperberg, Martin Krogmann, Ralf Reussner Karlsruhe Institute of Technology SOFTWARE DESIGN AND QUALITY GROUP INSTITUTE FOR

Dynamic Binary Instrumentation: Introduction to Pin Instrumentation A technique that injects

Beam Instrumentation Hermann Schmickler (CERN Beam Instrumentation Group) Hermann Schmickler

MPIfR APEX Instrumentation MPIfR APEX Instrumentation Bernd Klein Bernd Klein bklein@mpifr.de

Loom Weaving Instrumentation for Program Analysis Brian Kidney (Presenter) Jonathan Anderson

Instrumentation best practices in Brewing Slide 1 Ola Wesstrom Instrumentation best practices in

Analog Electronics for Beam Instrumentation Jeroen Belleman CERN June 4-5, 2018 Jeroen Belleman

Pin Tutorial What is Instrumentation? A technique that inserts extra code into a program to

Outline Last time: local invariant features, scale invariant detection Lecture 14:

Invariant Variational Calculus Irina Kogan North Carolina State University &amp; IMA December

T-duality Invariant Formalisms at the Quantum Level Daniel Thompson Queen Mary University of

Cross-ISA Machine Instrumentation Cross-ISA Machine Instrumentation using Fast and Scalable

State-based Testing Using Dynamic Instrumentation Vijay Upadya Microsoft Agenda What is

Loop-invariant code motion Example Two steps: analysis &amp; transformation x := 3 Step 1: find

1 Algorithm for Identifying Loop Invariant Code Algorithm for Identifying Loop Invariant Code

Improving Improving Finances, Finances, Improving Improving Lives Lives www.jeanchatzky.com

protoDUNE single-phase Cryogenic and Argon Instrumentation - Overview Stephen Pordes,

CSE 331 Composite Layouts; Decorators slides created by Marty Stepp based on materials by M.

Perf rform rmance ance Inter terfer ference ence on Multico ticore Processor essors

I N C LOUD C OMPUTING Christina Delimitrou Stanford University Defense May 26 th

Architecture-based Dependability Prediction for Service-oriented Computing Vincenzo Grassi

Digging Into The Core of Boot Yuriy Bulygin @c7zero Oleksandr Bazhaniuk @ABazhaniuk Agenda

Porting D3JS to kotlin multiplaform Gatan Zoritchak Pierre Mariac @gz_k @imtam5

Cross-Platform Development CS 4720 Mobile Application Development CS 4720 Which Platform?

Building and Distributing SDK Add-ons Dave Smith NewCircle, Inc. @devunwired +DaveSmithDev

Invariant Variational Calculus Irina Kogan North Carolina State University & IMA December

Loop-invariant code motion Example Two steps: analysis & transformation x := 3 Step 1: find