Data-centric Profiling Working Group Outbrief Basic Concept - PowerPoint PPT Presentation

Oct 03, 2023 •376 likes •456 views

Data-centric Profiling Working Group Outbrief Basic Concept Associating performance data with data objects (arrays), beyond code contexts (loops, procedures) PMU support data-centric attribution use of data-centric profiling

Data-centric Profiling Working Group Outbrief
Basic Concept Associating performance data with data objects (arrays), beyond • code contexts (loops, procedures) – PMU support – data-centric attribution – use of data-centric profiling � 2
Data-centric Profiling WG Current PMU support • – Intel PEBS, AMD IBS, IBM Mark events to sample memory accesses • effective address, latency, memory layers – monitoring loads only is not enough, but also stores/prefetching instructions • use L1D replacement event (https://software.intel.com/en-us/forums/intel-performance- bottleneck-analyzer/topic/326007) – better to monitor evicted cache lines • Jeff’s paper: http://www.cs.umd.edu/~hollings/papers/ijhpca06.pdf – LBR: use call stack mode (monitoring calls/returns) to reconstruct the call stack • 16 frames on average with 32 LBR slots – Intel PT • ptwrite (Goldmont), a lightweight printf triggers LBR. Call ptwrite inside malloc can obtain the call path from LBR – page fault events, a hardware event (Goldmont) • possible measure first touch location – limitation • no PID or TID. OS Kernel needs to get this information • PEBS latency_above_threshold may produce biased results – sample MEM_LOAD/MEM_RETIRED � 3
Handle attribution to data structures • – static — easy to handle from symbol table • need Dyninst to extract allocation source lines from DWARF – heap • high overhead if malloc/free are frequently called – probably use ptwrite to reduce the overhead • call stack is important – merge the objects allocated in the same call path – (David) meaningful allocation site may a few frames above the “malloc” – stack • (Xiaozhu) Dyninst supports to extract the information from DWARF � 4
Use of data-centric profiling • – locality optimization • data layout optimization – David has some work in helping developers change data layout • temporal locality – false sharing • HITM events for loads – may miss store-store false sharing • Intel PTU, toplev, Feather (Xu’s group) identify false sharing – NUMA optimization • lightweight pattern analysis across threads – structure splitting • identify how different fields of a data structure are accessed – structslim from Xu’s group: https://dl.acm.org/citation.cfm? id=2854053 � 5
Challenges Stephane: how to do data profiling offline • – collect all raw data online with low overhead – perform data attribution offline – timestamp information Michael: automate the fix • – Joseph (UPenn)’s approach of detecting and fixing false sharing – Intel PGO can improve a DB workload by 25% to guide global data reorganization on Itanium Stephane: compiler support to annotate each memory access • instruction – which type accessed – the offset Michael: data-centric profiling on small cores • – insights for temporal locality � 6

Recommend

ESTCP Project Outbrief ESTCP Project Outbrief Demonstration and Testing of ClimaStat

ESTCP Project Outbrief ESTCP Project Outbrief Demonstration and Testing of ClimaStat Demonstration and Testing of ClimaStat for Improved DX Air-Conditioning Efficiency for Improved DX Air-Conditioning Efficiency Project EW-201144

349 views • 21 slides

Web User Profiling using Data Redundancy http://aminer.org/profiling Xiaotao Gu, Hong Yang, Jie

Web User Profiling using Data Redundancy http://aminer.org/profiling Xiaotao Gu, Hong Yang, Jie Tang, Jing Zhang Tsinghua University 1 Web User Profiling using Data Redundancy Introduction Traditional Way Basic Idea

717 views • 34 slides

Profiling of Data-Parallel Processors Daniel Kruck 09/02/2014 09/02/2014 Profiling Daniel

Profiling of Data-Parallel Processors Daniel Kruck 09/02/2014 09/02/2014 Profiling Daniel Kruck 1 / 41 Outline Motivation 1 Background - GPUs 2 Profiler 3 NVIDIA Tools Lynx Optimizations 4 Conclusion 5 09/02/2014 Profiling

441 views • 41 slides

Leaving no one behind The role of evidence-building and profiling to include displacement in

Leaving no one behind The role of evidence-building and profiling to include displacement in recovering and development processes Natalia Krynsky Baal , Coordinator, Joint IDP Profiling Service WHAT IS PROFILING? Defining profiling A

439 views • 12 slides

Expression Profiling Mark Voorhies 4/4/2011 Mark Voorhies Expression Profiling Review

Expression Profiling Mark Voorhies 4/4/2011 Mark Voorhies Expression Profiling Review Sequence analysis hmmbuild (not hmmfit) JackHMMer Mark Voorhies Expression Profiling Its hard work at times, but you have to be realistic. If you

291 views • 8 slides

COZ : Finding Code that Counts with Causal Profiling Anuja Golechha Agenda Profiling

COZ : Finding Code that Counts with Causal Profiling Anuja Golechha Agenda Profiling Issues with current profilers Causal profiling COZ Overview and Implementation COZ Evaluation Comparison with Pivot Tracing

661 views • 23 slides

Optimization Profiling VisualVM Exercise Meme Credit: Randall Munroe, hrefhttp://xkcd.comxkcd

Optimization Profiling VisualVM Exercise Meme Credit: Randall Munroe, hrefhttp://xkcd.comxkcd Lab 4: Profiling Optimization Profiling VisualVM Exercise Lab 4: Profiling CS 2112 Fall 2020 September 28 / 30, 2020 Portions of todays lab

458 views • 23 slides

Profiling of Algorithms Profiling refers to the experimental measurement of the performance of

Profiling of Algorithms Profiling refers to the experimental measurement of the performance of algorithms. Profiling techniques fall into two main categories: Instruction counting the number of times which particular instruction(s)

672 views • 19 slides

An introduction to Profiling Physics Coding Club: 09/06/2017 D. Dickinson

An introduction to Profiling Physics Coding Club: 09/06/2017 D. Dickinson (d.dickinson@york.ac.uk) Overview What is meant by profiling? Why do we care about profiling? How do we do profiling? Specific example using Scalasca

912 views • 22 slides

Object-centric profiling: Advanced Visualizations to Tame Wild Program Execution Vanessa Pea,

Object-centric profiling: Advanced Visualizations to Tame Wild Program Execution Vanessa Pea, Juan Pablo Sandoval, Pablo Estefo, Alexandre Bergel Object Profile & University of Chile 2 Execution profiling with Kai Problem: Traditional

722 views • 31 slides

Advancing Computer Systems without Technology Progress ISAT Outbrief, April 17-18, of

Advancing Computer Systems without Technology Progress ISAT Outbrief, April 17-18, of DARPA/ISAT Workshop, March 26-27, 2012 Organized by: Mark Hill & Christos Kozyrakis w/ Serena Chan & Melanie Sineath Approved for Public Release,

559 views • 38 slides

TransMR: Data Centric Programming Beyond Data Parallelism Naresh Rapolu Karthik Kambatla Prof.

TransMR: Data Centric Programming Beyond Data Parallelism Naresh Rapolu Karthik Kambatla Prof. Suresh Jagannathan Prof. Ananth Grama Limitations of Data-Centric Programming Models Data-centric programming models (MapReduce, Dryad etc.)

572 views • 14 slides

The Worlds First LED Human Centric Fluorescent Tube by Human Centric Optics Inc. 333,

The Worlds First LED Human Centric Fluorescent Tube by Human Centric Optics Inc. 333, 10654-82 Ave NW Edmonton, Alberta info@hcolab.com www.hcolab.com Human Centric LED Medical science discovers a third receptor in our eyes This

714 views • 13 slides

GraVF: GraVF: A Vertex-Centric A Vertex-Centric Graph Processing Graph Processing Framework

GraVF: GraVF: A Vertex-Centric A Vertex-Centric Graph Processing Graph Processing Framework Framework on FPGA on FPGA Nina Engelhardt August 31, 2016 Graphs and Graph Traversal Algorithms 1 Vertex-centric Programming Model: From POV of

262 views • 9 slides

Data Centric Networking Session 1: Introduction to R202 Data Centric Networking Eiko Yoneki

Data Centric Networking Session 1: Introduction to R202 Data Centric Networking Eiko Yoneki Systems Research Group University of Cambridge Computer Laboratory Welcome and Introduction Welcome to R202 First introduce yourselves Tell

374 views • 18 slides

Various Faces of Data Centric Networking Eiko Yoneki University of Cambridge Computer Laboratory

Various Faces of Data Centric Networking Eiko Yoneki University of Cambridge Computer Laboratory Data Centric Networking Shift of communication paradigm From end-to-end to data centric Data as communication token Multipoint

418 views • 19 slides

Designing Privacy-Aware Social Networks: A Mul:-Agent Approach

4th workshop on Web Intelligence & Communities, Lyon, 16th April 2012 Designing Privacy-Aware Social Networks: A Mul:-Agent Approach Andrei Ciortea 1 , Yann Krupa 2 , Laurent

630 views • 21 slides

Secrets and Snacks Thinking about Game Design Drew Davidson a little bit about me Perspectives

Secrets and Snacks Thinking about Game Design Drew Davidson a little bit about me Perspectives Tap-Repeatedly.com Post-Secret Game Design Snackable Gameplay Post-Secret Game Design Post-Secret World No Secrets Personally

662 views • 20 slides

Helping Your Children Protect Their Personal Data Online Outline of Presentation Online

Sharing with Parents on Helping Your Children Protect Their Personal Data Online Outline of Presentation Online Trends Online Opportunities and Potential Risks Protecting Personal Data Online MOEs Cyber Wellness

743 views • 25 slides

Cr e ating a High Quality Sc alable Online Pr ogr am R a c hna Siz e mor e H e iz e r, J

Cr e ating a High Quality Sc alable Online Pr ogr am R a c hna Siz e mor e H e iz e r, J .D . A ndr e a Mc C our t , PhD N o v e m e r 1 4 , 2 0 1 8 This Photo by Unknown Author is licensed under CC BY-SA Star ting fr om Sc r

537 views • 27 slides

Flame Graphs for Online Performance Profiling agentzh@gmail.com Yichun Zhang (agentzh)

Flame Graphs for Online Performance Profiling agentzh@gmail.com Yichun Zhang (agentzh) 2013.06.01 Flame Graphs is a kind of visualization for analyzing how time or some other resource is distributed among all the code paths.

712 views • 58 slides

Profiling and diagnosing large-scale decentralized systems David Oppenheimer ROC Retreat

Profiling and diagnosing large-scale decentralized systems David Oppenheimer ROC Retreat Thursday, June 5, 2003 1 Why focus on P2P systems? There are a few real ones file trading, backup, IM Look a lot like other decentralized

685 views • 22 slides

Dynamic Binary Optimization Introduction Application profiling Optimizing translation

Dynamic Binary Optimization Introduction Application profiling Optimizing translation blocks Compatibility Code reordering Other code optimizations 1 EECS 768 Virtual Machines Optimization Overview Identify frequently

822 views • 40 slides

ECE590-03 Enterprise Storage Architecture Fall 2016 Workload profiling and sizing Tyler Bletsch

ECE590-03 Enterprise Storage Architecture Fall 2016 Workload profiling and sizing Tyler Bletsch Duke University The problem Workload characterization : Determining the IO pattern of an application (or suite of applications) We do so

661 views • 16 slides