Effective Data-Race Detection for the Kernel John Erickson, Madanlal - PowerPoint PPT Presentation

Effective Data-Race Detection for the Kernel John Erickson, Madanlal Musuvathi, Sebastian Burckhardt, Kirk Olynyk Microsoft Research Symposium on Operating Systems Design and Implementation (OSDI), October 2010 1 / 13

Motivation Finding data races is hard Analysing them is even harder Races often indicate problems Kernel code “operates at a lower con- currency abstraction” than user code 2 / 13

Motivation ✞ ☎ struct{ int status :4; int pktRcvd :28; } st; Finding data races is hard ✝ ✆ Analysing them is even harder Thread 1 Races often indicate problems ✞ ☎ st.status = 1; Kernel code “operates at a lower con- ✝ ✆ currency abstraction” than user code Thread 2 ✞ ☎ st.pktRcvd ++; ✝ ✆ 2 / 13

Data Races Two operations that access main memory are called conflicting if the physical memory they access is not disjoint, at least one of them is a write, and they are not both synchronization accesses. A program has a data race if it can be executed on a multiprocessor in such a way that two conflicting memory accesses are performed simultaneously (by processors or any other device). 3 / 13

Race Detection Precision Missed race No warning from detector Benign race Race without negative effects on program behaviour False race Error reported even though there is no race 4 / 13

Race Detection Precision Missed race No warning from detector Benign race Race without negative effects on program behaviour False race Error reported even though there is no race Detection Techniques Static Analyse source or byte code Dynamic Instrument program and monitor execution 4 / 13

Race Detection Precision Missed race No warning from detector Benign race Race without negative effects on program behaviour False race Error reported even though there is no race Detection Techniques Static Analyse source or byte code Dynamic Happens-Before-Tracking Record ordering of events and synchronisation operations Lock Sets Examine lock set (held locks) during each data access 4 / 13

Data Collider Detects data races in existing Windows kernel code (x86) Independent of synchronisation protocols Extra debugging information about the race (stack trace, “context information”) Runtime overhead below 5 % due to sampling Post-processing to prune and prioritise found races 5 / 13

Sampling Algorithm 1 Identify instructions that access data Prune synchronisation instructions ( volatile , hardware synchronisation instructions) 2 3 Choose breakpoints uniformly from sampling set initially and after race detection 4 Periodically readjust according to number of fired breakpoints per second ⇒ Effective at low sampling rates 6 / 13

Algorithm ✞ ☎ AtPeriodicIntervals () { // determine k based on desired // memory access sampling rate repeat k times { pc = RandomlyChosenMemoryAccess (); SetCodeBreakpoint (pc); } } ✝ ✆ ✞ ☎ OnCodeBreakpoint (pc) { // disassemble the instruction at pc (loc , size , isWrite) = disasm(pc); DetectConflicts (loc , size , isWrite ); // set another code break point pc = RandomlyChosenMemoryAccess (); SetCodeBreakpoint (pc); } ✝ ✆ 7 / 13

Algorithm ✞ ☎ DetectConflicts (loc , size , isWrite) { temp = read(loc , size ); i f (isWrite) { SetDataBreakpointRW (loc , size ); } e l s e { SetDataBreakpointW (loc , size ); } delay (); ClearDataBreakpoint (loc , size ); temp ’ = read(loc , size ); i f (temp != temp ’ || data breakpoint fired) { ReportDataRace ( ); } } ✝ ✆ 7 / 13

Data Race Detection Hardware Data Breakpoints (of x86) Based on virtual addresses IPI to update atomically on all cores Write → Trap on read/write Read → Trap on write 8 / 13

Data Race Detection Hardware Data Breakpoints (of x86) Based on virtual addresses IPI to update atomically on all cores Write → Trap on read/write Read → Trap on write Repeated Reads No detection of conflicting reads or writes with same last value Detect concurrent DMA writes Fallback when out of hardware data breakpoint Workaround for different virtual addresses mapping to same physical address 8 / 13

Pruning Benign Races Statistics Counters Counters that maintain low-fidelity statistical data Safe Flag Updates Read a bit while a different bit is updated Special Variables Races are expected, e.g. current time ⇒ ~ 90 % of detected data races are benign ⇒ Still reported but deprioritised 9 / 13

Evaluation — Effectiveness [A]pplied DataCollider on several modules in the Windows operating system [...] class drivers, various PnP drivers, local and remote file system drivers, storage drivers, and the core kernel executive itself [B]enign data races pruned heuristically and manually 10 / 13

Evaluation — Effectiveness [A]pplied DataCollider on several modules in the Windows operating system [...] class drivers, various PnP drivers, local and remote file system drivers, storage drivers, and the core kernel executive itself [B]enign data races pruned heuristically and manually Data Races Reported Count Fixed 12 Confirmed and Being Fixed 13 Under Investigation 8 Harmless 5 Total 38 10 / 13

flicting accesses “write the same values,” the – flicting accesses “write the same values,” the – – – Evaluation — Overhead [W]e repeatedly measured the time taken for the boot–shutdown sequence for different sampling rates and compared against a baseline Windows kernel running without DataCollider. These experiments where done on the x86 version of Windows 7 running on a virtual machine with 2 processors and 512 MB memory. The host machine is an Intel Core2-Quad 2.4 GHz machine with 4 GB memory running Windows Server 2008. 11 / 13

Evaluation — Overhead [W]e repeatedly measured the time taken for the boot–shutdown sequence for different sampling rates and compared against a baseline Windows flicting accesses “write the same values,” the kernel running without DataCollider. These experiments where done on the x86 version of Windows 7 running on a virtual machine with 2 processors and 512 MB memory. The host machine is an Intel Core2-Quad 2.4 GHz machine with 4 GB memory running Windows Server 2008. – flicting accesses “write the same values,” the 11 / 13 – – –

Evaluation — Efficacy of Pruning We enabled DataCollider while running kernel stress tests for 2 hours sampling at approximately 1000 code breakpoints per second. 12 / 13

Evaluation — Efficacy of Pruning We enabled DataCollider while running kernel stress tests for 2 hours sampling at approximately 1000 code breakpoints per second. Data Race Category Count Statistic Counter 52 Benign — Heuristically Pruned Safe Flag Update 29 Special Variable 5 Double-check locking 8 Volatile 8 Benign — Manually Pruned Write Same Value 1 Other 1 Confirmed 5 Real Investigating 4 Total 113 12 / 13

Conclusion Summary DataCollider detects and reports data races on x86 Use of hardware breakpoints and sampling for low overhead Automatic pruning of most false positives Suitable for existing (kernel) code 13 / 13

Conclusion Summary DataCollider detects and reports data races on x86 Use of hardware breakpoints and sampling for low overhead Automatic pruning of most false positives Suitable for existing (kernel) code Discussion Astonishingly simple approach Evaluation of overhead using a virtual machine!? volatile vs. synchronisation 13 / 13

Effective Data-Race Detection for the Kernel John Erickson, Madanlal - PowerPoint PPT Presentation

Effective Data-Race Detection for the Kernel John Erickson, Madanlal Musuvathi, Sebastian Burckhardt, Kirk Olynyk Microsoft Research Symposium on Operating Systems Design and Implementation (OSDI), October 2010 1 / 13 Motivation Finding data

So You Want to Race to Bermuda Marion Bermuda Race Starts June 19, 2015 So You Want to Race to

Marion to Bermuda Race 2021 Race Starts: June 18, 2021 So You Want to Race to Bermuda Why

Race Race In D&D, race refers to any intelligent humanoid species Dwarf Elf

Marion to Bermuda Race 2017 Front Row Seat to the Americas Cup Race Starts: June 9, 2017 So

Looking Inside a Race Detector kavya @kavya719 data race detection data races when two+

Bedford Basin Yacht Club 2017 RACE PROGRAM Your 2017 Race Management Team Race Officers Emma

Tight Kernel Query Complexity of Kernel Ridge Regression and Kernel -means Clustering Manuel

Detection of neutral particles detection of neutrons detection of neutrinons detection of low

DataCollider: Effective Data-Race Detection for the Kernel John Erickson, Madanlal Musuvathi,

Kernel Properties - Convexity Leila Wehbe October 1st 2013 Leila Wehbe Kernel Properties -

Processes, Protection and the Kernel: Processes, Protection and the Kernel: Mode, Space, and

Krace: Data Race Fuzzing for Kernel File Systems Meng Xu , Sanidhya Kashyap, Hanqing Zhao,

INTRODUCTION THE WORLD CUP OF AIR RACING P 1 AIR RACE 1 WORLD CUP AIR RACE 1 WORLD CUP THE

Race 1 Peer Teaching What role do you think race plays in international relations? 2 Race

Black Kernel Rot Malady of Pecan B Wood, C Bock, l Wells, T Cottrell, M Hotchkiss Black Kernel

Linux Kernel Debugging Your kernel just oopsed - What do you do, hotshot? Muli Ben-Yehuda

SWORD: A Bounded Memory-Overhead Detector of OpenMP Data Races in Production Runs Simone Atzeni,

An Open-Source Streaming Machine Learning and Real-Time Analytics Architecture Using an IoT

SDOF linear oscillator Step-by-step Methods Response to Impulsive Loads & Step by Step

Nonlinear Control Lecture # 1 Introduction & Two-Dimensional Systems Nonlinear Control

Unmanaged Networks, tunnels, etc. C. Huitema, T. Chown, J. Palet, S. Satapati, R. van der Pol

Xenrelay: An Efficient Data Transmitting Approach for Tracing Guest Domain Hai Jin, We nzhi Cao,

AutoSynch An Automatic-Signal Monitor Based on Predicate Tagging Vijay K. Garg Wei-Lun Hung

Interlockings Railway Signalling Seminars Phillip James Overview All about Interlockings.

Effective Data-Race Detection for the Kernel John Erickson, Madanlal - PowerPoint PPT Presentation

Effective Data-Race Detection for the Kernel John Erickson, Madanlal Musuvathi, Sebastian Burckhardt, Kirk Olynyk Microsoft Research Symposium on Operating Systems Design and Implementation (OSDI), October 2010 1 / 13 Motivation Finding data

So You Want to Race to Bermuda Marion Bermuda Race Starts June 19, 2015 So You Want to Race to

Marion to Bermuda Race 2021 Race Starts: June 18, 2021 So You Want to Race to Bermuda Why

Race Race In D&amp;D, race refers to any intelligent humanoid species Dwarf Elf

Marion to Bermuda Race 2017 Front Row Seat to the Americas Cup Race Starts: June 9, 2017 So

Looking Inside a Race Detector kavya @kavya719 data race detection data races when two+

Bedford Basin Yacht Club 2017 RACE PROGRAM Your 2017 Race Management Team Race Officers Emma

Tight Kernel Query Complexity of Kernel Ridge Regression and Kernel -means Clustering Manuel

Detection of neutral particles detection of neutrons detection of neutrinons detection of low

DataCollider: Effective Data-Race Detection for the Kernel John Erickson, Madanlal Musuvathi,

Kernel Properties - Convexity Leila Wehbe October 1st 2013 Leila Wehbe Kernel Properties -

Processes, Protection and the Kernel: Processes, Protection and the Kernel: Mode, Space, and

Krace: Data Race Fuzzing for Kernel File Systems Meng Xu , Sanidhya Kashyap, Hanqing Zhao,

INTRODUCTION THE WORLD CUP OF AIR RACING P 1 AIR RACE 1 WORLD CUP AIR RACE 1 WORLD CUP THE

Race 1 Peer Teaching What role do you think race plays in international relations? 2 Race

Black Kernel Rot Malady of Pecan B Wood, C Bock, l Wells, T Cottrell, M Hotchkiss Black Kernel

Linux Kernel Debugging Your kernel just oopsed - What do you do, hotshot? Muli Ben-Yehuda

SWORD: A Bounded Memory-Overhead Detector of OpenMP Data Races in Production Runs Simone Atzeni,

An Open-Source Streaming Machine Learning and Real-Time Analytics Architecture Using an IoT

SDOF linear oscillator Step-by-step Methods Response to Impulsive Loads &amp; Step by Step

Nonlinear Control Lecture # 1 Introduction &amp; Two-Dimensional Systems Nonlinear Control

Unmanaged Networks, tunnels, etc. C. Huitema, T. Chown, J. Palet, S. Satapati, R. van der Pol

Xenrelay: An Efficient Data Transmitting Approach for Tracing Guest Domain Hai Jin, We nzhi Cao,

AutoSynch An Automatic-Signal Monitor Based on Predicate Tagging Vijay K. Garg Wei-Lun Hung

Interlockings Railway Signalling Seminars Phillip James Overview All about Interlockings.

Race Race In D&D, race refers to any intelligent humanoid species Dwarf Elf

SDOF linear oscillator Step-by-step Methods Response to Impulsive Loads & Step by Step

Nonlinear Control Lecture # 1 Introduction & Two-Dimensional Systems Nonlinear Control