Mark Batty University of Kent It is time for mechanised industrial - PowerPoint PPT Presentation

Mechanised industrial concurrency specification: C/C++ and GPUs Mark Batty   University of Kent  

It is time for mechanised industrial standards Specifications are written in English prose: this is insufficient Write mechanised specs instead (formal, machine-readable, executable) This enables verification, and can identify important research questions Writing mechanised specifications is practical now 2

A case study: industrial concurrency specification 3

Shared memory concurrency Multiple threads communicate through a shared memory Thread Thread … … Shared memory 4

Shared memory concurrency Multiple threads communicate through a shared memory Thread Thread … … Shared memory Most systems use a form of shared memory concurrency: 5

An example programming idiom data , flag , r initially zero Thread 1: Thread 2: data = 1; while (flag==0) flag = 1; {}; r = data; In the end r==1 Sequential consistency: … Thread 1 Thread 2 simple interleaving of concurrent accesses … data, flag, r Reality: more complex 6

An example programming idiom data , flag , r initially zero Thread 1: Thread 2: data = 1; while (flag==0) flag = 1; {}; r = data; In the end r==1 Sequential consistency: … Thread 1 Thread 2 simple interleaving of concurrent accesses … data, flag, r Reality: more complex 7

Relaxed concurrency Memory is slow, so it is optimised (buffers, caches, reordering…) e.g. IBM’s machines allow reordering of unrelated writes (so do compilers, ARM, Nvidia…) data , flag , r initially zero Thread 1: Thread 2: data = 1; while (flag==0) flag = 1; {}; r = data; In the end r==1 Sometimes, in the end r==0 , a relaxed behaviour Many other behaviours like this, some far more subtle, leading to trouble 8

Relaxed concurrency Memory is slow, so it is optimised (buffers, caches, reordering…) e.g. IBM’s machines allow reordering of unrelated writes (so do compilers, ARM, Nvidia…) data , flag , r initially zero Thread 1: Thread 2: flag = 1; while (flag==0) data = 1; {}; r = data; In the end r==1 Sometimes, in the end r==0 , a relaxed behaviour Many other behaviours like this, some far more subtle, leading to trouble 9

Relaxed behaviour leads to problems Bugs in deployed processors Power/ARM processors: unintended relaxed behaviour Many bugs in compilers observable on shipped machines Bugs in language specifications [AMSS10] Bugs in operating systems 10

Relaxed behaviour leads to problems Bugs in deployed processors Errors in key compilers (GCC, LLVM): compiled programs could Many bugs in compilers behave outside of spec. Bugs in language specifications [MPZN13, CV16] Bugs in operating systems 11

Relaxed behaviour leads to problems The C and C++ standards had Bugs in deployed processors bugs that made unintended behaviour allowed. Many bugs in compilers Bugs in language specifications More on this later. Bugs in operating systems [BOS+11, BMN+15] 12

Relaxed behaviour leads to problems Bugs in deployed processors Confusion among operating system engineers leads to Many bugs in compilers bugs in the Linux kernel Bugs in language specifications [McK11, SMO+12] Bugs in operating systems 13

Relaxed behaviour leads to problems Bugs in deployed processors Many bugs in compilers Bugs in language specifications Bugs in operating systems Current engineering practice is severely lacking! 14

Vague specifications are at fault Relaxed behaviours are subtle, difficult to test for and often unexpected, yet allowed for performance Specifications try to define what is allowed, but English prose is untestable, ambiguous, and hides errors 15

A diverse and continuing effort Modelling of hardware and languages Simulation tools and reasoning principles Build mechanised executable formal models of specifications Empirical testing of current hardware Verification of language design goals [AFI+09,BOS+11,BDW16] Test and verify compilers [FGP+16,LDGK08,OSP09] Feedback to industry: specs and test suites 16

A diverse and continuing effort Provide tools to simulate the Modelling of hardware and languages formal models, to explain their Simulation tools and reasoning principles behaviours to non-experts Empirical testing of current hardware Verification of language design goals Provide reasoning principles to help in the verification of code Test and verify compilers Feedback to industry: specs and test suites [BOS+11,SSP+,BDG13] 17

A diverse and continuing effort Modelling of hardware and languages Run a battery of tests to Simulation tools and reasoning principles understand the observable Empirical testing of current hardware behaviour of the system and Verification of language design goals check it against the model Test and verify compilers [AMSS’11] Feedback to industry: specs and test suites 18

A diverse and continuing effort Modelling of hardware and languages Simulation tools and reasoning principles Explicitly stated design goals Empirical testing of current hardware should be proved to hold Verification of language design goals [BMN+15] Test and verify compilers Feedback to industry: specs and test suites 19

A diverse and continuing effort Modelling of hardware and languages Test to find the relaxed Simulation tools and reasoning principles behaviours introduced by Empirical testing of current hardware compilers and verify that Verification of language design goals optimisations are correct Test and verify compilers [MPZN13, CV16] Feedback to industry: specs and test suites 20

A diverse and continuing effort Modelling of hardware and languages Specifications should be fixed Simulation tools and reasoning principles when problems are found Empirical testing of current hardware Test suites can ensure Verification of language design goals conformance to formal models Test and verify compilers Feedback to industry: specs and test suites [B11] 21

A diverse and continuing effort Modelling of hardware and languages Simulation tools and reasoning principles Empirical testing of current hardware Verification of language design goals Test and verify compilers Feedback to industry: specs and test suites I will describe my part: 22

The C and C++ memory model 23

Acknowledgements M. Dodds A. Gotsman K. Memarian K. Nienhuis S. Owens J. Pichon-Pharabod S. Sarkar P . Sewell T. Weber 24

C and C++ The medium for system implementation Defined by WG14 and WG21 of the International Standards Organisation The ’11 and ’14 revisions of the standards define relaxed memory behaviour I worked with WG21, formalising and improving their concurrency design 25

C and C++ The medium for system implementation Defined by WG14 and WG21 of the International Standards Organisation The ’11 and ’14 revisions of the standards define relaxed memory behaviour We worked with the ISO, formalising and improving their concurrency design 26

C++11 concurrency design A contract with the programmer: they must avoid data races , two threads competing for simultaneous access to a single variable data initially zero Thread 1: Thread 2: data = 1; r = data; Beware: Violate the contract and the compiler is free to allow anything: catch fire! 27

C++11 concurrency design A contract with the programmer: they must avoid data races , two threads competing for simultaneous access to a single variable data initially zero Thread 1: Thread 2: data = 1; r = data; Beware: Violate the contract and the compiler is free to allow anything: catch fire! 28

C++11 concurrency design A contract with the programmer: they must avoid data races , two threads competing for simultaneous access to a single variable data initially zero Thread 1: Thread 2: data = 1; r = data; Beware: Violate the contract and the compiler is free to allow anything: catch fire! Atomics are excluded from the requirement, and can order non-atomics, preventing simultaneous access and races 29

C++11 concurrency design A contract with the programmer: they must avoid data races , two threads competing for simultaneous access to a single variable data , r , atomic flag , initially zero Thread 1: Thread 2: data = 1; while (flag==0) flag = 1; {}; r = data; Beware: Violate the contract and the compiler is free to allow anything: catch fire! Atomics are excluded from the requirement, and can order non-atomics, preventing simultaneous access and races 30

Design goals in the standard The design is complex but the standard claims a powerful simplification: C++11/14: §1.10p21 It can be shown that programs that correctly use mutexes and memory_order_seq_cst operations to prevent all data races and use no other synchronization operations behave [according to] “sequential consistency”. This is the central design goal of the model, called DRF-SC 31

Implicit design goals Compilers like GCC, LLVM map C/C++ to pieces of machine code C/C++ Power ARM x86 Load acquire ld; cmp; bc; isync ldr; dmb MOV (from memory) Each mapping should preserve the behaviour of the original program C/C++11 x86 Power ARM 32

Mark Batty University of Kent It is time for mechanised industrial - PowerPoint PPT Presentation

Mechanised industrial concurrency specification: C/C++ and GPUs Mark Batty University of Kent It is time for mechanised industrial standards Specifications are written in English prose: this is insufficient Write mechanised specs instead

The C Language: It Is Not What You Think It Is! Mark Batty and Peter Sewell University of

Automatically Comparing Memory Consistency Models John Wickerson Mark Batty Tyler Sorensen

Synchronising C/C++ and POWER Susmit Sarkar 1 Kayvan Memarian 1 Scott Owens 1 Mark Batty 1 Peter

Multicore Programming: C++0x Mark Batty University of Cambridge in collaboration with Scott

Compositional C11 Program Transformation Mark Batty - Mike Dodds - Alexey Gotsman

Reading in EYFS OCTOBER 2017 What is Batty Phonics? A program for teaching letter sounds.

The team! Miss Higson Mr Kilkenny Mr Batty Mr Harrison Mrs Bargery Mrs

Modelling a New Normal Social Distancings Impact on Land Use Michael Batty Centre for

Holomorphic functions which preserve holomorphic semigroups Charles Batty (University of Oxford)

SCHEME (CJRS) MARK II 1. Summary of CJRS Mark I and CJRS Mark II 2. Key Dates of CJRS 3.

Discrete Differential Geometry Operators for Triangulated 2-Manifolds Mark Meyer, Mathieu

Mark Tucker mark@thinkgenealogy.com 10 Things Genealogy Software Should DO Mark Tucker

and Design Overview II Mark C. Paulk, Ph.D. Mark.Paulk@utdallas.edu, Mark.Paulk@ieee.org

Inventing Abstractions An Academic Perspective on Industrial Memory Models Susmit Sarkar with:

Multicore Programming Java Memory Model Jaroslav ev Peter Sewell ck Tim Harris

Mark Gerling for the WATCHMAN Collaboration Background at Depth Gerling Mark Gerling Mark

Disclaimer Nursing Student Attitudes toward Mental Illness: This presenter has no conflicts of

Green Growth and Behavioral Economics Green Growth and Behavioral Economics Elke U Weber &

Edge Bundling for Visualizing Communication Behavior Ronny Brendel, Michael Heyde, Holger Brunst,

Improving Manufacturing Plants Through Big Data Analytics SDS2019 14 June 2019 Prof. Dr. Kurt

Building Large-Scale Internet Services Jeff Dean jeff@google.com Plan for Today Googles

Graphical Perception Nam Wook Kim Mini-Courses January @ GSAS 2018 What is graphical

rts t t rsss

ECON 626: Applied Microeconomics Lecture 5: Regression Discontinuity Professors: Pamela Jakiela

Mark Batty University of Kent It is time for mechanised industrial - PowerPoint PPT Presentation

Mechanised industrial concurrency specification: C/C++ and GPUs Mark Batty University of Kent It is time for mechanised industrial standards Specifications are written in English prose: this is insufficient Write mechanised specs instead

The C Language: It Is Not What You Think It Is! Mark Batty and Peter Sewell University of

Automatically Comparing Memory Consistency Models John Wickerson Mark Batty Tyler Sorensen

Synchronising C/C++ and POWER Susmit Sarkar 1 Kayvan Memarian 1 Scott Owens 1 Mark Batty 1 Peter

Multicore Programming: C++0x Mark Batty University of Cambridge in collaboration with Scott

Compositional C11 Program Transformation Mark Batty - Mike Dodds - Alexey Gotsman

Reading in EYFS OCTOBER 2017 What is Batty Phonics? A program for teaching letter sounds.

The team! Miss Higson Mr Kilkenny Mr Batty Mr Harrison Mrs Bargery Mrs

Modelling a New Normal Social Distancings Impact on Land Use Michael Batty Centre for

Holomorphic functions which preserve holomorphic semigroups Charles Batty (University of Oxford)

SCHEME (CJRS) MARK II 1. Summary of CJRS Mark I and CJRS Mark II 2. Key Dates of CJRS 3.

Discrete Differential Geometry Operators for Triangulated 2-Manifolds Mark Meyer, Mathieu

Mark Tucker mark@thinkgenealogy.com 10 Things Genealogy Software Should DO Mark Tucker

and Design Overview II Mark C. Paulk, Ph.D. Mark.Paulk@utdallas.edu, Mark.Paulk@ieee.org

Inventing Abstractions An Academic Perspective on Industrial Memory Models Susmit Sarkar with:

Multicore Programming Java Memory Model Jaroslav ev Peter Sewell ck Tim Harris

Mark Gerling for the WATCHMAN Collaboration Background at Depth Gerling Mark Gerling Mark

Disclaimer Nursing Student Attitudes toward Mental Illness: This presenter has no conflicts of

Green Growth and Behavioral Economics Green Growth and Behavioral Economics Elke U Weber &amp;

Edge Bundling for Visualizing Communication Behavior Ronny Brendel, Michael Heyde, Holger Brunst,

Improving Manufacturing Plants Through Big Data Analytics SDS2019 14 June 2019 Prof. Dr. Kurt

Building Large-Scale Internet Services Jeff Dean jeff@google.com Plan for Today Googles

Graphical Perception Nam Wook Kim Mini-Courses January @ GSAS 2018 What is graphical

rts t t rsss

ECON 626: Applied Microeconomics Lecture 5: Regression Discontinuity Professors: Pamela Jakiela

Green Growth and Behavioral Economics Green Growth and Behavioral Economics Elke U Weber &