Designing costless abstractions Bart Verhagen - PowerPoint PPT Presentation

Designing costless abstractions Bart Verhagen bart@verhagenconsultancy.be 16 November 2019

Introduction Definitions What??? xkcd. Is It Worth the Time? url : https://xkcd.com/1205/

Introduction Definitions Costless abstractions Software abstraction The essence of abstractions is preserving information that is relevant in a given context, and forgetting information that is irrelevant in that context. 1 Costless Having no cost 2 Costless abstractions or zero-overhead abstractions An abstraction with no additional runtime cost compared to not using the abstraction 1 John V. Guttag. Introduction to Computation and Programming Using Python, Spring 2013 Edition . MIT Press, 2013. isbn : 9780262519632 2 Wiktionary. costless — Wiktionary, The Free Dictionary . 2019. url : https://en.wiktionary.org/w/index.php?title=costless

Introduction Why do they matter How we typically talk about it Runtime performance Level of abstraction

Introduction Why do they matter Costless abstractions Runtime performance Level of abstraction

Introduction Why do they matter How it actually is (significantly simplified) Runtime performance Typical low level language Typical high level language Security Development time Abstraction Community Generality Usability University of Washington - Paul G. Allen School of Computer Science and Engineering. Evaluating Programming Languages . 2019. url : https://courses.cs.washington.edu/courses/cse341/02sp/concepts/evaluating-languages.html

Introduction Why do they matter Costless abstractions: the holy grail?

Techniques Thin wrappers std::unique_ptr /** * Heavily simplified version of std::unique_ptr */ template < typename T> class unique_ptr { public : explicit unique_ptr(T* resource) : m_resource(resource) {} unique_ptr( const unique_ptr<T>&) = delete ; unique_ptr(unique_ptr<T>&&) noexcept = delete ; ~unique_ptr() { delete m_resource; } unique_ptr& operator =( const unique_ptr<T>&) = delete ; unique_ptr& operator =(unique_ptr<T>&&) noexcept = delete ; // Other operators and functions omitted private : T* m_resource; }; int main() { unique_ptr< int > i( new int ()); doSomethingThatUsesVariable(i); }

Techniques Thin wrappers Reference implementation int main() { ; Compiled with x86-64 gcc 9.2, -O2 auto i = new int (); main: doSomethingThatUsesVariable(i); sub rsp, 8 delete i; mov edi, 4 } call operator new(unsigned long) mov DWORD PTR [rax], 0 mov rdi, rax mov esi, 4 call operator delete(void*, unsigned long) xor eax, eax add rsp, 8 ret ; Compiled with clang 8.0.0, -O2 main: push rax mov edi, 4 call operator new(unsigned long) mov dword ptr [rax], 0 mov qword ptr [rsp], rax mov rdi, rax call operator delete(void*) xor eax, eax pop rcx ret

Techniques Thin wrappers std::unique_ptr /** ; Compiled with x86-64 gcc 9.2, -O2 * Heavily simplified version of std::unique_ptr main: */ sub rsp, 24 template < typename T> mov edi, 4 class unique_ptr { call operator new(unsigned long) public : mov DWORD PTR [rax], 0 explicit unique_ptr(T* resource) : mov rdi, rax m_resource(resource) {} mov QWORD PTR [rsp+8], rax mov esi, 4 unique_ptr( const unique_ptr<T>&) = delete ; call operator delete(void*, unsigned long) unique_ptr(unique_ptr<T>&&) noexcept = delete ; xor eax, eax add rsp, 24 ~unique_ptr() { delete m_resource; } ret unique_ptr& operator =( const unique_ptr<T>&) ; Compiled with clang 8.0.0, -O2 = delete ; main: unique_ptr& operator =(unique_ptr<T>&&) noexcept push rax = delete ; mov edi, 4 // Other operators and functions omitted call operator new(unsigned long) mov dword ptr [rax], 0 private : mov rdi, rax T* m_resource; call operator delete(void*) }; xor eax, eax pop rcx int main() { ret unique_ptr< int > i( new int ()); doSomethingThatUsesVariable(i); }

Techniques Thin wrappers std::unique_ptr manual memory management 1 std::unique_ptr implementation 2 ; Compiled with x86-64 gcc 9.2, -O2 ; Compiled with x86-64 gcc 9.2, -O2 main: main: sub rsp, 8 sub rsp, 24 mov edi, 4 mov edi, 4 call operator new(unsigned long) call operator new(unsigned long) mov DWORD PTR [rax], 0 mov DWORD PTR [rax], 0 mov rdi, rax mov rdi, rax mov esi, 4 mov QWORD PTR [rsp+8], rax call operator delete(void*, unsigned long) mov esi, 4 xor eax, eax call operator delete(void*, unsigned long) add rsp, 8 xor eax, eax ret add rsp, 24 ret ; Compiled with clang 8.0.0, -O2 main: ; Compiled with clang 8.0.0, -O2 push rax main: mov edi, 4 push rax call operator new(unsigned long) mov edi, 4 mov dword ptr [rax], 0 call operator new(unsigned long) mov qword ptr [rsp], rax mov dword ptr [rax], 0 mov rdi, rax mov rdi, rax call operator delete(void*) call operator delete(void*) xor eax, eax xor eax, eax pop rcx pop rcx ret ret 1 Matt Godbolt. Compiler explorer . url : https://godbolt.org/z/-mdgmt 2 Matt Godbolt. Compiler explorer . url : https://godbolt.org/z/QwRwpF

Techniques Thin wrappers Ensuring std::unique_ptr is costless /** * Heavily simplified version of std::unique_ptr */ template < typename T> class unique_ptr { public : explicit unique_ptr(T* resource) : m_resource(resource) {} unique_ptr( const unique_ptr<T>&) = delete ; unique_ptr(unique_ptr<T>&&) noexcept = delete ; ~unique_ptr() { delete m_resource; } unique_ptr& operator =( const unique_ptr<T>&) = delete ; unique_ptr& operator =(unique_ptr<T>&&) noexcept = delete ; // Other operators and functions omitted private : T* m_resource; }; int main() { unique_ptr< int > i( new int ()); doSomethingThatUsesVariable(i); }

Techniques Thin wrappers The costless inlining law A sufficiently smart compiler will always inline a function if - and only if - inlining is faster in all cases w.r.t. not inlining it.

Techniques Thin wrappers CppCon 2019: There are no Zero-cost Abstractions

Techniques Thin wrappers Unique_ptr costless: are we sure? #include <memory> int foo(std::unique_ptr< int > bar) noexcept ; Chandler Carruth. “There Are No Zero-cost Abstractions”. In: CppCon, 2019

Techniques Thin wrappers std::async #include <future> #include <iostream> #include <numeric> #include <vector> template < typename RandomIt> int parallel_sum(RandomIt beg, RandomIt end) { constexpr unsigned int magicalTurningPoint = 100U; auto len = end - beg; if (len < magicalTurningPoint) { return std::accumulate(beg, end, 0); } RandomIt mid = beg + len/2; auto handle = std::async(std::launch::async, parallel_sum<RandomIt>, mid, end); int sum = parallel_sum(beg, mid); return sum + handle.get(); } int main() { constexpr int nbOfValues = 10000; std::vector< int > v(nbOfValues, 1); std::cout « "The sum is " « parallel_sum(v.begin(), v.end()) « std::endl; } The sum is 10000 cppreference.com. std::async . url : https://en.cppreference.com/w/cpp/thread/async

Techniques Thin wrappers std::async (conceptually) #include <thread> template < typename Result> class future { public : template < class Function , class... Args> explicit future(Function&& f, Args&&... args ) { m_thread = std:: thread ([ this , f, args...]() { m_result = f(args...); }); }; future( const future<Result>&) = delete ; ~future() = default ; future<Result>& operator =( const future<Result>&) = delete ; void wait() /*const*/ { m_thread.join(); } Result get() { wait(); return m_result; } private : std:: thread m_thread; Result m_result; }; template < class Function , class... Args> future<std::invoke_result_t<std::decay_t<Function>, std::decay_t<Args>...> > async(Function&& f, Args&&... args) { // std::async implementation of the std::launch::async policy return future<std::invoke_result_t<std::decay_t<Function>, std::decay_t<Args>...»(f, args...); }

Techniques Thin wrappers std::async measurements For 10.000 values and magical turning point of 100 · 10 5 4 CPU cycles/Noop time 3 2 1 0 Using threads manually Using async Quick C++ Benchmark. Compare async vs threads . url : http://quick-bench.com/-j83zmDXSqOhFBEQwkbQRjhp8uA

Techniques Thin wrappers An unbloated, type-safe stack #include <iostream> #include <memory> #include <vector> namespace detail { class Stack { public : Stack() = default ; std::shared_ptr< void > top(); void push(std::shared_ptr< void > element); void pop(); private : std::vector<std::shared_ptr< void > > m_stack; // Choose the backend of your liking }; } // namespace detail template < typename T> class Stack { public : void push(std::shared_ptr<T> element) { m_stack.push(element); } std::shared_ptr<T> top() { return std::static_pointer_cast<T>(m_stack.top()); } void pop() { m_stack.pop(); } private : detail::Stack m_stack; }; Scott Meyers. Effective C++: 55 Specific Ways to Improve Your Programs and Designs . Addison-Wesley Professional, May 2005. isbn : 9780321334879

Techniques Thin wrappers Thin wrappers

Designing costless abstractions Bart Verhagen - PowerPoint PPT Presentation

Designing costless abstractions Bart Verhagen bart@verhagenconsultancy.be 16 November 2019 Introduction Definitions What??? xkcd. Is It Worth the Time? url : https://xkcd.com/1205/ Introduction Definitions Costless abstractions Software

FRONTEND AT SCALE Designing abstractions for big teams @joshduck What front-end abstractions

Designing for Designing for Greenspace Greenspace Greenspace Designing for Designing for

Abstractions for Routing Abstractions for Network Routing Brighten Godfrey Brighten Godfrey

Planning and Optimization D2. Abstractions: Additive Abstractions Gabriele R oger and Thomas

Automatically Deriving Abstraction Heuristics PDB Abstractions Explicit-State Abstractions

Unified L2 Abstractions for L3-Driven Fast Handover draft-irtf-mobopts-l2-abstractions-01 F.

Class 14 Slides SLIDE what is the designing principle how does designing principle

APIs/abstractions Previously iously Abstractions for infrastructure to ease operations (Ops)

L9: Frontend Abstractions Web Engineering 188.951 2VU SS20 Jrgen Cito L9: Frontend

Efficient Abstractions for GPGPU Programming . Mathias Bourgoin 10.03.2015 Efficient

Planning and Optimization D3. Abstractions: Additive Abstractions Malte Helmert and Thomas Keller

ABSTRACTIONS OF THE DATA PLANE DIMACS Working Group on Abstractions for Network Services,

Resources, Services, and Interfaces Services: Hardware Abstractions CPU/Memory abstractions

Resources, Services, and Interfaces Services: Hardware Abstractions CPU/Memory abstractions

Planning and Optimization D3. Abstractions: Additive Abstractions Malte Helmert and Gabriele R

Designing Your Fashion Portfolio From Concept To Presentation Designing Your Fashion Portfolio

Dyninst Scalable Tools Workshop Granlibakken Resort Lake Tahoe, California Dyninst Scalable

Electronic Business Documents UBL v2.1 Building on the Foundation 1 (2014-11-21 13:20z) OASIS

JANUARY 2020 TELEPHONE DO NOT USE COMPUTER AUDIO REGIONAL CALLS PARTICIPATE IN DISCUSSION

School of Medicine Jeremy M. Boss, Ph.D. Emory Chair in Basic Sciences Research Professor and

BUFFER OVERFLOW DEFENSES & COUNTERMEASURES CMSC 414 FEB 01 2018 RECALL OUR CHALLENGES

Equity & inclusion as a way of developing diverse, resilient, and more relevant open source

Introduction Synopsis 1 #include <sys/ptrace.h> long int ptrace(enum __ptrace_request

CE Report Workplace Climate and Environment Previously known as EDI Now a user initiative

Designing costless abstractions Bart Verhagen - PowerPoint PPT Presentation

Designing costless abstractions Bart Verhagen bart@verhagenconsultancy.be 16 November 2019 Introduction Definitions What??? xkcd. Is It Worth the Time? url : https://xkcd.com/1205/ Introduction Definitions Costless abstractions Software

FRONTEND AT SCALE Designing abstractions for big teams @joshduck What front-end abstractions

Designing for Designing for Greenspace Greenspace Greenspace Designing for Designing for

Abstractions for Routing Abstractions for Network Routing Brighten Godfrey Brighten Godfrey

Planning and Optimization D2. Abstractions: Additive Abstractions Gabriele R oger and Thomas

Automatically Deriving Abstraction Heuristics PDB Abstractions Explicit-State Abstractions

Unified L2 Abstractions for L3-Driven Fast Handover draft-irtf-mobopts-l2-abstractions-01 F.

Class 14 Slides SLIDE what is the designing principle how does designing principle

APIs/abstractions Previously iously Abstractions for infrastructure to ease operations (Ops)

L9: Frontend Abstractions Web Engineering 188.951 2VU SS20 Jrgen Cito L9: Frontend

Efficient Abstractions for GPGPU Programming . Mathias Bourgoin 10.03.2015 Efficient

Planning and Optimization D3. Abstractions: Additive Abstractions Malte Helmert and Thomas Keller

ABSTRACTIONS OF THE DATA PLANE DIMACS Working Group on Abstractions for Network Services,

Resources, Services, and Interfaces Services: Hardware Abstractions CPU/Memory abstractions

Resources, Services, and Interfaces Services: Hardware Abstractions CPU/Memory abstractions

Planning and Optimization D3. Abstractions: Additive Abstractions Malte Helmert and Gabriele R

Designing Your Fashion Portfolio From Concept To Presentation Designing Your Fashion Portfolio

Dyninst Scalable Tools Workshop Granlibakken Resort Lake Tahoe, California Dyninst Scalable

Electronic Business Documents UBL v2.1 Building on the Foundation 1 (2014-11-21 13:20z) OASIS

JANUARY 2020 TELEPHONE DO NOT USE COMPUTER AUDIO REGIONAL CALLS PARTICIPATE IN DISCUSSION

School of Medicine Jeremy M. Boss, Ph.D. Emory Chair in Basic Sciences Research Professor and

BUFFER OVERFLOW DEFENSES &amp; COUNTERMEASURES CMSC 414 FEB 01 2018 RECALL OUR CHALLENGES

Equity &amp; inclusion as a way of developing diverse, resilient, and more relevant open source

Introduction Synopsis 1 #include &lt;sys/ptrace.h&gt; long int ptrace(enum __ptrace_request

CE Report Workplace Climate and Environment Previously known as EDI Now a user initiative

BUFFER OVERFLOW DEFENSES & COUNTERMEASURES CMSC 414 FEB 01 2018 RECALL OUR CHALLENGES

Equity & inclusion as a way of developing diverse, resilient, and more relevant open source

Introduction Synopsis 1 #include <sys/ptrace.h> long int ptrace(enum __ptrace_request