pinned_vector A Contiguous Container without Pointer Invalidation - PowerPoint PPT Presentation

pinned_vector A Contiguous Container without Pointer Invalidation Meeting C++ 2018

std::vector contiguous layout cache locality fastest iteration O(1) lookup random access amortized O(1) growth 2

std::vector contiguous layout cache locality POINTER fastest iteration O(1) lookup INVALIDATION random access amortized O(1) growth 3

std::vector Invalidation capacity=6 4

std::vector Invalidation capacity=6 5

std::vector Invalidation capacity=6 capacity=12 6

std::vector Invalidation may invalidate all always invalidates all may invalidate other push_back clear insert emplace_back assign erase insert emplace reserve resize shrink_to_fit 9

Contiguous Storage Invariant 10

Contiguous Storage Invariant erase( ) 11

Contiguous Storage Invariant insert( ) 14

Alternatives with Truly Stable Pointers https://en.cppreference.com/w/cpp/container 17

Alternatives with Truly Stable Pointers boost::stable_vector<T> ● Not a “vector” ● Not contiguous ● Equivalent to vector<unique_ptr<T>> T* T* T* T* T* T* T* T* T* T* T* T* 18

Alternatives with Truly Stable Pointers plf::colony 19

Alternatives with Truly Stable Pointers plf::colony ● Manages elements in disjoint memory chunks ● Contiguous layout not guaranteed ● Iteration performance comparable to std::deque ● Primary use case is storage, not iteration 20

“A Contiguous Container without Pointer Invalidation” 21

“A Contiguous Container without Pointer Invalidation” not quite… must maintain contiguous layout invariant 22

“A Contiguous Container with Essential Pointer Invalidation” The minimum amount of pointer invalidation absolutely necessary to maintain the contiguous layout invariant. If insertion or erasure occurs only at the end of the container then pointers to all other elements shall remain valid. Idealized std::vector with infinite capacity. 23

std::vector Invalidation may invalidate all always invalidates all may invalidate other push_back clear insert emplace_back assign erase insert emplace reserve resize shrink_to_fit 24

pinned_vector Invalidation may invalidate all always invalidates all may invalidate other push_back clear insert emplace_back assign erase insert emplace reserve resize shrink_to_fit 25

Virtual Memory History ● Introduced in DEC’s VAX-11/780 ( “Virtual Address eXtension”, 1977 ) ● First consumer CPU with integrated MMU Intel 80286 (1982) 26

Virtual Memory ● Illusion of huge memory ● Abstraction of Hardware Storage and Resources ○ Physical Memory ○ Filesystem ○ Memory mapped I/O ○ Inter-Process Communication 27

Virtual Memory vs Physical Memory #include <memory> #include <iostream> int main() { auto foo = std::make_unique( 42) std::cout << foo.get() << std::endl; return 0; } 28

Virtual Memory Virtual Memory Filesystem Main Memory GPU Memory Other Process 29

Virtual Memory ● Process isolation ○ Separate address space ● More space then physical available ○ x86-64 eg. 128TiB 30

Page ● Fixed size block of virtual memory ● Most CPUs have a minimum page size of 4 KiB ○ Memory aligned in page size ● Huge Pages ○ x86-64 has also 2 MiB and 1 GiB pages ○ Performance 31

Memory Management Unit ● Everyone here has seen it in action already terminated by signal SIGSEGV (Address boundary error) ○ Access Violation ○ ● Separate part on the CPU to map virtual memory addresses to physical memory addresses ● Page protection ○ Check Read, Write, Executable Bit 32

Translation Lookaside Buffer ● Part of the MMU ● Stores mapping of physical and virtual addresses ● Hardware accelerated ● Typically has 4096 entries 33

Page Table ● Cache for TLB ● Stored in memory ● Page walk ○ Hardware or Software 34

Swap Space ● File / Partition ● Unused Pages are saved on disk to free physical memory ● Controlled by the OS 36

Page-Faults Virtual Memory Swap file Physical Memory 37

Page-Faults ● Access to pages which are not loaded in physical memory ● Swap of pages into/from swap file ● Super expensive 42

TLB Miss Memory Translate virtual address MMU Page table TLB Return physical address ✅ 43

Thrashing ● Constant swapping of pages ● Unresponsive system ○ Filesystem Access 44

Mapping Memory Reserve Commit ● Prevents other allocations within reserved ● Get physical memory space area ● Consumes memory or swap space ● Does not consume memory or swap space 45

pinned_vector Internals Virtual Memory Address Space 0 fff... VirtualAlloc(..., MEM_RESERVE); mmap(..., PROT_NONE, MAP_ANON | MAP_PRIVATE); auto v = pinned_vector<int>(max_elements(1’000’000’000)); max_pages max_bytes v.max_size(); 46

pinned_vector Internals Virtual Memory Address Space 0 fff... VirtualAlloc(..., MEM_COMMIT); mprotect(..., PROT_READ | PROT_WRITE); auto v = pinned_vector<int>(max_elements(1’000’000’000)); v.push_back(279); v.push_back(188); ... 47

pinned_vector Internals Virtual Memory Address Space 0 fff... VirtualFree(..., MEM_DECOMMIT); mprotect(..., PROT_NONE); madvise(..., MADV_DONTNEED); auto v = pinned_vector<int>(max_elements(1’000’000’000)); v.pop_back(); ... v.shrink_to_fit(); 48

But Is It Any Good? std::vector pinned_vector Round 1: establish a common baseline auto v = Container<T>(); v.reserve(n); ⏱ fill_n(back_inserter(v), n, x); ⏱ 49

Baseline for int 50

Baseline for bigval struct bigval { double data[10]; }; 51

Baseline for std::string 52

Baseline All 53

So Is It Any Good? std::vector pinned_vector Round 2: size not known upfront auto v = Container<T>(); v.reserve(n); ⏱ fill_n(back_inserter(v), n, x); ⏱ 54

Total Time for int 55

Total Time for bigval struct bigval { double data[10]; }; 56

Total Time for std::string 57

Total Time 58

Yes It Is Good std::vector pinned_vector Round 3: so how much faster is it? ● Normalize the runtimes: Treat vector<T> time as 1.0 ● Rescale pinned_vector<T> time based on that ● 59

Total Speedup Windows 10 build 17134 (x64) Intel Core i7-7700HQ @ 2.80 GHz Clang-7.0.0 (VS 15.8.4 stdlib) 60

Total Speedup MacOS 10.14.1 (x64) Intel Core i7-7820HQ @ 2.90 GHz Apple LLVM 10.0.0 (clang-1000.11.45.5) 61

Total Speedup Windows 10 build 17134 (x64) Intel Core i7-7820HQ @ 2.90 GHz Clang-7.0.0 (VS 15.8.4 stdlib) 62

But Why Is It Good? std::vector pinned_vector Round 4: where does a vector’s time go? ≡ total time - allocations auto v = vector<T, bump_alloc >(); ≡ insertion + copying ⏱ ≡ baseline + copying fill_n(back_inserter(v), n, x); ⏱ Times for: insertion + allocation + copying 63

Breakdown of push_back 64

Benchmark Conclusions push_back with preceding reserve() roughly equivalent slower than std::vector for small sizes faster than std::vector after a breaking point achieved by not copying values around exact numbers vary significantly by system and value_type 67

Availability ● Virtual Memory Support ● Desktop ○ Linux ○ macOS ○ Windows ● Mobile ○ Android ○ iOS (reserve limited by physical memory) 68

Use Case ECS ● ECS: Entity Component System - Entity: ID - Component: Data only storage - System: Uses Components to operate on these ● Data Oriented Design ○ Data oriented design in C++ by Mike Acton ○ Data-oriented design in practice by Stoyan Nikolov ● Mostly used in Games 69

ECS with std::vector Storage (std::vector) Handle: Raw Pointer to Component 70

ECS with std::vector Storage (std::vector) Handle: Raw Pointer to Component New Storage (std::vector) 71

ECS with std::vector Entity System Component Index Handle: Logic Data - Index Storage std::vector 72

ECS with std::vector Pro: Con: ● Dynamic Storage ● Use of Handles ○ grow/shrink ○ e.g. index dynamically during ○ Indirection runtime 73

ECS with std::array Pro: Con: ● No Indirection ● Preallocate memory => waste of memory ● Need max size ● No dynamic resizing 74

pinned_vector A Contiguous Container without Pointer Invalidation - PowerPoint PPT Presentation

pinned_vector A Contiguous Container without Pointer Invalidation Meeting C++ 2018 std::vector contiguous layout cache locality fastest iteration O(1) lookup random access amortized O(1) growth 2 std::vector contiguous layout cache

eleven other beams & pinned frames Continental train platform, Grimshaw 1993 Pinned Frames

ten other beams & pinned frames Continental train platform, Grimshaw 1993 Pinned Frames 1

Vector addition: The zero vector The D -vector whose entries are all zero is the zero vector ,

Revisit Euler Buckling problem joint with Atia Afroz Toshizumi Fukui (Saitama University) 15

ten other beams & pinned frames Pinned Frames 2 Elements of Architectural Structures

ten other beams & pinned frames Pinned Frames 2 Elements of Architectural Structures

Matrix and Vector Operations Matrix and Vector Operations 1 / 21 Matrix and Vector Operations

Day 3 Advanced Vector Architectures Session A: Vector Instruction Execution Pipelines Break

PREDICTION OF MECHANICAL PROPERTIES FOR WOVEN Z-PINNED COMPOSITES H. Chun 1 , J. Son 1 , K. Kang 1

Systems with pinned Particles Walter Kob Universit Montpellier France In collaboration with:

Testing Theories: Finding Functional Fixed Points for Pinned Manifolds P. Le Doussal, K. Wiese,

eleven non-rigid rigid other beams & pinned frames Continental train platform,

Relevance Vector Machines Jukka Lankinen LUT February 21, 2011 Jukka Lankinen Relevance Vector

Lecture 11 Vector Linear Network Coding Vector Linear Network Coding Outline Fundamentals for

. Vector Graphics Introduction to Web Design Vector graphics contain geometric objects, such as

Class 7: Vector and scalar, components Vector operations in components Multiplying a vector with a

Announcement (No deadline extension for the rest of quarter) Project 2 final deadline is

Virtual Memory Eric McCreath Introduction Virtual memory is a technique that permits processes

Swapping ! Active processes use more physical memory than system has Address Binding Swap out

CS412 Software Security Security Policies Mathias Payer EPFL, Spring 2019 Mathias Payer CS412

Virtual Memory What if program is bigger than available memory? Split program up and only have

Silberschatz and Galvin Chapter 13 Secondary Storage Structure CPSC 410--Richard Furuta 3/30/99

Module 13: Secondary-Storage Structure Disk Structure Disk Scheduling Disk Management

for Identity, Community, and Partnership Zaira A. Rojas Navarro Politics, Philosophy, &

pinned_vector A Contiguous Container without Pointer Invalidation - PowerPoint PPT Presentation

pinned_vector A Contiguous Container without Pointer Invalidation Meeting C++ 2018 std::vector contiguous layout cache locality fastest iteration O(1) lookup random access amortized O(1) growth 2 std::vector contiguous layout cache

eleven other beams &amp; pinned frames Continental train platform, Grimshaw 1993 Pinned Frames

ten other beams &amp; pinned frames Continental train platform, Grimshaw 1993 Pinned Frames 1

Vector addition: The zero vector The D -vector whose entries are all zero is the zero vector ,

Revisit Euler Buckling problem joint with Atia Afroz Toshizumi Fukui (Saitama University) 15

ten other beams &amp; pinned frames Pinned Frames 2 Elements of Architectural Structures

ten other beams &amp; pinned frames Pinned Frames 2 Elements of Architectural Structures

Matrix and Vector Operations Matrix and Vector Operations 1 / 21 Matrix and Vector Operations

Day 3 Advanced Vector Architectures Session A: Vector Instruction Execution Pipelines Break

PREDICTION OF MECHANICAL PROPERTIES FOR WOVEN Z-PINNED COMPOSITES H. Chun 1 , J. Son 1 , K. Kang 1

Systems with pinned Particles Walter Kob Universit Montpellier France In collaboration with:

Testing Theories: Finding Functional Fixed Points for Pinned Manifolds P. Le Doussal, K. Wiese,

eleven non-rigid rigid other beams &amp; pinned frames Continental train platform,

Relevance Vector Machines Jukka Lankinen LUT February 21, 2011 Jukka Lankinen Relevance Vector

Lecture 11 Vector Linear Network Coding Vector Linear Network Coding Outline Fundamentals for

. Vector Graphics Introduction to Web Design Vector graphics contain geometric objects, such as

Class 7: Vector and scalar, components Vector operations in components Multiplying a vector with a

Announcement (No deadline extension for the rest of quarter) Project 2 final deadline is

Virtual Memory Eric McCreath Introduction Virtual memory is a technique that permits processes

Swapping ! Active processes use more physical memory than system has Address Binding Swap out

CS412 Software Security Security Policies Mathias Payer EPFL, Spring 2019 Mathias Payer CS412

Virtual Memory What if program is bigger than available memory? Split program up and only have

Silberschatz and Galvin Chapter 13 Secondary Storage Structure CPSC 410--Richard Furuta 3/30/99

Module 13: Secondary-Storage Structure Disk Structure Disk Scheduling Disk Management

for Identity, Community, and Partnership Zaira A. Rojas Navarro Politics, Philosophy, &amp;

eleven other beams & pinned frames Continental train platform, Grimshaw 1993 Pinned Frames

ten other beams & pinned frames Continental train platform, Grimshaw 1993 Pinned Frames 1

ten other beams & pinned frames Pinned Frames 2 Elements of Architectural Structures

ten other beams & pinned frames Pinned Frames 2 Elements of Architectural Structures

eleven non-rigid rigid other beams & pinned frames Continental train platform,

for Identity, Community, and Partnership Zaira A. Rojas Navarro Politics, Philosophy, &