Automatic Techniques to Systematically Discover New Heap - PowerPoint PPT Presentation

Automatic Techniques to Systematically Discover New Heap Exploitation Primitives Ins Insu Yu Yun , Dhaval Kapil, and Taesoo Kim Georgia Institute of Technology 1

Heap vulnerabilities are the most common, yet serious security issues. % 𝑝𝑔 ℎ𝑓𝑏𝑞 𝑤𝑣𝑚𝑜𝑓𝑠𝑏𝑐𝑗𝑚𝑗𝑢𝑗𝑓𝑡 = 233 604 = 39% From “Killing Uninitialized Memory: Protecting the OS Without Destroying Performance”, Joe Bialek and Shayne Hiet-Block, CppCon 2019 2

Heap exploitation techniques (HETs) are preferable methods to exploit heap vulnerabilities • Abuse underlying allocator to achieve more powerful primitives (e.g., arbitrary write) for control hijacking • Application-agnostic: rely on only underlying allocators • Powerful: e.g., off-by-one null byte overflow à arbitrary code execution • Used to compromise (in 2019) 3

Example: unlink() in ptmalloc2 fd fd bk bk fd fd bk bk fd fd bk bk Chunk Chunk Chunk Chunk Chunk Chunk unlink(): P->fd->bk = P->bk P->bk->fd = P->fd 4

Example: unlink() in ptmalloc2 fd fd bk bk fd fd bk bk fd fd bk bk Chunk Chunk Chunk Chunk Chunk Chunk unlink(): P->fd->bk = P->bk P->bk->fd = P->fd 5

Example: Unsafe unlink() in the presence of memory corruptions (e.g., overflow) fp fptr tr Obje bject ct fd fd bk bk addr ddr fd fd evi bk bk vil fd fd bk bk Chunk Chunk Chunk Chunk Chunk Chunk unlink(): P->fd->bk = P->bk => fptr = evil 6

Security checks are introduced in the allocator to prevent such exploitations unlink(): assert(P->fd->bk == P); P->fd->bk = P->bk This check is still by bypassable , but it makes HET more co complicated 7

Researchers have been studied reusable HETs to handle such complexities All analyses are manual, ad-hoc, and allocator-specific! 8

Problem 1: Existing analyses are highly biased to certain allocators ptmalloc2 (Linux allocator) tcmalloc DieHarder mimalloc mesh jemalloc scudo Freeguard 9

Problem2: A manual re-analysis is required in the changes of an allocator’s implementation ptmalloc2 (Linux allocator) A new feature: thread-local cache (tcache) Question: How to find HETs automatically? 10

Our key idea: ArcHeap autonomously explore spaces similar to fuzzing! “ ” HET 11

Technical challenges Lack of an efficient way to evaluate HETs Large search space HET 12

Search space consisting of heap actions is enormous size(p) x 2 64 2 64 malloc(sz) free(p) p[i]=v buf[i]=v Allocation Deallocation Heap write Buffer write Legitimate actions Search space can be reduced using model-based search common designs of allocators! based on co p[i overflow ]=v p freed [i]=v free(p freed ) free(p non-heap ) Overflow Write-after-free Double free Arbitrary free Buggy actions 14

Common design 1: Binning • Specially managing chunks in different size groups • Small chunks: Performance is more important • Large chunks: Memory footprint is more important • e.g., ptmalloc • fast bin (< 128 bytes): no merging in free chunks • small bin ( < 1024 bytes): merging is enabled • Sampling a size uniformly in the 2 64 space è P(fast bin) = 2 -57 15

ArcHeap selects an allocation size aware of binning • Sampling in exponentially distant size groups • ArcHeap partitions an allocation size into four groups: (2 0 , 2 5 ], (2 5 , 2 10 ], (2 10 , 2 15 ], and (2 15 , 2 20 ] • Then, it selects a group and then selects a size in the group uniformly • e.g., P(fast bin) > P(selecting a first group) = ¼ 16

Other common designs: Cardinal data and In-place metadata • Cardinal data: Metadata in a chunk are either sizes or pointers, but not other random values • In-place metadata: Allocators place metadata near its chunk’s start or end for locality 17

Cardinal data and In-place metadata reduce search space in data writes Random size Size Other chunk’s size Other chunk p[i]=v Pointer Heap write Buffer 0xdeadbeef Container An array that stores chunks -8 ~ 8 1337 18

Automatically synthesizing full exploits is inappropriate in evaluating HETs • Difficult: e.g., In the DAPRA CGC competition, on only on one e hea eap bug was successfully exploited by the-state-of-the-art systems • Inefficient: Takes a few seconds, minutes, or even hours for one try • Application-dependent: A HET, which is not useful in a certain application, may be useful in general 20

Our idea: Evaluating impacts of exploitations (i.e., detecting broken invariants that have security implications) 1. Allocated memory should not be overlapped with pre-allocated memory Easy to detect: Check • Overlapping chunks: Can corrupt other chunk’s data this at every allocation • Arbitrary chunks: Can corrupt global data 2. An allocator should not modify memory, which is not under its control (i.e., heap) • Arbitrary writes How about this? • Restricted writes (NOTE: should be efficient) 21

Shadow memory can detect arbitrary writes and restricted writes • Maintain external consistency • Check divergence container[i] = malloc(sz) malloc(sz) free(p) Allocation container shadow [i] = malloc(sz) Deallocation Divergence can only happen Allocation p[i]=v in the internal of allocators buf[i]=v Heap write buf[i]=v Buffer write buf shadow [i]=v CHECK: equal(container, container shadow ) Buffer write equal(buf, buf shadow ) 22

ArcHeap provides a minimized PoC code for further analysis • Proof-of-Concept code: Converting actions into C code • Trivial, because they have one-to-one mapping • Minimize the PoC code using delta-debugging • Idea: Eliminate an action, which is not necessary for triggering the impact of exploitations • Details can be found in our paper 23

Evaluation questions 1. How effective is ArcHeap in finding new HETs, compared to the existing tool, HeapHopper? 2. How general is ArcHeap’s approach? 24

ArcHeap discovered five new HETs in ptmalloc2, which cannot be found by HeapHopper • Unsorted bin into stack: Write-after-free à Arbitrary chunk • Requires fewer steps (5 steps vs 9 steps) • House of unsorted einherjar: Off-by-one write à Arbitrary chunk • No require heap address leak • Unaligned double free: Double free à Overlapping chunk cannot be discovered by HeapHopper because of its All HETS ca • First HET targets small bin chunks, which have more checks than fast bin scalability issue (i.e., symbolic execution + model checking) • Overlapping chunks using a small bin : Overflow à Overlapping chunk • Fast bin into other bin: Write-after-free à Arbitrary chunk 25

ArcHeap is generic enough to test various allocators • Tested 10 different allocators • Cannot find HETs in LLVM Scudo, FreeGuard, and Guarder, which are “secure allocators” Even found HETs in Works for ptmalloc2- “secure” allocators unrelated allocators 26

Case study1: Double free à Overlapping chunks in DieHarder and mimalloc-secure // [PRE-CONDITION] // lsz : large size (> 64 KB) // xlsz: more large size (>= lsz + 4KB) // [BUG] double free Double free large chunk è // [POST-CONDITION] Overlapping chunk // p2 == malloc(lsz); void* p0 = malloc(lsz); free(p0); void* p1 = malloc(xlsz); // [BUG] free 'p0' again free(p0); Same thing happens in both void* p2 = malloc(lsz); free(p1); DieHarder and mimalloc assert(p2 == malloc(lsz)); 27

Interestingly, these issues are irrelevant Me: Is mimalloc free(p large ) related to DieHarder? DieHarder unmap(p large ) No No ch check eck! Mimalloc developer: mimalloc Wr Wrong check(p large ) No! ch check eck! 28

Our PoC has been added in a mimalloc’s regression test 29

Case study 2: Overflow à Arbitrary chunk in dlmalloc-2.8.6 • dlmalloc: ancestor of ptmalloc2 but has been diverged after its fork void* p0 = malloc(sz); void* p1 = malloc(xlsz); void* p2 = malloc(lsz); Looks complicated… void* p3 = malloc(sz); // [BUG] overflowing p3 to overwrite top chunk struct malloc_chunk *tc = raw_to_chunk(p3 + chunk_size(sz)); tc->size = 0; void* p4 = malloc(fsz); void* p5 = malloc(dst - p4 - chunk_size(fsz) \ - offsetof( struct malloc_chunk, fd)); assert(dst == malloc(sz)); 30

Its root cause is more complicated! // Make top chunk available void* p0 = malloc(sz); // Set mr.mflags |= USE_NONCONTIGUOUS_BIT void* p1 = malloc(xlsz); // Current top size < lsz (4096) and no available bins, so dlmalloc calls sys_alloc // Instead of using sbrk(), it inserts current top chunk into treebins // and set mmapped area as a new top chunk because of the non-continous bit void* p2 = malloc(lsz); Easy to miss by manual analysis void* p3 = malloc(sz); è Shows benefits of // [BUG] overflowing p3 to overwrite treebins struct malloc_chunk *tc = raw_to_chunk(p3 + chunk_size(sz)); automated methods! tc->size = 0; // dlmalloc believes that treebins (i.e., top chunk) has enough size // However, underflow happens because its size is actually zero void* p4 = malloc(fsz); // Similar to house-of-force, we can allocate an arbitrary chunk void* p5 = malloc(dst - p4 - chunk_size(fsz) \ - offsetof( struct malloc_chunk, fd)); assert(dst == malloc(sz)); 31

Automatic Techniques to Systematically Discover New Heap - PowerPoint PPT Presentation

Automatic Techniques to Systematically Discover New Heap Exploitation Primitives Ins Insu Yu Yun , Dhaval Kapil, and Taesoo Kim Georgia Institute of Technology 1 Heap vulnerabilities are the most common, yet serious security issues. %

Stack Stack Heap Heap Data Data Text Text Program A Program B Stack Stack Text Heap

PROGRAM HEAP 2020-2021 WHAT IS HEAP? HEAP is a federally funded program that assists low

Heapsort Build-Max-Heap Next we build a full heap from an unsorted sequence Build-Max-Heap(A)

Windows 8 Heap Internals Windows 8 Heap Internals Windows 8 Heap Internals INTRODUCTION Windows 8

Fibonacci Heap Group Minus One Second December 6, 2016 Group Minus One Second Fibonacci Heap

Chapter 19: Binomial Heaps We will study another heap structure called, the binomial heap. The

1 Fib- -Heap Heap- -Extract Extract- -Min Min Example: Fib- -Heap Heap- -Extract

Understanding the heap by breaking it A case study of the heap as a persistent data structure

Priority Queue / Heap Stores ( key,data ) pairs (like dictionary) But, different set of

A heap, a stack, a bottle and a rack The Stack Canary Birds The heap Answer: The first three

In the Beginning was the Word Stephen Heap In the Beginning was the Word

When Testing in Production is a Good Idea Dan Robinson CTO, Heap whoami Joined as Heap's

Fibonacci Heap Group Paradox December 21, 2016 Contents 1. Introduction 2. Operations 3. Why

The beautiful binary heap. Weiss has a chapter on the binary heap - chapter 20, pp581-601.

Is the Heap Manager Important to Many Cores? Ye Liu, Shinpei Kato, Masato Edahiro Outline

heaps and heapsort on n elements height of a heap is in (log n ) building a heap bottum-up in O (

TREATMENT FOR PTSD/SUD The Fear Structure A fear structure is a program for escaping danger

The Institute for Advanced Architectures and Algorithms (IAA) David H. Rogers Sudip Dosanjh

Postoperative Cognitive Decline Noise or Signals? Jacqueline M. Leung, MD, MPH Professor

THE DATACENTER NEEDS AN OPERATING SYSTEM MATEI ZAHARIA, BENJAMIN HINDMAN, ANDY KONWINSKI, ALI

0DAF F9H:8AIF

in Virtual Memory Systems Presented by Michael Jantz Contributions from Carl Strickland, Kshitij

st st r tr

Main Memory Prof. Bracy and Van Renesse CS 4410 Cornell University based on slides designed by