MARDU: Efficient and Scalable Code Re-Randomization SYSTOR '20: - PowerPoint PPT Presentation

MARDU: Efficient and Scalable Code Re-Randomization SYSTOR '20: Proceedings of the 13th ACM International Systems and Storage Conference Christopher Jelesnianski (Virginia Tech), Jinwoo Yom (Virginia Tech), Changwoo Min (Virginia Tech), Yeongjin Jang (Oregon State University) 1

The Fight against Return Oriented Programming (ROP) Attack Defense Code Injection What is Return Oriented Programming? Data Execution Prevention (DEP) ● An attack that reuses program code to achieve Return Oriented arbitrary code computation Programming (ROP) Address Space Layout Randomization (ASLR) What are Gadgets? Just-In-Time ROP ● Snippets of code that perform specific actions (JIT-ROP) ○ Arithmetic operations Fine-Grained ASLR & eXecute-only Memory ○ Reading/writing to registers (XoM) ○ Etc. Blind ROP (BROP) (Code Inference) Continuous Randomization 2

Current randomization techniques are good ... Code Randomization ● Address Space Layout Randomization (ASLR) + Light-weight - Static code layout - One leak can compromise entire code base ● Re-Randomization Techniques + Continuous churn makes gadgets hard to find - High overhead - Rely on predictable thresholds such as - Time interval - Syscall invocation - Call history 3 3

But they are not practical. Why? ● Users desire acceptable performance Performance (<10% avg & worst-case) ● Users desire strong defenses Security ● Users desire scalability as more Scalability Guarantees computation is moved to the cloud ○ Have system-wide security coverage including shared libraries ● Achieving all three together is hard 4

Outline ● Introduction ● Challenges ● MARDU Design ● Implementation ● Evaluation ● Conclusion 5

Challenges for making a practical randomization defense ● Security challenges ○ Code disclosure: a single leaked pointer allows attacker to obtain code layout of a victim process ● Performance challenges ○ Avoiding useless overwork: Active randomization wastes CPU cycles in case of “what-if” ● Scalability challenges ○ Code Tracking: to support runtime re-randomization tracking and updating of pc-relative code is a necessary and expensive evil ○ Stop-the-world: Updating shared code on-the-fly is challenging especially in concurrent access 6

Outline ● Introduction ● Challenges ● MARDU Design ○ Security: Leveraging code trampolines ○ Scalability: Enabling code sharing ○ Performance: Re-randomization without stopping the world ● Implementation ● Evaluation ● Conclusion 7

Example: Code Control Flow Source Code Traditional Control Flow /* … */ void foo(){ call foo() /* … */ 1 bar(); /* … */ foo: } /* … */ call bar() void bar(){ /* … */ /* … */ 2 4 } ret bar: /* … */ 3 ret 8

MARDU is secure Code ● Code and Trampoline regions protect forward edge Trampoline ○ Trampolines are immutable code targets ○ Protects against code disclosure ● Shadow stack protects backward edge ● Randomization occurs at: ○ Process startup AND ○ Whenever an attack is detected ( on-demand ) ■ Process crash ■ Execute-only memory violation Stack Stack + Shadow Stack ... ... ... local local ret_addr5 local local ret_addr6 ret_addr 9

Example: Securing MARDU Code Source Code Using Code Trampolines Control Flow Intel MPK XoM Coverage void foo(){ /* … */ Code Region Trampoline Region Shadow Stack bar(); foo_body: bar_trampoline: /* … */ ... /* … */ jmp bar_body 3 2 jmp bar_trampoline() } foo_ret0_tr foo_trampoline: foo_ret0: jmp foo_body 1 void bar(){ /* … */ /* … */ foo_ret0_trampoline: jmp ShadowStack_top } jmp foo_ret0 5 bar_body: /* … */ jmp ShadowStack_top 4 10

MARDU is scalable ● MARDU is capable of code sharing (e.g., shared libraries) ○ No previous randomization scheme is capable of runtime re-randomization AND code sharing ● MARDU leverages position independent code ( -fPIC ) for easy fixups of PC-relative code. ● MARDU supports mixed instrumented and non-instrumented libraries 12

Example: Sharing MARDU code MARDU Patch .text Section Info Section Code Trampoline Fixups MARDU-compiled Region (C) Region (T) Binary/Library Place Trampoline Region Perform patching 2 Place Code Region 4 Random Offset 3 C T In-Kernel Randomized code cache 1 0xffffffff81171000 Map Kernel Memory 13

Example: Sharing MARDU code webserver libc.so dbserver libc.so MARDU MARDU C T C T C T C T Process 1 Process 2 Userspace Userspace 5 0x7fa67811a000 6 0x7fb67811b000 libc.so C T In-Kernel Randomized 0xffffffff81171000 code cache 14

Re-Randomization without stopping the world webserver libc.so dbserver libc.so MARDU MARDU C T C T C T C T Process 1 Process 2 Userspace Userspace C v1 T v1 In-Kernel Randomized code cache 0xffffffff81171000 16

Re-Randomization without stopping the world ● Gadgets previously deduced are webserver libc.so now stale MARDU C T C v1 C v2 T v2 T v1 ● Randomization is repeated Process 1 whenever another attack event is Userspace detected 3 2 ● Randomization is replicated for ALL ASSOCIATED shared code of a Map Code v2 to Map Trampoline v2 victim process userspace to userspace C v1 T v1 C v2 T v2 In-Kernel 4 Unmap old region Randomized code cache 0xffffffff81171000 1 0xffffffff2245d000 Map new region 17

MARDU is performant ● Trampolines ○ No Runtime Instrumentation Tracking ● Trampolines leverage immutable code ○ No stop-the-world mechanisms ● Re-active re-randomization ○ Only when attack detected ( on-demand ) ○ Responsibility of exiting (crashed) process/thread 18

Outline ● Introduction ● Challenges ● MARDU Design ● Implementation ● Evaluation ● Conclusion 19

MARDU Implementation ● Working Prototype ● Compiler ○ LLVM/Clang 6.0.0 ○ 3.5K LOC LLVM ● Kernel Compiler Infrastructure ○ X86-64 linux 4.17.0 ○ 4K LOC ● musl LibC ○ General C library 20

Outline ● Introduction ● Challenges ● MARDU Design ● Implementation ● Evaluation ○ How to evaluate MARDU? ○ Security: MARDU against popular ROP attacks ○ Performance: Compute Bound -> minimal runtime overhead ○ Scalability: Concurrent Web server -> negligible runtime overhead and scalability ● Conclusion 21

How to evaluate MARDU? 1) How secure is MARDU, against current known and popular attacks on randomization? 2) How much performance overhead does MARDU impose? 3) How scalable is MARDU in terms of load time, memory savings, and re-randomization, particularly for concurrent processes (such as a real-world web server)? 22

How MARDU defends against popular ROP ● Blind ROP (BROP) & Code Inference Attacks ○ MARDU: XoM protected code triggers a permission violation and re-randomization of code ○ MARDU: Re-randomization makes all previous collected layout information stale ○ MARDU: Usage of trampolines & function granularity randomization makes correlation prediction challenging for attackers ● JIT-ROP Attacks ● Low Profile Attacks ● Code Pointer Offsetting Attacks 24

Experimental Setup and Applications ● Experimental Setup ○ All programs compiled with MARDU LLVM compiler and -O2 -fPIC optimization flags ○ Platform: ■ 24-core (48-Hardware thread) machine with two Intel Xeon Silver 4116 CPUs (2.10 GHz) ■ 128 GB DRAM ● Applications ○ SPEC CPU 2006 (All C applications) ○ NGINX web server 26

How MARDU performs Web server (NGINX) 5.5% NGINX AVG Degradation: 4.4% 27 27

MARDU randomization with scalability ● Re-randomization latency scales approximately linearly with number of fixups required ● Cold start randomization latency for any number of workers for NGINX is 61ms ● Re-randomization latency plateau’s even when under attack 28

MARDU: Efficient and Scalable Code Re-Randomization SYSTOR '20: - PowerPoint PPT Presentation

MARDU: Efficient and Scalable Code Re-Randomization SYSTOR '20: Proceedings of the 13th ACM International Systems and Storage Conference Christopher Jelesnianski (Virginia Tech), Jinwoo Yom (Virginia Tech), Changwoo Min (Virginia Tech), Yeongjin

Randomization Algorithm Theory WS 2012/13 Fabian Kuhn Randomization Randomized Algorithm: An

Beyond Domain Randomization Josh Tobin 6/23/19 Goals for this talk Understand domain

Stage III of Social Subprojects Selection, Youth Corps Project Randomization (computer-based

Experience with MAC Address Randomization in Windows 10 Christian Huitema Huitema@microsoft.com

Cache Coherence in Scalable Machines Scalable Cache Coherent Systems Scalable, distributed

Scalable String Matching on the Scalable String Matching on the Scalable String Matching on the

Test statistics and randomization distributions Applied Statistics and Experimental Design

What About Randomization Tests? Strengths Gail et al. (1996) reported nominal Type I and II

Code Generation Machine code generation cs4713 1 Machine code generation machine Intermediate

{Sequential Code} {Sequential Code} {Sequential Code} {Sequential Code} {Sequential Code}

Gov 2002: 3. Randomization Inference Matthew Blackwell September 10, 2015 Where are we? Where

Br Breaking Kern rnel Ad Address ss Space La Layout Randomization (KASLR LR) wi with th

Topic III.1: Swap Randomization Discrete Topics in Data Mining Universitt des Saarlandes,

Address Space Randomization A n E f f e c t i v e I m p l e m e n t a t i o n Michael Cloppert

Optimal Randomization in Quantizer Design with Marginal Constraint Naci Saldi Queens

Lecture 4: Permutation Methods Applied Statistics 2014 1 / 21 Randomization Model Population

Randomization DS-GA 1013 / MATH-GA 2824 Mathematical Tools for Data Science

Improving the Randomization Step in Feasibility Pump using WalkSAT Santanu S. Dey Joint work

Detecting Network Effects Randomizing Over Randomized Experiments Martin Saveski (@msaveski)

Shuffler: Fast and Deployable Continuous Code Re-Randomization David Williams-King, Graham

Graphical Representation of Causal Effects November 10, 2016 Lords Paradox: Observed Data

CS573 Data Privacy and Security Differential Privacy Real World Deployments Li Xiong

1 Take a broad view of what constitutes therapies: changing intensity, switching medication,

CS 147: Computer Systems Performance Analysis Higher Designs and Other Considerations 1 / 25