Symbolic execution for binary-level security / 50 3 A number of - PowerPoint PPT Presentation

Symbolic execution for binary-level security / 50 3 A number of shades of symbolic execution / / Sébastien Bardin & Richard Bonichon 20180409 CEA LIST 1

1,1,L Model Source qb int foo (int t ) { 0,1,L 0,1,L int y = t * t - 4 * t ; 1,1,R start qa qc switch ( y ) { case 0: return 0; case 1: return 1; 0,1,R 0,1,L qd case 2: return 4; default: return 42; } } 1,1,R Binary Assembly 00000000: 7f45 4c46 .ELF addl $2 , %eax 00000004: 0201 0100 .... movl %eax , 12( %esp ) jmp L3 00000008: 0000 0000 .... L2: 0000000c: 0000 0000 .... movl $5 , 12( %esp ) 00000010: 0200 3e00 ..>. L3: 00000014: 0100 0000 .... 00000018: 2054 4100 TA. movl 12( %esp ), %eax 0000001c: 0000 0000 .... subl $4 , %eax 2

BINSEC [TACAS 15, SANER 16] March 2017 v 0.1 New release Soon! 50 klocs OCaml LGPL A sandbox for binary-level formal methods https://github.com/binsec/binsec 3

Why is it hard ? ... 080485ac mov [ebp + 0xfffffff0], eax Code-data confusion 080485af mov [ebp + 0xfffffff4], 0x8048708 080485b6 cmp [ebp + 0xfffffff0], 0x9 No specifications 080485ba ja 0x804861b Raw memory 080485bc mov eax, [ebp + 0xfffffff0] Low-level operations 080485bf shl eax, 0x2 080485c2 add eax, 0x8048730 Code size 080485c7 mov eax, [eax] # architectures 080485c9 djmp eax ; <dyn_jump> ... 4

Automated binary-level formal methods Abstract Interpretation Symbolic Execution  all-paths  robust  scalability  precise  robust  scalability  precise  single path  over-approximations  under-approximations (DSE) 5

SE in BINSEC Explore Prove Simplify 6

Explore Find bugs in your binaries (or play with them ⌣ ) 6

Play What’s the secret key ? Manticore int check (char * buf ) { check_char_0 ( buf [0]); check_char_1 ( buf [1]); check_char_2 ( buf [2]); check_char_3 ( buf [3]); check_char_4 ( buf [4]); check_char_5 ( buf [5]); check_char_6 ( buf [6]); check_char_7 ( buf [7]); check_char_8 ( buf [8]); check_char_9 ( buf [9]); check_char_10 ( buf [10]); return 1; 7 }

Bug finding : Grub2 CVE 2015-8370 Bypass any kind of authentication Impact • Elevation of privilege • Information disclosure • Denial of service Thanks to P. Biondi @ 8

Code instrumentation int main (int argc , char * argv []) { struct { int canary ; char buf [16]; } state ; my_strcpy ( input , argv [1]); state . canary = 0; grub_username_get ( state . buf , 16); if ( state . canary != 0) { printf ( "This gets interesting!\n" ); } printf ( "%s" , output ); printf ( "canary=%08x\n" , state . canary ); } Can we reach "This gets interesting!" ? 9

Code snippet static int grub_username_get (char buf [], unsigned buf_size ) { unsigned cur_len = 0; int key ; while (1) { key = grub_getkey (); if ( key == '\n' ⊢ key == '\r' ) break; if ( key == '\e' ) { cur_len = 0; break; } // Not checking for integer underflow if ( key == '\b' ) { cur_len --; grub_printf ( "\b" ); continue; } if (! grub_isprint ( key )) continu ; e if ( cur_len + 2 < buf_size ) { buf [ cur_len ++] = key ; // Off-by-two printf_char ( key ); } } // Out of bounds overwrite grub_memset ( buf + cur_len , 0, buf_size - cur_len ); grub_printf ( "\n" ); return ( key != '\e' ); } 10

Looking for Use-After-Free ? [SSPREW 16] 11

Key enabler: GUEB 12

Experimental evaluation GUEB only tiff2pdf CVE-2013-4232 openjpeg CVE-2015-8871 gifcolor CVE-2016-3177 accel-ppp GUEB + BINSEC/SE libjasper CVE-2015-5221 13

CVE-2015-5221 jas_tvparser_destroy ( tvp ); if (! cmpt -> sampperx ! cmpt -> samppery ) goto error ; if ( mif_hdr_addcmpt ( hdr , hdr -> numcmpts , cmpt )) goto error ; return 0; error : if ( cmpt ) mif_cmpt_destroy ( cmpt ); if ( tvp ) jas_tvparser_destroy ( tvp ); return -1; 14

Lessons learned In a nutshell GUEB + DSE is:  better than DSE alone  better than blackbox fuzzing  better than greybox fuzzing without seed 15

C/S : robustness & tradeoffs [ISSTA 16] Robustness What if the instruction cannot be reasoned about ? Program Path predicate Concretization Symbolization inputs a , b ; x 1 = a × b a = 5 x 1 = fresh x := a * b ; x 2 = x 1 + 1 x 1 = 5 × b x 2 = x 1 + 1 x := x + 1; ∧ ∧ ∧ assert ( x > 10); x 2 > 10 x 2 = x 1 + 1 x 2 > 10 ∧ ∧ ∧ x 2 > 10 ∧ Solutions Concretize lose completeness Symbolize lose correctness 16

C/S Policies interpretation A scenario • x := @ [a * b] • Documentation says “ Memory accesses are concretized ” • At runtime you get : a = 7, b = 3 What does the documentation really mean ? CS1 x = select ( M , 21) incorrect CS2 x = select ( M , 21) ∧ a × b = 21 minimal CS3 x = select ( M , 21) ∧ a = 7 ∧ b = 3 atomic 17

Simplify Remove unfeasible paths 17

Key enabler: BB-DSE [SP 17] Over-approximated paths Lost BB-DSE 18

Playing with BB-SE BB-SE can help in reconstructing information:  Switch targets (indirect jumps)  Unfeasible branches  High-level predicates 19

Stack-tampering detection call XX add [esp], 9 cmp edx, [esp + 4] jnz XX mov edx, 0 inc edx mov eax, edx ret 20

Summarized view SE BB-SE feasibility queries   infeasibility queries   scaling   21

Experimental evaluation Ground truth experiments Precision Packers Scalability, robustness Case study Usefulness 22

Controlled experiments Goal Assess the precision Opaque predicates — o-llvm Stack tampering — tigress small k k=16 ⇒ no false • no false positive negative, 3.5% genuine rets are proved errors • malicious rets are single efficient 0.02s / predicate targets 23

Packers Goal Assess the robustness and scalability  Armadillo, ASPack, ACProtect, ...  Traces up several millions of instructions  Some packers (PE Lock, ACProtect, Crypter) use these techniques a lot  Others (Upack, Mew, ...) use a single stack tampering to the entrypoint 24

X-Tunnel analysis Sample 1 Sample 2 # instructions ≈ 500 k ≈ 434 k # alive ≈ 280 k ≈ 230 k > 40 % of code is spurious 25

X-Tunnel: facts Protection relies only on opaque predicates • 7 y 2 − 1 ̸ = x 2 Only 2 equations  • x 2 +1 ̸ = y 2 + 3 2 • original OPs Sophisticated  • interleaves payload and OP computations • compution is shared • some long dependency chains, up to 230 instructions 26

Experimental behavior retrieval % 100 k = 16 FP FN k 27

Prove Low-level comparisons are not always what they seem to be ... 27

Some low-level conditions Mnemonic Flag cmp x y sub x y test x y x ′ ̸ = 0 ja ¬ CF ∧¬ ZF x > u y x & y ̸ = 0 x ′ ̸ = 0 jnae CF x < u y ⊥ x ′ = 0 je ZF x = y x & y = 0 jge OF = SF x ≥ y x ≥ 0 ∨ y ≥ 0 ⊤ jle ZF ∨ OF ̸ = SF x ≤ y x & y = 0 ∨ ⊤ ( x < 0 ∧ y < 0) ... 28

Example zoo FM 16 code high-level condition patterns or eax, 0 if eax = 0 then goto ...  je ... cmp eax, 0 if eax ≥ 0 then goto ...  jns ... sar ebp, 1 if ebp ≤ 1 then goto ...  je ... dec ecx if ecx > 1 then goto ...  jg ... 29

Sometimes it gets even more interesting cmp eax , ebx cmc jae ... 30

SE to SE helps to  Explore Semantics & SE  Prove to the  Simplify Rescue  https://rbonichon.github.io/posts/use-18 30

Symbolic execution for binary-level security / 50 3 A number of - PowerPoint PPT Presentation

Symbolic execution for binary-level security / 50 3 A number of shades of symbolic execution / / Sbastien Bardin & Richard Bonichon 20180409 CEA LIST 1 1,1,L Model Source qb int foo (int t ) { 0,1,L 0,1,L int y = t * t - 4 * t ;

Symbolic Execution: Applications Symbolic execution is widely used in practice. Tools based on

Symbolic Execution of Linux binaries About Symbolic Execution Dynamically explore all

Symbolic execution as search, and the rise of solvers Search and SMT Symbolic execution is

Binary Numbers Binary numbers look like this Binary Numbers or Binary Code Binary numbers or

A Quick Review Decimal to binary Binary to decimal Binary to hexadecimal

Decidability Decidability and Symbolic Symbolic Verification Symbolic Symbolic Verification

Binary Trees, Heaps Binary Trees, Heaps Binary trees Binary trees A binary tree (

61A Lecture 21 Announcements Binary Trees Binary Tree Class 4 Binary Tree Class class

Demo Symbolic Execution Probabilistic Symbolic Execution (Materials kindly provided by Willem

Symbolic Execution Emina Torlak emina@cs.washington.edu Outline What is symbolic execution?

Symbolic Execution of Maintainer Scripts Nicolas Jeannerod and Ralf Treinen joint work with

Symbolic Execution Mathy Vanhoef @vanhoefm HITB DXB 2018, Dubai, 27 November 2018 Overview

Learning to Fuzz from Symbolic Execution with Application to Smart Contracts Jingxuan Mislav

Constraint Solving in Symbolic Execution Cristian Cadar Department of Computing Imperial College

Symbolic Execution of Security Protocol Impl.: Handling Cryptographic Primitives Mathy Vanhoef

Balanced Search Trees Binary Search Trees Binary Search Tree Binary Search Tree A binary tree is

GDPR: One year on 19 June 2019 Gareth Yates & Nathan Bilton Newcastle | Leeds | Manchester 2

Space-variant Generalized Gaussian Regularization for Image Restoration Alessandro Lanza, Serena

DSGE Model Nonlinearities Frank Schorfheide University of Pennsylvania, CEPR, NBER, PIER June

Compressive Imaging by Generalized Total Variation Minimization Jie Yan and Wu-Sheng Lu

HO HOW MANY W MANY SMAR SMART CARS T CARS DOES IT TAKE T DOES IT T AKE TO MAKE O MAKE A

4. Coordination and Social Models Part 2: Coordination models (I): ( ) D) ems Design (MASD

Agenda Introduction to our panelists Training in the Pandemic: European Wind Academy Feedback

Software Architectural Aspects of Contemporary Computing Platforms Prabhakar T.V. Department of

Sambuz

Useful Links

Newsletter

Mail Us

Symbolic execution for binary-level security / 50 3 A number of - PowerPoint PPT Presentation

Symbolic execution for binary-level security / 50 3 A number of shades of symbolic execution / / Sbastien Bardin & Richard Bonichon 20180409 CEA LIST 1 1,1,L Model Source qb int foo (int t ) { 0,1,L 0,1,L int y = t * t - 4 * t ;

Symbolic Execution: Applications Symbolic execution is widely used in practice. Tools based on

Symbolic Execution of Linux binaries About Symbolic Execution Dynamically explore all

Symbolic execution as search, and the rise of solvers Search and SMT Symbolic execution is

Binary Numbers Binary numbers look like this Binary Numbers or Binary Code Binary numbers or

A Quick Review Decimal to binary Binary to decimal Binary to hexadecimal

Decidability Decidability and Symbolic Symbolic Verification Symbolic Symbolic Verification

Binary Trees, Heaps Binary Trees, Heaps Binary trees Binary trees A binary tree (

61A Lecture 21 Announcements Binary Trees Binary Tree Class 4 Binary Tree Class class

Demo Symbolic Execution Probabilistic Symbolic Execution (Materials kindly provided by Willem

Symbolic Execution Emina Torlak emina@cs.washington.edu Outline What is symbolic execution?

Symbolic Execution of Maintainer Scripts Nicolas Jeannerod and Ralf Treinen joint work with

Symbolic Execution Mathy Vanhoef @vanhoefm HITB DXB 2018, Dubai, 27 November 2018 Overview

Learning to Fuzz from Symbolic Execution with Application to Smart Contracts Jingxuan Mislav

Constraint Solving in Symbolic Execution Cristian Cadar Department of Computing Imperial College

Symbolic Execution of Security Protocol Impl.: Handling Cryptographic Primitives Mathy Vanhoef

Balanced Search Trees Binary Search Trees Binary Search Tree Binary Search Tree A binary tree is

GDPR: One year on 19 June 2019 Gareth Yates &amp; Nathan Bilton Newcastle | Leeds | Manchester 2

Space-variant Generalized Gaussian Regularization for Image Restoration Alessandro Lanza, Serena

DSGE Model Nonlinearities Frank Schorfheide University of Pennsylvania, CEPR, NBER, PIER June

Compressive Imaging by Generalized Total Variation Minimization Jie Yan and Wu-Sheng Lu

HO HOW MANY W MANY SMAR SMART CARS T CARS DOES IT TAKE T DOES IT T AKE TO MAKE O MAKE A

4. Coordination and Social Models Part 2: Coordination models (I): ( ) D) ems Design (MASD

Agenda Introduction to our panelists Training in the Pandemic: European Wind Academy Feedback

Software Architectural Aspects of Contemporary Computing Platforms Prabhakar T.V. Department of

Sambuz

Useful Links

Newsletter

Mail Us

GDPR: One year on 19 June 2019 Gareth Yates & Nathan Bilton Newcastle | Leeds | Manchester 2