CMPSC 497: Symbolic Execution Trent Jaeger Systems and Internet - PowerPoint PPT Presentation

CMPSC 497: � Symbolic Execution Trent Jaeger Systems and Internet Infrastructure Security (SIIS) Lab Computer Science and Engineering Department Pennsylvania State University Systems and Internet Infrastructure Security Laboratory (SIIS) Page 1

Our Goal • In this course, we want to develop techniques to detect vulnerabilities before they are exploited automatically ‣ What ’ s a vulnerability? ‣ How to find them? Systems and Internet Infrastructure Security Laboratory (SIIS) Page 2

Static vs. Dynamic • Dynamic ‣ Depends on concrete inputs ‣ Must run the program ‣ Impractical to run all possible executions in most cases • Static ‣ Overapproximates possible input values (sound) ‣ Assesses all possible runs of the program at once ‣ Setting up static analysis is somewhat of an art form • Is there something that combines best of both? Systems and Internet Infrastructure Security Laboratory (SIIS) Page 3

Best of Both? • What would be the best of both? Systems and Internet Infrastructure Security Laboratory (SIIS) Page 4

Best of Both? • What would be the best of both? ‣ Run over lots of inputs at once (static) ‣ Easy to setup (dynamic) ‣ Run all paths (static) ‣ Identify concrete values that lead to problems (dynamic) • Can’t quite achieve all these, but can come closer Systems and Internet Infrastructure Security Laboratory (SIIS) Page 5

Symbolic Execution • Symbolic execution is a method for emulating the execution of a program to learn constraints ‣ Assign variables to symbolic values instead of concrete values ‣ Symbolic execution tells you what values are possible for symbolic variables at any particular point in your program • Like dynamic analysis (fuzzing) in that the program is executed in a way – albeit on symbolic inputs • Like static analysis in that one start of the program tells you what values may reach a particular state Systems and Internet Infrastructure Security Laboratory (SIIS) Page 6

Symbolic Execution • What’s a symbolic value? • Remember in AFL fuzzing, you provide a candidate concrete input to identify the format ‣ And the fuzzer produces lots of variants of this input • In symbolic execution, you don’t provide a concrete input, but rather identify which value(s) you want to assess – just say an input is “symbolic” ‣ Then the symbolic execution tells you the possible values of that input to reach particular points in the program Systems and Internet Infrastructure Security Laboratory (SIIS) Page 7

Automatic Generation of EXE & KLEE Inputs of Death   and High-Coverage Tests Slides by Yoni Leibowitz Systems and Internet Infrastructure Security Laboratory (SIIS) Page

Example int main(void) { unsigned int i, t, a[4] = { 1, 3, 5, 2 }; if (i >= 4) exit(0); char *p = (char *)a + i * 4; *p = *p − 1 t = a[*p]; t = t / a[i]; if (t == 2) assert(i == 1); else assert(i == 3); return 0; } Systems and Internet Infrastructure Security Laboratory (SIIS) Page

Marking Symbolic Data int main(void) { unsigned int i, t, a[4] = { 1, 3, 5, 2 }; make_symbolic(&i); Marks the 4 bytes associated with if (i >= 4) 32-bit variable ‘i’ exit(0); as symbolic char *p = (char *)a + i * 4; *p = *p − 1 t = a[*p]; t = t / a[i]; if (t == 2) assert(i == 1); else assert(i == 3); return 0; } Systems and Internet Infrastructure Security Laboratory (SIIS) Page

Compiling... example.c int main(void) { unsigned int i, t, a[4] = { 1, 3, 5, 2 }; make_symbolic(&i); EXE compiler if (i >= 4) exit(0); char *p = (char *)a + i * 4; example.out *p = *p − 1 t = a[*p]; t = t / a[i]; Executable if (t == 2) assert(i == 1); else assert(i == 3); return 0; } Inserts checks around every assignment , expression & branch , to determine if its operands are concrete or symbolic unsigned int a[4] = {1,3,5,2} if (i >= 4) Systems and Internet Infrastructure Security Laboratory (SIIS) Page

Compiling... example.c int main(void) { unsigned int i, t, a[4] = { 1, 3, 5, 2 }; make_symbolic(&i); EXE compiler if (i >= 4) exit(0); char *p = (char *)a + i * 4; example.out *p = *p − 1 t = a[*p]; t = t / a[i]; Executable if (t == 2) assert(i == 1); else assert(i == 3); return 0; } Inserts checks around every assignment , expression & branch , to determine if its operands are concrete or symbolic If any operand is symbolic , the operation is not performed, but is added as a constraint for the current path Systems and Internet Infrastructure Security Laboratory (SIIS) Page

Compiling... example.c int main(void) { unsigned int i, t, a[4] = { 1, 3, 5, 2 }; make_symbolic(&i); EXE compiler if (i >= 4) exit(0); char *p = (char *)a + i * 4; example.out *p = *p − 1 t = a[*p]; t = t / a[i]; Executable if (t == 2) assert(i == 1); else assert(i == 3); return 0; } Inserts code to fork program execution when it reaches a symbolic branch point , so that it can explore each possibility if (i >= 4) (i ≥ 4) (i < 4) Systems and Internet Infrastructure Security Laboratory (SIIS) Page

Compiling... example.c int main(void) { unsigned int i, t, a[4] = { 1, 3, 5, 2 }; make_symbolic(&i); EXE compiler if (i >= 4) exit(0); char *p = (char *)a + i * 4; example.out *p = *p − 1 t = a[*p]; t = t / a[i]; Executable if (t == 2) assert(i == 1); else assert(i == 3); return 0; } Inserts code to fork program execution when it reaches a symbolic branch point , so that it can explore each possibility For each branch constraint , queries constraint solver for existence of at least one solution for the current path . If not – stops executing path Systems and Internet Infrastructure Security Laboratory (SIIS) Page

Compiling... example.c int main(void) { unsigned int i, t, a[4] = { 1, 3, 5, 2 }; make_symbolic(&i); EXE compiler if (i >= 4) exit(0); char *p = (char *)a + i * 4; example.out *p = *p − 1 t = a[*p]; t = t / a[i]; Executable if (t == 2) assert(i == 1); else assert(i == 3); return 0; } Inserts code for checking if a symbolic expression could have any possible value that could cause errors t = t / a[i] Division by Zero? Systems and Internet Infrastructure Security Laboratory (SIIS) Page

Compiling... example.c int main(void) { unsigned int i, t, a[4] = { 1, 3, 5, 2 }; make_symbolic(&i); EXE compiler if (i >= 4) exit(0); char *p = (char *)a + i * 4; example.out *p = *p − 1 t = a[*p]; t = t / a[i]; Executable if (t == 2) assert(i == 1); else assert(i == 3); return 0; } Inserts code for checking if a symbolic expression could have any possible value that could cause errors If the check passes – the path has been verified as safe under all possible input values (relative to those checks) Systems and Internet Infrastructure Security Laboratory (SIIS) Page

Running... int main(void) { unsigned int i, t, a[4] = { 1, 3, 5, 2 }; make_symbolic(&i); if (i >= 4) exit(0); 4 ≤ i char *p = (char *)a + i * 4; *p = *p − 1 e.g. i = 8 t = a[*p]; t = t / a[i]; EXE generates a if (t == 2) test case assert(i == 1); else assert(i == 3); return 0; } Systems and Internet Infrastructure Security Laboratory (SIIS) Page

Running... int main(void) { unsigned int i, t, a[4] = { 1, 3, 5, 2 }; make_symbolic(&i); 0 ≤ i ≤ 4 if (i >= 4) e.g. i = 2 exit(0); char *p = (char *)a + i * 4; p → a[2] = 5 *p = *p − 1 t = a[*p]; a[2] = 5 – 1 = 4 t = t / a[i]; if (t == 2) t = a[4] assert(i == 1); else assert(i == 3); Out of bounds return 0; } EXE generates a test case Systems and Internet Infrastructure Security Laboratory (SIIS) Page

Running... int main(void) { 0≤ i ≤ 4 , i ≠ 2 unsigned int i, t, a[4] = { 1, 3, 5, 2 }; make_symbolic(&i); e.g. i = 0 if (i >= 4) p → a[0] = 1 exit(0); char *p = (char *)a + i * 4; a[0] = 1 – 1 = 0 *p = *p − 1 t = a[*p]; t = a[0] t = t / a[i]; if (t == 2) t = t / 0 assert(i == 1); else Division by 0 assert(i == 3); return 0; } EXE generates a test case Systems and Internet Infrastructure Security Laboratory (SIIS) Page

Running... 0≤ i ≤ 4 , i ≠ 2 , i ≠ 0 int main(void) { unsigned int i, t, a[4] = { 1, 3, 5, 2 }; i = 3 i = 1 make_symbolic(&i); p → a[3] p → a[1] if (i >= 4) exit(0); a[3] = 1 a[1] = 2 char *p = (char *)a + i * 4; *p = *p − 1 t = a[1] t = a[2] t = a[*p]; t = t / a[i]; t ≠ 2 t = 2 if (t == 2) assert(i == 1); else EXE determines assert(i == 3); neither ‘assert’ fails return 0; } 2 valid test cases Systems and Internet Infrastructure Security Laboratory (SIIS) Page

Output test3.err ERROR: simple.c:16 Division/modulo by zero! # concrete byte values: test3.out i = 0 0 # i[0], 0 # i[1], 0 # i[2], 0 # i[3] # take these choices to follow path 0 # false branch (line 5) test3.forks 0 # false (implicit: pointer overflow check on line 9) 1 # true (implicit: div − by − 0 check on line 16) Systems and Internet Infrastructure Security Laboratory (SIIS) Page

CMPSC 497: Symbolic Execution Trent Jaeger Systems and Internet - PowerPoint PPT Presentation

CMPSC 497: Symbolic Execution Trent Jaeger Systems and Internet Infrastructure Security (SIIS) Lab Computer Science and Engineering Department Pennsylvania State University Systems and Internet Infrastructure Security Laboratory (SIIS) Page

Symbolic Execution of Linux binaries About Symbolic Execution Dynamically explore all

Symbolic Execution: Applications Symbolic execution is widely used in practice. Tools based on

Symbolic execution as search, and the rise of solvers Search and SMT Symbolic execution is

CMPSC 497 Execution Integrity Trent Jaeger Systems and Internet Infrastructure Security

Decidability Decidability and Symbolic Symbolic Verification Symbolic Symbolic Verification

Demo Symbolic Execution Probabilistic Symbolic Execution (Materials kindly provided by Willem

Symbolic Execution Emina Torlak emina@cs.washington.edu Outline What is symbolic execution?

Symbolic Execution of Maintainer Scripts Nicolas Jeannerod and Ralf Treinen joint work with

Symbolic execution for binary-level security / 50 3 A number of shades of symbolic execution /

Symbolic Execution Mathy Vanhoef @vanhoefm HITB DXB 2018, Dubai, 27 November 2018 Overview

Learning to Fuzz from Symbolic Execution with Application to Smart Contracts Jingxuan Mislav

Constraint Solving in Symbolic Execution Cristian Cadar Department of Computing Imperial College

CMPSC 497: Static Analysis Trent Jaeger Systems and Internet Infrastructure Security (SIIS)

CMPSC 497: Software Fault Isolation Trent Jaeger Systems and Internet Infrastructure

CMPSC 497: Java Security Trent Jaeger Systems and Internet Infrastructure Security (SIIS)

CMPSC 497 Other Memory Vulnerabilities Trent Jaeger Systems and Internet Infrastructure

Non-uniform Concrete security: an example cracks in the concrete: What is the best NIST P-256

1 How far can a Contract Serve as a Justification for Permanent Storage on a Blockchain?

Better Concrete Security for Half-Gates Garbling (in the Multi-Instance Setting) Chun Guo

On the use of semantic features for the semantic indexing task Bahjat Safadi, Nadia Derbas,

Information Technology Is Creating A New Social Fabric Definition

Math 8001: Groupwork (School of Mathematics) Math 8001 UMN 1 / 12 Any current issues in your

Two principles of dynamic constructivism Giovanni Sambin Logic Colloquium, special session on

Is Constructive Logic relevant for Computer Science? Thorsten Altenkirch University of

Sambuz

Useful Links

Newsletter

Mail Us