Coverage-guided Fuzzing of Individual Functions Without Source Code - PowerPoint PPT Presentation

Coverage-guided Fuzzing of Individual Functions Without Source Code Alessandro Di Federico Politecnico di Milano October 25, 2018 1

Index Coverage-guided fuzzing An overview of rev.ng Experimental results 2

Fuzzing 3

Fuzzing 1 Generate a lot of different inputs 2 Feed them to a program 3 Wait for it to reach an invalid state 4 Collect a report for the analyst 4

Features Pros: • Easy to setup • It can find subtle bugs Cons: • It might require large amount of resources • Semi-decidable 5

A huge leap forward Coverage-guided fuzzing 6

A huge leap forward Coverage-guided fuzzing Privilege inputs leading to cover new code paths 7

A huge leap forward int main() { if (A && B) { crash (); } else { all_good (); } } 8

The Control-flow Graph A B 9

First run Input: 0000 0000 0000 0000 A B 10

Second run Input: 0000 0000 0000 0001 A B 14

This input is not interesting! 18

Third run Input: 0001 0000 0000 0000 A B 19

This input is interesting! It led us to discover a new basic block 24

Fourth run Input: 0011 0000 0000 0000 A B 25

american fuzzy lop • It made coverage-guided fuzzing popular • Developed by lcamtuf • Performs instrumentation to detect executed basic blocks • Two key modes of operation: • Source mode • Binary mode 30

Source mode Instrumentation is performed at compiler-level 31

Source mode Instrumentation is performed at compiler-level int main() { record (1); if (A && B) { record (2); crash (); } else { record (3); all_good (); } record (4); } 32

Binary mode An emulator is employed to detect executed basic blocks 33

Binary mode An emulator is employed to detect executed basic blocks • QEMU is the chosen emulator • It incurs in a sensible slowdown 34

libfuzzer • Alternative to afl • It requires the source code to be available • Based on LLVM 35

What’s LLVM? LLVM is a compiler framework Famous for its C/C++ frontend (clang) and its intermediate representation (the LLVM IR) 36

libfuzzer can be a lot faster It doesn’t fork int main() { while (true) { char *new_input = random_input (); target(new_input ); } } 37

What is rev.ng ? rev.ng is a unified framework for binary analysis based on QEMU and LLVM 39

What is rev.ng ? rev.ng is a unified framework for binary analysis based on QEMU and LLVM Everything you’ll see here is architecture-agnostic 40

How does QEMU work? 41

A dynamic binary translator AArch64 AArch64 ARM Alpha CRIS Unicore AArch64 SPARC ARM SPARC64 x86 SuperH x86-64 SystemZ PowerPC QEMU IR QEMU IR MIPS PowerPC64 PowerPC XCore SystemZ MIPS SPARC MIPS64 OpenRISC TCI MicroBlaze x86-64 x86 RISC V 42

The frontend is a lifter AArch64 AArch64 ARM Alpha CRIS Unicore AArch64 SPARC ARM SPARC64 x86 SuperH x86-64 SystemZ PowerPC QEMU IR QEMU IR MIPS PowerPC64 PowerPC XCore SystemZ MIPS SPARC MIPS64 OpenRISC TCI MicroBlaze x86-64 x86 RISC V 43

QEMU translates at run-time 44

QEMU translates at run-time rev.ng translates offline 45

rev.ng : a static binary translator Collect md5sum.arm entry points Lift to QEMU IR Collect new Translate entry points to LLVM IR Link runtime md5sum.x86-64 functions 46

Alpha ARM CRIS AArch64 Unicore RISC V SPARC64 Hexagon SPARC SuperH x86 QEMU IR SystemZ x86-64 MicroBlaze PowerPC OpenRISC PowerPC64 MIPS64 MIPS XCore 47

Alpha ARM CRIS AArch64 Unicore RISC V SPARC64 Hexagon SPARC SuperH x86 LLVM IR SystemZ x86-64 MicroBlaze PowerPC OpenRISC PowerPC64 MIPS64 MIPS XCore 48

Alpha ARM CRIS AArch64 Unicore RISC V SPARC64 Hexagon SPARC rev.ng SuperH x86 SystemZ x86-64 MicroBlaze PowerPC OpenRISC PowerPC64 MIPS64 MIPS XCore 49

Alpha ARM CRIS AArch64 Unicore RISC V SPARC64 Hexagon SPARC rev.ng SuperH x86 SystemZ x86-64 MicroBlaze PowerPC OpenRISC PowerPC64 MIPS64 MIPS XCore 50

We produce LLVM IR 51

We produce LLVM IR We can employ libfuzzer directly 52

Steps 1 Lift the program to LLVM IR 2 Identify all the functions 3 Identify a function to fuzz 4 Create the fuzzing function 5 Compile fuzzing function 6 Instrument using libfuzzer 7 Launch the fuzzer 53

Steps 1 Lift the program to LLVM IR 2 Identify all the functions 3 Identify a function to fuzz MANUAL 4 Create the fuzzing function MANUAL 5 Compile fuzzing function 6 Instrument using libfuzzer 7 Launch the fuzzer 54

We are sensibly faster than QEMU 56

We are sensibly faster than QEMU 1 The LLVM optimizer has a wider view on the code 2 The translation is performed offline 57

Runtime (seconds) 4000 rev.ng Native QEMU 3000 2000 1000 458.sjeng 464.h264ref 400.perlbench 471.omnetpp 462.libquantum 473.astar 2000 1500 1000 500 401.bzip2 483.xalancbmk 429.mcf 403.gcc 445.gobmk 456.hmmer 58

On average, 68% faster than QEMU 59

A practical case study We want to fuzz the PCRE library 60

A practical case study We want to fuzz the PCRE library Not directly, but embedded in another program ( less ) 61

Steps (again) 1 Lift the program to LLVM IR 2 Identify all the functions 3 Identify a function to fuzz 4 Create the fuzzing function 5 Compile fuzzing function 6 Instrument using libfuzzer 7 Launch the fuzzer 62

Steps (again) 1 Lift the program to LLVM IR 2 Identify all the functions 3 Identify a function to fuzz 4 Create the fuzzing function 5 Compile fuzzing function 6 Instrument using libfuzzer 7 Launch the fuzzer 63

Fuzzing function (simplified) int LLVMFuzzerTestOneInput(uint8_t *data , size_t size) { char input_string [] = "Test␣string!"; void *compiled_re; compiled_re = pcre_compile(data); pcre_exec(compiled_re , input_string , strlen(input_string )); pcre_free(compiled_re ); return 0; } 64

We were able to find a known vulnerability in PCRE 65

Comparing with afl Are we faster than afl? • afl fuzzing worked directly on PCRE (without less ) • Used black-box mode 66

Performances Execs per second Total execs 1 min 10 min 60 min 60 min afl 3 582 3 495 3 682 13 187 295 150 617 79 701 78 306 271 217 728 rev.ng 67

Summary • We do not require the source code • We can fuzz any entry point • We are sensibly faster than existing techniques 68

Future works • Improve performances • Perform symbolic execution (through KLEE) 69

Future works Backup slides 70

Very effective! 71

License This work is licensed under the Creative Commons Attribution-ShareAlike 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-sa/3.0/ or send a letter to Creative Commons, 444 Castro Street, Suite 900, Mountain View, California, 94041, USA. 72

Coverage-guided Fuzzing of Individual Functions Without Source Code - PowerPoint PPT Presentation

Coverage-guided Fuzzing of Individual Functions Without Source Code Alessandro Di Federico Politecnico di Milano October 25, 2018 1 Index Coverage-guided fuzzing An overview of rev.ng Experimental results 2 Fuzzing 3 Fuzzing 1 Generate

Modern Fuzzing of Media-processing projects Max Moroz, FOSDEM 2017 Agenda Fuzzing

through Coverage-guided Tracing Stefan Nagy Matthew Hicks snagy2@vt.edu mdhicks2@vt.edu

2000 2010 2015 2005 Blackbox Fuzzing Verification Whitebox Fuzzing Patrice Godefroid

Not All Coverage Measurements Are Equal Fuzzing by Coverage Accounting for Input Prioritization

FFR Guided Functional FFR Guided Functional FFR Guided Functional FFR Guided Functional

Coverage-Guided Fuzzing Dynamic Static Smart Coverage Structure Algorithms Security

ParmeSan: Sanitizer-guided Greybox Fuzzing USENIX 2020 *some pages borrowed from Zheyu Ma

Wi-Fi Advanced Fuzzing Wi-Fi Advanced Fuzzing Laurent BUTTI France Tlcom / Orange

Fuzzing Kamailio Security testing the Kamailio SIP server with fuzzing Agenda About me

Fuzzing for CyberSecurity Abe Cohen 2019-11-13 Fuzzing for CyberSecurity What is

FUZZIFICATION : Anti-Fuzzing Techniques Jinho Jung , Hong Hu, David Solodukhin, Daniel Pagan, Kyu

Structure-aware fuzzing for Clang and LLVM with libprotobuf-mutator Kostya Serebryany, Vitaly

File format fuzzing in Android: Giving Stagefright to the Android installer Alexandru Blanda

Fuzzing the Media Framework in Android Alexandru Blanda OTC Security QA 1 Agenda Introduction

Virtualised USB Fuzzing using QEMU and Scapy Breaking USB for Fun and Profit Tobias Mueller (c)

The Fuzzing Project https://fuzzing-project.org/ Hanno B ock 1 / 18 Introduction Motivation

RIAA 2016: Corporate Engagement A closer look at the impact of engagement v Engagement can

Presentation at Credit Suisse 12 th Financial Services Forum Brady W. Dougan, Chief Executive

The Classical Decision Problem 1 Validity/satisfiability of arbitrary first-order formulas is

Outline Part 1 (uniqueness) special case general case Part 2 (existence) notations and some

Q2 2020 Earnings April 28, 2020 Forward-Looking Statements and Non-GAAP Financial Measures

#BIAScapstone Sign up for the Behavioral Buzz ! 2 BIAS Capstone Convening | April 27, 2016 |

Applying Implementation Science to Your Prevention Work Mindy Anderson-Knott, PFS Evaluator Paul

Results on a Scalable Byzantine Agreement. Olumuyiwa Oluwasanmi, University of New Mexico , Joint