Scheduling Black-box Muta5onal Fuzzing ACM CCS 2013 Maverick Woo - PowerPoint PPT Presentation

Scheduling Black-box Muta5onal Fuzzing ACM CCS 2013 Maverick Woo Carnegie Mellon University pooh@cmu.edu

Our Crew Maverick Woo Sang Kil Cha Samantha Gottlieb David Brumley 2

The Story 3

Typical Exploit Genera5on Bug Finding Exploit Fuzzing Bug Triage Generation crashes bugs 4

Scheduling is Equally Important Ordering Time Allocation 5

Scheduling Black-box Muta5onal Fuzzing 6

Scheduling Black-box Muta5onal Fuzzing Termi- nation Test Fuzzer Program Input Crash A common program testing technique popularized by Miller et al. in late 1980s [18] • Use a fuzzer to generate test inputs to program-under-test • At its simplest, look for crashes —memory corruption, uncaught exceptions, failed assertions, etc. 7

Scheduling Black-box Muta5onal Fuzzing Seed Termi- Input nation Mutated Fuzzer Program Input PRNG(j) Crash A black-box fuzzer observes a program’s I/O behavior only • cf. Whitebox Fuzzing by Godefroid et al. 2012 [11] • SimpliSication: only distinguish termination vs. crash Detect anomaly by mutating a valid input (= seed) 8

Scheduling Black-box Muta5onal Fuzzing Seed Termi- Input nation Mutated Fuzzer Program Input PRNG(j) Crash A black-box fuzzer observes a program’s I/O behavior only • cf. Whitebox Fuzzing by Godefroid et al. 2012 [11] • SimpliSication: only distinguish termination vs. crash Given a seed input s and a mutation ratio r : 1. Select d = r × | s | bits in s uniformly at random 2. Flip each selected bit with probability ½ 9

Scheduling Black-box Muta5onal Fuzzing Seed Termi- Input nation Mutated Fuzzer Program Input PRNG(j) Crash Key Observations: 1. We can reproduce a program crash by storing (a) the seed input and (b) the PRNG seed 2. Mutation = uniform sampling from the Hamming cube of radius d centered at s 10

Scheduling Black-box Muta5onal Fuzzing Seed Termi- Input nation Mutated Fuzzer Program Input PRNG(j) Crash Key Observations: 1. We can reproduce a program crash by storing (a) the seed input and (b) the PRNG seed “Fuzz ConBiguration” 2. Mutation = uniform sampling from the Hamming cube of (i) program p radius d centered at s (ii) seed input s (iii) mutation ratio r 11

Scheduling Black-box Muta5onal Fuzzing Seed Termi- Input nation Mutated Fuzzer Program Input PRNG(j) Crash Key Observations: 1. We can reproduce a program crash by storing (a) the seed input and (b) the PRNG seed “Fuzz ConBiguration” 2. Mutation = uniform sampling from the Hamming cube of (i) program p radius d centered at s (ii) seed input s (iii) 0.04% 12

Scheduling Black-box Muta5onal Fuzzing Seed Termi- Input nation Mutated Fuzzer Program Input PRNG(j) Crash Key Observations: 1. We can reproduce a program crash by storing (a) the seed input and (b) the PRNG seed “Fuzz ConBiguration” 2. Mutation = uniform sampling from the Hamming cube of = radius d centered at s “(program, seed) pair” in this talk 13

Scheduling Black-box Muta5onal Fuzzing A fuzz campaign comprises a sequence of epochs : 1. takes a list of (program, seed) pairs as input 2. at the beginning of each epoch, picks one (program, seed) pair to fuzz based on data collected from previous epochs We investigate two epoch types : • Fixed-run : Sixed number of fuzz runs in each epoch – implemented in CMU CERT BFF v2.6 [14] • Fixed-time : Sixed amount of time in each epoch – proposed in this paper – slightly harder to implement 14

Problem Statement Given a list of K fuzz conSigurations {( p 1 , s 1 ), . . . , ( p K , s K )}, the Fuzz Con?iguration Scheduling (FCS) problem seeks to maximize the number of unique bugs discovered in a fuzz campaign that runs for a duration of length T . Important Assumptions: 1. Only one conSiguration can be fuzzed within an epoch 2. Separate program analysis of ( p i , s i ) is not allowed 3. Bugs from different ( p i , s i ) are disjoint See paper for discussions 15

How to Solve the FCS Problem? Two competing goals during a fuzz campaign: Explore each ( p i , s i ) Exploit knowledge of sufSiciently often so ( p i , s i ) that are likely vs. as to identify pairs to yield new bugs that can yield new bugs by fuzzing them more Good News: • Clearly a Multi-Armed Bandit (MAB) problem! 16

Mul5-Armed Bandits 17

MAB in Berlin 18

How to Solve the FCS Problem? Two competing goals during a fuzz campaign: Explore each ( p i , s i ) Exploit knowledge of sufSiciently often so ( p i , s i ) that are likely vs. as to identify pairs to yield new bugs that can yield new bugs by fuzzing them more Good News: • Clearly a Multi-Armed Bandit (MAB) problem! • Lots of published MAB algorithms – provably optimal algorithms for many settings, e.g., Auer et al. 2002 [2] handles certain adversarial cases 19

How to Solve the FCS Problem? Bad News: recognizing “FCS ∈ MAB” is not enough Given a list of K fuzz conSigurations {( p 1 , s 1 ), . . . , ( p K , s K )}, the Fuzz Con?iguration Scheduling (FCS) problem seeks to maximize the number of unique bugs discovered in a fuzz campaign that runs for a duration of length T . 1. Classic MAB: once you identify a good beer, it stays good ⇒ drink it often to accumulate rewards J 2. Our Setting: each program has a ?inite number of bugs ⇒ bug exhaustion gives a diminish of return L We are not aware of MAB algorithms that cater to our case… ⇒ We need our own algorithms! 20

How to Solve the FCS Problem? Bad News: recognizing “FCS ∈ MAB” is not enough Given a list of K fuzz conSigurations {( p 1 , s 1 ), . . . , ( p K , s K )}, the Fuzz Con?iguration Scheduling (FCS) problem seeks to maximize the number of unique bugs discovered in a fuzz campaign that runs for a duration of length T . 1. Classic MAB: once you identify a good beer, it stays good ⇒ drink it often to accumulate rewards J 2. Our Setting: each program has a ?inite number of bugs ⇒ bug exhaustion gives a diminish of return L We are not aware of MAB algorithms that cater to our case… ⇒ We need our own algorithms! 21

Previously Scheduling Black-box Muta5onal Fuzzing Seed Termi- Input nation Mutated Fuzzer Program Input PRNG(j) Crash Key Observations: 1. We can reproduce a program crash by storing (a) the seed input and (b) the PRNG seed 2. Mutation = uniform sampling from the Hamming cube of radius d centered at s 22

Modeling Black-box Muta5onal Fuzzing Seed Termi- Input nation Mutated Fuzzer Program Input PRNG(j) Crash Consider the repeated fuzzings of a ?ixed ( p i , s i ) and let outcome i ( j ) denote the j -th outcome in the sequence: • Termination ⇒ ID 0 • Crash ⇒ bug ID obtained from bug triage Key Observation: BMF is memoryless , i.e., outcome i ( j ) are i.i.d. RVs for a Sixed i 23

Coupon Collector’s Problem (CCP) Suppose every box of breakfast cereal comes with a coupon that is randomly chosen among M different coupon types • How many boxes do you expect to buy before you have collected at least one coupon of each type? Traditional Setting • Coupon types are uniformly distributed ⇒ Θ ( M log M ) Our Setting • Bugs do not occur uniformly at random ⇒ Weighted CCP • Prevalence of different bugs is unknown ahead of time 24

Coupon Collector’s Problem (CCP) Suppose every box of breakfast cereal comes with a coupon that is randomly chosen among M different coupon types • How many boxes do you expect to buy before you have collected at least one coupon of each type? Traditional Setting • Coupon types are uniformly distributed ⇒ Θ ( M log M ) Our Setting • Bugs do not occur uniformly at random ⇒ Weighted CCP Also observed by • Prevalence of different bugs is unknown ahead of time Arcuri 2010 [1] 25

WCCP w/ Unknown is Intractable No Free Lunch Theorem … vs . … (you did pay the registration, right? ) 26

WCCP w/ Unknown is Intractable No Free Lunch Theorem … vs . … Wolpert and Macready 2005 on [22] • “Any two optimization algorithms are equivalent when their performance is averaged across all possible problems” 27

“Bring Your Own Prior” No Free Lunch Theorem … vs . … Wolpert and Macready 2005 on [22] • “Any two optimization algorithms are equivalent when their performance is averaged across all possible problems” Circumvention may be possible! • NFL Theorem does not apply if we focus on distributions that are more likely to occur in practice • More accurate model ⇒ More accurate predictions ⇒ More bugs 28

Rule of Three aaaa Q: Suppose we have Slipped a biased H-T coin n times and every time it comes up H . Does Pr[ T ] have to be small? A: No, so long as Pr[ T ] < 1, our observation is always possible See discussion ConBidence Intervals: in Jovanovic Pr[ T ] < 3/ n in 95% of all “parallel universes” 1997 [15] Usage: 1. Suppose ( p i , s i ) has yielded n different outcomes so far 2. Collectively call all n outcome types H 3. With 95% conSidence, Pr[ T (i.e., new outcome)] < 3/n 29

Algorithm Design Space We explore 3 dimensions in algorithm design and present: • 2 Epoch Types – Sixed-run 2 * (3 + 2 * 5) = – Sixed-time 26 Scheduling • 5 MAB Algorithms Algorithms – Round-Robin – Uniform-Random – EXP3.S.1 from Auer et al. 2002 [2] – Weighted-Random – ε -Greedy w.r.t. belief metrics • 5 Belief Metrics 30

Scheduling Black-box Muta5onal Fuzzing ACM CCS 2013 Maverick Woo - PowerPoint PPT Presentation

Scheduling Black-box Muta5onal Fuzzing ACM CCS 2013 Maverick Woo Carnegie Mellon University pooh@cmu.edu Our Crew Maverick Woo Sang Kil Cha Samantha Gottlieb David Brumley 2 The Story 3 Typical Exploit Genera5on Bug Finding Exploit

CCS 2 | CCS Classified as Business 3 | CCS Classified as Business 4 | CCS Classified as Business

Modern Fuzzing of Media-processing projects Max Moroz, FOSDEM 2017 Agenda Fuzzing

Semantics and Verification 2005 Lecture 2 informal introduction to CCS syntax of CCS semantics

2000 2010 2015 2005 Blackbox Fuzzing Verification Whitebox Fuzzing Patrice Godefroid

Paradoxes in Probability How probability continues to amuse me! Let's play a game! Box A Box B

Hawkeye: Towards a Desired Directed Grey-box Fuzzing Hongxu Chen, Yinxing Xue, Yuekang Li,

Wi-Fi Advanced Fuzzing Wi-Fi Advanced Fuzzing Laurent BUTTI France Tlcom / Orange

Fuzzing Kamailio Security testing the Kamailio SIP server with fuzzing Agenda About me

Fuzzing for CyberSecurity Abe Cohen 2019-11-13 Fuzzing for CyberSecurity What is

FUZZIFICATION : Anti-Fuzzing Techniques Jinho Jung , Hong Hu, David Solodukhin, Daniel Pagan, Kyu

Structure-aware fuzzing for Clang and LLVM with libprotobuf-mutator Kostya Serebryany, Vitaly

File format fuzzing in Android: Giving Stagefright to the Android installer Alexandru Blanda

Fuzzing the Media Framework in Android Alexandru Blanda OTC Security QA 1 Agenda Introduction

Virtualised USB Fuzzing using QEMU and Scapy Breaking USB for Fun and Profit Tobias Mueller (c)

The Fuzzing Project https://fuzzing-project.org/ Hanno B ock 1 / 18 Introduction Motivation

Coverage-guided Fuzzing of Individual Functions Without Source Code Alessandro Di Federico

The Exploration-Exploitation Dilemma A. LAZARIC ( SequeL Team @INRIA-Lille ) ENS Cachan - Master 2

MAB Learning in IoT Networks Learning helps even in non-stationary settings! Rmi Bonnefoi

An Optimal Private Stochastic-MAB Algorithm Based on an Optimal Private Stopping Rule Touqir Sajed

Beaufort Pediatrics QTIP team Jill Aiken, MD Nan Krueger, BSN Sydney Lubkin, RN Nikki Self A

A Multi-Armed Bandit Framework for Recommendations at Netflix Jaya Kawale Elliot Chow

The Alternative Block Nondeterministially choose and execute any fragment whose guard is true

Noncommutative OSp ( 4 | 2 ) SUGRA canin 1 Dragoljub Go 1Faculty of Physics, University of

A Gang of Bandits Will Knospe, Paul Reich, Bryce Bern, Dawson dAlmeida The Problem Trying