Finding Vulnerabilities with Fuzzing Chao Zhang Tsinghua - PowerPoint PPT Presentation

软件漏洞挖掘方法探索 Finding Vulnerabilities with Fuzzing Chao Zhang Tsinghua University http://netsec.ccert.edu.cn/chaoz/

About Me 2004-2008-2013 2013-2016 2016-present è è p Hack for fun software and system security Tencent CSS TSec 2 nd Place, 300+ CVE p Automated vuln. discovery: p Automated exploit mitigation: Microsoft BlueHat Prize (Special Recognition Award) p Automated exploit generation: Tencent CSS TSec Breakthrough Prize (1 st place) DARPA CGC (1 st in defense 2015, 2 nd in offense 2016) p Automated attack & defense: DEFCON CTF (2 nd in 2016, 5 th in 2015 and 2017) p Manual hacking: p Goal: AlphaGo for software security. To better defend yourself, know your enemy first. --- Sun Tzu 2020/8/22 2

Research Interests 2020/8/22 http://netsec.ccert.edu.cn/chaoz/ 3

http://netsec.ccert.edu.cn/ 网络空间安全实验室 p 段海新教授，张超副教授，李琦副教授，诸葛建伟副研究员等 p 学术研究 p 研究方向：网络、系统、应用安全（AI、物联网、区块链） p 学术成果：国际四大安全会议论文数量名列前茅 p 实践应用：促进Google、微软、IETF等多次改进产品、协议标准安全性 p 组织发起 p InForSec网络安全研究国际学术论坛 p XCTF 国际网络安全技术对抗联赛 p “蓝莲花”“紫荆花”战队 4

没有什么能够阻挡紫荆花蓝莲花没有什么能够阻挡你对自由的向往 … … 如此的清澈高远盛开着永不凋零蓝莲花欢迎热爱安全研究的同学们加入蓝莲花！（不限学校）

6 Vulnerability: Ghost in Cyberspace p Valuable assets, root causes of most security incidents 2020/8/22 http://netsec.ccert.edu.cn/chaoz/

Hacking Practice: DEFCON CTF Global Blue-Lotus (coach) • • 2013 first time in DEFCON ； 2013 ： ppp, men in black hats, raon_ASRT 2014 5 th place ； • • 2014 ： ppp, hitcon, dragonsector, blue-lotus 2015 5 th place ； • • 2015 ： defkor, ppp, 0daysober, hitcon, blue-lotus 2016 2 nd place ； (human vs. machine) • • 2016 ： ppp, b1o0p, defkor, hitcon 2017 5 th place ; • • 2017 ： ppp, hitcon, a*0*e, defkor, tea-deliverers 2018 6 th place • • 2018 ： defkoroot, ppp, hitcon, a*0*e, sauercloud, tea-deliverers 7 2019 3 rd place • • 2019: ppp, hitcon, tea-deliverers

DARPA Cyber Grand Challenge （ Automated Offense and Defense ）（ CodeJitsu Team Captain, CQE Defense #1 ， CFE Offense #2 ）

Vulnerability Discovery p Code Review (10%?) p Static Analysis p Dynamic Analysis p Taint Analysis p Symbolic Execution p Model Checking p Fuzzing (80%?) 2020/8/22 http://netsec.ccert.edu.cn/chaoz/ 9

Fuzzing p Goal: p Finding PoC samples that prove vulnerabilities p Solution: testing monitor Security Generator/ target how? inputs violation? Mutator program bugs p Find needle in the haystack 2020/8/22 http://netsec.ccert.edu.cn/chaoz/ 10

A better strategy: Genetic Algorithm Target Application Test Select Mutate seed seed seed Track Testcases Seed Seed Security Report Tracking Crashes Potential Filter Initial Seed Vulnerabilities Inputs Pool Seeds p Iterative testing, keep GOOD seeds, report bugs 2020/8/22 http://netsec.ccert.edu.cn/chaoz/ 11

A better strategy: Genetic Algorithm Target Instrument Application Cov. Security Algor. Sanitizers Test Select Mutate seed seed seed Track Testcases Seed Seed Security Report Tracking Crashes Coverage Tracking Potential Filter Initial Seed coverage Vulnerabilities Inputs Pool Seeds p GOOD: coverage increases p Bugs: sanitizers 2020/8/22 http://netsec.ccert.edu.cn/chaoz/ 12

A pioneer tool: AFL Target Instrument Application Seed Seed Cov. Security Selection Mutation Optimizations Testing Algor. Sanitizers Policies Policies Env Test Select Mutate seed seed seed Track Testcases Seed Seed Security Report Tracking Crashes Coverage Filtering Seed Tracking Policies Generation Potential Filter Initial Seed coverage Vulnerabilities Inputs Pool Seeds • Evolving: filter out only GOOD samples contributing to code coverage • Scalable: mutation-based, few knowledge required • Fast: fork-server, persistent, parallel • Sensitive: support different sanitizers to catch security violations 2020/8/22 http://netsec.ccert.edu.cn/chaoz/ 13

Our works Target Instrument Application MOpt (Sec19) Seed Seed Cov. Security Selection Mutation Optimizations Testing Algor. Sanitizers Policies Policies Env Test Select Mutate seed seed seed Track Testcases Seed Seed Security Report Tracking Crashes GreyOne (Sec20) Coverage Filtering Seed Tracking Policies Generation Potential Filter Initial Seed coverage Vulnerabilities Inputs Pool Seeds CollAFL (Oakland18) FANS (Sec20) HOTracer (Sec17) Vul Dist (ICSE20) 2020/8/22 http://netsec.ccert.edu.cn/chaoz/ 14

Improvement 1: Coverage & Seed Selection 2020/8/22 http://netsec.ccert.edu.cn/chaoz/ 15

IEEE S&P 2018 2020/8/22 http://netsec.ccert.edu.cn/chaoz/ 16

Observations (1) p Collision in Coverage Tracking p “ The size of the map is chosen so that collisions are sporadic with almost all of the intended targets, which usually sport between 2k and 10k … ” -- from AFL’s description p AFL uses a 64KB bitmap to track edge coverage ; key: prev Code in BB1 ; key: cur hash = cur ⊕ (prev ≫ 1) bitmap[hash]++ Code in BB2 p Two edges may have a same hash p Discarding GOOD seeds p Discarding unique crashes p Providing inaccurate coverage info for fuzzing policies (e.g., seed selection) 17

Observations (2) p Few seed selection policies aim at increasing the code coverage directly q E.g., AFLfast, VUzzer, AFLgo, QTEP, SlowFuzz p Coverage-first seed selection policies could reach higher code coverage faster. 2020/8/22 http://netsec.ccert.edu.cn/chaoz/ 18

Our Solution: CollAFL Target Instrument Application Seed Seed Cov. Security Selection Mutation Optimizations Testing Algor. Sanitizers Policies Policies Env Test Select Mutate seed seed seed Track Testcases Seed Seed Security Report Tracking Crashes Coverage Filtering Seed Tracking Policies Generation Potential Filter Initial Seed coverage Vulnerabilities Inputs Pool Seeds p Mitigate collision in coverage tracking p Apply coverage-first seed selection policy 2020/8/22 http://netsec.ccert.edu.cn/chaoz/ 19

RQ1: Eliminate hash collisions p AFL uses a 64KB bitmap to track edge coverage ; key: prev Code in BB1 ; key: cur hash = cur ⊕ (prev ≫ 1) bitmap[hash]++ Code in BB2 2020/8/22 http://netsec.ccert.edu.cn/chaoz/ 20

Naïve solution: increase bitmap size 2020/8/22 http://netsec.ccert.edu.cn/chaoz/ 21

Our solution: intuition p Replace the hash algorithm, without much performance loss ; key: prev code hash = cur ⊕ (prev ≫ 1) hash = (cur ≫ x) ⊕ (prev ≫ y) +z ; key: cur ; paras: x, y, z bitmap[hash]++ code p Each block could have different combination of parameters x,y,z p Search parameters x,y,z for all blocks one by one, to avoid collisions. p harder and harder to find parameters for remaining blocks. 2020/8/22 http://netsec.ccert.edu.cn/chaoz/ 22

Our solution: in-a-nutshell p Search parameters x,y,z for multi-precedent blocks p Construct hash table for unsolvable multi-precedent blocks p Assign un-used hashes to single-precedent blocks 2020/8/22 http://netsec.ccert.edu.cn/chaoz/ 25

Performance of Collision Mitigation The bitmap will be enlarged when the edge count is larger than bitmap size, otherwise collision is inevitable. Most BBs have only one precedent, saving hash computation and improving runtime performance. 2020/8/22 http://netsec.ccert.edu.cn/chaoz/ 26

RQ2: Coverage-first seed selection p Prioritize seeds with more untouched branches code untouched Path explored code by a seed untouched code code touched p Mutations on these seeds are more likely to exercise those untouched branches, contributing to coverage. 2020/8/22 http://netsec.ccert.edu.cn/chaoz/ 27

Evaluation: Code Coverage p 20% more paths over AFL With extra untouched-branch seed selection policy With collision mitigation only 2020/8/22 http://netsec.ccert.edu.cn/chaoz/ 28

Evaluation: Crashes p 320% more unique crashes than AFL (CollAFL-br) average 2020/8/22 http://netsec.ccert.edu.cn/chaoz/ 29

Evaluation: Vulnerabilities p 134 new bugs, 23 collided bugs, 95 CVE, 9 ACE 2020/8/22 http://netsec.ccert.edu.cn/chaoz/ 30

Improvement 2: Seed Mutation & Tracking 2020/8/22 http://netsec.ccert.edu.cn/chaoz/ 31

USENIX Security 2020 2020/8/22 http://netsec.ccert.edu.cn/chaoz/ 32

p Where to mutate? p input[0:8] p How to mutate? p MAGICHDR p Seed prioritization p 1 byte match, vs. p 7 bytes match Data flow information is useful for fuzzing 2020/8/22 http://netsec.ccert.edu.cn/chaoz/ 33

What types of data-flow features? p Taint attributes p Dependency between inputs and variables p Branch value conformance p Distance between branch condition operands p The higher conformance, the closer distance 2020/8/22 http://netsec.ccert.edu.cn/chaoz/ 34

Finding Vulnerabilities with Fuzzing Chao Zhang Tsinghua - PowerPoint PPT Presentation

Finding Vulnerabilities with Fuzzing Chao Zhang Tsinghua University http://netsec.ccert.edu.cn/chaoz/ About Me 2004-2008-2013 2013-2016 2016-present p Hack for fun software and system security Tencent

Fuzzing Suricata: Finding Vulnerabilities in Large Projects Sirko Her @golle0x90 1 Fuzzing

Modern Fuzzing of Media-processing projects Max Moroz, FOSDEM 2017 Agenda Fuzzing

FUZZIFICATION : Anti-Fuzzing Techniques Jinho Jung , Hong Hu, David Solodukhin, Daniel Pagan, Kyu

2000 2010 2015 2005 Blackbox Fuzzing Verification Whitebox Fuzzing Patrice Godefroid

Tools and Techniques to automate the discovery of Zero Day Vulnerabilities A.K.A Fuzzing 101

Wi-Fi Advanced Fuzzing Wi-Fi Advanced Fuzzing Laurent BUTTI France Tlcom / Orange

Fuzzing Kamailio Security testing the Kamailio SIP server with fuzzing Agenda About me

Fuzzing for CyberSecurity Abe Cohen 2019-11-13 Fuzzing for CyberSecurity What is

Structure-aware fuzzing for Clang and LLVM with libprotobuf-mutator Kostya Serebryany, Vitaly

File format fuzzing in Android: Giving Stagefright to the Android installer Alexandru Blanda

Fuzzing the Media Framework in Android Alexandru Blanda OTC Security QA 1 Agenda Introduction

Virtualised USB Fuzzing using QEMU and Scapy Breaking USB for Fun and Profit Tobias Mueller (c)

The Fuzzing Project https://fuzzing-project.org/ Hanno B ock 1 / 18 Introduction Motivation

Coverage-guided Fuzzing of Individual Functions Without Source Code Alessandro Di Federico

T-Fuzz: Fuzzing by Program Transformation Hui Peng 1 , Yan Shoshitaishvili 2 , Mathias Payer 1 1

No source? No problem! High speed binary fuzzing Nspace & @gannimo About this talk

The Promises and Pitfalls of Hardware-Assisted Security Alexandra Dmitrienko

System Security System Security Aurlien Francillon francill@eurecom.fr Administrativa...

The one universal imperative for every organization's progress - business, government and

Thank you so much to the [Mowafaghian] Foundation and to SFU for inviting me here, and thanks to

Rules of Thumb to Help the Public Assess "Scientific" Claims Caroline Crocker, MSc, PhD

Development of a Psychosocial Risk Screening Tool for Genetic Testing Objectives : To develop a

INTRODUCTION TO GENETIC EPIDEMIOLOGY (EPID0754) Prof. Dr. Dr. K. Van Steen (February 2012)

Pacing/Teacher's Notes Investigation #8: Transformation Click on the topic to go to that section

Finding Vulnerabilities with Fuzzing Chao Zhang Tsinghua - PowerPoint PPT Presentation

Finding Vulnerabilities with Fuzzing Chao Zhang Tsinghua University http://netsec.ccert.edu.cn/chaoz/ About Me 2004-2008-2013 2013-2016 2016-present p Hack for fun software and system security Tencent

Fuzzing Suricata: Finding Vulnerabilities in Large Projects Sirko Her @golle0x90 1 Fuzzing

Modern Fuzzing of Media-processing projects Max Moroz, FOSDEM 2017 Agenda Fuzzing

FUZZIFICATION : Anti-Fuzzing Techniques Jinho Jung , Hong Hu, David Solodukhin, Daniel Pagan, Kyu

2000 2010 2015 2005 Blackbox Fuzzing Verification Whitebox Fuzzing Patrice Godefroid

Tools and Techniques to automate the discovery of Zero Day Vulnerabilities A.K.A Fuzzing 101

Wi-Fi Advanced Fuzzing Wi-Fi Advanced Fuzzing Laurent BUTTI France Tlcom / Orange

Fuzzing Kamailio Security testing the Kamailio SIP server with fuzzing Agenda About me

Fuzzing for CyberSecurity Abe Cohen 2019-11-13 Fuzzing for CyberSecurity What is

Structure-aware fuzzing for Clang and LLVM with libprotobuf-mutator Kostya Serebryany, Vitaly

File format fuzzing in Android: Giving Stagefright to the Android installer Alexandru Blanda

Fuzzing the Media Framework in Android Alexandru Blanda OTC Security QA 1 Agenda Introduction

Virtualised USB Fuzzing using QEMU and Scapy Breaking USB for Fun and Profit Tobias Mueller (c)

The Fuzzing Project https://fuzzing-project.org/ Hanno B ock 1 / 18 Introduction Motivation

Coverage-guided Fuzzing of Individual Functions Without Source Code Alessandro Di Federico

T-Fuzz: Fuzzing by Program Transformation Hui Peng 1 , Yan Shoshitaishvili 2 , Mathias Payer 1 1

No source? No problem! High speed binary fuzzing Nspace &amp; @gannimo About this talk

The Promises and Pitfalls of Hardware-Assisted Security Alexandra Dmitrienko

System Security System Security Aurlien Francillon francill@eurecom.fr Administrativa...

The one universal imperative for every organization's progress - business, government and

Thank you so much to the [Mowafaghian] Foundation and to SFU for inviting me here, and thanks to

Rules of Thumb to Help the Public Assess &quot;Scientific&quot; Claims Caroline Crocker, MSc, PhD

Development of a Psychosocial Risk Screening Tool for Genetic Testing Objectives : To develop a

INTRODUCTION TO GENETIC EPIDEMIOLOGY (EPID0754) Prof. Dr. Dr. K. Van Steen (February 2012)

Pacing/Teacher's Notes Investigation #8: Transformation Click on the topic to go to that section

No source? No problem! High speed binary fuzzing Nspace & @gannimo About this talk

Rules of Thumb to Help the Public Assess "Scientific" Claims Caroline Crocker, MSc, PhD