When Malware is Packin Heat; Limits of Machine Learning Classifiers - PowerPoint PPT Presentation

When Malware is Packin’ Heat; Limits of Machine Learning Classifiers Based on Static Analysis Features Hojjat Aghakhani , Fabio Gri*, Francesco Mecca, Mar0na Lindorfer, Stefano Ortolani, Davide Balzaro*, Giovanni Vigna, Christopher Kruegel

Packing Original PE Header PE Header Packing Decompression Process Stub .text Packed Section/s .data, .rsrc, .rdata, … Packed File Original File 2

Packing Original PE Header Original PE Header PE Header Packing Decompression Unpacking Process Stub RouAne .text .text Packed Section/s .data, .rsrc, .rdata, … .data, .rsrc, .rdata, … Packed File Original File Original Program Loaded in Memory 3

Packing Employed By Malware Authors 4

Packing Evolution • Most packers are not this simple anymore... 5

Packing Evolution • Most packers are not this simple anymore... • Different methods of obfuscation or encryption are being used 6

Packing Evolution • Most packers are not this simple anymore... • Different methods of obfuscation or encryption are being used • Packing happens at multiple layers 7

Packing Evolution • Most packers are not this simple anymore... • Different methods of obfuscation or encryption are being used • Packing happens at multiple layers • Unpacking routines are not necessarily executed in a straight line 8

Packing Evolution • Most packers are not this simple anymore... • Different methods of obfuscation or encryption are being used • Packing happens at multiple layers • Unpacking routines are not necessarily executed in a straight line • Only a single fragment of the original code at any given time 9

Packing Evolution • Most packers are not this simple anymore... • Different methods of obfuscation or encryption are being used • Packing happens at multiple layers • Unpacking routines are not necessarily executed in a straight line • Only a single fragment of the original code at any given time • Usually anti-debugging or anti-reverse-engineering techniques are employed 10

Why Does Packing Matter? • It hampers the analysis of the code 11

Why Does Packing Matter? • It hampers the analysis of the code • Makes malware classification more challenging! 12

Why Does Packing Matter? • It hampers the analysis of the code • Makes malware classification more challenging! • Especially, when using only static analysis 13

Malware Classification Using Static Analysis Static Analysis Anti-Malware + Companies Dynamic Machine Learning Analysis 16

Malware Classification Using Static Analysis Static Analysis Anti-Malware + Companies Dynamic Machine Learning Analysis • What happens if the program is packed, i.e., the features are obfuscated? 17

Do Benign Software Programs Use Packing? Packed Malicious YES NO Not Packed 18

Packing Is Common in Benign Programs 19

Packing Is Common in Benign Programs • Rahbarinia et al. [84], who studied 3 million web-based software downloads over 7 months in 2014, found that both malicious and benign files use known packers (58% and 54%, respectively) B. Rahbarinia, M. Balduzzi, and R. Perdisci, “Exploring the Long Tail of (Malicious) Software Downloads,” 20 in Proc. of the International Conference on Dependable Systems and Networks (DSN) , 2017.

“Packing == Malicious” 21

“Packing == Malicious” on VirusTotal? 613 Windows 10 binaries located Pack with Themida Submit to VT in C: \ Windows \ System32 22

Dataset Pollution 24

Does static analysis on packed binaries provide rich enough features to a malware classifier? 25

Datasets 1. Wild Dataset (50,724 executables): • 4,396 unpacked benign • 12,647 packed benign • 33,681 packed malicious 26

Datasets 1. Wild Dataset (50,724 executables): • 4,396 unpacked benign • 12,647 packed benign • 33,681 packed malicious 2. Lab Dataset: 91,987 Benign Samples Pack with 9 packers Wild (including Themida, Dataset PECompact, UPX, …) 198,734 Malicious Samples 27

Nine Feature Categories Category # Features PE headers 28 PE sections 570 DLL imports 4,305 API imports 19,168 Rich Header 66 Byte n-grams 13,000 Opcode n-grams 2,500 Strings 16,900 File generic 2 28

Our Research Questions 1. Do packers preserve static analysis features that are useful for malware classification? 29

Experiment “Different Packed Ratios (lab)” 1. We exclude packed benign samples from the training set 2. Then, we keep adding more packed benign samples to the training set 33

Experiment “Different Packed Ratios (lab)” 1. We exclude packed benign samples from the training set. 2. Then, we keep adding more packed benign samples to the training set • Surprisingly, the classifier is doing ok! 34

But, How?? • We focused on one packer at a time to identify useful features for each packer! 1. Some packers (e.g., Themida) often keep the Rich Header. 2. Packers often keep .CAB file headers in the resource sections of the executables. 3. UPX keeps one API for each DLL. 35

Our Research Questions 1. Do packers preserve static analysis features that are useful for malware classification? 2. Do packers preserve static analysis features that are useful for Packers preserve some information when packing malware classification? programs that may be “useful” for malware 3. Can a classifier that is carefully trained and not biased towards classification, however, such information does not specific packing routines perform well in real-world scenarios? necessarily represent the real nature of samples 36

Our Research Questions 1. Do packers preserve static analysis features that are useful for malware classification? 2. Can such a classifier perform well in real-world scenarios? 37

Our Research Questions 1. Do packers preserve static analysis features that are useful for malware classification? 2. Can such a classifier perform well in real-world scenarios? Generalization to unseen packers Adversarial examples 38

Generalization To Unseen Packers • Runtime packers are evolving, and malware authors often tend to use their own custom packers 39

Generalization To Unseen Packers 1. Experiment “withheld packer” UPX Themida Obsidium PECompact Petite PELock MPRESS tElock kkrunchy Training Set Test Set 42

Generalization To Unseen Packers Withheld Packer FPR (%) FNR (%) 1. Experiment “withheld packer” PELock 7.30 3.74 PECompact 47.49 2.14 Obsidium 17.42 3.32 Petite 5.16 4.47 tElock 43.65 2.02 Themida 6.21 3.29 MPRESS 5.43 4.53 kkrunchy 83.06 2.50 UPX 11.21 4.34 43

Generalization To Unseen Packers 2. Experiment “lab against wild” • We train the classifier on Lab Dataset • And evaluate it on packed executables in Wild Dataset 44

Generalization To Unseen Packers 2. Experiment “lab against wild” • We train the classifier on Lab Dataset • And evaluate it on packed executables in Wild Dataset • We observed the false negative rate of 41.84%, and false positive rate of 7.27% 45

Poor Generalization To Unseen Packers 46

Adversarial Examples • Machine-learning-based malware detectors shown to be vulnerable to adversarial samples 47

Adversarial Examples • Machine-learning-based malware detectors shown to be vulnerable to adversarial samples • Packing produces features not directly deriving from the actual (unpacked) program 48

Adversarial Examples • Machine-learning-based malware detectors shown to be vulnerable to adversarial samples • Packing produces features not directly deriving from the actual (unpacked) program • Generating such adversarial samples would be easier for an adversary 49

Adversarial Examples Unpacked Benign Packed Random Benign Forest Training: Packed Malicious Training Set Train RF Model 50

Adversarial Examples Unpacked Benign Packed Random Benign Forest Training: Packed Malicious Training Set Train RF Model Benign Packed Evasion: Malicious Strings Malicious Test Set Prediction 51

Adversarial Examples Unpacked Benign Packed Random Benign Forest Training: Packed Malicious Training Set Train RF Model Benign Packed Evasion: Benign Strings Malicious Test Set Prediction 52

Ma Machine Le Learn rning St Static E Evasion on Comp Competition on Benign 150 Malicious Samples 50% Evasion!!! Strings 53

Ma Machine Le Learn rning St Static E Evasion on Comp Competition on Benign 150 Malicious Samples 50% Evasion!!! Strings • Recently, a group of researchers found a very similar way to subvert an AI-based anti-malware engine • By simply taking strings from an online gaming program and appending them to known malware, like WannaCry 54

Vulnerable To Trivial Adversarial Examples 56

Conclusion Unpacked Benign Packed Benign Packed Malicious Training Set Not Biased Model 57

Conclusion .CAB Headers .CAB Headers Rich Header Rich Header M M a a n n i i f f e e s s t t API Imports API Imports S S t t r r i i n n g g s s 58

When Malware is Packin Heat; Limits of Machine Learning Classifiers - PowerPoint PPT Presentation

When Malware is Packin Heat; Limits of Machine Learning Classifiers Based on Static Analysis Features Hojjat Aghakhani , Fabio Gri, Francesco Mecca, Mar0na Lindorfer, Stefano Ortolani, Davide Balzaro, Giovanni Vigna, Christopher Kruegel

Malware Obfuscation Techniques: Packing November 18, 2014 Malware and packing Not packed (20%)

Linux malware presentation @r00tbsd Paul Rascagnres Malware.lu July 2013 @r00tbsd

Beat the Heat: Controlling Heat Hazards in the Workplace Safety Meeting Topic What is Heat

Heat, Specific Heat, and Phase Change Heat and Internal Energy Specific Heat Calorimetry Phase

GOODWARE DRUGS FOR MALWARE: ON-THE-FLY MALWARE ANALYSIS AND CONTAINMENT DAMIANO BOLZONI

Entrapment: Tricking Malware with Transparent, Scalable Malware Analysis Paul Royal

Malware Halting 1. Malware 2. Software diversity Part I: Method Development 3. Computer

Android Malware Analysis on Attacks and Defense Android malware Android malware With the

Malware What is malware? Malware: malicious software worm ransomware adware

On Static Malware Detection Tayssir Touili LIPN, CNRS & Univ. Paris 13 Motivation: Malware

Android Malware Adventures Mert Can Cokuner Krat Ouzhan Aknc Android Malware

Malware What is malware? Malware: malicious software worm ransomware adware

Preventing Heat Inj uries Learn to live with the heat Obj ectives Identify types of heat

Extreme Heat Preparedness Objectives What is extreme heat ? How does it impact SF? What are the

companie ies in involv lved in in productio ion, dis istrib ibutio ion and packin ing of

Mult lti-Resource Packin ing for Clu luster Schedule lers Robert Grandl, Ganesh

Ascent timescales at the Onset of the Oruanui, NZ Supereruption Madison Myers University of

Trip Report FINAL MEETING AND SUMMER SCHOOL OF DFG PRIORITY PROGRAM ALGORITHM ENGINEERING DFG

p(E|H p(E|H p ) p ) p(E|H p(E|H d ) d ) Need for testing In forensic voice comparison,

Exact JPEG recompression and forensics using interval arithmetic Andrew B. Lewis and Markus G.

Modeling Analytics for Computational Storage Veronica Lagrange, Harry Li, Anahita Shayesteh

Fronthaul Compression for Cloud Radio Access Networks O. Simeone New Jersey Institute of

Detect New Physics with Deep Learning Trigger at the LHC Zhenbin Wu (UIC) Thong Nguyen

Efficient compression of SIDH public keys Craig Costello 1 David Jao 2 Patrick Longa 1 Michael

When Malware is Packin Heat; Limits of Machine Learning Classifiers - PowerPoint PPT Presentation

When Malware is Packin Heat; Limits of Machine Learning Classifiers Based on Static Analysis Features Hojjat Aghakhani , Fabio Gri*, Francesco Mecca, Mar0na Lindorfer, Stefano Ortolani, Davide Balzaro*, Giovanni Vigna, Christopher Kruegel

Malware Obfuscation Techniques: Packing November 18, 2014 Malware and packing Not packed (20%)

Linux malware presentation @r00tbsd Paul Rascagnres Malware.lu July 2013 @r00tbsd

Beat the Heat: Controlling Heat Hazards in the Workplace Safety Meeting Topic What is Heat

Heat, Specific Heat, and Phase Change Heat and Internal Energy Specific Heat Calorimetry Phase

GOODWARE DRUGS FOR MALWARE: ON-THE-FLY MALWARE ANALYSIS AND CONTAINMENT DAMIANO BOLZONI

Entrapment: Tricking Malware with Transparent, Scalable Malware Analysis Paul Royal

Malware Halting 1. Malware 2. Software diversity Part I: Method Development 3. Computer

Android Malware Analysis on Attacks and Defense Android malware Android malware With the

Malware What is malware? Malware: malicious software worm ransomware adware

On Static Malware Detection Tayssir Touili LIPN, CNRS &amp; Univ. Paris 13 Motivation: Malware

Android Malware Adventures Mert Can Cokuner Krat Ouzhan Aknc Android Malware

Malware What is malware? Malware: malicious software worm ransomware adware

Preventing Heat Inj uries Learn to live with the heat Obj ectives Identify types of heat

Extreme Heat Preparedness Objectives What is extreme heat ? How does it impact SF? What are the

companie ies in involv lved in in productio ion, dis istrib ibutio ion and packin ing of

Mult lti-Resource Packin ing for Clu luster Schedule lers Robert Grandl, Ganesh

Ascent timescales at the Onset of the Oruanui, NZ Supereruption Madison Myers University of

Trip Report FINAL MEETING AND SUMMER SCHOOL OF DFG PRIORITY PROGRAM ALGORITHM ENGINEERING DFG

p(E|H p(E|H p ) p ) p(E|H p(E|H d ) d ) Need for testing In forensic voice comparison,

Exact JPEG recompression and forensics using interval arithmetic Andrew B. Lewis and Markus G.

Modeling Analytics for Computational Storage Veronica Lagrange, Harry Li, Anahita Shayesteh

Fronthaul Compression for Cloud Radio Access Networks O. Simeone New Jersey Institute of

Detect New Physics with Deep Learning Trigger at the LHC Zhenbin Wu (UIC) Thong Nguyen

Efficient compression of SIDH public keys Craig Costello 1 David Jao 2 Patrick Longa 1 Michael

When Malware is Packin Heat; Limits of Machine Learning Classifiers Based on Static Analysis Features Hojjat Aghakhani , Fabio Gri, Francesco Mecca, Mar0na Lindorfer, Stefano Ortolani, Davide Balzaro, Giovanni Vigna, Christopher Kruegel

On Static Malware Detection Tayssir Touili LIPN, CNRS & Univ. Paris 13 Motivation: Malware