Low-level Features for Multinomial Malware Classification Sergii - PowerPoint PPT Presentation

Low-level Features for Multinomial Malware Classification Sergii Banin 10.05.2017 Sergii Banin, COINS Finse Winter School, 1 10.05.2017

Agenda • Introduction (problem description). • Previous research. • Malware classification approaches. • Related studies. • Applying of low-level features for malware classification. Sergii Banin, COINS Finse Winter School, 2 10.05.2017

Introduction (problem description) • Signature-based malware detection is not robust against simple obfuscation techniques. • Static properties can be easily changed. • Dynamic analysis can aid and outperform static analysis. • Malware developers try to conceal malware’s functionality. • It is impossible to avoid execution on the hardware. • Thus we propose to analyze hardware or low-level activity produced by malware. Sergii Banin, COINS Finse Winter School, 3 10.05.2017

Previous Research • Memory access patterns were used for malware detection. • Record sequence of memory read and write accesses ( memtraces ) with a help of dynamic binary instrumentation tool Intel Pin. • Extract n-grams from memtrace sequence and use them as features. • Select best features . • Train Machine Learning models. • Assess classification accuracy achieved by ML models. • kNN and ANN achieved classification accuracy of 98.92% for 1,000,000 memtraces, 800 features and 96-grams. Sergii Banin, COINS Finse Winter School, 4 10.05.2017

Next Step • Apply low-level features for malware classification. BUT • Why do we need malware classification? • How it is usually performed? • Is it possible to apply low-level features for malware classification? Sergii Banin, COINS Finse Winter School, 5 10.05.2017

(Low-level Features for) Multinomial Malware Classification Sergii Banin 10.05.2017 Sergii Banin, COINS Finse Winter School, 6 10.05.2017

Agenda • Problem description. • Malware classification. • Malware families and types. • Reasons for separating malware by families and types. • Related studies. Sergii Banin, COINS Finse Winter School, 7 10.05.2017

Problem description. • Inconsistent terminology (family/type). • Dozens of malware naming systems. • CARO (Computer AntiVirus Researcher’s Organization ) naming system. [http://www.caro.org/articles/naming.html] • Common Malware Enumeration (CME) initiative by MITRE. [https://cme.mitre.org/about/] • Naming is usually made to describe malware’s target platform, functionality, variation of a certain sample, etc. [http://security.di.unimi.it/~roberto/teaching/vigorelli/0607/malware/material/caro.pdf, https://zeltser.com/malware-naming-approaches/, https://www.microsoft.com/en- us/security/portal/mmpc/shared/malwarenaming.aspx ] Sergii Banin, COINS Finse Winter School, 8 10.05.2017

Problem description (2) Sergii Banin, COINS Finse Winter School, 9 10.05.2017

Malware classification With classification we can: • assign threat level • assess possible harm • apply countermeasures • perform post-attack actions Sergii Banin, COINS Finse Winter School, 10 10.05.2017

Malware classification (types) • Malware types: Trojan, Virus, Hoax, Ransomware, Adware, Spyware. • Malware type is assigned by general functionality.  E.g. Viruses are self-replicating malware, and Ransomware encrypts user data while asking for ransom to be paid. • Certain type can have different subtypes assigned by actions performed on the victim system.  Trojan-Bankers are designed to steal account data from online banking.  Backdoor Trojans give malicious users remote control over the infected computer. Sergii Banin, COINS Finse Winter School, 11 10.05.2017

Malware classification (families) • Malware families are the malware categorization based on the particular functionality. • For describing malware families the following functionality could be used:  Which files are created/modified by a malware.  Which registry entries are affected by it.  Affected drivers.  Commands run by malware.  E.g. Win32/Ursnif (Gozi) (according to Microsoft Malware Protection Center) can run from PDF, MSI or EXE file. Create files in system directories, change registry entries related to software protection, capture screenshots, steal cookies, download and install new executables etc. Sergii Banin, COINS Finse Winter School, 12 10.05.2017

Reasons for separating malware by families and types. • Classification allows to assign threat level and assess possible harm. • Knowledge about the type of malware allows to apply particular counter-measures. • Knowledge about the family of malware allows to perform exact actions for restoring and cleaning the system. Sergii Banin, COINS Finse Winter School, 13 10.05.2017

Related studies • SANS in their paper (Malware 101 - Viruses) suggest the following virus classification strategy:  How malware stay in memory: resident, temporary resident, user process, kernel process.  Spreading methods: compiled, interpreted, multipartite.  Obfuscation techniques: no obfuscation, encryption, metamorphism, polymorphism, stealth.  By payload type: no payload, non-destructive, destructive, droppers. Sergii Banin, COINS Finse Winter School, 14 10.05.2017

Related studies SANS Sergii Banin, COINS Finse Winter School, 15 10.05.2017

Ways to perform multinomial malware detection • In the literature there are different ways of performing multinomial malware classification:  API calls, Byte and opcode n-grams, opcode frequencies.  API call sequences, control flow, autostart exstensibility points.  System state changes (through VM slices), call graph analysis, clustering via filesystem/registry/network activity. [Malware Analysis and Classification: A Survey Ekta Gandotra , Divya Bansal , Sanjeev Sofat] Sergii Banin, COINS Finse Winter School, 16 10.05.2017

Application of low-level features for multinomial classification. (Based on SANS taxonomy) • Memory activity can be analyzed on the level of single operations. (ongoing research) • Malware obfuscation-related activity could be traced within memory and CPU. • Interpreted viruses can be analyzed while interpreter is active. • Boot sector infection can be traced via HDD operations. Sergii Banin, COINS Finse Winter School, 17 10.05.2017

Conclusions and Suggestions. • Different ways of classification. • Properly described clustering can work better than well-known taxonomies. • Application of low-level activity can improve stealthy detection and new classification methods. Sergii Banin, COINS Finse Winter School, 18 10.05.2017

Thank you. Sergii Banin, COINS Finse Winter School, 19 10.05.2017

Low-level Features for Multinomial Malware Classification Sergii - PowerPoint PPT Presentation

Low-level Features for Multinomial Malware Classification Sergii Banin 10.05.2017 Sergii Banin, COINS Finse Winter School, 1 10.05.2017 Agenda Introduction (problem description). Previous research. Malware classification

Malware Obfuscation Techniques: Packing November 18, 2014 Malware and packing Not packed (20%)

Linux malware presentation @r00tbsd Paul Rascagnres Malware.lu July 2013 @r00tbsd

Low Level Low Level Low Level Low Level Detection of Detection of Detection of Detection of

CS7038 - Malware Analysis - Wk03.1 Malware Taxonomy and Terminology Coleman Kane

GOODWARE DRUGS FOR MALWARE: ON-THE-FLY MALWARE ANALYSIS AND CONTAINMENT DAMIANO BOLZONI

Entrapment: Tricking Malware with Transparent, Scalable Malware Analysis Paul Royal

Malware Halting 1. Malware 2. Software diversity Part I: Method Development 3. Computer

Android Malware Analysis on Attacks and Defense Android malware Android malware With the

Malware What is malware? Malware: malicious software worm ransomware adware

On Static Malware Detection Tayssir Touili LIPN, CNRS & Univ. Paris 13 Motivation: Malware

Android Malware Adventures Mert Can Cokuner Krat Ouzhan Aknc Android Malware

Malware What is malware? Malware: malicious software worm ransomware adware

COMPANY PROFILE WATER FEATURES 1 WATER FEATURES 2 WATER FEATURES 3 WATER FEATURES 4 WATER

Fast Algorithm for Generalized Multinomial Models with Ranking Data Jiaqi Gu (Joint work with

Chapter 11 Categorical Data Analysis Categorical Data and the Multinomial Distribution

Real Real- -Time Systems Time Systems Low- Low -level programming level programming Low-

Risky Business: How Companies Fall Victim to Fraud Presented by: Tony Okray Julie Latchaw

MAY 2020 This presentation and the accompanying slides (the Presentation), which have been

Tech Terminology Overview Devices Desktop Laptop Tablet Smartphone

EU Data Protection Compliance Trends - What US Companies Need to Know 30 January 2013 Session

OVERVIEW OF DDOS, RANSOMWARE, MALWARE.& ALL THINGS GENERALLY UNPLEASANT (HOPE YOU ENJOY

Insurance Service Providers and the Communications Circle March 10 th , 2015 Presented by: Sean

Security Partners or Security Police? Presented by: Janna Loeffler

Brilliant presentation richard hall pdf Mirror Link #1 2007-12-26 16 52 -- d-w C Documents and

Low-level Features for Multinomial Malware Classification Sergii - PowerPoint PPT Presentation

Low-level Features for Multinomial Malware Classification Sergii Banin 10.05.2017 Sergii Banin, COINS Finse Winter School, 1 10.05.2017 Agenda Introduction (problem description). Previous research. Malware classification

Malware Obfuscation Techniques: Packing November 18, 2014 Malware and packing Not packed (20%)

Linux malware presentation @r00tbsd Paul Rascagnres Malware.lu July 2013 @r00tbsd

Low Level Low Level Low Level Low Level Detection of Detection of Detection of Detection of

CS7038 - Malware Analysis - Wk03.1 Malware Taxonomy and Terminology Coleman Kane

GOODWARE DRUGS FOR MALWARE: ON-THE-FLY MALWARE ANALYSIS AND CONTAINMENT DAMIANO BOLZONI

Entrapment: Tricking Malware with Transparent, Scalable Malware Analysis Paul Royal

Malware Halting 1. Malware 2. Software diversity Part I: Method Development 3. Computer

Android Malware Analysis on Attacks and Defense Android malware Android malware With the

Malware What is malware? Malware: malicious software worm ransomware adware

On Static Malware Detection Tayssir Touili LIPN, CNRS &amp; Univ. Paris 13 Motivation: Malware

Android Malware Adventures Mert Can Cokuner Krat Ouzhan Aknc Android Malware

Malware What is malware? Malware: malicious software worm ransomware adware

COMPANY PROFILE WATER FEATURES 1 WATER FEATURES 2 WATER FEATURES 3 WATER FEATURES 4 WATER

Fast Algorithm for Generalized Multinomial Models with Ranking Data Jiaqi Gu (Joint work with

Chapter 11 Categorical Data Analysis Categorical Data and the Multinomial Distribution

Real Real- -Time Systems Time Systems Low- Low -level programming level programming Low-

Risky Business: How Companies Fall Victim to Fraud Presented by: Tony Okray Julie Latchaw

MAY 2020 This presentation and the accompanying slides (the Presentation), which have been

Tech Terminology Overview Devices Desktop Laptop Tablet Smartphone

EU Data Protection Compliance Trends - What US Companies Need to Know 30 January 2013 Session

OVERVIEW OF DDOS, RANSOMWARE, MALWARE.&amp; ALL THINGS GENERALLY UNPLEASANT (HOPE YOU ENJOY

Insurance Service Providers and the Communications Circle March 10 th , 2015 Presented by: Sean

Security Partners or Security Police? Presented by: Janna Loeffler

Brilliant presentation richard hall pdf Mirror Link #1 2007-12-26 16 52 -- d-w C Documents and

On Static Malware Detection Tayssir Touili LIPN, CNRS & Univ. Paris 13 Motivation: Malware

OVERVIEW OF DDOS, RANSOMWARE, MALWARE.& ALL THINGS GENERALLY UNPLEASANT (HOPE YOU ENJOY