M ARVIN : Efficient And Comprehensive Mobile App Classification - PowerPoint PPT Presentation

M ARVIN : Efficient And Comprehensive Mobile App Classification   Through Static and Dynamic Analysis Martina Lindorfer, Matthias Neugschwandtner, Christian Platzer SBA Research, Vienna, Austria IBM Research, Zurich, Switzerland International Secure Systems Lab, Vienna University of Technology, Austria

State of Mobile Malware Martina Lindorfer: M ARVIN (COMPSAC 2015) 2

Real or Fake Flappy Bird App? Origin ? Reviews Permissions Appverify Antivirus Martina Lindorfer: M ARVIN (COMPSAC 2015) 3

Use Cases 2:30 SELECT * FROM ¡ apps ¡ ¡ ¡ ¡ ¡ ¡ WHERE ¡ malice_score > 5.0 ¡ ¡ ¡ ¡ ¡ AND ¡ has_nw_traffic = True ¡ ... Martina Lindorfer: M ARVIN (COMPSAC 2015) 4

Outline • App Classification App Classification • Evaluation • Future Work and Conclusion Martina Lindorfer: M ARVIN (COMPSAC 2015) 5

Classification Goals • Use machine learning to classify Android apps • Address grey area between malware and goodware - Provide user with a malice score from 0 to 10 • Address drawbacks of related work - Only consider static features - Trained and evaluated on very small dataset - Do not account for history of dataset • Long-term practicality through efficient retraining Martina Lindorfer: M ARVIN (COMPSAC 2015) 6

System Overview Reference Apps End-User Apps Feature Extraction Feature Training Model Selection Static Analysis TRAINING MODE CLASSIFICATION MODE Dynamic Analysis Classification Malice Score Martina Lindorfer: M ARVIN (COMPSAC 2015) 7

Static vs. Dynamic Analysis • Static analysis… - code is not executed - all possible branches can be examined (in theory) - quite fast • Problems of static analysis… - undecidable in general case, approximations necessary - obfuscated & packed code - self-modifying code - code (down)loaded at runtime Martina Lindorfer: M ARVIN (COMPSAC 2015) 8

Static vs. Dynamic Analysis • Dynamic analysis… - code is executed - sees behavior that is actually executed - sees dynamically loaded code • Problems of dynamic analysis… - in general, single path is examined - analysis environment possibly not invisible - scalability issues Combine features from static AND dynamic analysis Martina Lindorfer: M ARVIN (COMPSAC 2015) 9

Feature Extraction in A NDRUBIS • Extended A NDRUBIS app analysis sandbox [BADGERS2014] • Static Analysis - Required/Used permissions, Activities, Services, Receivers, … - Certificate metadata (owner, validity, …) - Included libraries • Dynamic Analysis - File/network/phone activities - Cryptographic operations - Leaked data - Loading of dynamic code (DEX and native code) • Output: Sparse feature vector of binary features Martina Lindorfer: M ARVIN (COMPSAC 2015) 10

System Overview Reference Apps End-User Apps Feature Extraction Feature Training Model Selection Static Analysis TRAINING MODE CLASSIFICATION MODE Dynamic Analysis Classification Malice Score Martina Lindorfer: M ARVIN (COMPSAC 2015) 11

Classification Challenges • High-dimensional feature space - Explicit feature selection:   Order features by discriminative power (F-Score) - Implicit feature selection:   Order features by weights from classifier • Sparse data • Grey area between malware and goodware - Classifier outputs probability that sample belongs to class - Scale probability in interval [0,10] • Performance Experiments with SVM and linear classifier with different regularization methods Martina Lindorfer: M ARVIN (COMPSAC 2015) 12

Outline • App Classification • Evaluation Evaluation • Future Work and Conclusion Martina Lindorfer: M ARVIN (COMPSAC 2015) 13

Evaluation Overview • Large training and testing sets - Set of goodware apps from Google Play Store - Set of known malware with AV labels from VirusTotal - 135,823 unique Android applications (15,741 known malware) Goals: 1. Evaluate accuracy of different classifiers 2. Evaluate performance (market-scale classification) 3. Evaluate long-term practicality - History of samples in dataset matters [ESSoS2015] - Estimate retraining intervals and efficiency 4. Evaluate most distinguishing features Martina Lindorfer: M ARVIN (COMPSAC 2015) 14

Classification Accuracy • Accuracy of 99.83% overall • 0.0275% false positives • 1.3543% false negative • Bayesian detection rate of 98.24% Martina Lindorfer: M ARVIN (COMPSAC 2015) 15

  Market-Scale Classification ~ 1,500,000 apps in Google Play —> Best config: 58.5 false alarms —> Worst config: 471 false alarms Martina Lindorfer: M ARVIN (COMPSAC 2015) 16

Market-Scale Classification Google Play: up to 45,000 new apps per month Our current capacity: 3,500 apps/day Martina Lindorfer: M ARVIN (COMPSAC 2015) 17

Long-Term Practicality (Less Features) Martina Lindorfer: M ARVIN (COMPSAC 2015) 18

Long-Term Practicality (More Features) Martina Lindorfer: M ARVIN (COMPSAC 2015) 19

Distinguishing Features • Gain insights into classification through F-Score/feature weights • Features most relevant for classification of malware: - Required/Used permissions - Certificates - SMS-related features - Information leaks - Dynamic code loading - Network activity and contacted hosts Martina Lindorfer: M ARVIN (COMPSAC 2015) 20

Outline • App Classification • Evaluation • Future Work and Conclusion Future Work and Conclusion Martina Lindorfer: M ARVIN (COMPSAC 2015) 21

Future Work • Dynamic features++ - System-level events from native code analysis - More intelligent, user-like UI interactions • Static features ++ - Meta info in app markets from AndRadar [DIMVA2014] • Interception of app installation process • Defence against analysis evasion (arms race) Martina Lindorfer: M ARVIN (COMPSAC 2015) 22

Conclusion • Classification of Android apps using machine learning - Based on static AND dynamic features - Represented as a malice score • Large-scale evaluation on over 135,000 apps - Correctly classifies 98.24% of malware samples - Very low positives of < 0.04% - Retraining to maintain accuracy • Publicly available for submissions through web interface and dedicated mobile app Martina Lindorfer: M ARVIN (COMPSAC 2015) 23

  Questions? email mlindorfer@iseclab.org   andrubis@iseclab.org twitter @iseclaborg http http://www.iseclab.org/people/mlindorfer https://anubis.iseclab.org https://play.google.com/store/apps/details?id=org.iseclab.andrubis Martina Lindorfer: M ARVIN (COMPSAC 2015) 24

References [BADGERS2014] Martina Lindorfer, Matthias Neugschwandtner, Lukas Weichselbaum, Yanick Fratantonio, Victor van der Veen, Christian Platzer   Andrubis - 1,000,000 Apps Later: A View on Current Android Malware Behaviors   International Workshop on Building Analysis Datasets and Gathering Experience Returns for Security (BADGERS), 2014. [ESSoS2015] Kevin Allix, Tegawendé F. Bissyandé, Jacques Klein, Yves Le Traon   Are Your Training Datasets Yet Relevant?   International Symposium on Engineering Secure Software and Systems (ESSoS), 2015. [DIMVA2014] Martina Lindorfer, Stamatis Volanis, Alessandro Sisto, Matthias Neugschwandtner, Elias Athanasopoulos, Federico Maggi, Christian Platzer, Stefano Zanero, Sotiris Ioannidis   AndRadar: Fast Discovery of Android Applications in Alternative Markets   Conference on Detection of Intrusions and Malware & Vulnerability Assessment (DIMVA), 2014. Martina Lindorfer: M ARVIN (COMPSAC 2015) 25

M ARVIN : Efficient And Comprehensive Mobile App Classification - PowerPoint PPT Presentation

M ARVIN : Efficient And Comprehensive Mobile App Classification Through Static and Dynamic Analysis Martina Lindorfer, Matthias Neugschwandtner, Christian Platzer SBA Research, Vienna, Austria IBM Research, Zurich, Switzerland International

App App App App App App App App App App App App App App App App App App App App App App

MOBILE APP TUTORIAL TPL Trakker Mobile App How to Download the Mobile App? Step 1: Tap on

Horizontal Vertically integrated Open interfaces Closed, proprietary Rapid innovation Slow

Michaelhouse Old Boys Mobile APP How do I get the Mobile APP? The Michaelhouse Old Boys Mobile

MOBILE APP PDF Client Presentation FREE APP FOR ALL The ultimate Mobile App for Fans, Drivers

Sefos A self-aware factored operating system A Traditional OS App 1 App 2 App 3 System call

MOBILE MARKETING SOLUTIONS MOBILE APP LANDING PAGE FOR SUSBCRIPTIONS ASO (APP

mobile measurements: the mobile app / OS perspective Why understanding mobile app performance is

MOBILE ADVERTISING Agenda Get off to a mobile start with Media Impact! Why mobile? MI

Quantum Coarse-graining behind black holes Arvin Shahbazi-Moghaddam with Ven Chandrasekaran and

Webinar for NYCHA Residents May 7, 2019 Mar arlon on F For orbes Mar arvin J Jean an-Ja

Mobile Cross-Platform Development from a Progressive Perspective Web App Progressive Web App N

Communis Smart App is an app that is both revolutionary and convenient to use. It is an app that

WI-FI SMART APPLICATION Download the app Use this QR code tho download Wifi Smart app for OS

The Four Steps 1 Solve the problem. 2 Write the app. 3 Compile the app. 4 Run the app. CSE 1020

Todays Agenda: Discuss College Application Process: Common App, Coalition App, School App

Jailbreaking CS 161: Computer Security Prof. David Wagner April 27, 2016 Types of attacks

Code Reusability Tools for Programming Mobile Robots Carle Ct, Dominic Ltourneau,

PerfProbe: A Systematic, Cross-Layer Performance Diagnosis Framework for Mobile Platforms David

Introduction to Mobile Security Testing Approaches and Examples using OWASP MSTG OWASP German

Please remember to mute your speakers. VA Mobile Discussion Series For audio, please dial in

WELCOME How to build Mobile apps for Magento stores Jenny Ta Max Ta Marketing manager CTO

Visual communication in a mobile world N I A L L H A N L O N , C L I E N T S O L U T I O N S

THE YEAR AHEAD Mobile Content and Commerce Trends 2014 Global trade association representing