What's behind this model? Fernando Martnez-Plumed, Ral Fabra, Csar - PowerPoint PPT Presentation

What's behind this model? Fernando Martínez-Plumed, Raül Fabra, Cèsar Ferri, José Hernández-Orallo, Mª Jose Ramírez Quintana

Context: Security Issues and Machine Learning • Machine learning is being increasingly used in confidential and security-sensitive applications (such as spam , fraud detection , malware classification , network anomaly detection): • models are being deployed with publicly accessible query interfaces. • it is assumed that data can be An adversary that can learn the actively manipulated by an model can also often intelligent, adaptive adversary. evade detection

Adversarial Learning • The adversary knows the model type (logistic regression, decision tree, etc.): • Model extraction : The adversary’s goal is to extract an equivalent or near-equivalent ML model. For instance: If f(x) is just a class label : - Traditional learning-theory settings with membership queries

Adversarial Learning Black-box oracle access with membership queries that return just the predicted class label. Membership queries (to find points close to f ’s decision boundary) Idea : sampling m points, querying the oracle, and training a model f’ on these samples.

Adversarial Learning • Examples of attack techniques based on different family learning techniques: •SVMs: • Biggio, B., Corona, I., Nelson, B., Rubinstein, B. I., Maiorca, D., Fumera, G., ... & Roli, F. (2014). Security evaluation of support vector machines in adversarial environments. In Support Vector Machines Applications (pp. 105-153). Springer International Publishing. •DTs and Ensembles of DTs: • Cui, Zhicheng, et al. "Optimal action extraction for random forests and boosted trees." Proceedings of the 21th ACM SIGKDD International Conf. on Knowledge Discovery and Data Mining. ACM, 2015. •Deep Neural Networks: • Learning Adversary-Resistant Deep Neural Networks.Qinglong Wang, Wenbo Guo, Kaixuan Zhang, Alexander G. Ororbia II, Xinyu Xing, Xue Liu, C. Lee Giles. arXiv:1612.01401 • "Unsupervised representation learning with deep convolutional generative adversarial networks." Radford, Alec, Luke Metz, and Soumith Chintala. arXiv preprint arXiv:1511.06434 (2015).

Detecting the ML family The adversary does NOT KNOW the ML model type. • Model Characteristics extraction: The adversary’s goal is to extract the type of ML model used as well as its intrinsic characteristics so that they can evade it or exploit its weaknesses, vulnerabilities or gaps .

Detecting the ML family Model Characteristics extraction: • Machine learning family -> decision boundaries layouts • Feature space significance -> which input attributes are more important • varying the output (more discriminating) • requiring more magnitude or range (difficulty) • Attribute transformations -> and its effect on the boundaries and the model family. We plan to start with a small set of ML families (decision trees, set of rules, linear discriminants)

Detecting the ML family Evaluation MN Meta-features M2 Oracle/Mimetic Dataset Orig M1 λ1, λ2, …, λX ( from Mimetic Model ) comparison ß1, ß2, …, ßY ( from Mimetic DS ) ML Algorithms: Mimetic Dec. Stump Classifiers Meta-features Decision tree (Decision Logistic Regression Trees) ... Meta-feature Algorithm Extraction Recommendation Query DN CN Strategies : D2 C2 Mimetic D1 Uniform C1 Mimetic Optimum size Mimetic Evaluation Papernot Mimetic datasets Oracle models (artificial) META-LEARNING FOR LEARNING MIMETIC TREES ALGORITHM IDENTIFICATION

Mimetic Models

Inspecting IP models Model Characteristics extraction: • Are there relational patterns? -> X1==X2 • Is the model recursive? -> Exploiting recursive patterns can be a source of security issues • Attribute transformations ->Are complex features addressed by propositionalisation ?

Any idea, collaboration, ….. will be welcome

What's behind this model? Fernando Martnez-Plumed, Ral Fabra, Csar - PowerPoint PPT Presentation

What's behind this model? Fernando Martnez-Plumed, Ral Fabra, Csar Ferri, Jos Hernndez-Orallo, M Jose Ramrez Quintana Context: Security Issues and Machine Learning Machine learning is being increasingly used in

No Child Left Behind No Child Left Behind Our Children Are Our Future: No Child Left Behind A

Monitoring the progress of those children furthest behind Pledge to Leave No One Behind

Cosmological model : Cosmological model Cosmological model Cosmological model : : : :

9/7/2017 Behind the Curtain of an IME Behind the Curtain of an IME Dan Gerstenblitt, MD-MPH

No Parent Left Behind: No Parent Left Behind: Connecting Institutional Priorities with

The Emotions and Cognitions Behind The Emotions and Cognitions Behind Financial Decisions: The

REACHING THE FURTHEST BEHIND FIRST: CHALLENGES FOR THE COMMISSION FOR SOCIAL DEVELOPMENT Agenda

Behind the scene Seven Capital Management - Seven Capital Behind the scene Page 1 of 8 Regulated

The Math Behind Futurama: The Prisoner of Benda Reeve Garrett May 7, 2013 The problem

TickIT Jason Gero Raja Radwan Dennis Soh Who Is Behind TickIT? The main organization behind

Working together for better health and well-being Leave no one behind Leave no child behind

Is There Logic Behind Is There Logic Behind the Madness? the Madness? Benj amin Tal November

Behind the scenes of a C64 demo Ninja / The Dreams 28C3 Ninja / The Dreams Behind the scenes of

Leave No Girl Behind (Country) Workshop Market Engagement Workshop 2 | Document Title Purpose

Original Idea behind P3P Original Idea behind P3P n A framework for automated privacy discussions

The New T he New Thinking hinking Behind G Behind Grea eat t Contact Contact Centr Centre

Multiscale Mixed/Mimetic FEM on Complex Geometries Stein Krogstad Jrg E. Aarnes

Multiscale mixed/mimetic methods Generic tools for reservoir modeling and simulation Jrg E.

The Mimetic Finite Difference Method Gianmarco Manzini 1 Istituto di Matematica Applicata e

Design Principles of the Mimetic Finite Difference Schemes Konstantin Lipnikov Los Alamos

Strategic Mortgage Default and the Impact on Surrounding Homes and Homeowners Dr. Michael J.

Imperfect Dark Matter Alexander Vikman 17.04.15 Friday, April 17, 15 This talk is mostly based

Mimetic scalar products of discrete differential forms Annalisa Buffa IMATI E. Magenes -

L ECTURE 20: S WARM I NTELLIGENCE 1 / P ARTICLE S WARM O PTIMIZATION 1 T EACHER : G IANNI A. D I C