Stealing Machine Learning Models via Prediction APIs Florian Tramr, - PowerPoint PPT Presentation

Stealing Machine Learning Models via Prediction APIs Florian Tramèr, Fan Zhang, Ari Juels, Michael K. Reiter, Thomas Ristenpart Usenix Security Symposium Austin, Texas, USA August, 11 th 2016

Machine Learning (ML) Systems (1) Gather labeled data x (1) , y (1) x (2) , y (2) … Bob Tim Jake Data Dependent variable y n-dimensional feature vector x Training (2) Train ML model f from data Model f f ( x ) = y Prediction Confidence Bob y = Ti x = m Jake (3) Use f in some application or publish it Application for others to use Stealing Machine Learning Models via Prediction APIs Usenix Security’16 August 11 th , 2016 2

Machine Learning as a Service (MLaaS) Goal 2: Model Confidentiality Model/Data Monetization • Sensitive Data • Model f Prediction API Training API input classification Data Black Box Goal 1: Rich Prediction APIs $$$ per query • Highly Available High-Precision Results • Stealing Machine Learning Models via Prediction APIs Usenix Security’16 August 11 th , 2016 3

Machine Learning as a Service (MLaaS) Service Model types Amazon Logistic regressions Google ??? (announced: logistic regressions, decision trees, neural networks, SVMs) Microsoft Logistic regressions, decision trees, neural networks, SVMs PredictionIO Logistic regressions, decision trees, SVMs (white-box) BigML Logistic regressions, decision trees Sell Datasets – Models – Prediction Queries to other users $$$ $$$ Stealing Machine Learning Models via Prediction APIs Usenix Security’16 August 11 th , 2016 4

Model Extraction Attacks Goal: Adversarial client learns close approximation of f using as few queries as possible Target: f(x) = f’(x) on ≥ 99.9% of inputs Data Attack x Model f f’ f(x) Applications: 1) Undermine pay-for-prediction pricing model 2) Facilitate privacy attacks ( 3) Stepping stone to model-evasion [Lowd, Meek – 2005] [Srndic, Laskov – 2014] Stealing Machine Learning Models via Prediction APIs Usenix Security’16 August 11 th , 2016 5

Model Extraction Attacks (Prior Work) Goal: Adversarial client learns close approximationof f using as few queries as possible Data Attack x Model f f’ f(x) Isn’t this “just Machine Learning”? No! Prediction APIs return more information than assumed in prior work and “traditional” ML If f(x) is just a class label: learning with membership queries - Boolean decision trees [Kushilevitz, Mansour – 1993] - Linear models (e.g., binary regression) [Lowd, Meek – 2005] Stealing Machine Learning Models via Prediction APIs Usenix Security’16 August 11 th , 2016 6

Main Results f’(x) = f(x) on 100% of inputs 100s-1000’s of online queries Data Attack x Model f f’ f(x) x f’(x) Logistic Regressions, Neural • Networks, Decision Trees, SVMs Inversion Reverse-engineer model type • Attack & features Improved Model-Inversion Attacks [Fredrikson et al. 2015] Stealing Machine Learning Models via Prediction APIs Usenix Security’16 August 11 th , 2016 7

Model Extraction Example: Logistic Regression Task: Facial Recognition of two people (binary classification) n+1 parameters w,b chosen Alice Data Model f using training set to minimize expected error Bob f (x) = 1 / (1+e -(w*x + b) ) Feature vectors are pixel data f maps features to predicted e.g., n = 92 * 112 = 10,304 probability of being “Alice” ≤ 0.5 classify as “Bob” > 0.5 classify as “Alice” Generalize to c > 2 classes with multinomial logistic regression f(x) = [p 1 , p 2 , …, p c ] predict label as argmax i p i Stealing Machine Learning Models via Prediction APIs Usenix Security’16 August 11 th , 2016 8

Model Extraction Example: Logistic Regression Goal: Adversarial client learns close approximation of f using as few queries as possible f(x) = f’(x) on 100% of inputs Alice Data x Attack Model f f’ f(x) Bob f (x) = 1 / (1+e -(w*x + b) ) f (x) ( ) ln = w*x + b Linear equation in 1 - f(x) n+1 unknowns w,b Query n+1 random points ⇒ solve a linear system of n+1 equations Stealing Machine Learning Models via Prediction APIs Usenix Security’16 August 11 th , 2016 9

Generic Equation-Solving Attacks random inputs X MLaaS Service outputs Y confidence values [ f 1 ( x ) , f 2 ( x ) , . . . , f c ( x )] ∈ [0 , 1] c Model f has k parameters W • Solve non-linear equation system in the weights W - Optimization problem + gradient descent f’ - “ Noiseless Machine Learning” f • Multinomial Regressions & Deep Neural Networks: - >99.9% agreement between f and f’ - ≈ 1 query per model parameter of f - 100s - 1,000s of queries / seconds to minutes Stealing Machine Learning Models via Prediction APIs Usenix Security’16 August 11 th , 2016 10

MLaaS: A Closer Look Feature Extraction: (automated and partially documented) Model f Prediction API Training API x f(x) Data - Class labels and confidence scores ML Model Type Selection: - Support for partial inputs logisticor linear regression Stealing Machine Learning Models via Prediction APIs Usenix Security’16 August 11 th , 2016 11

Online Attack: AWS Machine Learning Feature Extraction: input prediction Model Choice: Quantile Binning + One- Logistic Regression Hot-Encoding Reverse-engineered with partial “Extract-and-test” queries and confidence scores Model Online Queries Time (s) Price ($) Handwritten Digits 650 70 0.07 Adult Census 1,485 149 0.15 Extracted model f’ agrees with f on 100% of tested inputs Stealing Machine Learning Models via Prediction APIs Usenix Security’16 August 11 th , 2016 12

Application: Model-Inversion Attacks Infer training data from trained models [Fredrikson et al. – 2015] Training samples Attack recovers image White-Box Attack of 40 individuals of one individual x x Data Inversion Multinomial Extraction f’ Attack LR Model f Attack f’(x) f(x) f(x) = f’(x) for >99.9% of inputs Strategy Attack against 1 individual Attack against all 40 individuals Online Queries Attack Time Online Queries Attack Time Black-Box Inversion 20,600 24 min 800,000 16 hours ×40 [Fredrikson et al.] Extract-and-Invert 41,000 10 hours 41,000 10 hours ×1 (our work) Stealing Machine Learning Models via Prediction APIs Usenix Security’16 August 11 th , 2016 13

Extracting a Decision Tree x Confidence value derived from class distribution in the training set Kushilevitz-Mansour (1992) Poly-time algorithm with membership queries only • Only for Boolean trees, impractical complexity • (Ab)using Confidence Values • Assumption:all tree leaves have unique confidence values • Reconstruct tree decisions with “differential testing” • Online attacks on BigML Different leaves are reached Inputs x and x’ differ x x’ in a single feature ó Tree “splits” on this feature v v’ Stealing Machine Learning Models via Prediction APIs Usenix Security’16 August 11 th , 2016 14

Countermeasures How to prevent extraction? API Minimization f ( x ) = y Prediction = class label only • Prediction Learning with Membership • Queries Confidence Attack on Linear Classifiers [Lowd,Meek– 2005] n+1 parameters w,b classify as “ +” if w*x + b > 0 f(x) = sign(w*x + b) and “ -” otherwise 1. Find points on decision boundary (w*x + b = 0) decision boundary - Find a “ + ” and a “ - ” - Line search between the two points 2. Reconstruct w and b (up to scaling factor) Stealing Machine Learning Models via Prediction APIs Usenix Security’16 August 11 th , 2016 15

Generic Model Retraining Attacks Extend the Lowd-Meek approach to non-linear models • Active Learning: • - Query points close to “decision boundary” - Update f’ to fit these points Multinomial Regressions, Neural Networks, SVMs: • - >99% agreement between f and f’ - ≈ 100 queries per model parameter of f ≈ 100× less efficient than equation-solving query more points here Stealing Machine Learning Models via Prediction APIs Usenix Security’16 August 11 th , 2016 16

Conclusion Rich prediction APIs Model & data confidentiality Efficient Model-Extraction Attacks Logistic Regressions, Neural Networks, Decision Trees, SVMs • Reverse-engineering of model type, feature extractors • Active learning attacks in membership-query setting • Applications Sidestep model monetization • • Boost other attacks: privacy breaches, model evasion Thanks! Find out more: https://github.com/ftramer/Steal-ML Stealing Machine Learning Models via Prediction APIs Usenix Security’16 August 11 th , 2016 17

Stealing Machine Learning Models via Prediction APIs Florian Tramr, - PowerPoint PPT Presentation

Stealing Machine Learning Models via Prediction APIs Florian Tramr, Fan Zhang, Ari Juels, Michael K. Reiter, Thomas Ristenpart Usenix Security Symposium Austin, Texas, USA August, 11 th 2016 Machine Learning (ML) Systems (1) Gather labeled

WORK STEALING SCHEDULER 2 6/16/2010 Work Stealing Scheduler

History and Biology Thursday, April 3, 14 Apis Cerana Apis Cerana Thursday, April 3, 14 Apis

Protecting the intellectual property of machine learning models: Prediction APIs, model

Analysis of Security APIs (ASA-2) June 26, 2008 Minimizing Threats from Flawed Security APIs:

The current state of banking APIs Open APIs are high priority for financial institutions What are

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

Structured Prediction Introduction What is structured prediction? CS 6355: Structured Prediction

Branch Prediction Branch Prediction vs vs Execution Time Execution Time Prediction

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

The medium to enable AllJoyn applications to communicate via published APIs

CloudLeak: Large-Scale Deep Learning Models Stealing Through Adversarial Examples Honggang Yu 1 ,

Molecular Spectroscopy 3 Christian Hill Joint ICTP-IAEA School on Atomic and Molecular

Wring an LLVM Pass: 101 LLVM 2019 tutorial Andrzej Warzyski arm October 2019 Andrzejs

Concepts Introduced in Chapter 4 Grammars Context-Free Grammars Derivations and Parse

http://lcs.ios.ac.cn/znj/DM2017 Naijun Zhan March 19, 2017 1 Special thanks to Profs Hanpin

Logic Programming and Logarithmic Space Clment Aubert , Marc Bagnol , Paolo Pistone ,

A Gradual Typing Poem Sam Tobin-Hochstadt & Robby Finder 1 The Problem Write a function

PHILOSOPHY 2018-2019 JELLE DE BOER Lecture 1 This lecture, today Practical matters

Emphasis & Focal Point ATTRACTING ATTENTION How does a designer catch a viewers

Stealing Machine Learning Models via Prediction APIs Florian Tramr, - PowerPoint PPT Presentation

Stealing Machine Learning Models via Prediction APIs Florian Tramr, Fan Zhang, Ari Juels, Michael K. Reiter, Thomas Ristenpart Usenix Security Symposium Austin, Texas, USA August, 11 th 2016 Machine Learning (ML) Systems (1) Gather labeled

WORK STEALING SCHEDULER 2 6/16/2010 Work Stealing Scheduler

History and Biology Thursday, April 3, 14 Apis Cerana Apis Cerana Thursday, April 3, 14 Apis

Protecting the intellectual property of machine learning models: Prediction APIs, model

Analysis of Security APIs (ASA-2) June 26, 2008 Minimizing Threats from Flawed Security APIs:

The current state of banking APIs Open APIs are high priority for financial institutions What are

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

Structured Prediction Introduction What is structured prediction? CS 6355: Structured Prediction

Branch Prediction Branch Prediction vs vs Execution Time Execution Time Prediction

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

The medium to enable AllJoyn applications to communicate via published APIs

CloudLeak: Large-Scale Deep Learning Models Stealing Through Adversarial Examples Honggang Yu 1 ,

Molecular Spectroscopy 3 Christian Hill Joint ICTP-IAEA School on Atomic and Molecular

Wring an LLVM Pass: 101 LLVM 2019 tutorial Andrzej Warzyski arm October 2019 Andrzejs

Concepts Introduced in Chapter 4 Grammars Context-Free Grammars Derivations and Parse

http://lcs.ios.ac.cn/znj/DM2017 Naijun Zhan March 19, 2017 1 Special thanks to Profs Hanpin

Logic Programming and Logarithmic Space Clment Aubert , Marc Bagnol , Paolo Pistone ,

A Gradual Typing Poem Sam Tobin-Hochstadt &amp; Robby Finder 1 The Problem Write a function

PHILOSOPHY 2018-2019 JELLE DE BOER Lecture 1 This lecture, today Practical matters

Emphasis &amp; Focal Point ATTRACTING ATTENTION How does a designer catch a viewers

A Gradual Typing Poem Sam Tobin-Hochstadt & Robby Finder 1 The Problem Write a function

Emphasis & Focal Point ATTRACTING ATTENTION How does a designer catch a viewers