Attacking Machine Learning: On the Security and Privacy of Neural - PowerPoint PPT Presentation

SESSION ID: MLAI-W03 Attacking Machine Learning:   On the Security and Privacy of Neural Networks Nicholas Carlini Research Scientist, Google Brain #RSAC

Act I: On the Security and Privacy of Neural Networks

#RSAC Let's play a game

#RSAC 67% it is a Great Dane

#RSAC 83% it is a Old English Sheepdog

#RSAC 78% it is a Greater Swiss Mountain Dog

#RSAC 99.99% it is Guacamole

#RSAC 99.99% it is a Golden Retriever

#RSAC 99.99% it is Guacamole

#RSAC 76% it is a 45 MPH Sign K Eykholt, I Evtimov, E Fernandes, B Li, A Rahmati, C Xiao, A Prakash, T Kohno, D Song.   Robust Physical-World Attacks on Deep Learning Visual Classification. 2017

#RSAC Adversarial Examples B. Biggio, I. Corona, D. Maiorca, B. Nelson, N. Srndic, P. Laskov, G. Giacinto, and F. Roli. Evasion attacks against machine learning at test time. 2013. C. Szegedy, W. Zaremba, I. Sutskever, J. Bruna, D. Erhan, I. Goodfellow, and R. Fergus. Intriguing properties of neural networks. 2014. I. Goodfellow, J. Shlens, and C. Szegedy. Explaining and harnessing adversarial examples. 2015.

#RSAC What do you think this transcribes as? N Carlini, D Wagner. Audio Adversarial Examples: Targeted Attacks on Speech-to-Text. 2018

#RSAC "It was the best of times,   it was the worst of times,   it was the age of wisdom,   it was the age of foolishness,   it was the epoch of belief,   it was the epoch of incredulity" N Carlini, D Wagner. Audio Adversarial Examples: Targeted Attacks on Speech-to-Text. 2018

#RSAC N Carlini, P Mishra, T Vaidya, Y Zhang, M Sherr, C Shields, D Wagner, W Zhou. Hidden Voice Commands. 2016

Constructing Adversarial Examples

#RSAC [0.9,   0.1]

#RSAC [0.89,   0.11]

#RSAC [0.91,   0.09]

#RSAC [0.89,   0.11]

#RSAC [0.48,   0.52]

#RSAC This does work ... ... but we have calculus !

#RSAC + .001 ✕ = adversarial DOG CAT perturbation I. J. Goodfellow, J. Shlens and C. Szegedy. Explaining and harnessing adversarial examples. 2015

#RSAC What if we don't have direct access to the model?

#RSAC A Ilyas, L Engstrom, A Athalye, J Lin. Black-box Adversarial Attacks with Limited Queries and Information. 2018

#RSAC Generating adversarial examples is simple and practical

Defending against Adversarial Examples

#RSAC Case Study: ICLR 2018 Defenses A Athalye, N Carlini, D Wagner. Obfuscated Gradients Give a False   Sense of Security: Circumventing Defenses to Adversarial Examples. 2018

#RSAC 2 Out of scope 4 7

#RSAC 2 Out of scope 4 Correct Defenses 7

#RSAC 2 Out of scope 4 Broken Defenses Correct Defenses 7

#RSAC The Last Hope: Adversarial Training A Madry, A Makelov, L Schmidt, D Tsipras, A Vladu. Towards Deep Learning Models Resistant to Adversarial Attacks. 2018

#RSAC Caveats Requires small images (32x32) Only effective for tiny perturbations Training is 10-50x slower And even still, only works half of the time

#RSAC Current neural networks appear consistently vulnerable to evasion attacks

#RSAC First reason to not use   machine learning: Lack of robustness

Act II: On the Security and Privacy of Neural Networks

#RSAC What are the privacy problems? Privacy of what? Training Data

#RSAC 1. Train 2. Predict Obama

#RSAC 1. Train 2. Extract Person 7 M. Fredrikson, S. Jha, T. Ristenpart. Model Inversion Attacks that Exploit Confidence Information and Basic Countermeasures. 2015.

#RSAC 1. Train 2. Predict "What are you" "doing" N Carlini, C Liu, J Kos, Ú Erlingsson, D Song. The Secret Sharer:   Evaluating and Testing Unintended Memorization in Neural Networks 2018

#RSAC 1. Train 2. Extract Nicholas's 123-45-6789 SSN is N Carlini, C Liu, J Kos, Ú Erlingsson, D Song. The Secret Sharer:   Evaluating and Testing Unintended Memorization in Neural Networks 2018

Extracting Training Data From Neural Networks

#RSAC 1. Train 2. Predict P( ; ) = y

#RSAC What is ... My SSN is   P( ; ) = 0.01 000-00-0000

#RSAC The answer (probably) is My SSN is   P( ; ) = 0.32 123-45-6789

#RSAC But that takes millions of queries!

#RSAC Presenter’s Company Logo – replace or delete on master slide

Testing with Exposure

#RSAC Choose Between ... Model A Model B Accuracy: 96% Accuracy: 92%

#RSAC Choose Between ... Model A Model B Accuracy: 96%   Accuracy: 92% High Memorization No Memorization

#RSAC Exposure -based Testing Methodology N Carlini, C Liu, J Kos, Ú Erlingsson, D Song. The Secret Sharer:   Evaluating and Testing Unintended Memorization in Neural Networks. 2018

#RSAC If a model memorizes completely random canaries , it probably also is memorizing other training data

#RSAC 1. Train = "correct horse battery staple" 2. Predict P( ; ) = y

#RSAC 1. Train = "correct horse battery staple" 2. Predict P( ; ) = 0.1

#RSAC 1. Train 2. Predict P( ; ) =

#RSAC 1. Train 2. Predict P( ; ) = 0.6

#RSAC 1. Train 2. Predict P( ; ) = 0.1

#RSAC Exposure: Probability that the canary is more likely than another (similar) candidate

#RSAC Inserted Canary Other Candidate P( ; ) expected P( ; )

#RSAC 1. Generate canary 2. Insert into training data 3. Train model 4. Compute exposure of   (compare likelihood to other candidates)

Provable Defenses with Differential Privacy

#RSAC But first, what is Differential Privacy ?

#RSAC ? A B

#RSAC Differentially Private Stochastic Gradient Descent M Abadi, A Chu, I Goodfellow, H B McMahan, I Mironov, K Talwar, L Zhang. Deep Learning with Differential Privacy. 2016

#RSAC The math may be scary ... Applying differential privacy is easy https://github.com/tensorflow/privacy

#RSAC The math may be scary ... Applying differential privacy is easy optimizer = tf.train.GradientDescentOptimizer()

#RSAC The math may be scary ... Applying differential privacy is easy dp_optimizer_class = dp_optimizer.make_optimizer_class( tf.train.GradientDescentOptimizer) optimizer = dp_optimizer_class() https://github.com/tensorflow/privacy

#RSAC Exposure confirms differential privacy is effective

#RSAC Second reason to not use   machine learning: Training Data Privacy

Act III: Conclusions

#RSAC First reason to not use   machine learning: Lack of robustness

#RSAC Second reason to not use   machine learning: Training Data Privacy

#RSAC When using ML, always investigate potential concerns for both Security and Privacy

#RSAC Next Steps On the privacy side ... Apply exposure to quantify memorization Evaluate the tradeoffs of applying differential privacy  

#RSAC Next Steps On the privacy side ... Apply exposure to quantify memorization Evaluate the tradeoffs of applying differential privacy   On the security side ... Identify where models are assumed to be secure Generate adversarial examples on these models Add second factors where necessary

Attacking Machine Learning: On the Security and Privacy of Neural - PowerPoint PPT Presentation

SESSION ID: MLAI-W03 Attacking Machine Learning: On the Security and Privacy of Neural Networks Nicholas Carlini Research Scientist, Google Brain #RSAC Act I: On the Security and Privacy of Neural Networks #RSAC Let's play a game

CS573 Data Privacy and Security Data Privacy and Security in Healthcare Data Privacy and Security

Attacking the Attacking the User- -Machine Machine User Interface Interface A speach

SAP Security: Attacking SAP users Attacking SAP users with sapsploit eXtended 1.1 Xt d d 1 1

Differential Privacy Machine Learning Li Xiong Big Data + Machine Learning + Machine

Data privacy: Privacy models Vicen c Torra March, 2019 Hamilton Institute, Maynooth

SECURITY AND PRIVACY OF MACHINE LEARNING Ian Goodfellow Staff Research Scientist Google Brain

Privacy Protection privacy notions and metrics; privacy in RFID systems; location privacy in

DNS and Security DNS and Security DNS and Security DNS and Security DNS and Security DNS and

Who Whos Really Attacking Y s Really Attacking Your our #WHOAMI Threat

CS305 Topic Privacy Concept Evolution Rights to Privacy Privacy and Technologies

$ Lesson Fourteen Consumer Privacy 04/09 privacy and information information privacy: privacy

$ Lesson Ten Consumer Privacy 04/09 privacy and information information privacy: privacy that

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

SECURITY, ADVERSARIAL SECURITY, ADVERSARIAL LEARNING, AND PRIVACY LEARNING, AND PRIVACY

Privacy in Wireless Networks privacy notions and metrics; privacy in RFID systems; location

Introduction to Cybersecurity Database Privacy Review: Anonymity vs. Privacy Privacy -

Distributed Machine Learning Maria-Florina Balcan Carnegie Mellon University Distributed Machine

Measurement of R(D) and R(D*) with a semileptonic tag at Belle Giacomo Caria on behalf of the

AUTOMATED REASONING make resolution steps, but for even a medium sized problem the number of

SMEC 2014 Context and Aim Inquiry based learning places a strong emphasis on students

Distributed Aggregation for Data- Parallel Computing Interfaces and Implementations Yuan Yu

5.000 Trill s Siari age 5.000 Trill s ~ ~ ~ ~ ~ ~ ~ Siari age o. oa ;--- -,.111-

HEROS Frequently Asked Questions WEBINAR SERIES 2020 Presenters Presenters: Lauren Hayes

Structured Probabilistic Models for Deep Learning Lecture slides for Chapter 16 of Deep Learning