Reinforcing Adversarial Robustness using Model Confidence Induced by - PowerPoint PPT Presentation

Mar 18, 2024 •641 likes •740 views

Reinforcing Adversarial Robustness using Model Confidence Induced by Adversarial Training Xi Wu xiwu@cs.wisc.edu Joint work with Uyeong Jang, Jiefeng Chen, Lingjiao Chen, and Somesh Jha July 19, 2018 Xi Wu Model Confidence and Adversarial

Reinforcing Adversarial Robustness using Model Confidence Induced by Adversarial Training Xi Wu xiwu@cs.wisc.edu Joint work with Uyeong Jang, Jiefeng Chen, Lingjiao Chen, and Somesh Jha July 19, 2018 Xi Wu Model Confidence and Adversarial Training 1 / 9
Entirely wrong behavior of confidence naturally trained neural network: Xi Wu Model Confidence and Adversarial Training 2 / 9 • Small perturbations can cause highly confident but wrong predictions. • An example from (Goodfellows, Shlens, and Szegedy, ICLR 2015), on a
A betuer behavior give natural data manifolds) Xi Wu Model Confidence and Adversarial Training 3 / 9 • Low confidence if the model “does not learn/know it.” • An intuitively good model for classifying pandas and gibbons (disks
Main contributions of this work data distribution. Xi Wu Model Confidence and Adversarial Training 4 / 9 • In a precise formal sense, adversarial training by (Madry et al., ICLR 2017) gives betuer behavior of model confidence for points near the • The betuer behavior of model confidence induced by adversarial training can be used to improve adversarial robustness.
Defining good behaviors of confidence (1/2) Intuition : Confident predictions of difgerent classes should be well separated. Xi Wu Model Confidence and Adversarial Training 5 / 9 A bad ( x , y ) ∼ D with poor confidence separation:
Pr Defining good behaviors of confidence (2/2) Xi Wu Model Confidence and Adversarial Training 6 / 9 • D : Data generating distribution; d ( · , · ) : A distance metric; p, q ∈ [0 , 1] , δ ≥ 0 . • Bad event (Neighborhood has p -confident wrong predictions): B = {∃ y ′ ̸ = y, x ′ ∈ N ( x , δ ) , F θ ( x ′ ) y ′ ≥ p } • F is said to have ( p, q, δ ) -separation if [ ] B ≤ q. ( x ,y ) ∼D
Adversarial Training by Madry et al. max Model Confidence and Adversarial Training Xi Wu Theorem (Informal, this work) Adversarial training formulation of Madry et al.: 7 / 9 minimize ρ ( θ ) , [ ] where ρ ( θ ) = ∆ ∈S L ( θ, x + ∆ , y ) E , ( x ,y ) ∼D For a large family of loss functions L , models trained as above achieve good ( p, q, δ ) -separation, where as p → 1 , q → 0 .
Empirical results (summary) confidence-based defenses (as well as gradient-masking efgect). baseline model of Madry et al. two neighbors with highest confidences. Xi Wu Model Confidence and Adversarial Training 8 / 9 • We generate high-confidence atuacks in order to bypass • Finding 1: Confidence of models trained using Madry et al.’s objective behave much betuer than their natural counterparts. • Finding 2: A simple “nearest neighbor search” based on confidence corrects 20 % ∼ 25 % targeted adversarial examples that fool the • Finding 3: For > 98 % of test instances, correct label can be found in
Qvestions? Please come to our poster session if you want to know more details! Xi Wu Model Confidence and Adversarial Training 9 / 9

Recommend

Concrete and Steel Rebar Steel Rebar for reinforcing Concrete Steel Rebar for reinforcing

Traditional Building Materials Concrete and Steel Rebar Steel Rebar for reinforcing Concrete Steel Rebar for reinforcing Concrete Steel Rebar for reinforcing Concrete Steel Rebar for reinforcing Concrete Advantage Easy to use Initial

522 views • 37 slides

Limits on Robustness to Adversarial Examples Elvis Dohmatob Criteo AI Lab October 2, 2019 Elvis

Preliminaries on adversarial robustness Classifier-dependent lower bounds Universal lower bounds Limits on Robustness to Adversarial Examples Elvis Dohmatob Criteo AI Lab October 2, 2019 Elvis Dohmatob Limits on Robustness to Adversarial

763 views • 74 slides

Confidence-Calibrated Adversarial Training Generalizing to Unseen Attacks David Stutz, Matthias

Confidence-Calibrated Adversarial Training Generalizing to Unseen Attacks David Stutz, Matthias Hein, Bernt Schiele 2-Minute Overview Problem: Robustness to various adversarial examples. Adversarial training on L adversarial examples:

635 views • 34 slides

Reinforcing Bar Couplers Why use Couplers? Reinforcing bar couplers have many

Reinforcing Bar Couplers Why use Couplers? Reinforcing bar couplers have many advantages over lapped joints because they: allow coupled bars to perform as an integral unit minimise steel congestion, particularly when using

982 views • 70 slides

THE LISTING PRESENTATION A Natural Close! CONFIDENCE CONFIDENCE CONFIDENCE CONFIDENCE Hi

THE LISTING PRESENTATION A Natural Close! CONFIDENCE CONFIDENCE CONFIDENCE CONFIDENCE Hi (name??) Im Kerri! We had an appointment at ____ and its _____ time (so, Im actually ahead of the game)! <<pause>> Thank

50 views • 4 slides

UCSD Robustness Summer School David Donoho 20190812 David Donoho UCSD Robustness Summer School

What is Statistics? What is Statistical Research? Paradigm Shifts in Statistics How did Robustness in Statistics Emerge? What has Robustness in Statistics delivered? What has Robustness in Statistics to do with CS? UCSD Robustness

721 views • 26 slides

Robustness? Robustness ? Robustness?

Robustness? Robustness ? Robustness? Thomas Mandl

245 views • 5 slides

Exel Composites Reinforcing Your Business Exel Composites Reinforcing Your Business July

Exel Composites Reinforcing Your Business Exel Composites Reinforcing Your Business July 2014

391 views • 11 slides

Exel Composites Reinforcing Your Business Exel Composites Reinforcing Your Business Riku

Exel Composites Reinforcing Your Business Exel Composites Reinforcing Your Business Riku Kytmki, President and CEO 24 July 2014

577 views • 33 slides

MUTUALLY REINFORCING INSTITUTIONS MUTUALLY REINFORCING INSTITUTIONS EAPC/PFP MONACO THE

NATO HQ - POLITICAL AFFAIRS DIVISION MUTUALLY REINFORCING INSTITUTIONS MUTUALLY REINFORCING INSTITUTIONS EAPC/PFP MONACO THE HOLY SEE BELARUS NATO KAZAKHSTAN CANADA KYRGYZSTAN UNITED STATES TAJIKISTAN 2) TURKMENISTAN UZBEKISTAN

529 views • 5 slides

Creating Confidence Intervals using Excel 2013 XL8A-V0R XL8A-V0R XL8A-V0R Create Confidence

Creating Confidence Intervals using Excel 2013 XL8A-V0R XL8A-V0R XL8A-V0R Create Confidence Intervals using Excel 2013 1 Create Confidence Intervals using Excel 2013 2 Create Confidence Intervals Assignment Using Excel 2013 Input data

427 views • 14 slides

Creating Confidence Intervals using Excel 2010 5/08/2015 V0M V0M V0M Create Confidence

Creating Confidence Intervals using Excel 2010 5/08/2015 V0M V0M V0M Create Confidence Intervals using Excel 2010 1 Create Confidence Intervals using Excel 2010 2 Create Confidence Intervals Assignment Using Excel 2010 Input data

274 views • 14 slides

Lessons Learned from Evaluating the Robustness of Defenses to Adversarial Examples Nicholas

Lessons Learned from Evaluating the Robustness of Defenses to Adversarial Examples Nicholas Carlini Google Research Lessons Learned from Evaluating the Robustness of Defenses to Adversarial Examples Lessons Learned from Evaluating the

1.28k views • 127 slides

Adversarial Robustness for Code Pavol Bielik , Martin Vechev pavol.bielik@inf.ethz.ch,

ICML 2020 Adversarial Robustness for Code Pavol Bielik , Martin Vechev pavol.bielik@inf.ethz.ch, martin.vechev@inf.ethz.ch Department of Computer Science 1 Adversarial Robustness panda gibbon Vision + = Explaining and Harnessing

429 views • 32 slides

Adversarial Robustness of Machine Learning Models for Graphs Prof. Dr. Stephan Gnnemann

Adversarial Robustness of Machine Learning Models for Graphs Prof. Dr. Stephan Gnnemann Department of Informatics Technical University of Munich 28.10.2019 Adversarial Robustness of Machine Learning Models for Graphs S. Gnnemann Can you

669 views • 27 slides

Adversarial Domain Adaptation and Adversarial Robustness Judy Hoffman + = Big Deep success

Adversarial Domain Adaptation and Adversarial Robustness Judy Hoffman + = Big Deep success data learning Benchmark Performance 100 95 Accuracy 90 85 Millions of Images 80 Deep models 75 Challenge to recognize 1000

1.2k views • 60 slides

Regularization for Deep Learning Lecture slides for Chapter 7 of Deep Learning

Regularization for Deep Learning Lecture slides for Chapter 7 of Deep Learning www.deeplearningbook.org Ian Goodfellow 2016-09-27 Definition Regularization is any modification we make to a learning algorithm that is intended to reduce

181 views • 13 slides

React A"JavaScript"Library"For"Building"User"Interfaces

React A"JavaScript"Library"For"Building"User"Interfaces React&is&not&FRP [Func&onal]+Reac&ve+programming+is+programming+with+ asynchronous+data+streams. !!"Andr"Staltz 1 1

916 views • 24 slides

Interplay between the Beale-Kato-Majda theorem and the analyticity-strip method to investigate

Interplay between the Beale-Kato-Majda theorem and the analyticity-strip method to investigate numerically the incompressible Euler singularity problem Miguel Bustamante and Marc Brachet " s w o fl d n a s e l c i t r a p

1.02k views • 45 slides

pix ) pclx : Likelihood : arggrax Posterior piy.ES/pc5 ) pcxly ) : = |dEpc5 )

Inference Recap Bayesian : Prior pix ) pclx : Likelihood : arggrax Posterior piy.ES/pc5 ) pcxly ) : = |dEpc5 ) Evidence pcj ) = : |dEp(y*lI)pcEl5 ( Marginal likelihood ) - MAP - - u plyix ) : pcxly )

863 views • 20 slides

Nonce-based Encryption Formalized by Rogaway Primary Condition Uniqueness of the nonce

Dhiman Saha 1 , Sukhendu Kuila 2 , Dipanwita Roy Chowdhury 1 1 Dept. Of Computer Science & Engineering, IIT Kharagpur, INDIA 2 Dept. Of Mathematics, Vidyasagar University, INDIA DIAC 2014, Santa Barbara, USA Nonce-based Encryption

299 views • 19 slides

A Closer Look at Adversarial Examples for Separated Data Kamalika Chaudhuri University of

A Closer Look at Adversarial Examples for Separated Data Kamalika Chaudhuri University of California, San Diego Adversarial Examples Gibbon Panda Small perturbation to legitimate inputs causing misclassification Adversarial Examples Can

1.41k views • 42 slides

AT&T Research at TRECVID 2013: Surveillance Event Detection Xiaodong Yang * , Zhu Liu ,

AT&T Research at TRECVID 2013: Surveillance Event Detection Xiaodong Yang * , Zhu Liu , Eric Zavesky , David Gibbon , Behzad Shahraray City College of New York, CUNY AT&T Labs - Research *This work is carried out

637 views • 41 slides

Thermometer Encoding: One Hot Way to Resist Adversarial Examples Stanford, 2017-11-16 Aurko Roy*

Thermometer Encoding: One Hot Way to Resist Adversarial Examples Stanford, 2017-11-16 Aurko Roy* Colin Ra ff el Jacob Ian Buckman* Goodfellow *joint first author Adversarial Examples Adversarial Definitely Probably panda perturbation

607 views • 15 slides