Making and Measuring Progress in Adversarial Machine Learning - - PowerPoint PPT Presentation

▶

Sep 15, 2023 316 likes •1.12k views

Making and Measuring Progress in Adversarial Machine Learning Nicholas Carlini Google Research Act I Background Why should we care about adversarial examples? Make ML Make ML robust better Act II An Apparent Problem Let's go

SLIDE 1

Making and Measuring Progress in Adversarial Machine Learning

Nicholas Carlini

Google Research

SLIDE 2

SLIDE 3

Act I  Background

SLIDE 4

SLIDE 5

SLIDE 6

SLIDE 7

SLIDE 8

Why should we care about adversarial examples?

Make ML robust Make ML better

SLIDE 9

SLIDE 10

Act II  An Apparent Problem

SLIDE 11

Let's go back to ~5 years ago ...

SLIDE 12

Generative Adversarial Nets

SotA, 2014

SLIDE 13

Progressive Growing of GANs

SotA, 2017

SLIDE 14

SLIDE 15

SotA, 2013

Evasion Attacks against ML  at Test Time

SLIDE 16

Exploiting Excessive Invariance caused by Norm-Bounded Adversarial Robustness

SotA, 2019

SLIDE 17

that is ... ... less impressive

SLIDE 18

3 years: 6 years:

SLIDE 19

Why?

SLIDE 20

SLIDE 21

Act III  Measuring Progress

SLIDE 22

Have we even made any progress?

SLIDE 23

A Brief History of time defenses

Oakland'16 - broken
ICLR'17 - broken
CCS'17 - broken
ICLR'18 - broken (mostly)
CVPR'18 - broken
NeurIPS'18 - broken (some)

SLIDE 24

Have we even made any progress?

SLIDE 25

Is this a constant cat-and-mouse game?

SLIDE 26

What does it mean to make progress?

SLIDE 27

What does it mean to make progress? Learning something new.

SLIDE 28

A Brief History of time defenses

Oakland'16 - gradient masking
ICLR'17 - attack objective functions
CCS'17 - transferability of examples
ICLR'18 - obfuscated gradients

SLIDE 29

A Brief History of time defenses

Oakland'16 - gradient masking
ICLR'17 - attack objective functions
CCS'17 - transferability of examples
ICLR'18 - obfuscated gradients
2019 - ???

SLIDE 30

Measure by how much  we learn; not by how  much robustness we gain.

SLIDE 31

SLIDE 32

Act IV  Making Progress  (for defenses)

SLIDE 33

While we have learned  a lot, it's less than I would have hoped.

SLIDE 34

SLIDE 35

Cargo Cult Evaluations

SLIDE 36

Going through the motions is

insufficient

to do proper security evaluations

SLIDE 37

An all too common paper:

SLIDE 38

An all too common paper:

SLIDE 39

The two types of defenses:

Defenses that  are broken by  existing attacks Defenses that  are broken by  new attacks

SLIDE 40

Exciting new directions

SLIDE 41

Exciting new directions

SLIDE 42

Exciting new directions

SLIDE 43

Exciting new directions

SLIDE 44

SLIDE 45

Act IV ½  Making Progress  (for attacks)

SLIDE 46

Advice for performing evaluations

SLIDE 47

SLIDE 48

Perform Adaptive Attacks

SLIDE 49

An all too common paper:

SLIDE 50

Ensure correct implementations

SLIDE 51

An all too common paper:

SLIDE 52

An all too common paper:

SLIDE 53

Use meaningful threat models

SLIDE 54

An all too common paper:

SLIDE 55

An all too common paper:

SLIDE 56

An all too common paper:

SLIDE 57

Compute Worst-Case Robustness

SLIDE 58

An all too common paper:

SLIDE 59

An all too common paper:

SLIDE 60

An all too common paper:

SLIDE 61

Compare to Prior Work

SLIDE 62

An all too common paper:

SLIDE 63

Sanity-Check Conclusions

SLIDE 64

An all too common paper:

SLIDE 65

An all too common paper:

SLIDE 66

Making errors in defense evaluations is okay. Making errors in  attack evaluations is not.

SLIDE 67

Breaking a defense is useful ... ... teaching a lesson is better

SLIDE 68

Exciting new directions

SLIDE 69

Exciting new directions

SLIDE 70

Exciting new directions

SLIDE 71

Exciting new directions

SLIDE 72

Exciting new directions

SLIDE 73

Exciting new directions

SLIDE 74

Exciting new directions

SLIDE 75

SLIDE 76

Act VI  Conclusions

SLIDE 77

Research new topics Do good science Progress is learning

SLIDE 78

Questions?

nicholas@carlini.com https://nicholas.carlini.com

SLIDE 79

References

Biggio et al. Evasion Attacks on Machine Learning at Test Time.   https://arxiv.org/abs/1708.06131 Jaconbsen et al. Exploiting Excessive Invariance caused by Norm-Bounded Adversarial Robustness   https://arxiv.org/abs/1903.10484 Carlini et al. On Evaluating Adversarial Robustness.   https://arxiv.org/abs/1902.06705 Chou et al. SentiNet: Detecting Physical Attacks Against Deep Learning Systems.   https://arxiv.org/abs/1812.00292 Shumailov et al. Sitatapatra: Blocking the Transfer of Adversarial Samples.   https://arxiv.org/abs/1901.08121 Ilyas et al. Adversarial Examples Are Not Bugs, They Are Features.   https://arxiv.org/abs/1905.02175 Brendel et al. Decision-Based Adversarial Attacks: Reliable Attacks Against Black-Box Machine Learning  https://arxiv.org/abs/1712.04248 Wong et al. Wasserstein Adversarial Examples via Projected Sinkhorn Iterations   https://arxiv.org/abs/1902.07906.

Making and Measuring Progress in Adversarial Machine Learning

Act I Background

Why should we care about adversarial examples?

Act II An Apparent Problem

Let's go back to ~5 years ago ...

Generative Adversarial Nets

Progressive Growing of GANs

Evasion Attacks against ML at Test Time

that is ... ... less impressive

3 years: 6 years:

Why?

Act III Measuring Progress

Have we even made any progress?

Have we even made any progress?

Is this a constant cat-and-mouse game?

What does it mean to make progress?

What does it mean to make progress? Learning something new.

Measure by how much we learn; not by how much robustness we gain.

Act IV Making Progress (for defenses)

While we have learned a lot, it's less than I would have hoped.

Cargo Cult Evaluations

insufficient

The two types of defenses:

Exciting new directions

Act IV ½ Making Progress (for attacks)

Exciting new directions

Act VI Conclusions

Questions?

References

Act I  Background

Act II  An Apparent Problem

Evasion Attacks against ML  at Test Time

Act III  Measuring Progress

Measure by how much  we learn; not by how  much robustness we gain.

Act IV  Making Progress  (for defenses)

While we have learned  a lot, it's less than I would have hoped.

Act IV ½  Making Progress  (for attacks)

Act VI  Conclusions