Adversarial Examples are a Natural Consequence of Test Error in - - PowerPoint PPT Presentation

▶

Aug 14, 2022 371 likes •549 views

Adversarial Examples are a Natural Consequence of Test Error in Noise Nic Ford*, Justin Gilmer*, Nicholas Carlini, Dogus Cubuk *equal contribution Confidential + Proprietary Confidential + Proprietary Robust (out of distribution)

SLIDE 1

Confidential + Proprietary Confidential + Proprietary

Adversarial Examples are a Natural Consequence of Test Error in Noise

Nic Ford, Justin Gilmer, Nicholas Carlini, Dogus Cubuk

*equal contribution

SLIDE 2

Confidential + Proprietary

Robust (out of distribution) Generalization

Train on p(x) Test on q(x)

SLIDE 3

Confidential + Proprietary

Gaussian noise

50% top-1 acc 14% top-1 acc

SLIDE 4

Confidential + Proprietary

Corruption Robustness

[Hendrycks et. al] https://arxiv.org/pdf/1807.01697.pdf

Goal: Measure and improve model

robustness to distributional shift.

See also: [Mu, Gilmer] "MNIST-C" https://arxiv.org/abs/1906.02337 [Pei et. al.] - https://arxiv.org/pdf/1712.01785.pdf

SLIDE 5

Proprietary + Confidential

Adversarial Examples - The "Surprising" Phenomenon

x x_adv

In 2013 it was discovered that neural networks have “adversarial examples”.
2000+ papers written on this topic.

[Goodfellow et. al.]

SLIDE 6

Confidential + Proprietary

Adversarial Examples - The Phenomenon

Why do our models have adversarial examples?

SLIDE 7

Confidential + Proprietary

Adversarial Examples - The Phenomenon

Why do our models have adversarial examples? A: ???

SLIDE 8

Confidential + Proprietary

Adversarial Examples - The Phenomenon

Why do our models have adversarial examples? A: ??? What are adversarial examples?

SLIDE 9

Confidential + Proprietary

Adversarial Examples - The Phenomenon

Why do our models have adversarial examples? A: ??? What are adversarial examples? A: The nearest error

SLIDE 10

Confidential + Proprietary

Adversarial Examples - The Phenomenon

Why do our models have adversarial examples? A: ??? A: The nearest error What are adversarial examples?

SLIDE 11

Confidential + Proprietary

Adversarial Examples - The Phenomenon

Why do our models have (o.o.d) test error? A: ??? A: The nearest error What are adversarial examples?

SLIDE 12

Confidential + Proprietary

Adversarial Examples - The Phenomenon

Why do our models have (o.o.d) test error? A: ??? A: The nearest error What are adversarial examples?

Test error > 0 (iid, ood) -> errors exist -> there is a nearest error

SLIDE 13

Confidential + Proprietary

Linear Assumption

1% error rate on random perturbations of norm 79 => adv ex at norm .5

SLIDE 14

Confidential + Proprietary

Adversarial Defenses

SLIDE 15

Confidential + Proprietary

Adversarial Defenses

Not a useful measure of robustness

SLIDE 16

Confidential + Proprietary

Conclusion

It is not surprising that models have a

nearest error.

The nearest error is not unusually close

given measured o.o.d robustness.

The robustness problem is much broader

than tiny perturbations.

If a method doesn't improve o.o.d

robustness, is it more secure?