adversarial training and provable defenses bridging the

Adversarial Training and Provable Defenses: Bridging the Gap S 0 - PowerPoint PPT Presentation

Adversarial Training and Provable Defenses: Bridging the Gap S 0 1 1 = 1 2 3 Conv + ReLU Conv + ReLU Linear =


  1. Adversarial Training and Provable Defenses: Bridging the Gap

  2. 𝑀 ∞

  3. 𝑦 S 0 𝑦 𝑙 ∘ β„Ž πœ„ π‘™βˆ’1 ∘ β‹― ∘ β„Ž πœ„ 1 β„Ž πœ„ = β„Ž πœ„ 1 2 3 β„Ž πœ„ β„Ž πœ„ β„Ž πœ„ Conv + ReLU Conv + ReLU Linear β€² = β„Ž πœ„ (𝑦′) 𝑦 β€² ∈ 𝑇 0 (𝑦) β€² β€² 𝑦 1 𝑦 3 𝑦 2

  4. 𝑑 π‘ˆ β„Ž πœ„ 𝑦 β€² + 𝑒 < 0, βˆ€π‘¦ β€² ∈ 𝑇 0 (𝑦) 1 2 3 β„Ž πœ„ β„Ž πœ„ β„Ž πœ„ Conv + ReLU Conv + ReLU Linear β€² = β„Ž πœ„ (𝑦′) 𝑦 β€² ∈ 𝑇 0 (𝑦) β€² β€² 𝑦 1 𝑦 3 𝑦 2

  5. 1 2 3 β„Ž πœ„ β„Ž πœ„ β„Ž πœ„ Conv + ReLU Conv + ReLU Check output condition: Linear β€² + 𝑒 < 0, βˆ€π‘¦ 3 β€² ∈ 𝐷 3 𝑦 𝑑 π‘ˆ 𝑦 3 𝐷 0 𝑦 = 𝑇 0 (𝑦) 𝐷 1 𝑦 𝐷 2 𝑦 𝐷 3 𝑦 Guarantees: 𝑑 π‘ˆ β„Ž πœ„ 𝑦 β€² + 𝑒 < 0, βˆ€π‘¦ β€² ∈ 𝑇 0 (𝑦)

  6. β„’ 𝑦 β€² βˆˆπ‘‡ 0 (𝑦) β„’(β„Ž πœ„ 𝑦 β€² , 𝑧) min πœ„ 𝐹 𝑦,𝑧 ~𝐸 max lower upper

  7. upper β€’ β€’ lower β€’ β€’ β€’ β€’

  8. 1 2 3 β„Ž πœ„ β„Ž πœ„ β„Ž πœ„ β€² β€² β€² 𝑦 1 𝑦 2 𝑦 3 β€² 𝑦 2 β€² 𝑦 3 β€² 𝑦 1 𝐷 0 𝑦 = 𝑇 0 (𝑦) 𝐷 1 𝑦 𝐷 2 𝑦 𝐷 3 𝑦 β€² + 𝑒 < 0 β†’ certification fails 𝑑 π‘ˆ 𝑦 3

  9. 𝑇 0 (𝑦) 𝐷 1 𝑦 , 𝐷 2 𝑦 , 𝐷 3 (𝑦)

  10. 2 1 3 β„Ž πœ„ β„Ž πœ„ β„Ž πœ„ Conv + ReLU Conv + ReLU β€² β€² 𝑦 2 𝑦 1 Linear β€² , 𝑧) β„’(𝑦 3 β€² , 𝑧) 𝛼 πœ„ β„’(𝑦 3 β€² 𝑦 2 β€² 𝑦 3 β€² 𝑦 1 𝐷 0 𝑦 = 𝑇 0 (𝑦) 𝐷 1 𝑦 𝐷 2 𝑦 𝐷 3 𝑦

  11. projection

  12. 𝐷 π‘š 𝑦 = 𝑏 π‘š + 𝐡 π‘š 𝑓 𝑓 ∈ βˆ’1, 1 𝑛 π‘š 𝑏 π‘š 𝐡 π‘š 𝑀 ∞ πœ— 𝑏 0 = 𝑦 𝐡 0 = πœ—π½

  13. Key idea 𝑦 β€² = 𝑏 π‘š + 𝐡 π‘š 𝑓 β€² 𝑦 1 𝑓 1 β€² 𝑓 2 𝑦 2 β€² ≔ 2𝑓 1 βˆ’ 𝑓 2 𝑦 1 β€² ≔ 𝑓 1 + 𝑓 2 𝑦 2

  14. Method Accuracy (%) Certified Robustness (%)

  15. Method Accuracy (%) Certified Robustness (%)

Recommend


More recommend