does data augmentation lead to positive margin
play

Does Data Augmentation Lead to Positive Margin? Dimitris Po-Ling - PowerPoint PPT Presentation

Does Data Augmentation Lead to Positive Margin? Dimitris Po-Ling Loh Shashank Rajput* Zhili Feng* Zachary Charles Papailiopoulos * Equal Contribution Data Augmentation (DA) DA means increasing the training set artificially. Used to


  1. Does Data Augmentation Lead to Positive Margin? Dimitris Po-Ling Loh Shashank Rajput* Zhili Feng* Zachary Charles Papailiopoulos * Equal Contribution

  2. Data Augmentation (DA) • DA means increasing the training set artificially. • Used to train state of the art deep models. Rotations, crops Noise

  3. Why use Data Augmentation (DA)? Aim: Build a model that is robust to slight perturbations of input Idea: Train on perturbed versions of the inputs! Works in practice! But can we prove it?

  4. Setup Learning DA S' w' S • What margin does w’ achieve with Augmented Model Training respect to S ? Dataset Set

  5. Setup Learning DA S' w' S • What margin does w’ achieve? Augmented Model Training Dataset Set Blackbox learner – Outputs ANY classifier that fits the training set No DA • Enforces no margin è Not robust

  6. Setup Learning DA S' w' S • What margin does w’ achieve? Augmented Model Training Dataset Set Blackbox learner – Outputs ANY classifier that fits the training set No DA With DA • Enforces no margin è Not robust • Enforces some margin è Robust

  7. Can we use DA to enforce margin?

  8. Can we use DA to enforce margin? Idea: Create an ε-net of DA points. Problem: ε-net requires exponentially many points

  9. What is the minimum number of points we need? Class 1 Class 2 Theorem : d+1 points necessary and sufficient to get max - margin .

  10. What is the minimum number of points we need? Class 1 Class 2 Theorem : d+1 points necessary and sufficient to get max - margin . Caveat: You need to know the max margin classifier – Beats the purpose!

  11. Random DA: Points on the sphere δ δ • What should the radius δ be? • How many DA points?

  12. Random DA: Points on the sphere Max margin = ! * Margin Achieved δ = " ( ! *) " ( 2 % ) #DA Points

  13. Random DA: Points on the sphere Max margin = ! * $ ( ! * √# ) Margin Achieved δ = " ( ! *) δ = " " ( 2 ( ) " ( poly ( # )) #DA Points

  14. Beyond Linear Classifiers • Similar results for classifiers which “respect” local convex hulls of training points. • Example: Nearest neighbor classifier. Future Work: More structured augmentation • How much robustness do cropping, rotation etc. add? Adaptive augmentation • What margin does Adaptive Data Augmentation (Adversarial Training) achieve?

  15. Thank you • Poster #155 • 6:30 – 9:00 PM, Today • Pacific Ballroom • Emails: rajput3@wisc.edu, zfeng49@cs.wisc.edu

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend