revisiting class activation mapping for learning from
play

Revisiting Class Activation Mapping for Learning from Imperfect Data - PowerPoint PPT Presentation

The 2nd Learning from Imperfect Data (LID) Workshop Revisiting Class Activation Mapping for Learning from Imperfect Data Wonho Bae *, Junhyug Noh*, Jinhwan Seo, and Gunhee Kim Challenge Results 1 st place Track 3: Weakly Supervised Object


  1. The 2nd Learning from Imperfect Data (LID) Workshop Revisiting Class Activation Mapping for Learning from Imperfect Data Wonho Bae *, Junhyug Noh*, Jinhwan Seo, and Gunhee Kim

  2. Challenge Results 1 st place Track 3: Weakly Supervised Object Localization 2 nd place Track 1: Weakly Supervised Semantic Segmentation 2

  3. Weakly-Supervised Object Localization monkey Input Output 3

  4. Class Activation Mapping (CAM) 1 2 3 ⋯ = ! ",$ .: 012345 ! ',$ GAP CNN … … ! *,$ 6 789 , CAM (Class Activation Maps) ! ",$ ∗ + ! ',$ ∗ + ! (,$ ∗ + ⋯ + ! *,$ ∗ = , " , ' , ( + $ , - resize > @ ABC > + $ D $ localization result 4

  5. Class Activation Mapping (CAM) 1 2 3 ⋯ = ! ",$ .: 012345 ! ',$ GAP CNN … … ! *,$ 6 789 , CAM (Class Activation Maps) ! ",$ ∗ + ! ',$ ∗ + ! (,$ ∗ + ⋯ + ! *,$ ∗ = , " , ' , ( + $ , - resize > @ ABC > + $ D $ localization result 5

  6. Class Activation Mapping (CAM) for Track 3 1 2 3 ⋯ $ ! !,# *: ,-./01 ! $,# GAP CNN … … ! %,# # 234 " CAM (Class Activation Maps) ! !,# ∗ + ! $,# ∗ + ! %,# ∗ + ⋯ + ! &,# ∗ = " & " ' " ( ! % " ) resize > # &'( 5 ! % $ % localization result 6

  7. How to Grasp Whole Object Region? [HaS] Singh, et al. ICCV 2017 [AE] Wei, et al. CVPR 2017 7 [ACoL] Zhang, et al. CVPR 2018 [ADL] Choe, et al. CVPR 2019

  8. Our Approach Motivation • • Information to capture the whole area of the object already exists in feature maps Problem • • Three modules (M1 – M3) of CAM do not take phenomena (P1 – P3) into account • It results in the localization being limited to small discriminative regions of an object Solution • • Correctly utilize the information by simply modifying the three modules Phenomena observed in the feature map ( , ) M1: Global Average Pooling ( GAP) P1: 1 2 3 ⋯ = ! ",$ .: 012345 ! ',$ GAP P2: CNN … … ! *,$ P3: 6 789 , M2: Class Activation Maps ( CAM) M3: Thresholding resize > @ ABC ! ",$ ∗ + ! ',$ ∗ + ! (,$ ∗ + ⋯ + ! *,$ ∗ = , " , ' , ( , - + $ > + $ D $ localization result

  9. Our Approach (1) Thresholded Average Pooling Problem : Global Average Pooling (GAP) under P1 • Phenomena observed in the feature map ( , ) M1: Global Average Pooling ( GAP) P1: 1 2 3 ⋯ = ! ",$ .: 012345 ! ',$ GAP CNN … … ! *,$ 6 789 , M2: Class Activation Maps ( CAM) M3: Thresholding resize > @ ABC ! ",$ ∗ + ! ',$ ∗ + ! (,$ ∗ + ⋯ + ! *,$ ∗ = , " , ' , ( + $ , - > + $ D $ localization result 9

  10. Our Approach (1) Thresholded Average Pooling Problem : Global Average Pooling (GAP) under P1 • $ ! # ! " 10

  11. Our Approach (1) Thresholded Average Pooling Problem : Global Average Pooling (GAP) under P1 • # $,& GAP # $,& ∗ = ⋮ 2.5 (= 0.04 ) " ( $ (max: 64.7 ) ⋮ = 0.100 + 0.099 + ⋯ GAP # # ',& ∗ ',& = ⋮ 9.9 (= 0.01 ) * +,- ( ' (max: 59.2 ) Classification phase Localization phase 11

  12. Our Approach (1) Thresholded Average Pooling Problem : Global Average Pooling (GAP) under P1 • Solution: Thresholded Average Pooling (TAP) • 12

  13. Our Approach (2) Negative Weight Clamping Problem : Class Activation Maps (CAM) under P2 • Phenomena observed in the feature map ( , ) M1: Global Average Pooling ( GAP) 1 2 3 ⋯ = ! ",$ .: 012345 ! ',$ GAP P2: CNN … … ! *,$ 6 789 , M2: Class Activation Maps ( CAM) M3: Thresholding resize > @ ABC ! ",$ ∗ + ! ',$ ∗ + ! (,$ ∗ + ⋯ + ! *,$ ∗ = , " , ' , ( + $ , - > + $ D $ localization result

  14. Our Approach (2) Negative Weight Clamping Problem : Class Activation Maps (CAM) under P2 • − = Both Positive only Negative only 14

  15. Our Approach (2) Negative Weight Clamping Problem : Class Activation Maps (CAM) under P2 • IoA between the ground truth boxes and the CAMs Positive weights Negative weights 15

  16. Our Approach (2) Negative Weight Clamping Problem : Class Activation Maps (CAM) under P2 • Solution : Negative Weight Clamping (NWC) • 16

  17. Our Approach (3) Percentile as a Thresholding Standard Problem: Maximum as a Standard (MaS) under P3 • Phenomena observed in the feature map ( , ) M1: Global Average Pooling ( GAP) 1 2 3 ⋯ = ! ",$ .: 012345 ! ',$ GAP CNN … … ! *,$ P3: 6 789 , M2: Class Activation Maps ( CAM) M3: Thresholding resize > @ ABC ! ",$ ∗ + ! ',$ ∗ + ! (,$ ∗ + ⋯ + ! *,$ ∗ = , " , ' , ( + $ , - > + $ D $ localization result 17

  18. Our Approach (3) Percentile as a Thresholding Standard Problem: Maximum as a Standard (MaS) under P3 • threshold ( ! !"# ) 100 − percentile (%) threshold ( ! !"# ) 100 − percentile (%) Num of channels Result with CAM CAM values (descending order) (activation > " !.# ) 18

  19. Our Approach (3) Percentile as a Thresholding Standard Problem: Maximum as a Standard (MaS) under P3 • Solution: Percentile as a Standard (PaS) • 19

  20. Experimental Setting Backbone: ResNet50-SE • Batch size: 210 • Input size: 384 × 384 • Random crop size: 336 × 336 • TAP threshold ( 𝜐 !"# ): 0.05 • PaS percentile ( 𝑗 ): 98 • 20

  21. Results on Validation Set Results with different components • To preserve the details of masks, we also applied a fully connected CRF. • • The performance gradually improves as each component is added. 21

  22. Leaderboard Track 3: Weakly Supervised Object Localization • 22

  23. Qualitative Results CAM + Ours CAM + Ours CAM + Ours CAM + Ours

  24. Expansion to Track 1 24

  25. Expansion to Track 1 Our target! 25

  26. Class Activation Mapping (CAM) for Track 1 1 2 3 ⋯ $ ! !,# *: ,-./01 ! $,# GAP CNN … … ! %,# # 234 " CAM (Class Activation Maps) ! !,# ∗ + ! $,# ∗ + ! %,# ∗ + ⋯ + ! &,# ∗ = " & " ' " ( ! % " ) resize > # &'( 5 ! % $ % localization result 26

  27. Leaderboard Track 1: Weakly Supervised Semantic Segmentation • 27

  28. Thank You!

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend