Localization, and instance segmentation Fang Wan, Yi Zhu, Yanzhao - - PowerPoint PPT Presentation

localization and instance segmentation
SMART_READER_LITE
LIVE PREVIEW

Localization, and instance segmentation Fang Wan, Yi Zhu, Yanzhao - - PowerPoint PPT Presentation

Weakly Supervised Object Detection, Localization, and instance segmentation Fang Wan, Yi Zhu, Yanzhao Zhou, Qixian ang Ye Ye www.ucassdl.cn qxye@ucas.ac.cn people.ucas.ac.cn/~qxye Problem Supervised object ect detection ection and inst


slide-1
SLIDE 1

Weakly Supervised Object Detection, Localization, and instance segmentation

Fang Wan, Yi Zhu, Yanzhao Zhou, Qixian ang Ye Ye

www.ucassdl.cn qxye@ucas.ac.cn people.ucas.ac.cn/~qxye

slide-2
SLIDE 2

Problem

Supervised object ect detection ection and inst stance ance segme egmentat ation ion pipeline

Human annotation Machine learning

Classes, Boxes, Masks,

Object box and/or mask

slide-3
SLIDE 3

Problem

Supervised object ect detection ection and inst stance ance segme egmentat ation ion pipeline

Human annotation Machine learning

Classes, Boxes, Masks,

Object box and/or mask Bounding box annotation Mask annotation

slide-4
SLIDE 4

Problem

20k+ class 100/c 2M instances 5min/I 19 years /perso rson

Copy from UIUC Yunchao’s Slides

slide-5
SLIDE 5

Problem

Imagery databases Training sets

slide-6
SLIDE 6

Solutions

Data annotation is exp xpensive ensive

Imagery databases Training sets

slide-7
SLIDE 7

Solutions

Data annotation is efficie cient nt and d low-cos cost

Imagery databases Training sets Weakly supervi ervised ed data a annotat

  • tation
  • n

Weakly supervi ervised ed learni ning ng

slide-8
SLIDE 8

Solutions

Weakly Supervised Annotations

Scribes bes Copy from UIUC Yunchao’s Slides

slide-9
SLIDE 9

Solutions

Weakly Supervised Annotations

Scribes bes Point nt Copy from UIUC Yunchao’s Slides

slide-10
SLIDE 10

Solutions

Weakly Supervised Annotations

Person

  • n, Sheep, Dog

Scribes bes Point nt Image-lev evel el label els The most t efficient ent one Copy from UIUC Yunchao’s Slides

slide-11
SLIDE 11

Solutions

Weakly labeled imagery is widely available on the Web

slide-12
SLIDE 12

Solutions

Weakly labeled imagery is widely available on the Web

slide-13
SLIDE 13

Solutions

Data annotation is efficie cient nt and d low-cos cost

Imagery databases Training sets Weakly supervi ervised ed data a annotat

  • tation
  • n

Weakly supervi ervised ed learni ning ng

slide-14
SLIDE 14

Our works

MELM LM CVPR18: 8: Min-en entrop

  • py

y Laten ent Mode del (WSOD)

CMIL:

: Continuati

tion Multiple Insta tance Learning CVPR19 (WSOD)

PRM PRM CVPR CVPR18 18: Peak Respo ponse nse Mapp pping ng (WSIS) MELM+ LM+Recu ecurren ent Learni arning ng PAMI20 2019 19: Recurr urren ent Learni ning ng (WSO WSOD) SPN SPN ICCV17 17: Soft Propo posal sal Network

  • rk (WSOL)

IAM CVPR CVPR19 19: Inst stan ance ce Activat vation

  • n Map (WSIS)
slide-15
SLIDE 15

Latent variable h Latent variable learning Latent variable Latent variable

Our works-Challenge analysis

Multiple Instance learning

𝑀 Ɵ = 1 2 ∥ Ɵ ∥2 +𝜇

𝑗

max(0, 1 − 𝑧𝑗𝑔(𝑦𝑗, ℎ𝑗)) 𝑔(𝑦, ℎ) = max

Ɵ ∙ Φ(𝑦, ℎ)

slide-16
SLIDE 16

Our works-Challenge analysis

(0)

(1)

(*)

Local minimum global minimum

𝑀 Ɵ = 1 2 ∥ Ɵ ∥2 +𝜇

𝑗

max(0, 1 − 𝑧𝑗𝑔(𝑦𝑗, ℎ𝑗)) 𝑔(𝑦, ℎ) = max

Ɵ ∙ Φ(𝑦, ℎ)

slide-17
SLIDE 17

Our works-Methodology

(0)

(1)

(*)

Local minimum stronger minimum

(2)

Conv nvex Regular gulariz izatio ion Cont ntin inuat uation ion Optim imiz izat ation ion

Objecti tive functi tion Epoch t Epoch 0

slide-18
SLIDE 18

Our works-Min-entropy latent model

  • F. Wan, P. Wei, Z. Han, J. Jiao, Q. Ye, “Min-entropy Latent Model for Weakly Supervised object Detection,” IEEE CVPR2018

Object ect discov

  • very

ery Object ect localizat zation

  • n
slide-19
SLIDE 19

Our works-Min-entropy latent model

(1) Instance (object and object part) are collected with a clique e partitio tition n module le; (2) Object clique discovery with a global min-entropy model; (3) Object localization with a local min-entropy model

slide-20
SLIDE 20

Our works-Min-entropy latent model

Clique e partit tition: ion:

slide-21
SLIDE 21

Our works-Methodology

(0)

(1)

(*)

Local minimum stronger minimum

(2)

Conv nvex Regular gulariz izatio ion Cont ntin inuat uation ion Optim imiz izat ation ion

Objecti tive functi tion Epoch t Epoch 0

slide-22
SLIDE 22

Our works-Continuation Multiple Instance Learning

  • F. Wan, C. Liu, J. Jiao, Q. Ye, “CMIL:

L: Continuation Multiple Instance Learning for Weakly Supervised object Detection (CVPR2019)

slide-23
SLIDE 23

Our works-Min-entropy latent model

slide-24
SLIDE 24

Our works-Recurrent Learning

  • F. Wan, P. Wei, Z. Han, J. Jiao, Q. Ye, “Min-entropy Latent Model for Weakly Supervised object Detection,” IEEE

Transactions on Pattern Analysis and Machine Intelligence (PAMI), DOI:10.1109/TPAMI.2019.2898858.

Recur urren rent Learnin earning Accumula ulated d Recur urren rent Learnin earning

slide-25
SLIDE 25

Our works-Results

slide-26
SLIDE 26

Our works-Results

SSER: Semant antic ic Stable able Extre remal al Regio ion

slide-27
SLIDE 27

Our works-Soft Proposal Network

  • Y. Zhu, Y. Zhou, Q. Ye, Q. Qiu, and J. Jiao, "Soft Proposal Network for Weakly Supervised Object Localization," in
  • Proc. of IEEE Int. Conf. on Computer Vision (ICCV), 201
slide-28
SLIDE 28

Our works-Soft Proposal Network

slide-29
SLIDE 29

Our works-Peak Response Mapping

  • Y. Zhou, Y. Zhu, Q. Ye, Q. Qiu, J. Jiao, “Weakly Supervised Instance Segmentation using Class Peak Response, IEEE CVPR

2018 (Spotlig tlight).

slide-30
SLIDE 30

Our works-Peak Response Mapping

slide-31
SLIDE 31

Our works-learning Instance Activation Maps

  • Y. Zhu, Y. Zhou, H. Xu, Q. Ye., D. Doermann, J. Jiao, “Learning Instance Activation Maps for Weakly

Supervised Instance Segmentation,” IEEE CVPR 2019.

slide-32
SLIDE 32

Our works-learning Instance Activation Maps

slide-33
SLIDE 33

The future

Beyond regularization and continuation optimization

(0)

(1)

(*)

Local minimum strong nger minimum

(2)

Conv nvex Regular gulariz izatio ion Contin ntinuat uation ion Optim imiz izat ation ion

Objecti tive functi tion Epoch t Epoch 0

slide-34
SLIDE 34

The future

Beyond weakly supervised detection and segmentation

slide-35
SLIDE 35

The future

Fill the gap of supervised and weakly supervised methods

0.3 .348 0.412 .412 0.42 .428 0.44 .443 0.453 .453 0.47 .473 0.50 .505 0.25 0.3 0.35 0.4 0.45 0.5 0.55 0.6 0.65

WSDDN (2016) OICR (2017) WCCN (2017) TSC (2018) WeakRPN 2018 MELM (2018) CMIL (2019)

mAP on

  • n PascalVOC 20

2007 with Fast-RCNN framework

15%

slide-36
SLIDE 36

The future

Weakly supervised detection meets X

X= Few-shot Active Learning | Online Feedback | Temporal

slide-37
SLIDE 37

The future

X= Few-shot Active Learning | Online Feedback | Temporal

  • Q. Ye, Z. Zhang, Q. Qiu, B. Zhang, J. Chen, and G. Sapiro, "Self-learning Scene-specific Pedestrian Detectors using a

Progressive Latent Model," IEEE CVPR, 2017

slide-38
SLIDE 38

Ref.

[1] F. Wan, P. Wei, Z. Han, J. Jiao, Q. Ye, “Min-entropy Latent Model for Weakly Supervised object Detection,” IEEE

  • Trans. PAMI, DOI:10.1109/TPAMI.2019.2898858. (MELM+Recu

Recurr rrent Learn rnin ing) [2] F. Wan, C. Liu, J. Jiao, Q. Ye, “CMIL: Continuation Multiple Instance Learning for Weakly Supervised object Detection (CVPR2019) (C (C-MIL IL) [3] Y. Zhu, Y. Zhou, H. Xu, Q. Ye., D. Doermann, J. Jiao, “Learning Instance Activation Maps for Weakly Supervised Instance Segmentation,” IEEE CVPR 2019. (IAM) [4] P. Tang, X. Wang, S. Bai, W. Shen, X. Bai, W. Liu, and A. L. Yuille, “Pcl: Proposal cluster learning for weakly supervised object detection,” IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 2018. (PCL) CL) [5] Y. Zhou, Y. Zhu, Q. Ye, Q. Qiu, J. Jiao, “Weakly Supervised Instance Segmentation using Class Peak Response,” in Proc. of IEEE Int. Conf. on Computer Vision and Pattern Recognition (CVPR), 2018 (Spotlight). (PRM) [6] F. Wan, P. Wei, Z. Han, J. Jiao, Q. Ye, “Min-entropy Latent Model for Weakly Supervised object Detection,” in

  • Proc. of IEEE Int. Conf. on Computer Vision and Pattern Recognition (CVPR), 2018: 1297-1306. (MELM)

[7] Y. Zhu, Y. Zhou, Q. Ye, Q. Qiu, and J. Jiao, "Soft Proposal Network for Weakly Supervised Object Localization," in

  • Proc. of IEEE Int. Conf. on Computer Vision (ICCV), 2017. (SPN)

N) [8] Q. Ye, Z. Zhang, Q. Qiu, B. Zhang, J. Chen, and G. Sapiro, "Self-learning Scene-specific Pedestrian Detectors using a Progressive Latent Model," IEEE CVPR, 2017 (Self lf-Learn rnin ing) [9] B. Hakan and V. Andrea, “Weakly supervised deep detection networks,” in Proc. IEEE Int. Conf. Comput. Vis. Pattern Recognit. (CVPR), 2016, pp. 2846–2854. (WSDD DDN) N) [10] B. Zhou, A. Khosla, A. Lapedriza, A. Oliva, and A. Torralba, “Learning deep features for discriminative localization,” in Proc. IEEE Int. Conf. Comput. Vis. Pattern Recognit. (CVPR), 2016, pp.2921–2929. (CAM)

slide-39
SLIDE 39

Thank!

www.ucassdl.cn qxye@ucas.ac.cn people.ucas.ac.cn/~qxye