Branch&Rank: Efficient, Non-Linear Object Detection Alain D. - - PowerPoint PPT Presentation

branch rank efficient non linear object detection
SMART_READER_LITE
LIVE PREVIEW

Branch&Rank: Efficient, Non-Linear Object Detection Alain D. - - PowerPoint PPT Presentation

Image Classification Task "BBox" of Hypotheses Set Ranking Condition (Multiple) Inter mediate Tasks + f( ) < f( ) some elements of Object Detection Task T4 T3 T2 + T1 "BBox" of


slide-1
SLIDE 1

Λ

+

"BBox" of Hypotheses Set Λ

"BBox" of Hypotheses Set Λ

f( )

+

f( ) Λ

< Condition Ranking

some elements ofΛ− T1 T3 T4 T2 Image Classification Task Object Detection Task mediate Tasks (Multiple) Inter

Branch&Rank: Efficient, Non-Linear Object Detection

Alain D. Lehmann, Peter V. Gehler, and Luc Van Gool

Computer Vision Laboratory, ETH Zurich, Switzerland

30 August, 2011 Alain D. Lehmann, Peter V. Gehler, and Luc Van Gool Branch&Rank: Efficient, Non-Linear Object Detection

slide-2
SLIDE 2

Detection means to localise and categorise objects

input annotated output car car

Alain D. Lehmann, Peter V. Gehler, and Luc Van Gool Branch&Rank: Efficient, Non-Linear Object Detection

slide-3
SLIDE 3

Appearance variations make it difficult

intra-class variations different views/poses illumination changes

  • cclusions, etc.

sophisticated expensive models

Alain D. Lehmann, Peter V. Gehler, and Luc Van Gool Branch&Rank: Efficient, Non-Linear Object Detection

slide-4
SLIDE 4

Localise objects among thousands of hypotheses

  • ⇓⇓⇓⇓⇓⇓

Search space size >10′000 locations >1′000 classes avoid exhaustive enumeration

Alain D. Lehmann, Peter V. Gehler, and Luc Van Gool Branch&Rank: Efficient, Non-Linear Object Detection

slide-5
SLIDE 5

Efficient detection by ranking sub-images

Runtime = (classifier cost) × (#calls)

◮ reduce cost: cascades [Viola et al. 04, Vedaldi et al. 09]

exhaustive search →not scalable

◮ reduce calls: branch&bound [Lampert et al. 08, Lehmann et al. 09]

bounds not tight enough →not effective

Ranking: ”learn the bound”

◮ branch, but not bound ◮ often <100 classifier calls

→non-linear SVMs

◮ classification for detection

T1 T3 T4 T2 Image Classification Task Object Detection Task mediate Tasks (Multiple) Inter

Alain D. Lehmann, Peter V. Gehler, and Luc Van Gool Branch&Rank: Efficient, Non-Linear Object Detection

slide-6
SLIDE 6

Outline

1 Detection: best-first search 2 Training: ranking hypothesis sets 3 Multi-tasks aspects 4 Results and conclusion Alain D. Lehmann, Peter V. Gehler, and Luc Van Gool Branch&Rank: Efficient, Non-Linear Object Detection

slide-7
SLIDE 7

Efficiency by means of adaptive partitioning

Sets of hypothesis exploit correlations split promising sets

slide-8
SLIDE 8

Efficiency by means of adaptive partitioning

Sets of hypothesis exploit correlations split promising sets

slide-9
SLIDE 9

Efficiency by means of adaptive partitioning

Sets of hypothesis exploit correlations split promising sets

slide-10
SLIDE 10

Efficiency by means of adaptive partitioning

Sets of hypothesis exploit correlations split promising sets

slide-11
SLIDE 11

Efficiency by means of adaptive partitioning

Sets of hypothesis exploit correlations split promising sets

slide-12
SLIDE 12

Efficiency by means of adaptive partitioning

Sets of hypothesis exploit correlations split promising sets

slide-13
SLIDE 13

Efficiency by means of adaptive partitioning

Sets of hypothesis exploit correlations split promising sets

slide-14
SLIDE 14

Efficiency by means of adaptive partitioning

Sets of hypothesis exploit correlations split promising sets

slide-15
SLIDE 15

Efficiency by means of adaptive partitioning

Sets of hypothesis exploit correlations split promising sets

slide-16
SLIDE 16

Efficiency by means of adaptive partitioning

? ? ? ? ?

Sets of hypothesis exploit correlations split promising sets

Alain D. Lehmann, Peter V. Gehler, and Luc Van Gool Branch&Rank: Efficient, Non-Linear Object Detection

slide-17
SLIDE 17

Efficiency by means of adaptive partitioning

? ? ? ? ? ? ? ? ? ?

Sets of hypothesis exploit correlations split promising sets correspond to subimages Ranking function f prioritises f  

  • contains obj.

 >f  

  • no object

  supersedes upper bounds

Alain D. Lehmann, Peter V. Gehler, and Luc Van Gool Branch&Rank: Efficient, Non-Linear Object Detection

slide-18
SLIDE 18

Training with sets for increased efficiency

Λ

+

"BBox" of Hypotheses Set Λ

"BBox" of Hypotheses Set Λ

f( )

+

f( ) Λ

< Condition Ranking

some elements ofΛ−

Structured SVM ranking [Tsochantaridis et al. 04, Blaschko et al. 08] min

w,ξi≥0 w2 + C

  • i

ξi f (Λ+

i ) − f (Λ−) ≥ ∆(Λ−) − ξi with f (Λ) = w, φ(Λ) ◮ bag-of-words descriptor φ(Λ) ◮ kernelize with RBF-χ2 kernel ◮ Λ+: generate with oracle ◮ Λ−: delayed constraint generation

Alain D. Lehmann, Peter V. Gehler, and Luc Van Gool Branch&Rank: Efficient, Non-Linear Object Detection

slide-19
SLIDE 19

From image classification to object categorisation

T1 T3 T4 T2 Image Classification Task Object Detection Task mediate Tasks (Multiple) Inter

Large sets

◮ object somewhere ◮ image classification

Small sets

◮ object centred ◮ object categorisation

Task-adapted ranking f (Λ) = wq(Λ), φ(Λ)

◮ task mapping q(Λ) ◮ leverage set information ◮ exploit context ◮ improved AP by ≈ 10%

Alain D. Lehmann, Peter V. Gehler, and Luc Van Gool Branch&Rank: Efficient, Non-Linear Object Detection

slide-20
SLIDE 20

Branch&rank detects in often <50 iterations

0.0 0.5 1.0

prec recall

0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 detection score 50 100 150 200 iterations till detection 'Horse' detections heat map

true positive

  • avg. #iterations

prec=recal

more efficient more confident detections Dataset: PASCAL VOC 2007 (Horses) [Everingham et al., 2007] non-linear RBF-χ2 SVMs no cascade approximations costly classifier feasible

Alain D. Lehmann, Peter V. Gehler, and Luc Van Gool Branch&Rank: Efficient, Non-Linear Object Detection

slide-21
SLIDE 21

More results

Horse Cow Cat

0.0 0.5 1.0

prec recall

0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 detection score 50 100 150 200 iterations till detection 'Horse' detections heat map

true positive

  • avg. #iterations

prec=recal

0.0 0.5 1.0

prec recall

0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 detection score 50 100 150 200 iterations till detection 'Cow' detections heat map

true positive

  • avg. #iterations

prec=recal

0.0 0.5 1.0

prec recall

0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 detection score 50 100 150 200 iterations till detection 'Cat' detections heat map

true positive

  • avg. #iterations

prec=recal

branch&rank part-based detector best in challenge

[Lehmann et al. 2011] [Felzenszwalb et al. 2008] [Everingham et al. 2007]

Horse 36.8% 30.1% 33.5% better Cow 10.8% 16.5% 14.0% worse Cat 17.6% 11.0% 24.0% in-between

Future work

combine multiple features use task-adapted features

Alain D. Lehmann, Peter V. Gehler, and Luc Van Gool Branch&Rank: Efficient, Non-Linear Object Detection

slide-22
SLIDE 22

Conclusion

Λ

+

"BBox" of Hypotheses Set Λ

"BBox" of Hypotheses Set Λ

f( )

+

f( ) Λ

< Condition Ranking

some elements ofΛ−

Branch&rank is efficient

less than 100 classifier calls non-linear SVMs feasible

Process hypothesis sets

during detection and training “learn the bound”

T1 T3 T4 T2 Image Classification Task Object Detection Task mediate Tasks (Multiple) Inter

Multiple task

combine classification and detection

Alain D. Lehmann, Peter V. Gehler, and Luc Van Gool Branch&Rank: Efficient, Non-Linear Object Detection