HARDNET: CONVOLUTIONAL NETWORK FOR LOCAL IMAGE DESCRIPTION A n a - - PowerPoint PPT Presentation

hardnet convolutional network for local image description
SMART_READER_LITE
LIVE PREVIEW

HARDNET: CONVOLUTIONAL NETWORK FOR LOCAL IMAGE DESCRIPTION A n a - - PowerPoint PPT Presentation

HARDNET: CONVOLUTIONAL NETWORK FOR LOCAL IMAGE DESCRIPTION A n a s t a s i i a Mi s h c h u k , D m y t r o Mi s h k i n , F i l i p R a d e n o v i c J i r i Ma t a s OUTLINE Short review


slide-1
SLIDE 1

HARDNET: CONVOLUTIONAL NETWORK FOR LOCAL IMAGE DESCRIPTION

A n a s t a s i i a Mi s h c h u k , D m y t r

  • Mi

s h k i n , F i l i p R a d e n

  • v

i c J i r i Ma t a s

slide-2
SLIDE 2

Short review of methods for learning

  • f local descriptors

The L2-Net HardNet loss and architecture Benchmarks

2

OUTLINE

slide-3
SLIDE 3

3

TRAINING DATA

D i s c r i mi n a n t L e a r n i n g

  • f

L

  • c

a l I ma g e D e s c r i p t

  • r

s B r

  • w

n e t a l , P A MI 2 1

3 s e t s , 4 k p a t c h e s e a c h :

  • L

i b e r t y ( s h

  • w

n )

  • N
  • t

r e d a me

  • Y
  • s

e mi t e S i z e : 6 4 x 6 4 , g r a y s c a l e . O b t a i n e d f r

  • m

S f M mo d e l , 3 D p

  • i

n t → D

  • G

k e y p

  • i

n t s U s e d i n a l l l e a r n e d d e s c r i p t

  • r

s me a n t i

  • n

e d i n t h i s p r e s e n t a t i

  • n
slide-4
SLIDE 4

4

CONVEXOPT (SIMONYAN ET AL, 2012)

Global margin loss

S i mo n y a n e t a l , E C C V 2 1 2

C

  • n

v e x

  • p

t i mi z a t i

  • n

p r

  • b

l e m

slide-5
SLIDE 5

5

MATCHNET

H a n e t a l , C V P R 2 1 5 . Wo r k s w e l l , b u t r e l y

  • n

me t r i c n e t w

  • r

k . A p p r

  • x

i ma t e k N N me t h

  • d

s , e . g . F L A N N c a n n

  • t

b e a p p l i e d d i r e c t l y

7x7 Conv pad 1

64 24

ReLU

1 24

5x5 Conv pad 2

64

ReLU

64

3x3 Conv pad 1 ReLU

64

2x2 MP/2

32 32

2x2 MP/2

16 96 16

3x3 Conv pad 1 ReLU

96 16

3x3 Conv pad 1 ReLU

64 16

3x3 MP/2

64 8

8x8 Conv ReLU

1 128

1x1 Conv ReLU

1 256

1x1 Conv ReLU

1 256

1x1 Conv

Sofumax

1 2

slide-6
SLIDE 6

6

DEEPCOMPARE

Z a g

  • r

u y k

  • a

n d K

  • mo

d a k i s , C V P R 2 1 5 Wo r k s w e l l , b u t r e l y

  • n

me t r i c n e t w

  • r

k . A p p r

  • x

i ma t e k N N me t h

  • d

s , e . g . F L A N N c a n n

  • t

b e a p p l i e d d i r e c t l y

7x7 Conv pad 3

64 96

ReLU

1 96

5x5 Conv pad 2

192

ReLU

192

3x3 Conv pad 1 ReLU

64

2x2 MP/2

32 32

2x2 MP/2

16 256 16 8

8x8 Conv ReLU

1 1 256

1x1 Conv ReLU 256 1x1 Conv

Sigmoid 1

2x2 MP/2

256

slide-7
SLIDE 7

9 S i mo

  • S

e r r a e t a l , I C C V 2 1 5 . B a l n t a s e t a l , B MV C 2 1 6 32 32

7x7 Conv

26

TanH

1

2x2 MP/2

13

6x6 Conv TanH

8 64

8x8 Conv TanH

1 128 32

T F e a t

( B a l n t a s e t a l , 2 1 6 )

  • E

v e n s h a l l

  • w

e r a n d f a s t e r C N N ,

  • h

a r d

  • n

e g a t i v e mi n i n g : b y a n c h

  • r

s w a p i n t r i p l e t .

  • t

r i p l e t ma r g i n l

  • s

s

  • n

L 2 d i s t a n c e 1 64

7x7 Conv

58 32

TanH 2x2 L2pool/2

29

6x6 Conv TanH

23 64

5x5 Conv TanH

4 128 32

3x3 L2Pool/3 8

64

4x4 L2Pool/4 1

128

D e e p D e s c

( S i mo

  • S

e r r a e t a l , 2 1 5 ) R e l a t i v e l y s h a l l

  • w

a n d f a s t C N N . H a r d n e g a t i v e mi n i n g : C

  • n

t r a s t i v e l

  • s

s

  • n

L 2 d i s t a n c e

slide-8
SLIDE 8

1

DESCRIPTOR COMPARISON

D e s c r . # l a y e r s w/ p a r a ms L

  • s

s Ha r d mi n i n g K d

  • t

r e e r e a d y C

  • n

v e x O p t 1 G l

  • b

a l ma r g i n

  • +

D e e p D e s c 3 C

  • n

t r a s t i v e + + T F e a t 3 T r i p l e t ma r g i n + /

  • +

Ma t c h N e t 8 C r

  • s

s e n t r

  • p

y

  • D

e e p C

  • mp

5 H i n g e

  • B

a l n t a s e t a l , B MV C 2 1 6

slide-9
SLIDE 9

1 1

  • L2NET. TIAN ET AL (CVPR 2017)

32 32 16 16

3x3 Conv pad 1

32 32

BN + ReLU

1

3x3 Conv pad 1

32

BN + ReLU 3x3 Conv pad 1 /2

64

BN + ReLU 3x3 Conv pad 1

64

BN + ReLU 3x3 Conv pad 1 /2 BN + ReLU

8 128

3x3 Conv pad 1 BN + ReLU

8 128

8x8 Conv BN+ L2Norm

1 128

slide-10
SLIDE 10

1 3

L2NET: LOSS TERMS

S

  • f

t ma x

  • v

e r r

  • w

/ c

  • l

u mn

  • f

d i s t a n c e ma t r i x

slide-11
SLIDE 11

1 4

L2NET: LOSS TERMS

S

  • f

t ma x

  • v

e r r

  • w

/ c

  • l

u mn

  • f

d i s t a n c e ma t r i x P e n a l t y

  • n

d e s c r i p t

  • r

c

  • mp
  • n

e n t s c

  • r

r e l a t i

  • n
slide-12
SLIDE 12

1 5

L2NET: LOSS TERMS

S

  • f

t ma x

  • v

e r r

  • w

/ c

  • l

u mn

  • f

d i s t a n c e ma t r i x S

  • f

t ma x

  • v

e r r

  • w

/ c

  • l

u mn

  • f

d i s t a n c e ma t r i x

  • f

i n t e r me d i a t e f e a t u r e s P e n a l t y

  • n

d e s c r i p t

  • r

c

  • mp
  • n

e n t s c

  • r

r e l a t i

  • n
slide-13
SLIDE 13

1 6

HARDNET

Triplet margin loss for hard negative P e n a l t y

  • n

d e s c r i p t

  • r

c h a n n e l s c

  • r

r e l a t i

  • n

S

  • f

t ma x

  • v

e r r

  • w

/ c

  • l

u mn

  • f

d i s t a n c e ma t r i x

  • f

i n t e r me d i a t e f e a t u r e s

slide-14
SLIDE 14

1 7

HARDNET (OURS)

3x3 Conv pad 1

32 32

BN + ReLU

1

3x3 Conv pad 1

32

BN + ReLU 3x3 Conv pad 1 /2

64

BN + ReLU 3x3 Conv pad 1

64

BN + ReLU 3x3 Conv pad 1 /2 BN + ReLU

8 128

3x3 Conv pad 1 BN + ReLU

8 128

8x8 Conv BN+ L2Norm

1 128

slide-15
SLIDE 15

1 8

BATCH SIZE INFLUENCE

slide-16
SLIDE 16

1 9

DESCRIPTOR COMPARISON

D e s c r . # l a y e r s w/ p a r a ms L

  • s

s Ha r d mi n i n g K d

  • t

r e e r e a d y C

  • n

v e x O p t 1 G l

  • b

a l ma r g i n

  • +

D e e p D e s c 3 C

  • n

t r a s t i v e + + T F e a t 3 T r i p l e t ma r g i n + /

  • +

Ma t c h N e t 8 C r

  • s

s e n t r

  • p

y

  • D

e e p C

  • mp

5 H i n g e

  • L

2 N e t 7 S

  • f

t Ma x + + Ha r d N e t 7 T r i p l e t ma r g i n + +

slide-17
SLIDE 17

Loss comparison on patch triplets

2

slide-18
SLIDE 18

2 1

LOSSES COMPARISON, DERIVATIVES

slide-19
SLIDE 19

2 2

LOSSES COMPARISON, DERIVATIVES

N

  • g

r a d i e n t f r

  • m

n e g a t i v e e x a mp l e S ma l l g r a d i e n t s

slide-20
SLIDE 20

2 3

LOSSES COMPARISON

Contrastive Softmax (L2Net) Triplet margin FPR, Brown Yos 0.009 0.009 0.006 mAUC, W1BS 0.072 0.083 0.083 mAUC, HP-T 0.153 0.157 0.164

slide-21
SLIDE 21

Results

2 4

slide-22
SLIDE 22

2 5

RESULTS: BROWN DATASET

slide-23
SLIDE 23

2 6

RESULTS: W1BS DATASET

Mi s h k i n e t a l , B MV C 2 1 5 N u i s a n c e f a c t

  • r

: A p p e a r a n c e G e

  • me

t r y L i g h t i n g S e n s

  • r
slide-24
SLIDE 24

2 7

HPATCHES DATASET

D

  • G

, H e s s i a n , H a r r i s – i n r e f . i ma g e ~ 1 3 p a t c h e s p e r i ma g e k e p t . R e p r

  • j

e c t e d t

  • t

h e r i ma g e s w i t h 3 l e v e l s

  • f

“ a ffjn e f r a me n

  • i

s e ” a d d e d V : 5 7 i ma g e s i x p l e t s – p h

  • t
  • me

t r i c c h a n g e s I : 5 9 i ma g e s i x p l e t s – g e

  • me

t r i c c h a n g e s B a l n t a s e t a l , C V P R 2 1 7

slide-25
SLIDE 25

2 8

RESULTS: HPATCHES

slide-26
SLIDE 26

2 9

RESULTS: MATCHING WITH VIEW SYNTH

D a t a s e t s a r e a l r e a d y s a t u r a t e d

O n p a r wi t h R

  • t

S I F T S t i l l c h a l l e n g i n g d u e t

  • mu

l t i p l e n u i s a n c e f a c t

  • r

s

Z i t n i c k a n d R a mn a t h , 2 1 1 , Mi s h k i n e t a l 2 1 5 , Mi k

  • l

a j c z y k e t a l . 2 1 3 , H a u a g g e a n d S n a v e l y , 2 1 2 , K e l ma n e t a l , 2 7 , F e r n a n d

  • e

t a l . 2 1 4

slide-27
SLIDE 27

3

RESULTS: BOW OXFORD5K & PARIS 6K

P h i l b i n e t a l 2 7 , P h i l b i n e t a l 2 8

slide-28
SLIDE 28

3 1

RESULTS: HQE OXFORD5K & PARIS 6K

slide-29
SLIDE 29

Thank you for attention

PDF: https://arxiv.org/abs/1705.10872 Source and models: https://github.com/DagnyT/hardnet 3 2