Virtual Class Enhanced Discriminative Embedding Learning Binghui - - PowerPoint PPT Presentation

virtual class enhanced discriminative
SMART_READER_LITE
LIVE PREVIEW

Virtual Class Enhanced Discriminative Embedding Learning Binghui - - PowerPoint PPT Presentation

Virtual Class Enhanced Discriminative Embedding Learning Binghui Chen, Weihong Deng, Haifeng Shen BUPT & DiDi 32nd Conference on Neural Information Processing Systems (NeurIPS), 2018, Montral, Canada. Observation & Motivation


slide-1
SLIDE 1

Virtual Class Enhanced Discriminative Embedding Learning

Binghui Chen, Weihong Deng, Haifeng Shen BUPT & DiDi

32nd Conference on Neural Information Processing Systems (NeurIPS), 2018, Montréal, Canada.

slide-2
SLIDE 2

Observation & Motivation

  • For d-dimensional feature space under Softmax classifier, the feature

region of each class is inversely proportional to the number of class.

Increase class number

slide-3
SLIDE 3

Virtual Softmax:

Learning towards discriminative image features

  • Formulation: inject a dynamic virtual negative class

where

  • Optimization goal:

Virtual class

slide-4
SLIDE 4
  • Optimization goal of Virtual Softmax:
  • The conventional Softmax learns towards a weaker goal:

Objective comparison

slide-5
SLIDE 5

Discussion:

  • Interpretation from Coupling Decay:

perform the first order Taylor Expansion for the second log-term in Eq.2, a term of shows up. Therefore, minimizing the above equation is to minimize to some extend, and this can be viewed as a coupling decay term, i.e. Data-Dependent Weight Decay and Weight-Dependent Data Decay.

(2) (1)

slide-6
SLIDE 6
  • Interpretation from Feature Update:

For a linear neural layer, the Feature Update by Softmax and our Virtual Softmax is like:

Softmax: Virtual Softmax:

slide-7
SLIDE 7

Experiments:

  • Similar convergence and higher accuracy on CIFAR100:
slide-8
SLIDE 8
  • Visualization of Feature Compactness and Separability on MNIST :

Experiments:

slide-9
SLIDE 9

Experiments:

  • Visualization of intra-class and inter-class similarities on CIFAR10,

CIFAR100:

slide-10
SLIDE 10

Experiments:

  • Performances on small-scale object classification datasets:
  • Performances on large-scale object classification and face verification

datasets:

slide-11
SLIDE 11

Thanks!

http://www.bhchen.cn