Distance Metric Learning with Joint Representation Diversification - PowerPoint PPT Presentation

Introduction Method Experiment References Distance Metric Learning with Joint Representation Diversification Xu Chu 1,2 Yang Lin 1,2 Yasha Wang 2,3 Xiting Wang 4 Hailong Yu 1,2 Xin Gao 1,2 Qi Tong 2,5 1 School of Electronics Engineering and Computer Science, Peking University 2 Key Laboratory of High Confidence Software Technologies, Ministry of Education 3 National Engineering Research Center of Software Engineering, Peking University 4 Microsoft Research Asia 5 School of Software and Microelectronics, Peking University July 14, 2020

Introduction Method Experiment References The goal of distance metric learning (DML) Learn a mapping f θ from the original feature space to a representation space where similar examples are closer than dissimilar examples in the learned representation space.

Introduction Method Experiment References The training objectives of deep DML methods encourage intra-class compactness and inter-class separability. Embedding Loss Contrastive loss [Chopra et al., 2005]: ℓ contrastive = [ d ( x a , x p ) − m pos ] + + [ m neg − d ( x a , x n )] + Triplet loss [Schroff et al., 2015]: ℓ triplet = [ d ( x a , x p ) − d ( x a , x n ) + m ] + · · · Classification Loss e s ( Sim ( xi , wyi ) − m ) AMSoftmax loss [Wang et al., 2018]: ℓ AM = − log e s ( Sim ( xi , wyi ) − m ) +Σ C j � = yi e sSim ( xi , wj ) · · ·

Introduction Method Experiment References Trade-off between intra-class compactness and inter-class separability. Intra-class compactness: risk of filtering out useful factors (for open-set classification ) Inter-class separability: risk of introducing nuisance factors

Introduction Method Experiment References Trade-off between intra-class compactness and inter-class separability. Intra-class compactness: risk of filtering out useful factors (for open-set classification ) Inter-class separability: risk of introducing nuisance factors Blue Jay Florida Jay Seen Classes Hooded Warbler? Yellow Warbler? Unseen Wilson Warbler? Classes Orange Crowned Warbler?

Introduction Method Experiment References Motivation Is it possible to find a better balance point between intra-class compactness and inter-class separability? How to leverage the hierarchical representations of DNNs to improve the DML representation?

Introduction Method Experiment References Motivation Is it possible to find a better balance point between intra-class compactness and inter-class separability? How to leverage the hierarchical representations of DNNs to improve the DML representation? Results 1 Additional explicit penalizations on intra-class distances of representations is risky for the classification loss methods (AMSoftmax).

Introduction Method Experiment References Motivation Is it possible to find a better balance point between intra-class compactness and inter-class separability? How to leverage the hierarchical representations of DNNs to improve the DML representation? Results 1 Additional explicit penalizations on intra-class distances of representations is risky for the classification loss methods (AMSoftmax). 2 Encouraging inter-class separability by penalizing distributional similarities of joint representations is beneficial for the classification loss methods (AMSoftmax).

Introduction Method Experiment References Motivation Is it possible to find a better balance point between intra-class compactness and inter-class separability? How to leverage the hierarchical representations of DNNs to improve the DML representation? Results 1 Additional explicit penalizations on intra-class distances of representations is risky for the classification loss methods (AMSoftmax). 2 Encouraging inter-class separability by penalizing distributional similarities of joint representations is beneficial for the classification loss methods (AMSoftmax). 3 We propose a framework distance metric learning with joint representation diversification (JRD).

Introduction Method Experiment References Challenge How to measure the similarities of joint distributions of representations across multiple layers? Solution Representers of probability measures in the reproducing kernel Hilbert space (RKHS) Definition 1 (kernel mean embedding). Let M 1 + ( X ) be the space of all probability measures P on a measurable space ( X , Σ). RKHS is a reproducing kernel Hilbert space with reproducing kernel k . The kernel mean embedding is defined by the mapping, µ : M 1 � k ( · , x ) d P ( x ) � µ P . + ( X ) − → RKHS , P �− → Definition 2 (cross-covariance operator) Let M 1 + ( × L l =1 X l ) be the space of all probability measures P on × L l =1 X l . l =1 RKHS l = RKHS 1 ⊗ · · · ⊗ RKHS L is a tensor product space with ⊗ L reproducing kernels { k l } L l =1 . The cross-covariance operator is defined by the mapping, C X 1: L : M 1 + ( × L l =1 X l ) − → ⊗ L l =1 RKHS l , l =1 X l ( ⊗ L l =1 k l ( · , x l )) d P ( x 1 , . . . , x L ) � C X 1: L ( P ). P �→ � × L

Distance Metric Learning with Joint Representation Diversification - PowerPoint PPT Presentation

Introduction Method Experiment References Distance Metric Learning with Joint Representation Diversification Xu Chu 1,2 Yang Lin 1,2 Yasha Wang 2,3 Xiting Wang 4 Hailong Yu 1,2 Xin Gao 1,2 Qi Tong 2,5 1 School of Electronics Engineering and

Distance Metric Learning: Beyond 0/1 Loss Praveen Krishnan CVIT, IIIT Hyderabad June 14, 2017 1

Welcome back... Metric spaces. Approximate metric using a tree. Tree metric: 16 16 A metric

Distance Education Distance education used to be about the distance. 1700s 1800s 1900s 2000s

Metric Spaces Definition If d is a metric on X , then the metric topology on X induced by d is

Mark-recapture distance sampling (MRDS) in Distance 7.1 Setting up Distance for MRDS

Information- -Velocity Metric Velocity Metric Information-Velocity Metric Information for the

The Metric Coalescent joint with David Aldous Daniel Lanoue University of California, Berkeley

The Metric Coalescent Process joint with David Aldous Daniel Lanoue June 17, 2014 Daniel Lanoue

Shortest Path Similar Routing 2 A New Metric A new metric path- based metric that can use used

Distance in data space Notion of distance (metrics) in data space Who is my closest neighbor?

Learning distance functions Xin Sui CS395T Visual Recognition and Search The University of Texas

Learning Distance for Sequences by Learning a Ground Metric Bing Su Ying Wu

Information Theoretic Metric Learning Instructor: Sham Kakade 1 Metric Learning In k -nearest

SYDE 372 - Winter 2011 Introduction to Pattern Recognition Distance Measures for Pattern

A new family of maximum rank distance codes or: Maximum rank distance codes and finite semifields

Distance Learning Components Piedmont Unified School District August 5, 2020 Distance Learning

Inductive Types for Free Representing Nested Inductive Types using W-types Michael Abbott (U.

#First download the code # on gordon # # ssh -X name@gordon.sdsc.edu # source

Indexed Containers based on joint work with Neil Ghani, Peter Hancock, Conor McBride, Peter

The Pythagoras numbers of projective varieties. Grigoriy Blekherman (Georgia Tech, USA) Rainer

A proof checking kernel for the -calculus modulo Mathieu Boespflug, cole Polytechnique

Compilerconstructie najaar 2016 http://www.liacs.leidenuniv.nl/~vlietrvan1/coco/ Rudy van Vliet

Multi-Level Languages are Generalized Arrows Adam Megacz 12-Apr-2011 1/38 Lambda Calculus

CLASE Cursor Library for A Structured Editor Zip! Photo from