Label Embedding Based on Multi-Scale Locality Preservation - PowerPoint PPT Presentation

Label Embedding Based on Multi-Scale Locality Preservation Cheng-Lun Peng, An Tao, Xin Geng Reporter: Cheng-Lun Peng Date: July 17, 2018

Outline 1 Background 2 Proposed Method: MSLP 3 Experiment 4 Conclusion

1 Background: LE & LDL q Label Embedding (LE): A Learning Strategy u Usual Steps encoding process (encoder) learning process (predictor ) decoding process (decoder) q Label Distribution Learning (LDL): A Learning Paradigm 0.3 0 label label 1 label 1 0 0.65 label label 0 instance instance label instance 1 0.05 label label 1 label Single-label learning Multi-label learning Label Distribution learning q Our Work Propose a specially designed LE Method named MSLP for LDL, which is the first attempt of applying LE in LDL

1 Background: The Meaning of Our Work q Why Apply LE in LDL u The labels in LDL may encounter problems (e.g., redundancy, noise, …) u Effective exploitation of the label correlations is crucial for the success for LDL. u LE owns advantages in addressing problematic labels and capturing latent correlation between labels. q What’s The Challenges of Applying LE in LDL u There are no LE method for LDL proposed yet. Most existing LE methods are designed for SLL and MLL, i.e., focusing on the binary labels (0/1) . u Two main issues a) How to exploit the information of label distributions efficiently. b) How to design a decoder that restricts the recovered label vector to satisfy the constraints of the label distribution.

1 Background: Symbol Definition q Symbol Definition : Dataset : i-th instance : i-th label vector : i-th embedded label vector

2 MSLP: Motivation q Motivation u Locality Preserving Embedding for The Label Space Inspired by Laplacian Eigenmaps [Belkin and Niyogi, 2002], MSLP aims to make the data points with similar label distributions close to each other in the embedding space. Find 𝑙 " nearest neighbors for data point 𝒒 𝒋 in the label space among the given point set

2 MSLP: Explicit Assumption q Explicit Assumption Assume an explicit mapping from the features to the embedded labels Advantage : u Makes the process of label embedding feature-aware u Omits the additional learning process from to after completing embedding. L2 Regularization

2 MSLP: Explicit Assumption q Problem of Explicit Linear Assumption The solution for V will tend to be dominated by the large feature distances of data pairs where Data pairs which keep very close in the label space, but keep far away from each other in the feature space.

2 MSLP: Restriction q Multi-Scale Locality Preservation Restriction: The 𝑙 " nearest neighbors of one data point in label space should be found within its α 𝑙 " nearest neighbors in feature space. That is, utilizing different locality granularity in the label space and the feature space, the locality information of data points in both spaces are integrated.

2 MSLP: Robust to Noise q Smoothness Assumption [Chapelle et al. ,2006] Neighboring data points in feature space are more likely to share the similar labels. q Hetero-neighbors Data pairs which keep very close in the feature space, but keep far away from each other in the label space. 𝑛𝑏𝑦

2 MSLP: Objective q The objective of MSLP

2 MSLP: Solution

2 MSLP: Solution Applying the Lagrangian method, the problem can be transfered into a general eigen-decomposition problem. The optimal V consists of the first 𝑚 normalized eigenvectors corresponding to the top 𝑚 smallest eigenvalues.

2 MSLP: Decoder q Testing Phrase

3 Experiment: Configuration q Compared Methods u Eight popular LDL methods: IIS-LDL, CPNN, BFGS-LDL, LDSVR, AA-BP , AA-KNN, PT-SVM, PT-Bayes u Four typical Feature Embedding methods: CCA, NPE, PCA, LPP ( The Linear version of Laplacian Eigenmaps) The compared FE methods are allowed to be extended to their kernel version with the rbf kernel, which gives them full chances to beat MSLP . q Widely-used Metrics in LDL u Four distance metrics: Chebyshev, Clark, Kullback-Leibler, Canberra u Two similarity metrics: Cosine and Intersection q Other Settings u the embedding ratio of the dimensionality ranges over {10% , 20% , …, 100%} u Running each method with the best tuned parameters u 10-fold cross validation u Pairwise t-tests at 90% significance level

3 Experiment: Datasets Datasets #S #Lab #Feat Domain el ure facial s-JAFEE 213 6 243 expression recognition facial s-BU- 250 6 243 expression 3DEF 0 recognition 150 facial beauty SCUT-FBP 5 300 0 sense 124 facial beauty M * B 5 250 0 sense Nature_Sc 200 natural scene 9 294 ene 0 annotation

3 Experiment: Visualization u Different colors are used to display images according to the highest description degree of the basic emotions

3 Experiment: Quantitative Results

3 Experiment 3 Experiment: Quantitative Results Across all metrics, MSLP ranks 1st in 93.3% cases.

4 Conclusion q Conclusion u The first attempt of embedding LE into LDL. u MSLP is insensitive to the presence of hetero-neighbors and integrates the locality structure of points in both spaces with different granularity. u Experiments reveal the effectiveness of MSLP in gathering points with similar label distributions in the embedding space. q Future Work u Explore if there exist better ways to utilize the structure information described by the label distributions. u Shift MSLP to some other learning paradigms (e.g., multi-output regression) which own numerical labels.

Thank you Q & A

Label Embedding Based on Multi-Scale Locality Preservation - PowerPoint PPT Presentation

Label Embedding Based on Multi-Scale Locality Preservation Cheng-Lun Peng, An Tao, Xin Geng Reporter: Cheng-Lun Peng Date: July 17, 2018 Outline 1 Background 2 Proposed Method: MSLP 3 Experiment 4 Conclusion 1 Background: LE

Greedy embedding of a graph Greedy embedding of a graph 99 Greedy embedding Greedy embedding

Blue Label Pilot-plant Reactor 1 Product Line-up Platinum Label Gold Label Blue Label Blue

AG! Blue Label Bench-top Reactor 1 Product line up Platinum Label Gold Label Blue Label Blue

CONTEXT LOCALITY LOCALITY LOCALITY LOCALITY LAYOUTS M E E R L U S T R O A D PICK

Extreme Classification A New Paradigm for Ranking & Recommendation Manik Varma Microsoft

Locality Locality CS 105 Tour of the Black Holes of Computing Principle of Locality: Programs

On-line Hierarchical Multi-label Text Classification Jesse Read Supervised by Bernhard (and Eibe

Graph Drawing Embedding Embedding For a given graph G = ( V , E ) , an embedding (into R 2 )

Planarity Embedding Embedding For a given graph G = ( V , E ) , an embedding (into R 2 ) assigns

A PROPOSAL FOR A PRESERVATION A PROPOSAL FOR A PRESERVATION A PROPOSAL FOR A PRESERVATION A

Digital Data Preservation Digital Data Preservation Research into a solution for preservation of

Learning Context-dependent Label Permutations for Multi-label Classification Jinseok Nam Amazon

MPLS Source Label draft-chen-mpls-source-label-02 Mach Chen, Xiaohu Xu Zhenbin Li, Luyuan Fang

locality.org.uk Locality is the national network of ambitious and enterprising community-led

Highway Locality Budget Scheme Steve Dibben Highway Locality Manager Mid Herts Group

Large-Scale Clustering through Functional NCut Embedding Embedding Experiments Summary

Quotients of Bounded Natural Functors Basil Frer Andreas Lochbihler Joshua Schneider Dmitriy

Fast Solvers for Linear Systems on the GPU Kees Vuik, Rohit Gupta, Martijn de Jong Martin van

CSE 115 Introduction to Computer Science I Road map Review functions and control flow

Hardware Support for Hardware Support for Out-of-Order Instruction Out-of-Order Instruction

decomposition with static pivoting M. Arioli, I. S. Duff, S. Gratton, and S. Pralet

Monitoring Business Metaconstraints Based on LTL & LDL for Finite Traces Marco Montali KRDB

Parameterized Linear Temporal Logics Meet Costs: Still not Costlier than LTL Martin Zimmermann

Leon van der Torre, University of Luxembourg & CSLI O = Obligation LHistoire d 1969

Label Embedding Based on Multi-Scale Locality Preservation - PowerPoint PPT Presentation

Label Embedding Based on Multi-Scale Locality Preservation Cheng-Lun Peng, An Tao, Xin Geng Reporter: Cheng-Lun Peng Date: July 17, 2018 Outline 1 Background 2 Proposed Method: MSLP 3 Experiment 4 Conclusion 1 Background: LE

Greedy embedding of a graph Greedy embedding of a graph 99 Greedy embedding Greedy embedding

Blue Label Pilot-plant Reactor 1 Product Line-up Platinum Label Gold Label Blue Label Blue

AG! Blue Label Bench-top Reactor 1 Product line up Platinum Label Gold Label Blue Label Blue

CONTEXT LOCALITY LOCALITY LOCALITY LOCALITY LAYOUTS M E E R L U S T R O A D PICK

Extreme Classification A New Paradigm for Ranking &amp; Recommendation Manik Varma Microsoft

Locality Locality CS 105 Tour of the Black Holes of Computing Principle of Locality: Programs

On-line Hierarchical Multi-label Text Classification Jesse Read Supervised by Bernhard (and Eibe

Graph Drawing Embedding Embedding For a given graph G = ( V , E ) , an embedding (into R 2 )

Planarity Embedding Embedding For a given graph G = ( V , E ) , an embedding (into R 2 ) assigns

A PROPOSAL FOR A PRESERVATION A PROPOSAL FOR A PRESERVATION A PROPOSAL FOR A PRESERVATION A

Digital Data Preservation Digital Data Preservation Research into a solution for preservation of

Learning Context-dependent Label Permutations for Multi-label Classification Jinseok Nam Amazon

MPLS Source Label draft-chen-mpls-source-label-02 Mach Chen, Xiaohu Xu Zhenbin Li, Luyuan Fang

locality.org.uk Locality is the national network of ambitious and enterprising community-led

Highway Locality Budget Scheme Steve Dibben Highway Locality Manager Mid Herts Group

Large-Scale Clustering through Functional NCut Embedding Embedding Experiments Summary

Quotients of Bounded Natural Functors Basil Frer Andreas Lochbihler Joshua Schneider Dmitriy

Fast Solvers for Linear Systems on the GPU Kees Vuik, Rohit Gupta, Martijn de Jong Martin van

CSE 115 Introduction to Computer Science I Road map Review functions and control flow

Hardware Support for Hardware Support for Out-of-Order Instruction Out-of-Order Instruction

decomposition with static pivoting M. Arioli, I. S. Duff, S. Gratton, and S. Pralet

Monitoring Business Metaconstraints Based on LTL &amp; LDL for Finite Traces Marco Montali KRDB

Parameterized Linear Temporal Logics Meet Costs: Still not Costlier than LTL Martin Zimmermann

Leon van der Torre, University of Luxembourg &amp; CSLI O = Obligation LHistoire d 1969

Extreme Classification A New Paradigm for Ranking & Recommendation Manik Varma Microsoft

Monitoring Business Metaconstraints Based on LTL & LDL for Finite Traces Marco Montali KRDB

Leon van der Torre, University of Luxembourg & CSLI O = Obligation LHistoire d 1969