Discrete Hashing Fast, scalable retrieval and classification Fumin - PowerPoint PPT Presentation

Discrete Hashing Fast, scalable retrieval and classification Fumin Shen Center for Future Media, University of Electronic Science and Technology of China

Outline • Introduction to Hashing • Discrete optimization for Hashing • Applications of Discrete Hashing • Classification by Hamming Retrieval

Background: Hashing Hamming distance Extremely Extremely fast! ast!

Background: Hashing • Locality-Sensitive Hashing (LSH): [Gionis, Indyk, and Motwani 1999], [Datar et al. 2004], etc. Query hash function for a bit 101 101 0 Data Vector random 1 Recent study: learn to hash 0 1 1 0 Learned from data

Background: Hashing The main application: Approximate Nearest Neighbor Search (ANNS) Image Database query visually relevant

LSH • LSH has a lot of variants suitable for � � ( � � � � � ), Cosine similarity, Gaussian, � � kernels (KLSH), etc. • Good: Sublinear search time �� for � < 1. • Bad: Long hash bits ( ~�� ) and hundreds of hash tables (big memory).

Learning based Hashing

Unsupervised Hashing Learning binary codes preserving data similarities • PCAH: generate W by principal component analysis (PCA) • SH: (Weiss et al., 2008) introduce unsupervised graph hashing • ITQ: (Gong and Lazebnik, 2011) orthogonal rotation matrix to refine the initial projection matrix by PCA • AGH: (Liu et al., 2011) solve SH by anchor graphs • IMH: (Shen et al., 2013) generate binary codes form general data manifolds • DGH: (Liu et al., 2014) Solve SH by discrete optimization • AIBC: (Shen et al., 2015) asymmetric hashing • …

Supervised Hashing Learning binary codes supervised by piecewise or pairwise/ranking labels • SSH: (Wang et al., 2010) exploits both labeled and unlabeled data for hashing • MLH: (Norouzi and Fleet, 2011) based on structural SVM • KSH: (Liu et al., 2012) kernel based supervised hashing • FastH: (Lin et al., 2014) solve hashing by Graph cuts • SDH: (Shen et al., 2015) generate binary codes by discrete optimization • COSDISH: (Kang, et al., 2016) column sampling based discrete supervised hashing • DSeRH (Liu et al., 2017) deep ranking hashing • …

Deep learning based Hashing • Lots of supervised methods • DAPH (Shen et al., MM’17) • DSeRH (Liu et al., CVPR’17) • DPSH (Li et al., IJCAI’16) • VDSH (Zhang et al., CVPR’16) • DSH (Liu et al., CVPR’16) • CNNH (Xia et al., AAAI’15) • Very few unsupervised ones • DH (Liong et al., CVPR’15) • Deepbit (Lin et al., CVPR’16) • UH-BDNN (Do et al., ECCV’16 )

Deep vs. Shallow Deep learning boost supervised hashing Long way for unsupervised deep hashing Method ITQ IMH CNN+ITQ DH UH-BDNN MAP 17.76 18.38 0.255 16.62 18.35

Manifold learning vs. Hashing Spectral Hashing Optimal hash codes • Very similar formulation • Key difference: discrete constraint

The hashing problem • Mixed Integer Program; Normally NP hard • Difficult to optimize due to the discrete variables

Solution in literature • Step 1: Relaxation -- discard the discrete constraints • Mimic sign function by continuous Sigmoid • Hard to achieve good (local) optima • Step 2: Rounding – thresholding after learning • Quantization techniques: ITQ (Gong and Lazebnik, 2011) • Increasing quantization distortion with long hash codes

Our solution

(I) Supervised Discrete Hashing Formulation: Joint learning of binary codes � , feature representation �� and the linear classifier � � is the ground truth label matrix Algorithm: Alternating minimization until convergence • solve the W-subproblem (multi-class classification); • solve the F -subproblem (feature learning); • solve the B-subproblem (hash learning) – the key problem F. Shen, C. Shen, W. Liu, H. T. Shen, “ Supervised Discrete Hashing ”, CVPR’15.

(I) Supervised Discrete Hashing The key binary code optimization problem Algorithm: Discrete Cyclic Coordinate descent (DCC) learn bit-by-bit Optimal, closed-form solution in each iteration!

(I) Supervised Discrete Hashing Results Discrete optimization vs . Relaxed optimization CIFAR-10 dataset Discrete Optimization is Important for Hashing!

(I) Supervised Discrete Hashing • SDH supports other losses such as hinge loss. Then the B- subproblem still has a closed-form update, while the W- subproblem is the multi-class SVM. • SDH scales linearly with the number � of labeled examples, so it can incorporate massive labeled data into training.

Binary optimization How to solve the general binary code learning problem? • Design a new algorithm for every different loss? • The loss can be too complex to design feasible discrete optimization algorithm.

(II) Discrete Proximal Linearized Minimization Motivation: Minimize an equivalent smooth + non- smooth loss Algorithm: Discrete Proximal Linearized Minimization Each iteration: closed-form, optimal solution! F. Shen, X. Zhou, Y. Yang, J. Song, H. T. Shen and D. Tao, ʺ A Fast Optimization Method for General Binary Code Learning ʺ , IEEE Transactions on Image Processing (TIP), 2016.

(II) Discrete Proximal Linearized Minimization • Theoretical Guaranteed to converge! • Practical: • Very Fast, even faster than DCC in SDH • Successfully applied to supervised and unsupervised Hashing

(III) Asymmetric Inner-product Binary Coding Hashing for Maximum Inner Product Search (MIPS): Retrieve the datum having the largest inner product with query q from database A Algorithm: Inner product fitting by asymmetric hash functions � is the inner products of � and � Decomposed this hard problem into two sub-problems with each solved by DCC , as in SDH. F. Shen, W. Liu, S. Zhang, Y. Yang, and H. T. Shen, “ Learning Binary Codes for Maximum Inner Product Search ”, ICCV 2015

Results: unsupervised hashing Asymmetric Inner-product Binary Coding (AIBC)

(IV) Discrete Collaborative Filtering Collaborative Filtering Our proposal: Discrete Collaborative Filtering H. Zhang, F. Shen , L. Liu, W. Liu, X. He, H. Luan and T. ‐ S. Chua, “ Discrete Collaborative Filtering ”, SIGIR 2016. Best Paper Award Honorable Mention

(IV) Discrete Collaborative Filtering

(V) Classification by Hamming Retrieval Motivation • Very few learn-to-hash work for classification! • Existing classification methods treat hash codes as real-valued features Boost even linear classification by hashing

(V) Classification by Hamming Retrieval Idea : Classify binary data with binary weights Floating-point XNOR operations multiplications F. Shen , Y. Mu, Y. Yang, W. Liu, L. Liu, J. Song, H. T. Shen, “ Classification by Retrieval: Binarizing Data and Classifier ”, SIGIR 2017. Best Paper Award Honorable Mention

(V) Classification by Hamming Retrieval Framework Classifying an image reduces to retrieving its nearest class codes in the Hamming space.

(V) Classification by Hamming Retrieval Fomulation : Joint learning of binary codes and binary weights Inter-class margin • The loss can be any proper empirical loss. We particularly study the Exponential loss and Linear loss.

(V) Classification by Hamming Retrieval Solution: Exponential loss W-subproblem: Binary Quadratic Program (BQP) • bit-by-bit Sequential bit flipping algorithm – local optimal • B-subproblem bit-by-bit • P-subproblem

(V) Classification by Hamming Retrieval Results: LibLinear vs. our method on SUN 397

(V) Classification by Hamming Retrieval Results : Comparison in accuracy (%), training and testing time (seconds).

(V) Classification by Hamming Retrieval Results: Accuracy (%) with increasing binary code length

(V) Classification by Hamming Retrieval Conclusions: • Convert linear classification to Hamming retrieval • Binarize both data and classifier in a joint problem • Support many empirical loss functions • Significant reduction on storage, training and testing computation

(VI) Deep Sketch Hashing Sketch based image retrieval Existing methods: • Hand-crafted feature engineering. (e.g., SIFT, HOG, HELO [1] , LKS [2] ) • Deep learning based feature extraction Li Liu, Fumin Shen , Yuming Shen, Xianglong Liu, Ling Shao, “ Deep Sketch Hashing: Fast Free ‐ hand Sketch ‐ Based Image Retrieval, ”, CVPR 2017

(VI) Deep Sketch Hashing Framework of DSH We integrate a convolutional neural network and discrete binary code learning into a unified framework. Li Liu, Fumin Shen, Yuming Shen, Xianglong Liu, Ling Shao, “ Deep Sketch Hashing: Fast Free ‐ hand Sketch ‐ Based Image Retrieval, ”, CVPR 2017

(VI) Deep Sketch Hashing Objective Formulation of DSH non-convex and non-smooth

(VI) Deep Sketch Hashing Alternating Optimization

(VI) Deep Sketch Hashing Comparison with previous SBIR methods

(VI) Deep Sketch Hashing Experimental results of DSH Comparison with cross-modality methods

(VI) Deep Sketch Hashing Successful Cases of DSH:

Discrete Hashing Fast, scalable retrieval and classification Fumin - PowerPoint PPT Presentation

Discrete Hashing Fast, scalable retrieval and classification Fumin Shen Center for Future Media, University of Electronic Science and Technology of China Outline Introduction to Hashing Discrete optimization for Hashing Applications

Today. Cuckoo hashing. Today. Cuckoo hashing. Johnson-Lindenstrass. Cuckoo hashing. Hashing

14. Hashing Hash Tables, Pre-Hashing, Hashing, Resolving Collisions using Chaining, Simple

Overview Intro to Hashing Intro to Hashing Hashing with Chaining Whats hashing?

14. Hashing Hash Tables, Pre-Hashing, Hashing, Resolving Collisions using Chaining, Simple

Database Systems Index: Hashing Based on slides by Feifei Li, University of Utah Hashing n

Hashing (Application of Probability) Ashwinee Panda Final CS 70 Lecture! 9 Aug 2018 Overview

Hashing Connections 2-Universal Hash Function Perfect Hashing Anil Maheshwari Proofs

Union-Find [10] In the last class Hashing Collision Handling for Hashing Closed

Hashing Chapter 5 1 Objectives Understand the idea of hashing Compare hashing to sorting

Hashing Hashing What is it? A form of narcotic intake? A side order for your eggs? A

Lecture 8: Hashing I Lecture Overview Dictionaries and Python Motivation Prehashing

Chapter 11: Indexing and Hashing Basic Concepts Ordered Indices B + -Tree Index Files

Advanced Algorithms COMS31900 Hashing part two Static Perfect Hashing Rapha el Clifford

Discrete Mathematics Jeremy Siek Spring 2010 Jeremy Siek Discrete Mathematics 1 / 118 Jeremy

Cyber-Physical Systems Discrete Dynamics IECE 553/453 Fall 2019 Prof. Dola Saha 1 Discrete

CMSC 222: Discrete Mathematics Prof S Fall 2018 What is Discrete Mathematics? Discrete

CSC 244/ 444 Knowledge Representation and Reasoning in AI: Introduction Instructor: Len Schubert

Framework for Library and Knowledge Services - Outcome 2 September 2019 What does Outcome 2

LEAN Library IMPROVING ACCESS TO EVIDENCE ACROSS MERSEY CARE small team Big Trust 4 7,000

MODULE ConcurrentTransactions [ V, % Value S, % State of database T % Transaction ID ] EXPORT

The Existence of Decoherence-Free Subspaces and an Effective Criterion Takeo Kamizawa Faculty of

Iterated Ultrapowers and Automorphisms Ali Enayat Pisa, May 2006 1 Our story begins with:

IETF84-KARP Key Management and Adjacency Management for KARP-based Routing Systems Definitions

Thirty years ago...

Sambuz

Useful Links

Newsletter

Mail Us