Stable and Efficient Representation Learning with Nonnegativity - PowerPoint PPT Presentation

Stable and Efficient Representation Learning with Nonnegativity Constraints Tsung-Han Lin and H.T. Kung

Unsupervised Representation Learning Layer 3 Representation Layer 2 Representation Encoding Sparse encoder Layer 1 Representation (e.g., l1-regularized sparse coding) Large Dictionary Dictionary

Why Sparse Representations? • Prior knowledge is better encoded into sparse representations – Data is explained by only a few underlying factors – Representations are more linearly separable Feature A Simplifies supervised classifier training: sparse representations work well even when labeled samples are few Feature B

Computing Sparse Representations Sparse approximation: + = 0.5 × 0.3 ×

Computing Sparse Representations Sparse approximation: • L1 relaxation approach: good classification accuracy, but computation is expensive • Greedy approach (e.g., orthogonal matching pursuit): fast, but yields suboptimal classification accuracy CIFAR-10 classification with single-layer architecture L1-regularized OMP Classification 78.7 76.0 accuracy (%) [Coates 2011]

Major Findings • Weak stability is the key to OMP’s suboptimal performance • By allowing only additive features (via nonnegativity constraints), classification with OMP delivers higher accuracy by large margins • Competitive classification accuracy with deep neural networks

Stability of Representations Data Input Encoder ? + n

Orthogonal Matching Pursuit (OMP) Select k atoms from a dictionary D that minimize | x-Dz | d 2 Select the atom that has the largest d 3 correlation with the residual x d 1 Support set d 1 k

Orthogonal Matching Pursuit (OMP) Select k atoms from a dictionary D that minimize | x-Dz | d 2 Select the atom that has the largest d 3 correlation with the residual r (1) x x d 1 Estimate the coefficients of the Dz (1) selected atoms by least squares Update the residual using current estimate Support set d 1 k

Orthogonal Matching Pursuit (OMP) Select k atoms from a dictionary D that minimize | x-Dz | d 2 Select the atom that has the largest d 3 correlation with the residual r (1) x d 1 Estimate the coefficients of the Dz (1) selected atoms by least squares Update the residual using current estimate Support set d 1 d 3 k

1. Larger region for “+d 1 ” noise tolerance d 1 d 1 residual residual n n d 2 d 2 δ “-d 2 ” 2. Terminate without overfitting OMP Nonnegative OMP Use only additive features by constraining the atoms and coefficients to be nonnegative

Allowing Only Additive Features = + Cancellation

Allowing Only Additive Features = + Enforce nonnegativity to eliminate cancellation On input: On dictionary: • Any nonnegative sparse 3 coding algorithms “+” • We use spherical K-means 0 channel 3 0 -2 On representation: 0 -1 “−” Encode with nonnegative • 2 channel OMP (NOMP) Sign splitting 1

Evaluate the Stability of Representations Feature dictionary learned from image datasets Representation A Grating A Rotate by some Encode by Measure change by small angle δ OMP/NOMP their correlation Grating B Representation B Correlation between representation A and B Encoder Rotation angle δ 0 0.01π 0.02π 0.03π 0.04π OMP 1 0.71 0.54 0.43 0.34 NOMP 1 0.92 0.80 0.68 0.57

Classification: NOMP vs OMP NOMP has ~3% Classification accuracy on CIFAR-10 improvement over OMP

NOMP Outperforms When Fewer Labeled Samples Are Available Classification accuracy on CIFAR-10 with fewer labeled training samples

STL-10: 10 classes, 100 labeled samples/class, 96x96 images airplane, bird, 67.9% 64.5% car, cat, deer, dog, horse, monkey, ship, Hierarchical matching This work truck pursuit (2012) CIFAR-100: 100 classes, 500 labeled samples/class, 32x32 images aquatic mammals, fish, flowers, food containers, fruit and vegetables, household electrical devices, 60.1% 61.4% household furniture, insects, large carnivores, large man-made outdoor things, large natural outdoor scenes, Maxout network (2013) This work large omnivores and herbivores, medium-sized mammals, non-insect invertebrates, people, reptiles, small mammals, trees, vehicles

Conclusion • Greedy sparse encoder is useful, giving a scalable unsupervised representation learning pipeline that attains state-of-the-art classification performance • Proper choice of encoder is critical: the stability of encoder is a key to the quality of representations

Stable and Efficient Representation Learning with Nonnegativity - PowerPoint PPT Presentation

Stable and Efficient Representation Learning with Nonnegativity Constraints Tsung-Han Lin and H.T. Kung Unsupervised Representation Learning Layer 3 Representation Layer 2 Representation Encoding Sparse encoder Layer 1 Representation

Stable Marriage Problem Stable Marriage Problem Small town with n boys and n girls. Stable

$US 50-60 oil prices stable at $US 50-60 construction slows oil prices stable at job market

Unsupervised Language Learning: Representation Learning for NLP Katia Shutova ILLC University

K K Knowledge Knowledge l d l d Representation Representation Representation

Stable Cluster Variables Grace Zhang August 1, 2016 Grace Zhang Stable Cluster Variables

Stable Matching CS31005: Algorithms-II Autumn 2020 IIT Kharagpur Stable Matching A type of

Efficient Contextual Representation Learning With Continuous Outputs Kai-Wei Chang Liunian Harold

High Level Synthesis Design Representation Intermediate representation essential for efficient

Precise and Approximate Representation of Numbers The Cartesian-Lagrangian representation of

Image and Video Coding: Representation, Acquisition, Display ... 10011 ... encoder decoder

Number representation in Java Scientific notation Overview topics Binary representation of

parametric surface patches 1 implicit representation implicit surface representation f ( P ) = 0

Lecture 5: Data Representation 1 / 43 Data Representation Discussion Deep learning job postings

Explicit Representation of Cost-efficient Strategies Carole Bernard (University of Waterloo)

Stable allocations and flows as Fleiner 1 Tam Summer School on Matching Problems, Markets, and

Stable Homes, Stable Schools Housing Policy and Development Committee January 23, 2019 1

MIG Encoders BEGE MIG NOVA+ BEGE Power Transmission Anton Philipsweg 30 2171 KX Sassenheim The

Current state and perspectives of the virtual research environment for the analysis of climate

LaTeX A typesetting package for formatting and creating documents. Created by L. Lamport based

Switching Efficiency Losses occur during transition from Off to On and from On to Off. Losses

is the official distributor of We provide the best prices and delivery terms for the full range of

ECE Senior Design: Team 2021 Mobile Hand-held Tissue Compression Tester Truc Minh Nguyen, Dhaval

RT ADVANCE HYDRAULIC WRENCH SYSTEM Ashish Malhotra 2017 2 CONTENTS Market drivers

ESER 2014 Neighborhood Fire Stations FS35 Fireboats Headquarters FS35 Fireboats