Visual Feature Learning and Representation Qingshan Liu Nanjing - PowerPoint PPT Presentation

Visual Feature Learning and Representation Qingshan Liu Nanjing University of Information Science & Technology 11. 5. 2016

What can we read from this story?

What Can We Read From Face Images?

Visual Recognition = Feature + Classifier

Global feature vs. Local feature ? From Shiguang Shan

Challenges  There have 100 million survellience cameras distributed in the word, which will produce 2.3 ZB (10 21 ) video data Volume Variety  Youtube will increase over 72 hours Volume Variety video data in each minute Velocity Value  Face book has over 300 billion images Velocity Value  …… 6

Visual Feature Representation SIFT DBN Nested Network (Scale-invariant feature transform) Gabor Filter Bank 1990s 2000s 2004 2005 2010~ 2006 HOG Stacked Encoder Recurrent Network color histogram Low level feature Hand-crafted feature Deep feature Complicated Data driven

High dimension issue 2000 2005~2013 2013~ ten million pixels Hundreds of thousands pixels Million pixels High dimension Sensor driven

David L. Donoho, High-dimensional data analysis: The curses and blessings of dimensionality. Aide-Memoire of a Lecture at (2000) How to learn the low dimensional feature representation

Subspace Learning  Learn a low dimensional subspace projection to handle the high-dimensional data  T y A x y  d    x  D d D . D d , A R , R ,

Subspace Learning  Linear subspace : A is a linear transformation for example: PCA, LDA, …  Kernel based nonlinear subspace : combining the nonlinear kernel trick with linear subspace for example: KPCA, KLDA, …  Manifold subspace for example: LLE, ISOMap, …

Sparse feature representation Simple 50 100 150 200 250 300 350 400 450 500 Sample Reliable  0.8 * + 0.3 * + 0.5 *     2 min X A Z 2 1 A X ,

Sparse Representation  Learning Discriminative Dictionary for Group Sparse Representation ( IEEE T-IP 2014 )  Newton Greedy Pursuit: a Quadratic Approximation Method for Sparsity-Constrained Optimization, (CVPR 2014).  Decentralized Robust Subspace Clustering (AAAI 2016)  Efficient k-Support-Norm Regularized Minimization via Fully Corrective Frank-Wolfe Method (IJCAI 2016)  Efficient λ 2 Kernel Linearization via Random Feature Maps (IEEE T- NNLS 2016)  Blessing of Dimensionality: Recovering Mixture Data via Dictionary Pursuit, (IEEE T-PAMI 2016)

Learning Discriminative Dictionary for Group Sparse Representation ( IEEE T-IP 2014 )

Dual sparse constrained cascade regression model ( IEEE T-IP 2015 ) CSR: D. Piotr, W. Peter, and P. Pietro. Cascaded pose regression. Intl. Conf. on Computer Vision and Pattern Recognition (CVPR), 2010.

Dual sparse constrained cascade regression model ( IEEE T-IP 2015 )

Face Alignment

Results COFW LFW BioID LFPW Common Challenge FULL MVFW OCFW

Results COFW AFLW Helen

M 3 CSR model ( IVC 2016 )  Multi-view, multi-scale and multi-component

http://ibug.doc.ic.ac.uk/resources/300-W_IMAVIS /

Spatio-temporal CSR ( ICCVW 2015 )  CSR + Pose tracking  Adaptive compressive sensing tracker ( CVIU / IEEE T-CYB 2016)

Video demo

Live demo

Why is hypergraph? How to build the complicated relationship of multiple features?

Why is hypergraph?  It is not complete to represent the relations among vertices only by pairwise simple graphs.  It may be helpful to take account of the relationship not only between two vertices, but also among three or more vertices containing local grouping information.

Why is Hypergraph?

Hypergraph-based feature representation  Unsupervised hypergraph learning  Video objects clustering (CVPR 2009)  Image categorization (TPAMI 2011)  Semi-supervised hypergraph learning  Content-based image retrieval (CVPR 2010, PR 2011)  Sparse hypergraph learning • Elastic hypergraph (TIP 2016) • Application in hyperspectral image classification (TGRS submitted)

Video Object Segmentation (ICCV 2009)

Results-Squirrel Simple Graph + Optical Flow Simple Graph + Motion Profile Ground Truth Hypergraph Cut Simple Graph + Both Motion Cues

Results-Walking with Rotation Simple Graph + Optical Flow Simple Graph + Motion Profile Ground Truth Hypergraph Cut Simple Graph + Both Motion Cues

Videos

Elastic Net Hypergraph Learning (IEEE T-IP 2016) Robust Elastic Net Representation KNN-Graph Hypergraph Learning Elastic Net Hypergraph

Elastic Net Hypergraph Learning (IEEE T-IP 2016) Robust Elastic net Model

Breakthrough 2011 年 Deep Learning Speech recogniton 2006 2012 年 Image classification

No. 1 in 10 breakthrough tech 2013 selected by MIT tech review 2015 年 5 月 Nature 杂志以综述的形式对深度学习进行了总结和评价，指出深度学习最大的优点是能自动学习和抽象数据特征

Object detection from Video Object detection on each frame Tracking from the high score frame (temporal smooth) Class-wise box regression and NMS on each frame

Cascade Region Regression Multi-scale Conv Feature Multi-layer Conv Feature (object + around (region size specific) context) Cascade region regression 根据 region 的大小选择不同层的 region regressor 对 bounding box 进行调整，较大的 region 使用后面的 feature map ，较小的 feature map 使用前面的 feature map 。

Model Ensemble Res-net Model ensemble Google-net is always effective.

Demo Video

Does Cartoonist use deep features?

Qingshan Liu Email: qsliu@nuist.edu.cn Cell: 13585199482

Visual Feature Learning and Representation Qingshan Liu Nanjing - PowerPoint PPT Presentation

Visual Feature Learning and Representation Qingshan Liu Nanjing University of Information Science & Technology 11. 5. 2016 What can we read from this story? What Can We Read From Face Images? Visual Recognition = Feature + Classifier

Decision Tree Prof. Seungchul Lee Industrial AI Lab. Feature Test Feature 1 Feature 2 Feature

A Distinctive Feature of A Distinctive Feature of A Distinctive Feature of A Distinctive Feature

Outline Reducing Dimensionality Feature Selection 1 Steven J Zeil Feature Extraction 2

Biovision team 2 Retina Visual cortex 3 Retina Visual cortex 3 Retina Visual cortex 3

Feature Representation Vision BoWs and Beyond Praveen Krishnan Feature Representation in

Neural representation of linguistic feature Neural representation of linguistic feature hierarchy

Intro to Feature Representation in Virtual Screening Shengchao Liu, Gitter Group Feature

Earth: The Feature Presentation - feature, landscape, topography Earth: The Feature Presentation

Reducing Dimensionality Steven J Zeil Old Dominion Univ. Fall 2010 1 Feature Selection

Overview Overview Visual displays Visual displays Visual and tactile displays Visual and

CHRONIC CHRONIC VISUAL LOSS VISUAL LOSS Wasu Supakornthanasarn, MD. Visual loss Sensory

A Model of Visual Imagery A Model of Visual Imagery John Abbondanza, OD, FCOVD John Abbondanza,

Recap by Milo Davies, SAS NZ POWERFUL ADAPTIVE OPEN UNIFIED SAS Visual Analytics SAS Visual

Local Feature Extraction and Learning for Computer Vision Part 3: Binary Feature Learning for

Feature Point Feature-based approach: Detect and match feature Detec.on and Matching points

Using Data Fusion and Web Mining to Support Feature Location in Software SEMERU Feature: a

APE(X): Authenticated Permutation-Based Encryption with Extended Misuse Resistance Atul Luykx

What What can can we we learn learn from from present present- -day day HVCs HVCs about

Problem for Problems for Discrete Transistor Amplifiers ECE 65, Winter2013, F. Najmabadi

How to maintain the star formation of the Milky Way Galaxy? On the value of single dish

Deep Neural Network Based Frame Reconstruction For Optimized Video Coding - An AV2 Approach

1 Mapping the Evolution of Co-Authorship Networks Ke, Visvanath & Brner, (2004) Won 1st

AV1 Coding Tools Ryan Lei Video Codec Architect, Intel Corp. ryan.lei@intel.com Agenda

Fertility and Childlessness in the US Thomas Baudin 1 David de la Croix 1 Paula Gobbi 3 1

Sambuz

Useful Links

Newsletter

Mail Us

Visual Feature Learning and Representation Qingshan Liu Nanjing - PowerPoint PPT Presentation

Visual Feature Learning and Representation Qingshan Liu Nanjing University of Information Science & Technology 11. 5. 2016 What can we read from this story? What Can We Read From Face Images? Visual Recognition = Feature + Classifier

Decision Tree Prof. Seungchul Lee Industrial AI Lab. Feature Test Feature 1 Feature 2 Feature

A Distinctive Feature of A Distinctive Feature of A Distinctive Feature of A Distinctive Feature

Outline Reducing Dimensionality Feature Selection 1 Steven J Zeil Feature Extraction 2

Biovision team 2 Retina Visual cortex 3 Retina Visual cortex 3 Retina Visual cortex 3

Feature Representation Vision BoWs and Beyond Praveen Krishnan Feature Representation in

Neural representation of linguistic feature Neural representation of linguistic feature hierarchy

Intro to Feature Representation in Virtual Screening Shengchao Liu, Gitter Group Feature

Earth: The Feature Presentation - feature, landscape, topography Earth: The Feature Presentation

Reducing Dimensionality Steven J Zeil Old Dominion Univ. Fall 2010 1 Feature Selection

Overview Overview Visual displays Visual displays Visual and tactile displays Visual and

CHRONIC CHRONIC VISUAL LOSS VISUAL LOSS Wasu Supakornthanasarn, MD. Visual loss Sensory

A Model of Visual Imagery A Model of Visual Imagery John Abbondanza, OD, FCOVD John Abbondanza,

Recap by Milo Davies, SAS NZ POWERFUL ADAPTIVE OPEN UNIFIED SAS Visual Analytics SAS Visual

Local Feature Extraction and Learning for Computer Vision Part 3: Binary Feature Learning for

Feature Point Feature-based approach: Detect and match feature Detec.on and Matching points

Using Data Fusion and Web Mining to Support Feature Location in Software SEMERU Feature: a

APE(X): Authenticated Permutation-Based Encryption with Extended Misuse Resistance Atul Luykx

What What can can we we learn learn from from present present- -day day HVCs HVCs about

Problem for Problems for Discrete Transistor Amplifiers ECE 65, Winter2013, F. Najmabadi

How to maintain the star formation of the Milky Way Galaxy? On the value of single dish

Deep Neural Network Based Frame Reconstruction For Optimized Video Coding - An AV2 Approach

1 Mapping the Evolution of Co-Authorship Networks Ke, Visvanath &amp; Brner, (2004) Won 1st

AV1 Coding Tools Ryan Lei Video Codec Architect, Intel Corp. ryan.lei@intel.com Agenda

Fertility and Childlessness in the US Thomas Baudin 1 David de la Croix 1 Paula Gobbi 3 1

Sambuz

Useful Links

Newsletter

Mail Us

1 Mapping the Evolution of Co-Authorship Networks Ke, Visvanath & Brner, (2004) Won 1st