Extended Bag-of-Words Formalism for Image Classification Sandra - PowerPoint PPT Presentation

Extended Bag-of-Words Formalism for Image Classification Sandra Avila 1 , 2 (Cotutelle PhD Candidate), ujo 1 (Advisor), Matthieu Cord 2 (Advisor), Arnaldo de A. Ara´ Nicolas Thome 2 (Co-Advisor), Eduardo Valle 3 (Collaborator) 1 Federal University of Minas Gerais, NPDI Lab – UFMG, Belo Horizonte, Brazil 2 Pierre and Marie Curie University, UPMC-Sorbonne Universities, LIP6, Paris, France 3 State University of Campinas, RECOD Lab, FEEC – UNICAMP, Campinas, Brazil Sandra Avila (UFMG/UPMC) sandra@dcc.ufmg.br June 2013 1 / 56

Image Classification: Why do we care? Sandra Avila (UFMG/UPMC) sandra@dcc.ufmg.br June 2013 2 / 56

Sandra Avila (UFMG/UPMC) sandra@dcc.ufmg.br June 2013 3 / 56

Huge amount of image is available Sandra Avila (UFMG/UPMC) sandra@dcc.ufmg.br June 2013 4 / 56

Why image classification is a hard problem? Sandra Avila (UFMG/UPMC) sandra@dcc.ufmg.br June 2013 5 / 56

Many classes and concepts Sandra Avila (UFMG/UPMC) sandra@dcc.ufmg.br June 2013 6 / 56

Viewpoint changes Illumination variations Occlusion Background clutter Inter-class similarity Intra-class diversity Much diversity in the data Sandra Avila (UFMG/UPMC) sandra@dcc.ufmg.br June 2013 6 / 56

How do we classify images? Sandra Avila (UFMG/UPMC) sandra@dcc.ufmg.br June 2013 7 / 56

Problem Statement Given an image dataset, how to represent their visual content information for a classification task? Sandra Avila (UFMG/UPMC) sandra@dcc.ufmg.br June 2013 9 / 56

night scenes sunset scenes young people old people Sandra Avila (UFMG/UPMC) sandra@dcc.ufmg.br June 2013 11 / 56

Bag-of-Visual-Words ( BoW ) [Sivic and Zisserman, 2003; Csurka et al., 2004] Slide credit: Ken Chatfield Sandra Avila (UFMG/UPMC) sandra@dcc.ufmg.br June 2013 12 / 56

Low-level Visual Feature Extraction patch 1  l 1 , 1 . . . l 1 ,N  l 2 , 1 . . . l 2 ,N    . .  . .   . .   l M, 1 . . . l M,N patch M Local feature extraction Patch detection : interest points, dense sampling, . . . Feature extraction : SIFT [Lowe, 2004], SURF [Bay et al., 2008], . . . Sandra Avila (UFMG/UPMC) sandra@dcc.ufmg.br June 2013 13 / 56

Visual Codebook Coding step Visual codebook learning : random, unsupervised (e.g., k -means, GMM), supervised [Perronnin et al., 2006; Goh et al., 2012], . . . Coding : hard-assignment, soft-assignment [van Gemert et al., 2008, 2010], sparse coding [Yang et al., 2009; Boureau et al., 2010], . . . Feature coding based on the vector difference : VLAD [J´ egou et al., 2010], SVC [Zhou et al., 2010], VLAT [Picard et al., 2011], . . . Sandra Avila (UFMG/UPMC) sandra@dcc.ufmg.br June 2013 14 / 56

Pooling step Pooling : sum/average-pooling, max-pooling [Yang et al., 2009], . . . Spatial pooling : spatial pyramid matching [Lazebnik et al., 2006], [Jia et al., 2012], . . . Spatial Pyramid Matching Sandra Avila (UFMG/UPMC) sandra@dcc.ufmg.br June 2013 15 / 56

Other Approaches Biologically-inspired Models Deep Learning Models [Fukushima and Miyake, 1982; LeCun et al., [Hinton and Salakhutdinov, 2006; 1990; Riesenhuber and Poggio, 1999; Serre Ranzato et al., 2007; Bengio, 2009] et al., 2007; Th´ eriault et al., 2012] Sandra Avila (UFMG/UPMC) sandra@dcc.ufmg.br June 2013 16 / 56

BossaNova Representation Sandra Avila (UFMG/UPMC) sandra@dcc.ufmg.br June 2013 17 / 56

Coding & Pooling Matrix Representation ... ... x 1 x j x N   α 1 , 1 . . . α 1 ,j . . . α 1 ,N c 1 . . . . . . . . . . . .     H = α m, 1 . . . α m,j . . . α m,N   c m   . . . . . . . .   . . . .   α M, 1 . . . α M,j . . . α M,N c M Notations : X = { x j } , j ∈ { 1 , . . . , N } : set of local descriptors (e.g., SIFT) C = { c m } , m ∈ { 1 , . . . , M } : visual codebook Sandra Avila (UFMG/UPMC) sandra@dcc.ufmg.br June 2013 18 / 56

Coding & Pooling Matrix Representation ... ... x 1 x j x N   α 1 , 1 . . . α 1 ,j . . . α 1 ,N c 1 . . . . . . . . . . . .     H = α m, 1 . . . α m,j . . . α m,N   c m   . . . . . . . .   . . . .   α M, 1 . . . α M,j . . . α M,N c M ⇓ f : Coding � x j − c k � 2 Coding : x j → f ( x j ) = { α m,j } , α m,j = 1 iff m = arg min 2 k ∈{ 1 ,...,M } Sandra Avila (UFMG/UPMC) sandra@dcc.ufmg.br June 2013 18 / 56

Coding & Pooling Matrix Representation ... ... x 1 x j x N   α 1 , 1 . . . α 1 ,j . . . α 1 ,N c 1 . . . . . . . . . . . .     H = α m, 1 . . . α m,j . . . α m,N ⇒ g : Pooling   c m   . . . . . . . .   . . . .   α M, 1 . . . α M,j . . . α M,N c M � x j − c k � 2 Coding : x j → f ( x j ) = { α m,j } , α m,j = 1 iff m = arg min 2 k ∈{ 1 ,...,M } N � Pooling : g ( { α j } ) = z : ∀ m, z m = α m,j j =1 Sandra Avila (UFMG/UPMC) sandra@dcc.ufmg.br June 2013 18 / 56

Coding & Pooling Matrix Representation ... ... x 1 x j x N     z 1 α 1 , 1 . . . α 1 ,j . . . α 1 ,N c 1 . . . . . . . . . . . . . . .         z = H = α m, 1 . . . α m,j . . . α m,N z m     c m     . . . . . . . . . .     . . . . .     z M α M, 1 . . . α M,j . . . α M,N c M � x j − c k � 2 Coding : x j → f ( x j ) = { α m,j } , α m,j = 1 iff m = arg min 2 k ∈{ 1 ,...,M } N � Pooling : g ( { α j } ) = z : ∀ m, z m = α m,j j =1 BoW representation : z = [ z 1 , z 2 , · · · , z M ] T Sandra Avila (UFMG/UPMC) sandra@dcc.ufmg.br June 2013 18 / 56

Early Ideas We pointed out the weakness in the standard pooling operation used in the BoW signature generation. Instead of averaging all the values from one row in the H matrix, we proposed to describe their distribution. BOSSA representation ( B ag O f S tatistical S ampling A nalysis) introduces our density function-based pooling strategy . Sandra Avila (UFMG/UPMC) sandra@dcc.ufmg.br June 2013 19 / 56

Our Pooling Illustration Our Pooling BoW Pooling Sandra Avila (UFMG/UPMC) sandra@dcc.ufmg.br June 2013 20 / 56

Our Pooling Formalism g : ❘ N ❘ B − → α m − → g ( α m ) = z m � b B ; b + 1 � �� z m,b = card x j | α m,j ∈ B b and b + 1 B ≥ α min ≤ α max m m B B denotes the number of bins of each histogram z m , and [ α min m ; α max ] limits the range of distances m Sandra Avila (UFMG/UPMC) sandra@dcc.ufmg.br June 2013 21 / 56

BossaNova Representation ... ... x 1 x j x N   α 1 , 1 . . . α 1 ,j . . . α 1 ,N c 1 . . . . . . . . . . . .   exp − β m d 2 ( x j , c m )   α m,j = α m, 1 . . . α m,j . . . α m,N   c m � K   m ′ =1 exp − β m d 2 ( x j , c m ′ ) . . . . . . . .   . . . .   α M, 1 . . . α M,j . . . α M,N c M Sandra Avila (UFMG/UPMC) sandra@dcc.ufmg.br June 2013 22 / 56

BossaNova Representation   z 1 , st 1 . . .     z m , st m     . .   .   z M , st M Sandra Avila (UFMG/UPMC) sandra@dcc.ufmg.br June 2013 22 / 56

BossaNova Scheme Sandra Avila (UFMG/UPMC) sandra@dcc.ufmg.br June 2013 23 / 56

BossaNova Scheme • SIFT descriptors on a dense spatial grid at multiple scales • Dimensionality reduction by applying PCA (128 → 64) Sandra Avila (UFMG/UPMC) sandra@dcc.ufmg.br June 2013 23 / 56

BossaNova Scheme • k -means algorithm Sandra Avila (UFMG/UPMC) sandra@dcc.ufmg.br June 2013 23 / 56

BossaNova Scheme Sandra Avila (UFMG/UPMC) sandra@dcc.ufmg.br June 2013 23 / 56

BossaNova Scheme • SVM classifiers are applied by using a nonlinear Gauss– ℓ 2 kernel Sandra Avila (UFMG/UPMC) sandra@dcc.ufmg.br June 2013 23 / 56

Extended Bag-of-Words Formalism for Image Classification Sandra - PowerPoint PPT Presentation

Extended Bag-of-Words Formalism for Image Classification Sandra Avila 1 , 2 (Cotutelle PhD Candidate), ujo 1 (Advisor), Matthieu Cord 2 (Advisor), Arnaldo de A. Ara Nicolas Thome 2 (Co-Advisor), Eduardo Valle 3 (Collaborator) 1 Federal

Bag of Words Model Overview of todays lecture Bag-of-words. K-means clustering.

Bag-of-features models for category classification for category classification Cordelia Schmid

Bag-of-features for category classification for category classification Cordelia Schmid

WINE BOTTLE AIRBAG SINGLE WINE BOTTLE AIRBAG SINGLE BOTTLE AIR BAG PROTECT ALL BOTTLED PRODUCT

Red-Bag Engineers Consultants Software User Day April 2017 Red-Bag 2017 1 Ves Online

Pathway Red Bag Scheme October 2018 The Red Bag concept The Red Bag scheme was first implemented

The Plastic Bag Free world in action Surfriders Ban the Bag Campaign Plastic bag free

Classification Image Classification Set of predefined categories [eg: table, apple, dog, giraffe]

Extended Project Qualification Introduction What is an Extended Project? What does an

Image Restoration Image Enhancement and Image Restoration both deal with improving images. Image

Lecture: Visual Bag of Words Juan Carlos Niebles and Ranjay Krishna Stanford Vision and Learning

Text Representation Bag-of-Words and Word Embeddings count vector unordered bag over

DC Bag Law Presented by Jeffrey Seltzer Associate Director Stormwater Management Division District

Bag-of-features for category classification Cordelia Schmid Category recognition Image

Bag-of-features for category classification Cordelia Schmid Category recognition Image

Bag-of-features for category classification Cordelia Schmid Category recognition Image

Consorzio COMETA FESR Visualization Element: towards the definition of a new Grid service

New developments on rational RBF Stefano De Marchi Department of Mathematics Tullio

SMOOTH SOLUTIONS IN VASILIEV THEORY Andrea Campoleoni Universit Libre de Bruxelles &

TRANSITIONING TO A CULTURE OF EVIDENCE: SUSTAINABLE ASSESSMENT PRACTICES Roundt dtable le D

Table of contents 1. Introduction: You are already an experimentalist 2. Conditions 3. Items

Registrant List Page 1 of 5 UCSF OCME Name City, State Ahmad Borhaan MD Redlands, CA 1 2

Lecture 1: Introduction to Pattern Recognition Dr. Chengjiang Long Computer Vision Researcher at

Traffic Predictions Supporting General Aviation Carlo Lancia, D.

Extended Bag-of-Words Formalism for Image Classification Sandra - PowerPoint PPT Presentation

Extended Bag-of-Words Formalism for Image Classification Sandra Avila 1 , 2 (Cotutelle PhD Candidate), ujo 1 (Advisor), Matthieu Cord 2 (Advisor), Arnaldo de A. Ara Nicolas Thome 2 (Co-Advisor), Eduardo Valle 3 (Collaborator) 1 Federal

Bag of Words Model Overview of todays lecture Bag-of-words. K-means clustering.

Bag-of-features models for category classification for category classification Cordelia Schmid

Bag-of-features for category classification for category classification Cordelia Schmid

WINE BOTTLE AIRBAG SINGLE WINE BOTTLE AIRBAG SINGLE BOTTLE AIR BAG PROTECT ALL BOTTLED PRODUCT

Red-Bag Engineers Consultants Software User Day April 2017 Red-Bag 2017 1 Ves Online

Pathway Red Bag Scheme October 2018 The Red Bag concept The Red Bag scheme was first implemented

The Plastic Bag Free world in action Surfriders Ban the Bag Campaign Plastic bag free

Classification Image Classification Set of predefined categories [eg: table, apple, dog, giraffe]

Extended Project Qualification Introduction What is an Extended Project? What does an

Image Restoration Image Enhancement and Image Restoration both deal with improving images. Image

Lecture: Visual Bag of Words Juan Carlos Niebles and Ranjay Krishna Stanford Vision and Learning

Text Representation Bag-of-Words and Word Embeddings count vector unordered bag over

DC Bag Law Presented by Jeffrey Seltzer Associate Director Stormwater Management Division District

Bag-of-features for category classification Cordelia Schmid Category recognition Image

Bag-of-features for category classification Cordelia Schmid Category recognition Image

Bag-of-features for category classification Cordelia Schmid Category recognition Image

Consorzio COMETA FESR Visualization Element: towards the definition of a new Grid service

New developments on rational RBF Stefano De Marchi Department of Mathematics Tullio

SMOOTH SOLUTIONS IN VASILIEV THEORY Andrea Campoleoni Universit Libre de Bruxelles &amp;

TRANSITIONING TO A CULTURE OF EVIDENCE: SUSTAINABLE ASSESSMENT PRACTICES Roundt dtable le D

Table of contents 1. Introduction: You are already an experimentalist 2. Conditions 3. Items

Registrant List Page 1 of 5 UCSF OCME Name City, State Ahmad Borhaan MD Redlands, CA 1 2

Lecture 1: Introduction to Pattern Recognition Dr. Chengjiang Long Computer Vision Researcher at

Traffic Predictions Supporting General Aviation Carlo Lancia, D.

SMOOTH SOLUTIONS IN VASILIEV THEORY Andrea Campoleoni Universit Libre de Bruxelles &