Words & Pictures Clustering and Bag of Words Many - PowerPoint PPT Presentation

Words ¡& ¡Pictures ¡ ¡ Clustering ¡and ¡Bag ¡of ¡Words ¡ Many ¡slides ¡adapted ¡from ¡Svetlana ¡Lazebnik, ¡Fei-‑Fei ¡Li, ¡Rob ¡Fergus, ¡and ¡Antonio ¡Torralba ¡

Document ¡Vectors ¡ • Represent ¡document ¡as ¡a ¡“bag ¡of ¡words” ¡

Bag-‑of-‑features ¡models ¡ Many ¡slides ¡adapted ¡from ¡Fei-‑Fei ¡Li, ¡Rob ¡Fergus, ¡and ¡Antonio ¡Torralba ¡

Origin: ¡Bag-‑of-‑words ¡models ¡ • Orderless ¡document ¡representaJon: ¡frequencies ¡ of ¡words ¡from ¡a ¡dicJonary ¡ ¡ Salton ¡& ¡McGill ¡(1983) ¡

Origin: ¡Bag-‑of-‑words ¡models ¡ • Orderless ¡document ¡representaJon: ¡frequencies ¡ of ¡words ¡from ¡a ¡dicJonary ¡ ¡ Salton ¡& ¡McGill ¡(1983) ¡ US ¡PresidenJal ¡Speeches ¡Tag ¡Cloud ¡ http://chir.ag/phernalia/preztags/

Bags ¡of ¡features ¡for ¡image ¡ classificaJon ¡ 1. Extract ¡features ¡

Bags ¡of ¡features ¡for ¡image ¡ classificaJon ¡ 1. Extract ¡features ¡ 2. Learn ¡“visual ¡vocabulary” ¡

Bags ¡of ¡features ¡for ¡image ¡ classificaJon ¡ 1. Extract ¡features ¡ 2. Learn ¡“visual ¡vocabulary” ¡ 3. QuanJze ¡features ¡using ¡visual ¡vocabulary ¡ ¡

Bags ¡of ¡features ¡for ¡image ¡ classificaJon ¡ 1. Extract ¡features ¡ 2. Learn ¡“visual ¡vocabulary” ¡ 3. QuanJze ¡features ¡using ¡visual ¡vocabulary ¡ ¡ 4. Represent ¡images ¡by ¡frequencies ¡of ¡ ¡ “visual ¡words” ¡ ¡

1. ¡Feature ¡extracJon ¡ • Regular ¡grid ¡ – Vogel ¡& ¡Schiele, ¡2003 ¡ – Fei-‑Fei ¡& ¡Perona, ¡2005 ¡

1. ¡Feature ¡extracJon ¡ • Regular ¡grid ¡ – Vogel ¡& ¡Schiele, ¡2003 ¡ – Fei-‑Fei ¡& ¡Perona, ¡2005 ¡ • Interest ¡point ¡detector ¡ – Csurka ¡et ¡al. ¡2004 ¡ – Fei-‑Fei ¡& ¡Perona, ¡2005 ¡ – Sivic ¡et ¡al. ¡2005 ¡

1. ¡Feature ¡extracJon ¡ • Regular ¡grid ¡ – Vogel ¡& ¡Schiele, ¡2003 ¡ – Fei-‑Fei ¡& ¡Perona, ¡2005 ¡ • Interest ¡point ¡detector ¡ – Csurka ¡et ¡al. ¡2004 ¡ – Fei-‑Fei ¡& ¡Perona, ¡2005 ¡ – Sivic ¡et ¡al. ¡2005 ¡ • Other ¡methods ¡ – Random ¡sampling ¡(Vidal-‑Naquet ¡& ¡Ullman, ¡2002) ¡ – SegmentaJon-‑based ¡patches ¡(Barnard ¡et ¡al. ¡2003) ¡

Compute ¡SIFT ¡ Normalize ¡patch ¡ descriptor ¡ ¡ ¡ ¡ ¡ ¡ ¡[Lowe’99] ¡ Detect ¡patches ¡ [Mikojaczyk ¡and ¡Schmid ¡’02] ¡ [Mata, ¡Chum, ¡Urban ¡& ¡Pajdla, ¡’02] ¡ ¡ [Sivic ¡& ¡Zisserman, ¡’03] ¡ Slide ¡credit: ¡Josef ¡Sivic ¡

… ¡

… ¡ Clustering ¡ Slide ¡credit: ¡Josef ¡Sivic ¡

Visual ¡vocabulary ¡ … ¡ Clustering ¡ Slide ¡credit: ¡Josef ¡Sivic ¡

Clustering ¡ – The ¡assignment ¡of ¡objects ¡into ¡groups ¡(called ¡clusters) ¡ so ¡that ¡objects ¡from ¡the ¡same ¡cluster ¡are ¡more ¡similar ¡ to ¡each ¡other ¡than ¡objects ¡from ¡different ¡clusters. ¡ ¡ – Ojen ¡similarity ¡is ¡assessed ¡according ¡to ¡a ¡distance ¡ measure. ¡ ¡ – Clustering ¡is ¡a ¡common ¡technique ¡for ¡staJsJcal ¡data ¡ analysis, ¡which ¡is ¡used ¡in ¡many ¡fields, ¡including ¡ machine ¡learning, ¡data ¡mining, ¡pakern ¡recogniJon, ¡ image ¡analysis ¡and ¡bioinformaJcs. ¡

Any ¡of ¡the ¡similarity ¡metrics ¡we ¡talked ¡about ¡before ¡(SSD, ¡angle ¡between ¡ vectors) ¡

Feature ¡Clustering ¡ Clustering ¡is ¡the ¡process ¡of ¡grouping ¡a ¡set ¡of ¡ features ¡into ¡clusters ¡of ¡similar ¡features. ¡ Features ¡within ¡a ¡cluster ¡should ¡be ¡similar. ¡ Features ¡from ¡different ¡clusters ¡should ¡be ¡ dissimilar. ¡

source: ¡Dan ¡Klein ¡ ¡

K-‑means ¡clustering ¡ • Want ¡to ¡minimize ¡sum ¡of ¡ squared ¡Euclidean ¡distances ¡ between ¡points ¡ x i ¡and ¡their ¡ nearest ¡cluster ¡centers ¡ m k ¡ source: ¡Svetlana ¡Lazebnik ¡ ¡

Source: ¡Hinrich ¡Schutze ¡

Hierarchical ¡clustering ¡strategies ¡ • AgglomeraJve ¡clustering ¡ • Start ¡with ¡each ¡point ¡in ¡a ¡separate ¡cluster ¡ • At ¡each ¡iteraJon, ¡merge ¡two ¡of ¡the ¡“closest” ¡clusters ¡ • Divisive ¡clustering ¡ • Start ¡with ¡all ¡points ¡grouped ¡into ¡a ¡single ¡cluster ¡ • At ¡each ¡iteraJon, ¡split ¡the ¡“largest” ¡cluster ¡ source: ¡Svetlana ¡Lazebnik ¡ ¡

Divisive ¡Clustering ¡ • Top-‑down ¡(instead ¡of ¡bokom-‑up ¡as ¡in ¡ AgglomeraJve ¡Clustering) ¡ • Start ¡with ¡all ¡docs ¡in ¡one ¡big ¡cluster ¡ • Then ¡recursively ¡split ¡clusters ¡ • Eventually ¡each ¡node ¡forms ¡a ¡cluster ¡on ¡its ¡ own. ¡ Source: ¡Hinrich ¡Schutze ¡

Flat ¡or ¡hierarchical ¡clustering? ¡ • For ¡high ¡efficiency, ¡use ¡flat ¡clustering ¡(e.g. ¡k ¡ means) ¡ • For ¡determinisJc ¡results: ¡hierarchical ¡ clustering ¡ • When ¡a ¡hierarchical ¡structure ¡is ¡desired: ¡ hierarchical ¡algorithm ¡ • Hierarchical ¡clustering ¡can ¡also ¡be ¡applied ¡if ¡K ¡ cannot ¡be ¡predetermined ¡(can ¡start ¡without ¡ knowing ¡K) ¡ Source: ¡Hinrich ¡Schutze ¡

… ¡ Clustering ¡ Slide ¡credit: ¡Josef ¡Sivic ¡

Visual ¡vocabulary ¡ … ¡ Clustering ¡ Slide ¡credit: ¡Josef ¡Sivic ¡

From ¡clustering ¡to ¡vector ¡quanJzaJon ¡ • Clustering ¡is ¡a ¡common ¡method ¡for ¡learning ¡a ¡visual ¡ vocabulary ¡or ¡codebook ¡ – Unsupervised ¡learning ¡process ¡ – Each ¡cluster ¡center ¡produced ¡by ¡k-‑means ¡becomes ¡a ¡ codevector ¡ – Codebook ¡can ¡be ¡learned ¡on ¡separate ¡training ¡set ¡ – Provided ¡the ¡training ¡set ¡is ¡sufficiently ¡representaJve, ¡ the ¡codebook ¡will ¡be ¡“universal” ¡ • The ¡codebook ¡is ¡used ¡for ¡quanJzing ¡features ¡ – A ¡ vector ¡quan0zer ¡takes ¡a ¡feature ¡vector ¡and ¡maps ¡it ¡ to ¡the ¡index ¡of ¡the ¡nearest ¡codevector ¡in ¡a ¡codebook ¡ – Codebook ¡= ¡visual ¡vocabulary ¡ – Codevector ¡= ¡visual ¡word ¡

Fei-‑Fei ¡et ¡al. ¡2005 ¡

Sivic ¡et ¡al. ¡2005 ¡

Visual ¡vocabularies: ¡Issues ¡ • How ¡to ¡choose ¡vocabulary ¡size? ¡ – Too ¡small: ¡visual ¡words ¡not ¡ representaJve ¡of ¡all ¡patches ¡ – Too ¡large: ¡quanJzaJon ¡arJfacts, ¡ overfisng ¡ • GeneraJve ¡or ¡discriminaJve ¡ learning? ¡ • ComputaJonal ¡efficiency ¡ – Vocabulary ¡trees ¡ ¡ (Nister ¡& ¡Stewenius, ¡2006) ¡

frequency ¡ ….. ¡ codewords ¡

Words & Pictures Clustering and Bag of Words Many - PowerPoint PPT Presentation

Words & Pictures Clustering and Bag of Words Many slides adapted from Svetlana Lazebnik, Fei-Fei Li, Rob Fergus, and Antonio Torralba Document

CSE 595 Words and Pictures Tamara L. Berg SUNY Stony Brook Class Info CSE 595: Words &

How To Create Simple How To Create Simple Pictures ToExe Pictures ToExe Digital Projected Image

Activity 4 Inference from pictures Page 27 Activity 5 Feelings from pictures Page

Let me send relevant pictures to my friends while we chat. Select a picture from a

Using JPEG to Compress Still Pictures Tyler Genter December 17, 2010 Tyler Genter Using JPEG to

SONY PICTURES TELEVISION World Class Entertainment for Foxtel July 2012 SPT NETWORKS PORTFOLIO

IMF CORE APPLICATION SONY PICTURES PRESENTATION 2011-12-06 THE CURRENT APPROACH ISSUES SONY

Basketball Court Resurfacing Before Pictures Before Pictures Starting the Process Monday

SONY PICTURES TELEVISION OVERVIEW APRIL 2014 SPT Overview SONY PICTURES TELEVISION

Diseases in pictures By Prof. Pushpa Raj Sharma These pictures are the personal collection of the

Pictures: all pictures: Hajo Seng except: p. 4: Leif Ekblad, Aspie Quiz,

TRAMADOL IN LABOUR ANALGESIA SLIDES TO PICTURES tramadol in labour analgesia slides to pictures

How are you learning to Live ...and Love like Jesus?! Send In Your Samaritan Pictures!!

Pictures: all pictures: Hajo Seng except: p. 3: Wikimedia, Mirror Phase, Lacan p. 5: Mind

Pictures: all pictures: Hajo Seng except: S. 8: Wikimedia: Mirror Phase, Lacan S. 15:

The sound /u/ is usually spelt with the letter u . Try writing the words to go with these

Text 1 / 24 Introduction to the PS3 Programming the SPEs PS3-clusters Results Why is the

CSE 158 Lecture 5 Web Mining and Recommender Systems Dimensionality Reduction This week How

Geodesic Distance Distance based based Geodesic Fuzzy Clustering Clustering Fuzzy Abonyi and

Vembu extends support to Vembu extends support to Vembu v4.0 Hyper-V Cluster with v4.0 Agenda

Engines Previously We talked about the motivation behind vertical search engines,

Focused Clustering and Outlier Detection in Large Attributed Graphs ACM SIG-KDD August 26, 2014

Streaming algorithms for k -center clustering with outliers and with anonymity Richard Matthew

Feature Selection and Clustering Jiwoo Bang, Chungyong Kim, Kesheng Wu, Alexander Sim, Suren

Words & Pictures Clustering and Bag of Words Many - PowerPoint PPT Presentation

Words & Pictures Clustering and Bag of Words Many slides adapted from Svetlana Lazebnik, Fei-Fei Li, Rob Fergus, and Antonio Torralba Document

CSE 595 Words and Pictures Tamara L. Berg SUNY Stony Brook Class Info CSE 595: Words &amp;

How To Create Simple How To Create Simple Pictures ToExe Pictures ToExe Digital Projected Image

Activity 4 Inference from pictures Page 27 Activity 5 Feelings from pictures Page

Let me send relevant pictures to my friends while we chat. Select a picture from a

Using JPEG to Compress Still Pictures Tyler Genter December 17, 2010 Tyler Genter Using JPEG to

SONY PICTURES TELEVISION World Class Entertainment for Foxtel July 2012 SPT NETWORKS PORTFOLIO

IMF CORE APPLICATION SONY PICTURES PRESENTATION 2011-12-06 THE CURRENT APPROACH ISSUES SONY

Basketball Court Resurfacing Before Pictures Before Pictures Starting the Process Monday

SONY PICTURES TELEVISION OVERVIEW APRIL 2014 SPT Overview SONY PICTURES TELEVISION

Diseases in pictures By Prof. Pushpa Raj Sharma These pictures are the personal collection of the

Pictures: all pictures: Hajo Seng except: p. 4: Leif Ekblad, Aspie Quiz,

TRAMADOL IN LABOUR ANALGESIA SLIDES TO PICTURES tramadol in labour analgesia slides to pictures

How are you learning to Live ...and Love like Jesus?! Send In Your Samaritan Pictures!!

Pictures: all pictures: Hajo Seng except: p. 3: Wikimedia, Mirror Phase, Lacan p. 5: Mind

Pictures: all pictures: Hajo Seng except: S. 8: Wikimedia: Mirror Phase, Lacan S. 15:

The sound /u/ is usually spelt with the letter u . Try writing the words to go with these

Text 1 / 24 Introduction to the PS3 Programming the SPEs PS3-clusters Results Why is the

CSE 158 Lecture 5 Web Mining and Recommender Systems Dimensionality Reduction This week How

Geodesic Distance Distance based based Geodesic Fuzzy Clustering Clustering Fuzzy Abonyi and

Vembu extends support to Vembu extends support to Vembu v4.0 Hyper-V Cluster with v4.0 Agenda

Engines Previously We talked about the motivation behind vertical search engines,

Focused Clustering and Outlier Detection in Large Attributed Graphs ACM SIG-KDD August 26, 2014

Streaming algorithms for k -center clustering with outliers and with anonymity Richard Matthew

Feature Selection and Clustering Jiwoo Bang, Chungyong Kim, Kesheng Wu, Alexander Sim, Suren

CSE 595 Words and Pictures Tamara L. Berg SUNY Stony Brook Class Info CSE 595: Words &