CS688 Paper Presentation 1
Deep Image-Text Embeddings
Learning Deep Structure-Preserving Image-Text Embeddings (CVPR 2016)
Deep Image-Text Embeddings Learning Deep Structure-Preserving - - PowerPoint PPT Presentation
CS688 Paper Presentation 1 Deep Image-Text Embeddings Learning Deep Structure-Preserving Image-Text Embeddings (CVPR 2016) Woobin Im ( ) 2016-11-08 Sentence-to-image Retrieval Retrieval system Query text A cat next to a blue chair
Learning Deep Structure-Preserving Image-Text Embeddings (CVPR 2016)
2
3
4
5
6
Source: Accounting for the Relative Importance of Objects in Image Retrieval
7
Source: Associating neural word embeddings with deep image representations using Fisher Vectors
8
Source: Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models
9
10
11
fc fc fc fc B-norm B-norm PCA
12
fc fc fc fc B-norm B-norm PCA
13
14
fc fc fc fc B-norm B-norm PCA
15
Source: Distributed representations of words and phrases and their compositionality
16
Word2Vec
Hybrid Gaussian-Laplacian Mixture model
Gaussian Mixture model
Concatenation
Final Vector (6000D)
PCA
Work of “Associating neural word embeddings with deep image representations using Fisher Vectors” v
17
fc fc fc fc B-norm B-norm PCA
18
image - sentence sentence - image Image structure preserving Text structure preserving
19
Source: “FaceNet: A unified embedding for face recognition and clustering” margin
20
21
fc fc fc fc B-norm B-norm PCA
22
23
image - sentence sentence - image Image structure preserving Text structure preserving
24
25
26
27
28