Inferring Restaurant Styles by Mining Crowd Sourced Photos from - - PowerPoint PPT Presentation
Inferring Restaurant Styles by Mining Crowd Sourced Photos from - - PowerPoint PPT Presentation
@IEEE BigData 2016 Inferring Restaurant Styles by Mining Crowd Sourced Photos from User-Review Websites Haofu Liao, Yuncheng Li, Tianran Hu and Jiebo Luo Department of Computer Science UNIVERSITY of ROCHESTER Part I - The Problem UNIVERSITY
UNIVERSITY of ROCHESTER
Part I - The Problem
Restaurant Style Classification
Business meetings Romantic Restaurant Photos UNIVERSITY of ROCHESTER
Multi-Instance Multi-Label Learning (MIML)
UNIVERSITY of ROCHESTER
Instance Label
- bject
Instance Label
- bject
Label Label
...
Instance Label
- bject
Instance Instance
...
Label
- bject
Label Label
...
Instance Instance Instance
...
(c) Multi-label learning (a) Traditional Supervised learning (b) Multi-instance learning (d) Multi-instance multi-label learning
Multi-Instance Multi-Label Learning (MIML)
UNIVERSITY of ROCHESTER
Business meetings
Restaurant
Special Occasion
Romantic
...
Photo 2 Photo 1 Photo N
...
- Each restaurant (object) is described by a set
- f photos (instances) and associated with
several class restaurant style tags (label).
- Conventionally, MIML is based on the
assumption that there exists a “key” instance that contributes the object’s class labels.
- For restaurant style classification, such
assumption is not guaranteed. For example, just one picture of a delicate dish does not mean the restaurant itself is romantic
UNIVERSITY of ROCHESTER
Part II - Solution
Proposed Architecture
UNIVERSITY of ROCHESTER
Label Initialization Multi-Label CNN Multi-Label CNN Pseudo Tagging Restaurant Profiling SVM SVM SVM
...
Business meetings Romantic
Overall Architecture
UNIVERSITY of ROCHESTER
Label Initialization Multi-Label CNN Multi-Lacbel CNN Pseudo Tagging Restaurant Profiling SVM SVM SVM
...
Business meetings Romantic
Multi-Label CNN
UNIVERSITY of ROCHESTER
CONV3 CONV2 CONV5 CONV1 CONV4 POOL3 POOL2 POOL5 FC6 FC7 FC8
Sigmoid Cross Entropy Loss
Data Label
AlexNet
- Sigmoid Cross Entropy Loss
Overall Architecture
UNIVERSITY of ROCHESTER
Label Initialization Multi-Label CNN Multi-Label CNN Pseudo Tagging Restaurant Profiling SVM SVM SVM
...
Business meetings Romantic
Photo 6
- 3.7
Photo 2
- 5.4
Photo 1
7.1
Pseudo Tagging
UNIVERSITY of ROCHESTER
Photo 5
9.5
Photo 3
0.9
Photo 7
3.2
Photo 8
- 0.2
Photo 4
- 1.1
Images Scores of tag k Images from restaurants with tag k Images from restaurants without tag k
Pseudo Tagging
UNIVERSITY of ROCHESTER
Photo 5 Photo 1
9.5
Photo 7 Photo 3
0.9 7.1 3.2
Photo 8 Photo 4
- 0.2
Photo 6 Photo 2
- 5.4
- 1.1
- 3.7
Ordered Steps: 1. Order images according their scores
Pseudo Tagging
UNIVERSITY of ROCHESTER
Photo 5 Photo 1
9.5
Photo 7 Photo 3
0.9 7.1 3.2
Photo 8 Photo 4
- 0.2
Photo 6 Photo 2
- 5.4
- 1.1
- 3.7
Ordered Steps: 1. Order images according their scores 2. Add tag k to the image that has the highest score among the images that do not have tag k. Photo 1
Pseudo Tagging
UNIVERSITY of ROCHESTER
Photo 5 Photo 1
9.5
Photo 7 Photo 3
0.9 7.1 3.2
Photo 8 Photo 4
- 0.2
Photo 6 Photo 2
- 5.4
- 1.1
- 3.7
Ordered Steps: 1. Order images according their scores 2. Add tag k to the image that has the highest score among the images that do not have tag k. 3. Drop tag k from the image that has the lowest score among the images that have tag k. Photo 1 Photo 2
Pseudo Tagging
UNIVERSITY of ROCHESTER
Photo 5 Photo 1
9.5
Photo 7 Photo 3
0.9 7.1 3.2
Photo 8 Photo 4
- 0.2
Photo 6 Photo 2
- 5.4
- 1.1
- 3.7
Ordered Steps: 1. Order images according their scores 2. Add tag k to the image that has the highest score among the images that do not have tag k. 3. Drop tag k from the image that has the lowest score among the images that have tag k. Photo 1 Photo 2
Pseudo Tagging
UNIVERSITY of ROCHESTER
Photo 5 Photo 1
9.5
Photo 7 Photo 3
0.9 7.1 3.2
Photo 8 Photo 4
- 0.2
Photo 6 Photo 2
- 5.4
- 1.1
- 3.7
Ordered Steps: 1. Order images according their scores 2. Add tag k to the image that has the highest score among the images that do not have tag k. 3. Drop tag k from the image that has the lowest score among the images that have tag k. 4. Repeat step 3 & step 4 until the scores of images reach a predefined threshold Photo 1 Photo 2
Overall Architecture
UNIVERSITY of ROCHESTER
Label Initialization Multi-Label CNN Multi-Label CNN Pseudo Tagging Restaurant Profiling SVM SVM SVM
...
Business meetings Romantic