Food Ingredients Recognition through Multi-label Learning Marc - - PowerPoint PPT Presentation

food ingredients recognition through multi label learning
SMART_READER_LITE
LIVE PREVIEW

Food Ingredients Recognition through Multi-label Learning Marc - - PowerPoint PPT Presentation

Food Ingredients Recognition through Multi-label Learning Marc Bolaos, Aina Ferr and Petia Radeva Motivation Motivation Motivation INPUT OUTPUT Computer List of Nutritional Vision Ingredients Composition Algorithm dishes


slide-1
SLIDE 1

Food Ingredients Recognition through Multi-label Learning

Marc Bolaños, Aina Ferrà and Petia Radeva

slide-2
SLIDE 2

Motivation

slide-3
SLIDE 3

Motivation

slide-4
SLIDE 4

Motivation

Computer Vision Algorithm List of Ingredients Nutritional Composition INPUT OUTPUT

slide-5
SLIDE 5

Related Work

Only one work has been proposed for ingredients recognition.

Jingjing Chen and Chong-Wah Ngo. “Deep-based ingredient recognition for cooking recipe retrieval”. In Proceedings of the 2016 ACM on Multimedia Conference, pages 32–41. ACM, 2016.

dishes ingredients

slide-6
SLIDE 6

Related Work

Only one work has been proposed for ingredients recognition. Handicaps of their proposal:

  • Both dish and ingredients information is
  • needed. Not applicable for never-seen

recipes.

  • Applicable to visible ingredients only.

dishes ingredients

Jingjing Chen and Chong-Wah Ngo. “Deep-based ingredient recognition for cooking recipe retrieval”. In Proceedings of the 2016 ACM on Multimedia Conference, pages 32–41. ACM, 2016.

slide-7
SLIDE 7

Categorical cross-entropy loss

egg

Softmax

Conventional CNN for classification

slide-8
SLIDE 8

loss Sigmoid Binary cross-entropy

egg salt paprika mustard

Our proposal: Adaptation for Multi-label recognition

slide-9
SLIDE 9

Dataset complementary to Food101:

  • 101 classes / dishes
  • 1000 images per class

A recipe for each class was downloaded from resulting in a list of ingredients per class and a total of 446 unique ingredients.

Datasets - Ingredients101

Carrot Cake Baby Back Ribs

slide-10
SLIDE 10

Dataset complementary to Food101:

  • 101 classes / dishes
  • 1000 images per class

A recipe for each class was downloaded from resulting in a list of ingredients per class and a total of 446 unique ingredients.

Datasets - Ingredients101

Carrot Cake Baby Back Ribs

slide-11
SLIDE 11

Datasets - Recipes5k

New dataset acquired from Around 50 different recipes were downloaded for each class in Food101 together with their respective picture. Resulting in 4,826 images, a list of ingredients per image and a total of 3,213 unique ingredients.

slide-12
SLIDE 12

Datasets - Recipes5k

Ingredients Simplification Two problems arise when dealing with too many labels:

  • The amount of training samples needed for learning them.
  • The ambiguity and minor differences between them.

We propose a simplified version by applying a simple removal of overly-descriptive particles (e.g. ’sliced’ or ’sauce’). Simplifying the 3,213 ingredients into 1,013.

large eggs → egg lemon zest → lemon meyer lemon juice → lemon unsalted butter → butter boiling water → water

slide-13
SLIDE 13

Results - Ingredients101

slide-14
SLIDE 14

Results - Ingredients101

slide-15
SLIDE 15

Results - Ingredients101

slide-16
SLIDE 16

Results - Recipes5k

SIMPLIFIED INGREDIENTS

slide-17
SLIDE 17

Results - Recipes5k

slide-18
SLIDE 18

Results - Recipes5k

slide-19
SLIDE 19

Neurons’ Activations

slide-20
SLIDE 20

Neurons’ Activations

slide-21
SLIDE 21

Neurons’ Activations

slide-22
SLIDE 22

Conclusions

We have proposed:

  • Model suitable for ingredients recognition through multi-label learning.
  • Two datasets for ingredients recognition benchmarking.

Advantages of our proposal with respect to the state of the art:

  • Straightforward model applicable to any highly performing CNN.
  • Dish/class information is not used for learning. Implying that the ingredients can be

inferred from never-seen dishes.

  • Can directly learn ingredients’ representation from visual appearance.
  • Can predict invisible ingredients implicitly.
slide-23
SLIDE 23

THANK YOU FOR YOUR ATTENTION

www.ub.edu/cvub/marcbolanos marc.bolanos@ub.edu