Bootstrapping Food Preferences Through an Adaptive Visual Interface - - PowerPoint PPT Presentation

bootstrapping food preferences through an adaptive visual
SMART_READER_LITE
LIVE PREVIEW

Bootstrapping Food Preferences Through an Adaptive Visual Interface - - PowerPoint PPT Presentation

Bootstrapping Food Preferences Through an Adaptive Visual Interface Longqi Yang , Yin Cui, Fan Zhang, JP Pollak, Serge Belongie, Deborah Estrin MOTIVATION Food preferences learning is important! Health and Life Obesity 113M HBP 50M


slide-1
SLIDE 1

Bootstrapping Food Preferences Through an Adaptive Visual Interface

Longqi Yang, Yin Cui, Fan Zhang, JP Pollak, Serge Belongie, Deborah Estrin

slide-2
SLIDE 2

MOTIVATION

Food preferences learning is important!

slide-3
SLIDE 3

Health and Life

*Number of Americans Living with Diet-and Inactivity-Related Diseases

Unflavored Healthy diet recommendations are of NO Benefit!

Obesity HBP Diabetes

113M 50M 15M

slide-4
SLIDE 4

Social Media and Commerce

Personalized diet profile is the Key to user experience!

slide-5
SLIDE 5

Our Vision

Clinicians

Personalize Treatment Plan

Online Groceries

Customers Targeting

Recipes

Food environment at home

Restaurants

Customized dishes

Social Network

Content personalization

Nutritionists

Healthy recommendations Personalized Diet Profile

slide-6
SLIDE 6

OUR SOLUTION

An adaptive visual interface

slide-7
SLIDE 7

Start Diet Profile

Exploration, 2 iters

10 food items

Exploration-exploitation: <15 iters

Pairwise Comparison

slide-8
SLIDE 8

ü Efficient: completed within a minute. ü Visual interface: low cognitive load, personalized and legible. ü Preference Elicitation: NO history required, NO ratings. ü Deep understanding of food images. ü Novel Online Learning Framework.

slide-9
SLIDE 9

System Design

Pr Pre-trai aine ned d Deep ep Siamese se Ne Networ

  • rk

Ingr gred edient ents 1000 dim 200 dim Image raw pixels Metadata Food Items Harvesting Ingredients Nutrients …… Us User er Pref efer eren ence Food

  • d alread

eady expl plor

  • red

ed Backend Online Learning

Explo lora rati tion

  • n (2

2 iter eratio tions) Explo lora rati tion

  • n – Explo

loita itati tion

  • n (>2)

# Iterations

……

Food Similarity Embedding

Yuc uck Yuc uck

Visual User Interface

  • nline
  • ffline

Online Learning Ø What images to present to the user? Ø How to update users’ preferences? Online Learning framework (LE + EE) Food Similarity Embedding Users have close preferences for similar items Ø Feature representation that can reflect similarities Food Items Harvesting Ø Food images and metadata.

slide-10
SLIDE 10

System Design: offline

Pr Pre-trai aine ned d Deep ep Siamese se Ne Networ

  • rk

Ingr gred edient ents 1000 dim 200 dim Image raw pixels Metadata Food Items Harvesting Ingredients Nutrients …… Us User er Pref efer eren ence Food

  • d alread

eady expl plor

  • red

ed Backend Online Learning

Explo lora rati tion

  • n (2

2 iter eratio tions) Explo lora rati tion

  • n – Explo

loita itati tion

  • n (>2)

# Iterations

……

Food Similarity Embedding

Yuc uck Yuc uck

Visual User Interface

  • nline
  • ffline

Food Items Harvesting

Ø 12,000 food items from Yummly API. Ø Images + Metadata (ingredients, nutrients etc.) Ø Outliers filtering, 10,028 items were used.

slide-11
SLIDE 11

System Design: offline

Pr Pre-trai aine ned d Deep ep Siamese se Ne Networ

  • rk

Ingr gred edient ents 1000 dim 200 dim Image raw pixels Metadata Food Items Harvesting Ingredients Nutrients …… Us User er Pref efer eren ence Food

  • d alread

eady expl plor

  • red

ed Backend Online Learning

Explo lora rati tion

  • n (2

2 iter eratio tions) Explo lora rati tion

  • n – Explo

loita itati tion

  • n (>2)

# Iterations

……

Food Similarity Embedding

Yuc uck Yuc uck

Visual User Interface

  • nline
  • ffline

Food Similarity Embedding

Representation: 1000 dim visual + 200 dim ingredients 1000 dim visual feature from Food-CNN Image 2 Image 1 x y

A (CNN) B (CNN)

Contrastive Loss f(x) f(y)

slide-12
SLIDE 12

System Design: offline

Pr Pre-trai aine ned d Deep ep Siamese se Ne Networ

  • rk

Ingr gred edient ents 1000 dim 200 dim Image raw pixels Metadata Food Items Harvesting Ingredients Nutrients …… Us User er Pref efer eren ence Food

  • d alread

eady expl plor

  • red

ed Backend Online Learning

Explo lora rati tion

  • n (2

2 iter eratio tions) Explo lora rati tion

  • n – Explo

loita itati tion

  • n (>2)

# Iterations

……

Food Similarity Embedding

Yuc uck Yuc uck

Visual User Interface

  • nline
  • ffline

Food Similarity Embedding

Representation: 1000 dim visual + 200 dim ingredients 1000 dim visual feature from Food-CNN

− ≈ 0 − > 𝑛 , , 𝓜 = 𝟐 𝟑𝒎𝑬𝟑 + 𝟐 𝟑 𝟐 − 𝒎 𝐧𝐛𝐲 (𝟏, 𝒏 − 𝑬) 𝟑 𝒎 = 𝟐 𝒎 = 𝟏

Pairs/Labels were sampled from Food-101 dataset

slide-13
SLIDE 13

System Design: offline

Pr Pre-trai aine ned d Deep ep Siamese se Ne Networ

  • rk

Ingr gred edient ents 1000 dim 200 dim Image raw pixels Metadata Food Items Harvesting Ingredients Nutrients …… Us User er Pref efer eren ence Food

  • d alread

eady expl plor

  • red

ed Backend Online Learning

Explo lora rati tion

  • n (2

2 iter eratio tions) Explo lora rati tion

  • n – Explo

loita itati tion

  • n (>2)

# Iterations

……

Food Similarity Embedding

Yuc uck Yuc uck

Visual User Interface

  • nline
  • ffline

Food Similarity Embedding

Representation: 1000 dim visual + 200 dim ingredients 200 dim ingredients feature Ø Lemmatization and preprocessing. Ø Filtering: Top 200 ingredients. Ø Feature vector: 0-1 vector denotes the existence of the ingredient. Visual and ingredients feature vectors are normalized separately with 𝒎𝟐 norm

slide-14
SLIDE 14

System Design: online

Pr Pre-trai aine ned d Deep ep Siamese se Ne Networ

  • rk

Ingr gred edient ents 1000 dim 200 dim Image raw pixels Metadata Food Items Harvesting Ingredients Nutrients …… Us User er Pref efer eren ence Food

  • d alread

eady expl plor

  • red

ed Backend Online Learning

Explo lora rati tion

  • n (2

2 iter eratio tions) Explo lora rati tion

  • n – Explo

loita itati tion

  • n (>2)

# Iterations

……

Food Similarity Embedding

Yuc uck Yuc uck

Visual User Interface

  • nline
  • ffline

Online Learning

Food preferences representation:

𝒒 = 𝑞8, 𝑞9, … ,𝑞 𝒯 <𝑞=

=

= 1

Distribution of preferences over all food items in 𝒯

t t t t t

𝒒?:updated preference vector after iteration t

Two tasks at each iteration t: Ø User state update: update 𝒒? based on the items presented and user’s choices at iteration t-1. Ø Images selection: Select a set of images to show at iteration t.

slide-15
SLIDE 15

System Design: online

Online Learning

Ø User state update: update 𝒒? based on the items presented and user’s choices at iteration t-1. Users’ selections Image Labeling Images selected Label “+1” Label “-1” Images not selected Images not presented Label “0”

slide-16
SLIDE 16

System Design: online

Online Learning Label propagation with regularized optimization

min

𝒗

< 𝜕=E 𝑧= − 𝑣

E H 𝒯 EI9,EJ=

+ < 1 − 𝜕=E 𝑣

E − 𝑧E H 𝒯 EI9,EJ=

Smoothness Fitting

Label Propagation and Exponentiated Gradient Algorithm (LE)

𝑣

E = < 𝜕=E𝑧= 𝒯 =I9

𝑞=

?

𝑞=

?K9×𝑓 NOP

QRS

TP

QRS

𝜕=E = 𝑓

K9 HUV 𝒈XPK𝒈XY

Ø User state update: update 𝒒? based on the items presented and user’s choices at iteration t-1.

slide-17
SLIDE 17

System Design: online

Online Learning

Ø Images selection: Select a set of images to show at iteration t. Exploration and Exploration-exploitation Algorithm (EE) Exploration (Ten images): 𝑢 ≤ 2 K-means++ Exploration-exploitation (Two images): 𝑢 > 2 One Item that user “prefer” (with high value of 𝑞) The other item that user hasn’t explored.

slide-18
SLIDE 18

System Design: online

Online Learning

user state update images selection

slide-19
SLIDE 19

EXPERIMENTS AND USER STUDY

Evaluation, findings and evidence

slide-20
SLIDE 20

Experiments: embedding

0.0 0.2 0.4 0.6 0.8 1.0

5eFall

10-2 10-1 100

PreFLsLRn(LRgarLWhPLF 6Fale)

)RRG-C11 (PAP: 0.216) Alex1eW (PAP: 0.051) 6I)T+BR: (PAP: 0.019) 5anGRP Guess (PAP: 0.01)

Clustering performance of Food-CNN (Tested on Food-101 dataset). Ø K-neighbors of each test image, calculate the precision-recall for each K

slide-21
SLIDE 21

Experiments: user study

Ø 227 anonymous users. Ø Two factors were controlled in the study. Online Perceptron (OP) Label Propagation and Exponentiated Gradient (LE) Exploration and Exploration- exploitation (EE) Random Selection (RS)

  • 1st. Algorithm:
  • 2nd. Number of iterations: 5/10/15
slide-22
SLIDE 22

Experiments: user study

Ø Algorithm to test: LE+EE

One image from top 1% of preference value. (unexplored) The other image from bottom 1% of preference value. (unexplored)

Exploration Exploration-exploitation PlateClick (10 iters) Testing (10 iters)

Ø Trials: 1/3

slide-23
SLIDE 23

Experiments: user study

One image from top 1% of preference value. (unexplored) The other image from bottom 1% of preference value. (unexplored)

Exploration Exploration-exploitation PlateClick (5 iters) Testing (10 iters)

Ø Algorithm to test: LE+EE Ø Trials: 2/3

slide-24
SLIDE 24

Experiments: user study

One image from top 1% of preference value. (unexplored) The other image from bottom 1% of preference value. (unexplored)

Exploration Exploration-exploitation PlateClick (15 iters) Testing (10 iters)

Ø Algorithm to test: LE+EE Ø Trials: 3/3

slide-25
SLIDE 25

Experiments: user study

* * ** ** *** *** * **

Prediction accuracy under different algorithms and number of iterations

slide-26
SLIDE 26

Experiments: user study

0.0 0.2 0.4 0.6 0.8 1.0

PredLctLon AccurDcy

0.0 0.2 0.4 0.6 0.8 1.0

CuPulDtLve DLstrLEutLon

LE+EE:5 LE+EE:10 LE+EE:15

Cumulative distribution of prediction accuracy for LE+EE algorithm

slide-27
SLIDE 27

Conclusions and Future work

Ø Engine for food preferences learning. Ø Applicable to general human-in-the-loop problems.

slide-28
SLIDE 28

For more information:

http://www.cs.cornell.edu/~ylongqi

Try it out online:

http://bit.ly/plateclick http://smalldata.io/ @ylongqi ylongqi@cs.cornell.edu