Bootstrapping Food Preferences Through an Adaptive Visual Interface
Longqi Yang, Yin Cui, Fan Zhang, JP Pollak, Serge Belongie, Deborah Estrin
Bootstrapping Food Preferences Through an Adaptive Visual Interface - - PowerPoint PPT Presentation
Bootstrapping Food Preferences Through an Adaptive Visual Interface Longqi Yang , Yin Cui, Fan Zhang, JP Pollak, Serge Belongie, Deborah Estrin MOTIVATION Food preferences learning is important! Health and Life Obesity 113M HBP 50M
Longqi Yang, Yin Cui, Fan Zhang, JP Pollak, Serge Belongie, Deborah Estrin
*Number of Americans Living with Diet-and Inactivity-Related Diseases
Personalize Treatment Plan
Customers Targeting
Food environment at home
Customized dishes
Content personalization
Healthy recommendations Personalized Diet Profile
Start Diet Profile
10 food items
Pairwise Comparison
Pr Pre-trai aine ned d Deep ep Siamese se Ne Networ
Ingr gred edient ents 1000 dim 200 dim Image raw pixels Metadata Food Items Harvesting Ingredients Nutrients …… Us User er Pref efer eren ence Food
eady expl plor
ed Backend Online Learning
Explo lora rati tion
2 iter eratio tions) Explo lora rati tion
loita itati tion
# Iterations
Food Similarity Embedding
Yuc uck Yuc uck
Visual User Interface
Online Learning Ø What images to present to the user? Ø How to update users’ preferences? Online Learning framework (LE + EE) Food Similarity Embedding Users have close preferences for similar items Ø Feature representation that can reflect similarities Food Items Harvesting Ø Food images and metadata.
Pr Pre-trai aine ned d Deep ep Siamese se Ne Networ
Ingr gred edient ents 1000 dim 200 dim Image raw pixels Metadata Food Items Harvesting Ingredients Nutrients …… Us User er Pref efer eren ence Food
eady expl plor
ed Backend Online Learning
Explo lora rati tion
2 iter eratio tions) Explo lora rati tion
loita itati tion
# Iterations
Food Similarity Embedding
Yuc uck Yuc uck
Visual User Interface
Food Items Harvesting
Ø 12,000 food items from Yummly API. Ø Images + Metadata (ingredients, nutrients etc.) Ø Outliers filtering, 10,028 items were used.
Pr Pre-trai aine ned d Deep ep Siamese se Ne Networ
Ingr gred edient ents 1000 dim 200 dim Image raw pixels Metadata Food Items Harvesting Ingredients Nutrients …… Us User er Pref efer eren ence Food
eady expl plor
ed Backend Online Learning
Explo lora rati tion
2 iter eratio tions) Explo lora rati tion
loita itati tion
# Iterations
Food Similarity Embedding
Yuc uck Yuc uck
Visual User Interface
Food Similarity Embedding
Representation: 1000 dim visual + 200 dim ingredients 1000 dim visual feature from Food-CNN Image 2 Image 1 x y
Contrastive Loss f(x) f(y)
Pr Pre-trai aine ned d Deep ep Siamese se Ne Networ
Ingr gred edient ents 1000 dim 200 dim Image raw pixels Metadata Food Items Harvesting Ingredients Nutrients …… Us User er Pref efer eren ence Food
eady expl plor
ed Backend Online Learning
Explo lora rati tion
2 iter eratio tions) Explo lora rati tion
loita itati tion
# Iterations
Food Similarity Embedding
Yuc uck Yuc uck
Visual User Interface
Food Similarity Embedding
Representation: 1000 dim visual + 200 dim ingredients 1000 dim visual feature from Food-CNN
− ≈ 0 − > 𝑛 , , 𝓜 = 𝟐 𝟑𝒎𝑬𝟑 + 𝟐 𝟑 𝟐 − 𝒎 𝐧𝐛𝐲 (𝟏, 𝒏 − 𝑬) 𝟑 𝒎 = 𝟐 𝒎 = 𝟏
Pairs/Labels were sampled from Food-101 dataset
Pr Pre-trai aine ned d Deep ep Siamese se Ne Networ
Ingr gred edient ents 1000 dim 200 dim Image raw pixels Metadata Food Items Harvesting Ingredients Nutrients …… Us User er Pref efer eren ence Food
eady expl plor
ed Backend Online Learning
Explo lora rati tion
2 iter eratio tions) Explo lora rati tion
loita itati tion
# Iterations
Food Similarity Embedding
Yuc uck Yuc uck
Visual User Interface
Food Similarity Embedding
Representation: 1000 dim visual + 200 dim ingredients 200 dim ingredients feature Ø Lemmatization and preprocessing. Ø Filtering: Top 200 ingredients. Ø Feature vector: 0-1 vector denotes the existence of the ingredient. Visual and ingredients feature vectors are normalized separately with 𝒎𝟐 norm
Pr Pre-trai aine ned d Deep ep Siamese se Ne Networ
Ingr gred edient ents 1000 dim 200 dim Image raw pixels Metadata Food Items Harvesting Ingredients Nutrients …… Us User er Pref efer eren ence Food
eady expl plor
ed Backend Online Learning
Explo lora rati tion
2 iter eratio tions) Explo lora rati tion
loita itati tion
# Iterations
Food Similarity Embedding
Yuc uck Yuc uck
Visual User Interface
Online Learning
Food preferences representation:
=
Distribution of preferences over all food items in 𝒯
t t t t t
Two tasks at each iteration t: Ø User state update: update 𝒒? based on the items presented and user’s choices at iteration t-1. Ø Images selection: Select a set of images to show at iteration t.
Online Learning
Online Learning Label propagation with regularized optimization
𝒗
E H 𝒯 EI9,EJ=
E − 𝑧E H 𝒯 EI9,EJ=
Label Propagation and Exponentiated Gradient Algorithm (LE)
E = < 𝜕=E𝑧= 𝒯 =I9
?
?K9×𝑓 NOP
QRS
TP
QRS
K9 HUV 𝒈XPK𝒈XY
Online Learning
Online Learning
0.0 0.2 0.4 0.6 0.8 1.0
5eFall
10-2 10-1 100
PreFLsLRn(LRgarLWhPLF 6Fale)
)RRG-C11 (PAP: 0.216) Alex1eW (PAP: 0.051) 6I)T+BR: (PAP: 0.019) 5anGRP Guess (PAP: 0.01)
Clustering performance of Food-CNN (Tested on Food-101 dataset). Ø K-neighbors of each test image, calculate the precision-recall for each K
One image from top 1% of preference value. (unexplored) The other image from bottom 1% of preference value. (unexplored)
Exploration Exploration-exploitation PlateClick (10 iters) Testing (10 iters)
One image from top 1% of preference value. (unexplored) The other image from bottom 1% of preference value. (unexplored)
Exploration Exploration-exploitation PlateClick (5 iters) Testing (10 iters)
One image from top 1% of preference value. (unexplored) The other image from bottom 1% of preference value. (unexplored)
Exploration Exploration-exploitation PlateClick (15 iters) Testing (10 iters)
* * ** ** *** *** * **
0.0 0.2 0.4 0.6 0.8 1.0
PredLctLon AccurDcy
0.0 0.2 0.4 0.6 0.8 1.0
CuPulDtLve DLstrLEutLon
LE+EE:5 LE+EE:10 LE+EE:15