Petia Radeva,
Collaboration with: Eduardo Aguilar, Marc Bolaños
University of Barcelona &
Computer Vision Center radevap@gmail.com
Uncertainty-Aware Food Recognition by Deep Learning
11:01
Uncertainty-Aware Food Recognition by Deep Learning Petia Radeva, - - PowerPoint PPT Presentation
Uncertainty-Aware Food Recognition by Deep Learning Petia Radeva, Collaboration with: Eduardo Aguilar, Marc Bolaos University of Barcelona & Computer Vision Center radevap@gmail.com 11:01 The Diabetes pandemy Diabetic people need to
Petia Radeva,
Collaboration with: Eduardo Aguilar, Marc Bolaños
University of Barcelona &
Computer Vision Center radevap@gmail.com
11:01
Diabetic people need to follow a strict record of their meals!
09:53
09:53
09:53
Sparkpeople LoseIt! MyFitnessPal Cronometer Fatsecret
24 hours dietary recall
Automatic visual food recognition tools for dietary assessment.
09:55
https://techcrunch.com/2016/09/29/lose-it-launches-snap-it-to-let-users-count-calories-in-food-photos/
How many food categories there are? Today we are speaking about 200.000 food categories, 8000 basic food (Wikipedia).
Is it possible?
09:41
Huge intra-class variations Ambiguous definition Inter-class similarities Mixed items Need of huge datasets Bad Labeled
09:41
What to do when you have a really complicate problem?
11:28
1. "Deep Residual Learning for Image Recognition" (2016) Proceedings of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition 25,256 citations 2. "Deep learning" (2015) Nature 16,750 citations 3. "Going Deeper with Convolutions" (2015) Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 14,424 citations 4. "Fully Convolutional Networks for Semantic Segmentation" (2015) Proceedings of the IEEE Conf. on Computer Vision and Pattern Recognition 10,153 citations 5. "Prevalence of Childhood and Adult Obesity in the United States, 2011-2012" (2014) JAMA 8,057 citations 6. "Global, regional, and national prevalence of overweight and obesity in children and adults during 1980–2013: a systematic analysis for the Global Burden of Disease Study 2013" (2014) Lancet 7,371 citations 7. "Observation of Gravitational Waves from a Binary Black Hole Merger" (2016) Physical Review Letters 6,009 citations
11:14 12
11:17 13
11:17 14
[Gatys et al. 2015]
11:15 16
This picture made by a GAN was sold for $432,500 and it’s not even real.
11:17 17
Deep Learning’s ‘Permanent Peak’ On Gartner’s Hype Cycle
11:14 18
Toni Hey, 2009
Data Resources Models
11:14 19
11:14 20 GPU cores is based on matrix multiplication https://www.doc.ic.ac.uk/~jce317/history-machine-learning.html#top
90% of all digital data were generated last 2 years.
Every minute of the day:
Daily:
11:14 21
https://www.forbes.com/sites/bernardmarr/2018/05/21/how-much-data-do-we-create-every-day-the-mind-blowing-stats-everyone-should-read/#46be238160ba
09:56
Number of
Number of images/Database
ImageNet & Deep learning
11:57 https://www.datasetlist.com
LVIS Challenge: 2.2M masks, 16K images SocialIQ Places2: 10M images TACO: Waste in the wild FastMRI Lyft Level 5
09:56
Food256: 25.600 images (100 images/class) Classes: 256 Food101 – 101.000 images (1000 images/class) Classes: 101 Food101+FoodCAT: 146.392 (101.000+45.392) Classes: 231
150.000 images 231 categories 1.400.000 images 1000 categories ????? images 200.000 categories Food DB ImageNet Future Food DB FoodImageNet soon to come!
09:56
11:47
LeCun, Chief AI Scientist for Facebook AI Research (FAIR), and a Silver Professor at New York University A.Krijevksi et.al. 2012, Google Brain & Waymo.
11:14 28
The process of training a CNN consists of training all hyperparameters: convolutional matrices and weights of the fully connected layers.
It has the three advantages:
11:16 29
09:41
Henry Roth is a man afraid of commitment up until he meets the beautiful Lucy. They hit it off and Henry think he's finally found the girl of his dreams, until he discovers she has short-term memory loss and forgets him the next day. Multi-ta sk learning Domain adaptati
Self-tha ught learning Unsuper vised transfer learning
11:17
10:47
09:41
09:41
run-time is prohibitive
complimentary tasks.
09:42
○ The main task is fixed and weights are learned for each side-task ([1]). ○ Weight the tasks according to the homoscedastic uncertainty ([2]).
[1] X. Yin and X. Liu. Multi-task convolutional neural network for face recognition. [2] A. Kendall, Y. Gal, and R. Cipolla. Multi-task learning using uncertainty to weigh losses for scene geometry and semantics.
09:42
09:42
Gal’16
asks the model to decide what is the object using a photo of a chocolate cake.
09:42 Adapted from Gal (2016) Who is the guilty for this?
09:42
Adapted from Gal (2016)
dataset? What model structure should we use?
09:42
Gal (2016)
Aleatoric – captures the noise inherent in the observations
09:42
09:42
How to determine the total loss of the MTF?
Use aleatoric uncertainty modeling to make the model more clever! Aleatoric uncertainty – How to model it?
In total: more than 550.000 images
09:42
Eduardo Aguilar, Marc Bolaños, Petia Radeva: Regularized uncertainty-based multi-task learning model for food analysis. J. Visual Communication and Image Representation 60: 360-370 (2019)
09:42
Food category and class recognition
By Mostafa Kamal, Domenec Puig et.al.
09:42
09:42