Deep Learning in the Connected Kitchen or Launching a Computer - - PowerPoint PPT Presentation

deep learning in the connected kitchen
SMART_READER_LITE
LIVE PREVIEW

Deep Learning in the Connected Kitchen or Launching a Computer - - PowerPoint PPT Presentation

Deep Learning in the Connected Kitchen or Launching a Computer Vision program in a new vertical Hristo Bojinov, CTO Company Vision The Problem Food People disconnect Not-so-smart smart kitchen Food info not available, not


slide-1
SLIDE 1

Deep Learning in the Connected Kitchen

  • r

“Launching a Computer Vision program in a new vertical”

Hristo Bojinov, CTO

slide-2
SLIDE 2
slide-3
SLIDE 3

Company Vision

slide-4
SLIDE 4

The Problem

Food ↔ People disconnect Not-so-smart “smart kitchen” Food info not available, not actionable

slide-5
SLIDE 5

What We Do

Food, personalization, technology “Give food a voice” (⇒ Computer Vision is essential)

Icons made by Madebyoliver, Popcorn Arts, Freepik from www.flaticon.com are licensed by CC 3.0 BY

slide-6
SLIDE 6
slide-7
SLIDE 7

Computer Vision at Innit

Helps us understand users

❖ Inventory, behaviors, multi-sensor fusion, market analytics ❖ And, build a delightful user experience

Applications in storage and processing

❖ Recognize and act on food state ❖ Visible light, depth, IR

slide-8
SLIDE 8

Program Logistics

Multi-site program (HQ, academia) Food Recognition service (AWS)

❖ G2 instance backend (blend of CPU and GPU workload) ❖ Frontend orchestrates auto and manual processing ❖ Service API for 3rd party use

slide-9
SLIDE 9

CV Tech: Food Recognition System

slide-10
SLIDE 10

CV Tech: Food Recognition System

slide-11
SLIDE 11

CV Tech: Food Recognition System

Data is King!

slide-12
SLIDE 12

CV Tech: Object Detection Stage

slide-13
SLIDE 13

CV Tech: Object Detection Stage

slide-14
SLIDE 14

CV Tech: Object Detection Stage

slide-15
SLIDE 15

CV Tech: Object Detection Stage

DetectNet ➔ Easy setup and initial training ➔ Python layers, “low resolution” Faster-RCNN ➔ Multi-phase training/tuning ➔ High resolution & recall 😁 DeepMask & SharpMask

slide-16
SLIDE 16

CV Tech: Object Detection Stage

slide-17
SLIDE 17

CV Tech: Classification Stage

slide-18
SLIDE 18

CV Tech: Classification Stage

slide-19
SLIDE 19

CV Tech: Classification Stage

slide-20
SLIDE 20

CV Tech: Classification Stage

Controlled scene layout ⇒ precision In-house data collection and tools Command-line → DIGITS AlexNet → VGG

slide-21
SLIDE 21

CV Tech: Product DB Image Retrieval

slide-22
SLIDE 22

CV Tech: Product DB Image Retrieval

❖ Exact product (or attribute) matching ❖ KAZE descriptors (GPU acceleration WIP) ➢ Current need to balance CPU/GPU ➢ Order-of-magnitude acceleration ❖ Hierarchical analysis in the pipeline

slide-23
SLIDE 23

CV Research: Training on Synthetic Sets

slide-24
SLIDE 24

CV Research: Text Extraction

slide-25
SLIDE 25

In a nutshell...

❖ Focus on differentiated capabilities, in the food space ❖ Tie in with all stages of human ↔ food interaction ❖ Fusion of images & other “sensors” ❖ GPU tech a strong enabler

slide-26
SLIDE 26

Takeaways

❖ Objectives → domain constraints (good!) ❖ Sources of initial training+test data; build tools ❖ Hardware (local experiments OK, cloud for serving) ❖ Software (don’t get tied to a framework; abstract away)

slide-27
SLIDE 27

We are hiring! 🚁 hristo@innit.com

slide-28
SLIDE 28

About Innit

❖ Inform and elevate the interaction between people and food ❖ 4+ years in the making, substantial funding, IP & tech ❖ Pirch SOHO, ShopWell

About the Speaker

❖ Embedded & Security ❖ Android, Computer Vision ❖ Computer technology at Innit