IMAGE RECOGNITION WITH SYNTHETIC DATA FOR RETAIL THERE WILL BE - - PowerPoint PPT Presentation
IMAGE RECOGNITION WITH SYNTHETIC DATA FOR RETAIL THERE WILL BE - - PowerPoint PPT Presentation
IMAGE RECOGNITION WITH SYNTHETIC DATA FOR RETAIL THERE WILL BE NO MATHEMATICAL FORMULAS Chris
Chris
В ЭТОЙ ПРЕЗЕНТАЦИИ НЕ БУДЕТ НИ ОДНОЙ МАТЕМАТИЧЕСКОЙ ФОРМУЛЫ
Наличие товара на полке
THERE WILL BE
NO
MATHEMATICAL FORMULAS IN THIS PRESENTATION
Наличие товара на полке
Наличие товара на полке
Наличие товара на полке
Solving business problems
Наличие товара на полке
Correct results Control over the progress Optimizing the work of field staff Reliability of collected data
Chris
- Image recognition for objects on a supermarket shelf
is the first and key stage of automation in retail
- We, humans, get 85% of our information through
visual sensors (eyes)
- Computer vision, i.e., training mathematical models to see and
understand what they see, is a fundamental problem of automation
IMAGE RECOGNITION IN RETAIL BY ECR RESEARCH:
ABOUT 40 BLN IMAGES PER YEAR
Chris
We give the business
1. Cost reduction 2. Scalability 3. Sales growth
IN OUR SOLUTIONS, IMAGE REGOCNITION CAN BE DONE
ON AN OFFLINE MOBILE DEVICE
Наличие товара на полке
CHALLENGES FOR
IMAGE RECOGNITION
IN RETAIL APPLICATIONS
Chris
- To automatically recognize goods on the
shelf, we need to design and train deep neural networks
- To teach the model to distinguish
between Pepsi Cola and Coca-Cola, it needs to “see” thousands of labeled images from both classes
- Central problem in many applications of
neural networks: where will the labeled data come from?
LOTS OF LABELED DATA REQUIRED
- Current catalogue of goods only in
Russian-speaking (ex-USSR) retail:
- Each SKU requires
- Moreover, the network needs “real-looking” photos,
i.e., labeled products on photos from real shelves...
170 000 SKU 1000 - 5000
labeled photos
WHERE WILL ALL THIS LABELED DATA COME FROM?
Chris
- Where can you get 1 billion
labeled photos?
- Today they are labeled by hand, usually
through crowdsourcing services
- A “turker” labels ~50 photos per 8 hour
workday, ~$0.2 per photo
PARADOX: TO AUTOMATE HUMAN LABOR, WE NEED ORDERS OF MAGNITUDE MORE HUMAN LABOR
Chris
- Manual labeling – years of work, hundreds of million dollars
- Moreover, crowdsourcing labeling is unreliable (the human factor),
and you need to cross-check, labeling each photo several times
- Wrong data will certainly lead to wrong models
1 BLN LABELED PHOTOS = 120 MLN WORK HOURS
Наличие товара на полке
THE NEUROMATION SOLUTION
- We create a virtual copy of the shelf (a 3D model) and place 3D objects
- Then we can generate unlimited amounts of labeled synthetic data
- We train deep neural networks on synthetic data
- Reaching good accuracy much faster and cheaper than the competition
Advantages of synthetic data
- We generate 100% accurate labeled
data, with pixel-perfect labeling which is impossible to do by hand
- Increase the speed of automation by
- rders of magnitude
- The resulting solution is several
times cheaper than hand labeling
Examples
Examples
Наличие товара на полке
THE SYNERGY BIG DATA + IMAGE RECOGNITION
FOR FMCG RETAIL BUSINESS
Chris
BIG DATA + IMAGE RECOGNITION
OSA Hybrid Platform Image Recognition by Neuromation
Together we develop a new approach to solving business problems for retail through automation
Chris
SUMMA TECHNOLOGIA = SALES GROWTH
Chris