cloud instance selection for your gpu fleet
play

Cloud Instance Selection for Your GPU Fleet Lessons from Developing - PowerPoint PPT Presentation

Cloud Instance Selection for Your GPU Fleet Lessons from Developing Smart Kitchen Technology Hristo Bojinov, CTO Rob Laber, Lead CV Engineer Motivation Motivation Motivation The Thesis The Problem Food People disconnect Not-so-smart


  1. Cloud Instance Selection for Your GPU Fleet Lessons from Developing Smart Kitchen Technology Hristo Bojinov, CTO Rob Laber, Lead CV Engineer

  2. Motivation

  3. Motivation

  4. Motivation

  5. The Thesis

  6. The Problem Food ↔ People disconnect Not-so-smart “smart kitchen” Food info not available, not actionable

  7. The Vision

  8. What We Do Food, personalization, technology “The Way You Eat” Icons made by Madebyoliver, Popcorn Arts, Freepik from www.flaticon.com are licensed by CC 3.0 BY

  9. What We Do Food, personalization, technology “The Way You Eat” ( ⇒ Computer Vision is essential) Icons made by Madebyoliver, Popcorn Arts, Freepik from www.flaticon.com are licensed by CC 3.0 BY

  10. Pushing the Envelope on CV Go from here...

  11. Pushing the Envelope on CV … to here.

  12. Computer Vision at Innit Helps us understand users Inventory, behaviors, multi-sensor fusion, market analytics ❖ And, build a delightful user experience ❖ Applications in storage and processing Recognize and act on food state ❖ Visible light, depth, IR ❖ Multi-site program (HQ, academia)

  13. CV Research

  14. Food Recognition Service (AWS) G2 & G3 instance backend (details coming up) ❖ Frontend orchestrates auto and manual processing ❖ Service API for 3rd party use ❖

  15. CV Tech: Food Recognition System Computer Service Vision Layer Backend Offline Web Data Processing Portal Store

  16. CV Tech: Food Recognition System

  17. CV Tech: Object Detection Stage

  18. CV Tech: Object Detection Stage

  19. CV Tech: Object Detection Stage

  20. CV Tech: Object Detection Stage

  21. CV Tech: Classification Stage

  22. CV Tech: Classification Stage

  23. CV Tech: Classification Stage

  24. CV Tech: Classification Stage Controlled scene layout ⇒ precision In-house data collection and tools CNN based

  25. CV Tech: Product DB Image Retrieval

  26. CV Tech: Product DB Image Retrieval ❖ Exact product (or attribute) matching ❖ KAZE descriptors (GPU acceleration WIP, stay tuned) ➢ Current need to balance CPU/GPU ➢ Order-of-magnitude acceleration ❖ Hierarchical analysis in the pipeline ❖ + Precise matching using CNN

  27. Summary: The Challenge Complex Scenes Segmentation / object detection required ❖ Must handle occlusions, uncertainty ❖ Highly variable lighting conditions ❖ ❖ Product DB not 100% accurate Non-image feeds? ❖

  28. Cloud instances: types, plans, sizes...

  29. So what is all this? Computer Service Vision Layer Backend Offline Web Data Processing Portal Store

  30. Decoupled Service Architecture Computer Service Vision Layer Backend Offline Web Data Processing Portal Store

  31. Decoupled Service Architecture Computer Service Vision Layer Backend

  32. Decoupled Service Architecture ❖ Single responsibility Computer Service Vision Layer Backend

  33. Decoupled Service Architecture ❖ Single responsibility Computer Service Vision Layer Backend ❖ Stateless GPUs

  34. Decoupled Service Architecture ❖ Single responsibility Computer Service Vision Layer Backend ❖ Stateless GPUs ❖

  35. Decoupled Service Architecture Computer Service Vision Layer Backend

  36. Decoupled Service Architecture ❖ Non-vision tasks Computer Service Vision Layer Backend

  37. Decoupled Service Architecture ❖ Non-vision tasks Computer Service Vision Layer Backend ❖ Horizontal scaling

  38. Decoupled Service Architecture ❖ Non-vision tasks Computer Service Vision Layer Backend ❖ Horizontal scaling ❖ Reserved instances

  39. Decoupled Service Architecture

  40. Decoupled Service Architecture WHY?

  41. Decoupled Service Architecture Computer Service Vision Layer Backend

  42. Decoupled Service Architecture Service Layer Computer Service Vision Layer Backend Service Layer

  43. Decoupled Service Architecture Service Layer Cost-effective Computer Service Vision buffer Layer Backend Service Layer

  44. Decoupled Service Architecture To summarize... Computer Service Vision Layer Backend

  45. Decoupled Service Architecture To summarize... Computer Service Vision Layer Backend ❖ Reserved instances

  46. Decoupled Service Architecture To summarize... Computer Service Vision Layer Backend ❖ Reserved instances ❖ Optimize hardware usage

  47. Decoupled Service Architecture To summarize... Computer Service Vision Layer Backend ❖ Reserved instances Delegate tasks to ❖ Optimize hardware save money! usage

  48. Isolated Background Jobs Computer Service Vision Layer Backend Offline Web Data Processing Portal Store

  49. Isolated Background Jobs Clustering

  50. Isolated Background Jobs Big Data

  51. Isolated Background Jobs Neural Net Training

  52. Isolated Background Jobs Offline Data Processing Store

  53. Isolated Background Jobs ❖ One-off or periodic Offline Data Processing Store

  54. Isolated Background Jobs ❖ One-off or periodic ❖ Resource intensive Offline Data Processing Store

  55. Isolated Background Jobs ❖ One-off or periodic ❖ Resource intensive ❖ Need it now? Offline Data Processing Store

  56. Spot Pricing?

  57. Spot Pricing? ❖ Auctioned resources

  58. Spot Pricing? ❖ Auctioned resources ❖ Price changes with resource availability

  59. Spot Pricing? ❖ Auctioned resources ❖ Price changes with resource availability ❖ Interrupted jobs

  60. Isolated Background Jobs ❖ One-off or periodic Spot pricing saves money! ❖ Resource intensive ❖ Need it now? Offline Data Processing Store

  61. Isolated Background Jobs

  62. Trade-Offs

  63. Trade-Offs ❖ Cost vs. Performance

  64. Trade-Offs ❖ Cost vs. Performance ❖ Local vs. Cloud

  65. Trade-Offs ❖ Cost vs. Performance ❖ Local vs. Cloud ❖ Cost vs. Complexity

  66. Trade-Offs ❖ Cost vs. Performance ❖ Local vs. Cloud ❖ Cost vs. Complexity ❖ Now vs. Later

  67. Takeaways ❖ Complex workload → use differentiated EC2 instances ❖ Profile GPU-dependent tasks, choose carefully ❖ Consider online / offline loads, spot instances

  68. Join the party! � hristo@innit.com

  69. About Innit Inform and elevate the interaction between people and food ❖ ❖ 5+ years in the making, substantial funding, IP & tech Partnerships (appliances, FMCG, culinary) ❖ About the Presenters R&D and Technology at Innit ❖

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend