real time image recognition
play

Real-Time Image Recognition Nikita Shamgunov, CEO, MemSQL - PowerPoint PPT Presentation

Real-Time Image Recognition Nikita Shamgunov, CEO, MemSQL In-Memory Computing Summit 2017 1 The future of computing is visual 2 and also numerical :) 3 4 5 6 7 Putting image recognition to work today How It Works 10


  1. Real-Time 
 Image Recognition Nikita Shamgunov, CEO, MemSQL In-Memory Computing Summit 2017 1

  2. The future of computing is visual 2

  3. and also numerical :) 3

  4. 4

  5. 5

  6. 6

  7. 7

  8. Putting image recognition to work today

  9. How It Works 10

  10. Real-Time Image Recognition Workflow ▪ Train the model with Spark, TensorFlow, and Gluon ▪ Use the Model to extract feature vectors from images • Model + Image => FV ▪ You can store every feature vector in a MemSQL table CREATE TABLE features ( id bigint(11) NOT NULL AUTO_INCREMENT, image binary(4096) DEFAULT NULL, KEY id (id) USING CLUSTERED COLUMNSTORE ) 11

  11. Working with Feature Vectors For every image, we store an ID and a normalized feature vector in a MemSQL table called features . ID | Feature Vector x | 4KB To find similar images, we use this SQL query SELECT id FROM features WHERE DOT_PRODUCT (feature * <input> ) > 0.9 12

  12. Understanding Dot Product ▪ Dot Product is an algebraic operation • SUM(Xi*Yi) TODO: Put a formula ▪ With the specific model and normalized feature vectors DOT PRODUCT results in a similarity score • The closer the score is to 1 the more similar are the images 13

  13. Performance Enhancing Techniques Achieving best-in-class Dot Product implementation ▪ SIMD-powered ▪ Data compression ▪ Query parallelism ▪ Scale out ▪ Result: Processing at Memory Bandwidth Speed 14

  14. Performance Numbers ▪ Memory Speed: 50GB/sec ▪ Each vector 4K ▪ 12.5 Million Images a second per node or ▪ 1 Billion images a second on 100 node cluster 15

  15. Demo

  16. Demo Architecture ML Images Model Framework Real-time 
 ML image 
 Framework recognition Persistent, Queryable Format 17

  17. SELECT id FROM features WHERE DOT_PRODUCT(image, 0xa334efa…) 18

  18. About MemSQL

  19. MemSQL: The Real-Time Data Warehouse ▪ Scalable ▪ Deployment • Petabyte scale • MemSQL Cloud • High concurrency • Any public cloud • System of record • On-premises ▪ Real-time ▪ Developer Edition • Operational • Unlimited scale • Limited high availability Compatible ▪ and security features • ETL • Business Intelligence • Kafka • Spark 20

  20. 2017 Magic Quadrant for Data Management Solutions for Analytics 21

  21. About ML Training

  22. ML training is available through a variety of frameworks, including Spark MLlib, TensorFlow, Gluon, and Caffe. 23

  23. 24

  24. Understanding ML Frameworks and MemSQL ML Frameworks MemSQL Fast, large scale Fast, large scale General processing engines Real-time data warehouse Great for training Great for real-time scoring 25

  25. Example: MemSQL Spark Connector Highly parallel, high throughput, bi-directional 26

  26. Thank you! @NikitaShamgunov www.memsql.com

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend