6.808 Mobile and Sensor Computing aka IoT Systems Lecture 14 Split - PowerPoint PPT Presentation

6.808 Mobile and Sensor Computing aka IoT Systems Lecture 14 Split Computing / Continuous Object Recognition

Logistics & Norm Setting • What to do now? • Turn on your video (if your connection allows it) • Mute your mic (unless you are the active speaker) • Open the “Participant” List • Make sure your full name is shown • If you have a question: • Use the chat feature to either write the question or to indicate your interest in asking the question • James will be monitoring the chat • unmute -> ask question -> mute again • Same procedure for answering questions • We will post this online

Glimpse Continuous, Real-Time Object Recognition on Mobile Devices Tiffany Chen Lenin Ravindranath Shuo Deng Victor Bahl Hari Balakrishnan

Continuous, Real-Time Recognition Apps - Apps that continuously locate and label objects in a video stream.

Continuous, Real-Time Recognition Apps SLOW Driver Assistance Augmented Reality Shopping Face Recognition Augmented Reality Tourist App

Earlier Designs: Picture-Based Object Recognition

Earlier Designs: Picture-Based Object Recognition calories 180

Video-Based Object Recognition

Video-Based Object Recognition Top seller Buy 1 get 1 free

Glimpse • Continuous, real-time object recognition on mobile devices in a video stream

Glimpse • Continuous, real-time object recognition on mobile devices in a video stream • Continuously identify and locate objects in each frame Bob Alice Bob Alice Bob Alice Alice

Object Recognition Pipeline

Object Recognition Pipeline Feature Classification Detection Extraction

Object Recognition Pipeline Feature Classification Detection Extraction Stop Sign

Before Convolutional Neural Network Feature Classification Detection Extraction

Before Convolutional Neural Network Feature engineering Feature Extraction

Before Convolutional Neural Network Feature engineering Feature Extraction 12 -2 . . … … .

Convolutional Neural Network Feature learning Feature Extraction Berkeley caffe http://caffe.berkeleyvision.org/

Object Recognition Pipeline Feature Classification Detection Extraction Stop Sign

Object Recognition Pipeline Feature Classification Detection Extraction Stop Sign • Computationally expensive and memory-intensive • Server is 700x faster than Google Glass • Scalability • We need to offload the recognition pipeline to servers

Client-Server Architecture Server Feature Detection Classification Extraction Labels, Network Frame bounding boxes Client Camera Display

End-to-End Latency Lowers Accuracy Expected In reality…

Client-Server Architecture Server Feature Detection Classification Extraction Labels, Network Frame bounding boxes Client Camera Display Challenges 1. End-to-end latency lowers object recognition accuracy

Client-Server Architecture Server Feature Detection Classification Extraction Labels, Network Frame bounding boxes Client Camera Display Challenges 1. End-to-end latency lowers object recognition accuracy 2. Bandwidth and battery efficiency

Glimpse Architecture Server Feature Detection Classification Extraction Labels, Network Frame bounding boxes Client Active Cache Camera Display 1. Active Cache combats e2e latency and regains accuracy

Glimpse Architecture Server Feature Detection Classification Extraction Labels, Network Frame bounding boxes Client Trigger Active Frame Cache Camera Display 1. Active Cache combats e2e latency and regains accuracy 2. Trigger Frame reduces bandwidth usage

Glimpse Architecture Server Feature Detection Classification Extraction Labels, Network Frame bounding boxes Client Trigger Active Frame Cache Camera Display 1. Active Cache combats e2e latency and regains accuracy

End-to-End Latency Lowers Accuracy Is it possible to combat latency and regain accuracy?

Relocate Moving Object with Tracking • Object tracking on the client to re-locate the object Frame 0 Frame 12 (delay = 360 ms)

Relocate Moving Object with Tracking • Object tracking on the client to re-locate the object Fast Frame 0 Frame 12 (delay = 360 ms)

Relocate Moving Object with Tracking • Object tracking on the client to re-locate the object • Fails to work when object displacement is large

Relocate Moving Object with Tracking • Object tracking on the client to re-locate the object • Fails to work when object displacement is large Frame 0 Frame 30 (delay= 1 sec)

Regain Accuracy with Active Cache • Cache and run tracking through the cached frames

Regain Accuracy with Active Cache • Cache and run tracking through the cached frames Delay = 1 sec Server Network Active Cache Frame 0

Regain Accuracy with Active Cache • Cache and run tracking through the cached frames Delay = 1 sec Alice Server Network Active Cache Frame 0

Regain Accuracy with Active Cache • Cache and run tracking through the cached frames Delay = 1 sec Server Network Active Cache Alice Frame 0 Frame 30 Run tracking from Frame 0 to Frame 30

Regain Accuracy with Active Cache • Cache and run tracking through the cached frames Server Tracking through all cached Network frames takes too long! Active Cache ….

Adaptive Frame Selection Given n_cached frames, select s_selected frames so that we can catch up without sacrificing tracking performance

Adaptive Frame Selection Given n_cached frames, select s_selected frames so that we can catch up without sacrificing tracking performance 1. How many frames to select? 2. Which frames to select?

Adaptive Frame Selection Given n_cached frames, select s_selected frames so that we can catch up without sacrificing tracking performance 1. How many frames to select? ‧ s_selected: active cache processing time vs. tracking accuracy

Adaptive Frame Selection Given n_cached frames, select s_selected frames so that we can catch up without sacrificing tracking performance 1. How many frames to select? ‧ s_selected: active cache processing time vs. tracking accuracy What is the maximum number of frames that can be tracked? e = execution time for processing any frame in the active cache N frames per second => have 1/N seconds before next frame => Can process s_selected = (1/N)/e frames

Adaptive Frame Selection Given n_cached frames, select s_selected frames so that we can catch up without sacrificing tracking performance 1. How many frames to select? ‧ s_selected: active cache processing time vs. tracking accuracy What is the maximum number of frames that can be tracked? What if I’m okay with increasing the latency a bit? e = execution time for processing any frame in the active cache N frames per second => have 1/N seconds before next frame If I’m fine with a lag of t frames => Can process s_selected = (t/N)/e frames

Adaptive Frame Selection Given n_cached frames, select s_selected frames so that we can catch up without sacrificing tracking performance 2. Given s_selected , which frames to select? ‧ Temporal redundancy between frames

Adaptive Frame Selection Given n_cached frames, select s_selected frames so that we can catch up without sacrificing tracking performance 2. Given s_selected , which frames to select? ‧ Temporal redundancy between frames ‧ Use frame differencing to quantify movement and select frames to capture as much movement as possible

Active Cache Short Question - Does Glimpse reduce the end-to-end latency of object recognition?

Active Cache Short Question - Does active cache reduce the end-to-end latency of object recognition? • No. It’s a trick to cheat the users into thinking the recognition is in real time.

Glimpse Architecture Server Feature Detection Classification Extraction Labels, Network Frame bounding boxes Client Trigger Active Frame Cache Camera Display 1. Active Cache combats e2e latency and regains accuracy 2. Trigger Frame reduces bandwidth usage

Reduce Bandwidth Usage with Trigger Frames ‧ Strategically send certain trigger frames to the server

Reduce Bandwidth Usage with Trigger Frames ‧ Strategically send certain trigger frames to the server 1. Measuring scene changes from the previously processed frame

6.808 Mobile and Sensor Computing aka IoT Systems Lecture 14 Split - PowerPoint PPT Presentation

6.808 Mobile and Sensor Computing aka IoT Systems Lecture 14 Split Computing / Continuous Object Recognition Logistics & Norm Setting What to do now? Turn on your video (if your connection allows it) Mute your mic (unless you are

6.808: Mobile and Sensor Computing Lecture 8: Introduction to Inertial Sensing & Sensor

Programming the 808: A project-based unit for rhythm pedagogy Jus<n London SMT 2017 Pedagogy

UHMC CTEC PROGRAM COORDINATOR OFFICE: (808) 984-3246 MOBILE: (808) 214-2435 CRUTHERF@HAWAII.EDU

Sensor Relocation Mesh-based Sensor Relocation Mesh-based Sensor Relocation Objective for

6.808 Mobile and Sensor Computing Lecture 12 Map Matching with Cellular Data Hari Balakrishnan

6.808: Mobile and Sensor Computing aka IoT Systems Lecture 2: Fundamentals of IoT Localization

6.808: Mobile and Sensor Computing Lecture 10: The Pothole Patrol Hari Balakrishnan hari@mit.edu

6.808: Mobile and Sensor Computing Lecture 13: Agriculture IoT (Complementary to online video)

MOBILE ADVERTISING Agenda Get off to a mobile start with Media Impact! Why mobile? MI

Mobile Health Monitoring health and well-being using mobile devices, wearable sensors, and smart

FLOOD SENSOR WATER LEAK & TEMPERATURE SENSOR Strictly Confidential Water leak &

Sensor Networks & TinyDB Author: Roman Kolcun Supervisor: Julie A. McCann Index Sensor

MMI 2: Mobile Human- Computer Interaction Sensor-Based Mobile Interaction Prof. Dr. Michael

Mobile Capabilities And Credentials Contents Mobile Landscape Mobile Functionalities

CS 528 Mobile and Ubiquitous Computing Lecture 11: Mobile Security and Mobile Software

808 NORTH WELLS CHICAGO, IL 08-15-2018 AMLI RESIDENTIAL HARTSHORNE PLUNKARD ARCHITECTURE WALSH

What Follows from Writing? InfoSys 103 Geoff Nunberg 9/12/07 1 1 Itinerary Finish section

1 The Past and Future of Social Democracy and the Consequences for Europe Sheri Berman Barnard

World War I 21H.102 October 3, 2005 The Titanic Sanford B. Dole (1844-1926) Hawaii sugar

10. The First World War and the Russian Revolution 10.1 The Causes of the First World War 10.2

Bread and Peace Voting in US Presidential Elections: What Impact Rising Inequality? Douglas

Speech Processing 15-492/18-492 Speech Synthesis Overview Text processing Speech Synthesis

Foundations of Chemical Kinetics Lecture 16: Unimolecular reactions in the gas phase: The

Witt vectors, commutative and non-commutative Dmitry Kaledin Steklov Math Institute & NRU

6.808 Mobile and Sensor Computing aka IoT Systems Lecture 14 Split - PowerPoint PPT Presentation

6.808 Mobile and Sensor Computing aka IoT Systems Lecture 14 Split Computing / Continuous Object Recognition Logistics & Norm Setting What to do now? Turn on your video (if your connection allows it) Mute your mic (unless you are

6.808: Mobile and Sensor Computing Lecture 8: Introduction to Inertial Sensing &amp; Sensor

Programming the 808: A project-based unit for rhythm pedagogy Jus&lt;n London SMT 2017 Pedagogy

UHMC CTEC PROGRAM COORDINATOR OFFICE: (808) 984-3246 MOBILE: (808) 214-2435 CRUTHERF@HAWAII.EDU

Sensor Relocation Mesh-based Sensor Relocation Mesh-based Sensor Relocation Objective for

6.808 Mobile and Sensor Computing Lecture 12 Map Matching with Cellular Data Hari Balakrishnan

6.808: Mobile and Sensor Computing aka IoT Systems Lecture 2: Fundamentals of IoT Localization

6.808: Mobile and Sensor Computing Lecture 10: The Pothole Patrol Hari Balakrishnan hari@mit.edu

6.808: Mobile and Sensor Computing Lecture 13: Agriculture IoT (Complementary to online video)

MOBILE ADVERTISING Agenda Get off to a mobile start with Media Impact! Why mobile? MI

Mobile Health Monitoring health and well-being using mobile devices, wearable sensors, and smart

FLOOD SENSOR WATER LEAK &amp; TEMPERATURE SENSOR Strictly Confidential Water leak &amp;

Sensor Networks &amp; TinyDB Author: Roman Kolcun Supervisor: Julie A. McCann Index Sensor

MMI 2: Mobile Human- Computer Interaction Sensor-Based Mobile Interaction Prof. Dr. Michael

Mobile Capabilities And Credentials Contents Mobile Landscape Mobile Functionalities

CS 528 Mobile and Ubiquitous Computing Lecture 11: Mobile Security and Mobile Software

808 NORTH WELLS CHICAGO, IL 08-15-2018 AMLI RESIDENTIAL HARTSHORNE PLUNKARD ARCHITECTURE WALSH

What Follows from Writing? InfoSys 103 Geoff Nunberg 9/12/07 1 1 Itinerary Finish section

1 The Past and Future of Social Democracy and the Consequences for Europe Sheri Berman Barnard

World War I 21H.102 October 3, 2005 The Titanic Sanford B. Dole (1844-1926) Hawaii sugar

10. The First World War and the Russian Revolution 10.1 The Causes of the First World War 10.2

Bread and Peace Voting in US Presidential Elections: What Impact Rising Inequality? Douglas

Speech Processing 15-492/18-492 Speech Synthesis Overview Text processing Speech Synthesis

Foundations of Chemical Kinetics Lecture 16: Unimolecular reactions in the gas phase: The

Witt vectors, commutative and non-commutative Dmitry Kaledin Steklov Math Institute &amp; NRU

6.808: Mobile and Sensor Computing Lecture 8: Introduction to Inertial Sensing & Sensor

Programming the 808: A project-based unit for rhythm pedagogy Jus<n London SMT 2017 Pedagogy

FLOOD SENSOR WATER LEAK & TEMPERATURE SENSOR Strictly Confidential Water leak &

Sensor Networks & TinyDB Author: Roman Kolcun Supervisor: Julie A. McCann Index Sensor

Witt vectors, commutative and non-commutative Dmitry Kaledin Steklov Math Institute & NRU