Mobile AR/VR with Edge-based Deep Learning Jiasi Chen Department - PowerPoint PPT Presentation

Mobile AR/VR with Edge-based Deep Learning Jiasi Chen Department of Computer Science & Engineering University of California, Riverside CNSM Oct. 23, 2019

Outline • What is AR/VR? • Edge computing can provide... 1. Real-time object detection for mobile AR 2. Bandwidth-efficient VR streaming with deep learning • Future directions 2

What is AR/VR? 3

End users Multimedia is… Audio On-demand video Internet Live video Content creation Compression Storage Distribution Virtual and augmented reality 4

What is AR/VR? | | | | virtual reality reality augmented virtuality augmented reality mixed reality 5

Who’s Using Virtual Reality? Smartphone-based hardware: Google Cardboard Google Daydream High-end hardware: 6 Playstation VR HTC Vive

Why VR now? Portability (2) Watch it at home (3) Carry it with you (1) Have to go somewhere Movies: VR: Oculus Rift (2016) Virtuality gaming (1990s) CAVE (1992) 7 Similar portability trend for VR, driven by hardware advances from the smartphone revolution.

Who’s Using Augmented Reality? Smartphone- based: Pokemon Go Google Translate (text processing) Snapchat filters (face detection) High-end hardware: 8 Microsoft Hololens Google Glasses

Is it all just fun and games? • AR/VR has applications in many areas: Data visualization Education Public Safety • What are the engineering challenges? • AR: process input from the real world (related to computer vision, robotics) • VR: output the virtual world to your display (related to computer graphics) 9

How AR/VR Works 1. Virtual world 4. Display 3. Render generation VR: 2. Real object detection AR: 4. Render 5. Display 1. Device tracking 10

What systems functionality is currently available in AR/VR? 11

Systems Support for VR Game engines • Unity • Unreal 1. Virtual world 4. Display 3. Render generation VR: • Mobile GPU • Qualcomm VR/AR chips 12

Systems Support for AR Computer vision / machine learning libraries • Vuforia • OpenCV • Tensorflow 2. Real object detection • Google ARCore 4. Render 5. Display • Apple ARKit • Microsoft Hololens 1. Device tracking • Microsoft Hololens • Magic Leap • Smartphones 13

What AR/VR functionality is needed by researchers? 14

Research Space in AR Can edge computing help? Typically done using deep learning (research, not industry) • Slow : 600 ms per frame on a smartphone • Energy drain: 1% battery per minute on a smartphone MARLIN (SenSys’19), Liu et al. (MobiCom’19), DeepDecision (INFOCOM’18), DeepMon (MobiSys’17) 2. Real object detection 4. Render 5. Display 1. Device tracking Typically done using SLAM (combine camera + IMU sensors) • Slow : 30 ms per frame on a smartphone Can edge • Energy drain: > 1.5 W on a smartphone computing help? ShareAR (HotNets’19), MARVEL (SenSys’18), OverLay (MobiSys’15) 15

Research Space in AR Example of slow object detection: Comparison of different apps’ energy drain: Take-home message : Machine learning is useful in AR • As part of the AR processing pipeline (object detection) • At the expense of energy 16

Research Space in VR On a content/edge server On the mobile device 1a. Virtual world 4. Display 3. Render generation 1b. Transmission over the network Internet High bandwidth : Up to 25 Mbps on YouTube at max resolution Can machine learning help with VR traffic optimization? Rubiks (MobiSys’18), FLARE (MobiCom’18), Characterization (SIGCOMM workshop’17), FlashBack (MobiSys’16) Take-home message : Machine learning is useful in VR • To help with user predictions, traffic management 17

Outline • Overview of AR/VR • Edge computing can provide... 1. Real-time object detection for mobile AR 2. Bandwidth-efficient VR streaming with deep learning • Future directions 18

How AR Works • Object detection is a computational bottleneck for AR • Current AR is only able to detect flat planes or specific object instances • Can we do more powerful processing on a server? 2. Real object detection 4. Render 5. Display 1. Device tracking 19

Reducing lag for augmented reality • Augmented and virtual reality requires a lot of computational power • Run expensive computer vision and machine learning algorithms Run on the cloud? Run on the device? Internet Cloud datacenter Too far  too slow! Too slow! e.g. AWS Run on the edge? Edge compute node 20 Xukan Ran, Haoliang Chen, Xiaodan Zhu, Zhenming Liu, Jiasi Chen, “DeepDecision: A Mobile Deep Learning Framework”, IEEE INFOCOM , 2018.

Remote processing Challenges with current approaches • Current approaches for machine learning on mobile devices • Local-only processing Slow! (~600 ms/frame) • Apple Photos, Google Translate • GPU speedup Local processing • Remote-only processing Doesn’t work when network is bad • Apple Siri, Amazon Alexa • Our observations • Different AR apps have different accuracy and latency requirements • Network latency is often higher than CPU/GPU processing time on the edge server • Video streams and deep learning models can scale gracefully 21

Problem Statement • Problem: How should the mobile device be configured to meet the lag requirements of the AR app and the user? • Solution: Periodically profile, optimize, and update the configuration 1. Offline performance 2. Online 3. Update the characterization optimization configuration 22

Online decision framework Metrics: Degrees of freedom: Constraints: Time video resolution • Current network conditions • Application requirements Optimize decision detection accuracy neural net model size • Network condition • Bandwidth • Latency • • Video characteristics App requirements energy consumption • • Latency Frame rate • Accuracy offloading decision • Resolution • Energy • Bit rate • Deep learning characteristics • Model size • Model latency / energy 23 • Model accuracy

System design Edge server Big deep learning Input live video Output display Online decision framework Tiny deep learning User’s battery constraint Current network conditions Big deep Performance App latency requirement learning characterization App accuracy requirement Front-end device 24

AR Object Detection Quality Metrics • Accuracy • Classification and location both important for AR • Intersection over union (IoU) metric • Ground truth: Big deep learning running on highest resolution • Timing • Latency: time from when we sent the frame to getting the result • Frame rate: 1 / time between consecutive frames 25

1. Offline performance characterization: How do latency and energy change with video resolution? Energy and latency increase with pixels 2 for local processing 26

1. Offline performance characterization: How does accuracy change with bit rate and resolution? • Encoded videos at different bitrates and resolutions Big deep learning: Tiny deep learning: Accuracy increases more with resolution than bitrate, especially for big deep learning 27

1. Performance characterization: How does accuracy change with latency? Result from deep learning is stale! time t = 0 ms t = 100 ms Deep learning processing delay • Measured accuracy as deep learning processing latency increased Deep learning processing latency (ms) Accuracy decreases as latency increases. 28

2. Online decision framework: Optimization problem From offline performance characterization: Frame rate Accuracy 𝑏 𝑗 𝑞, 𝑠, 𝑚𝑗 : accuracy function of model 𝑗 �� 𝑞 : latency function of model i 𝑚 � � Maximize 𝑔 + ⍺ ∑ 𝑏 � 𝑞, 𝑠, 𝑚 � · 𝑧 � �� 𝑐 � 𝑞, 𝑠, 𝑔 : battery function of model i Local processing time Network transmission time �� 𝑞 + � 𝑚 � �� + 𝑀 𝑗𝑔 𝑗 = 0 Subject to 𝑚 � = � Calculate end-to-end latency. �� 𝑞 𝑗𝑔 𝑗 > 0 𝑚 � �� 𝑞 𝑧 � � ∑ 𝑚 � ≤ 1/𝑔 Finish processing a frame before next frame arrives. �� Don’t use more than B battery ∑ 𝑐 � 𝑞, 𝑠, 𝑔 · 𝑧 � ≤ ℬ �� Meet application accuracy requirement. 𝑏 � 𝑞, 𝑠, 𝑔 ≥ 𝐵 · 𝑧 � , ∀ 𝑗: Meet application frame rate requirement. 𝑔 ≥ 𝐺; Don’t use more than R bandwidth. 𝑠 · 𝑧 � ≤ 𝑆 � ∑ 𝑧 � = 1 p : video resolution �� r : video bitrate 𝑔 : frame rate 29 Variables 𝑞, 𝑠, 𝑔 ≥ 0; 𝑧 � ∈ 0,1 ; 𝑧 𝑗 : which deep learning model to run (local, remote)

After: 30

Key Take-Aways Real-time video analysis using local deep learning is slow (~600 ms/frame on current smartphones) Relationship between degrees of freedom and metrics is complex, and requires profiling Choose the right device configuration (resolution, frame rate, deep learning model) to meet QoE requirements 31

Outline • Overview of AR/VR • Edge computing can provide... 1. Real-time object detection for mobile AR 2. Bandwidth-efficient VR streaming using deep learning • Future directions 32

Mobile AR/VR with Edge-based Deep Learning Jiasi Chen Department - PowerPoint PPT Presentation

Mobile AR/VR with Edge-based Deep Learning Jiasi Chen Department of Computer Science & Engineering University of California, Riverside CNSM Oct. 23, 2019 Outline What is AR/VR? Edge computing can provide... 1. Real-time object

Cloud Cloud Cloud Cloud network Edge Edge Edge Edge as a Edge Edge Edge Edge Edge

Get the edge Get the edge Get the edge Get the edge Get the edge Get the edge Get the edge

Edge-based Segmentation Transform Hough Edge Tracking Linking Edge Detection Canny Edge

Mobile Edge Cloud Services in 5G Yanyong Zhang WINLAB, Rutgers University

What is mobile learning, mobile learning policies and technologies Dr. Mohamed Ally Learning

MOBILE ADVERTISING Agenda Get off to a mobile start with Media Impact! Why mobile? MI

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

Deployment Characteristics of "The Edge" in Mobile Edge Computing Meenakshi Syamkumar *

Effect of Edge Preparation Methods on Effect of Edge Preparation Methods on Edge Retention Rate

Next Edge Theta Yield Fund Next Edge Capital Corp., January 2016 IMPORTANT NOTES The Next Edge

Next Edge Private Debt Fund Next Edge Capital Corp., June 2018 IMPORTANT NOTES The Next Edge

Mobile Capabilities And Credentials Contents Mobile Landscape Mobile Functionalities

> SOFT EDGE < By Iskos-Be rlin > SOFT EDGE < Soft Edge chair series is based on the

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys:

Pixel TPC simulations in the ilcsoft ILD framework Kees Ligtenberg Nikhef lepcol gridpix meeting

One out of billion within one second: ZK-friendly hash functions Poseidon and Starkad Dmitry

Recommendations as Treatments: Debiasing Learning and Evaluation ICML 2016, NYC Schnabel ,

Growing to Maturity in Christ Colossians 1:28 Conference Goals Navigator Military: -

Validation of an experimental on-demand cloud infrastructure for Earth Observation Web Services J.

Linear Collider - Concurrency Needs and Plans Frank Gaede, DESY Annual Concurrency Meeting

Software for LCTPC Astrid M unnich TPC Review DESY 4. November 2013 Astrid M unnich

Core Development Tools Update Java 9, Jenkins, Maven Will Iverson Co-owner, Dev9 Will

Mobile AR/VR with Edge-based Deep Learning Jiasi Chen Department - PowerPoint PPT Presentation

Mobile AR/VR with Edge-based Deep Learning Jiasi Chen Department of Computer Science & Engineering University of California, Riverside CNSM Oct. 23, 2019 Outline What is AR/VR? Edge computing can provide... 1. Real-time object

Cloud Cloud Cloud Cloud network Edge Edge Edge Edge as a Edge Edge Edge Edge Edge

Get the edge Get the edge Get the edge Get the edge Get the edge Get the edge Get the edge

Edge-based Segmentation Transform Hough Edge Tracking Linking Edge Detection Canny Edge

Mobile Edge Cloud Services in 5G Yanyong Zhang WINLAB, Rutgers University

What is mobile learning, mobile learning policies and technologies Dr. Mohamed Ally Learning

MOBILE ADVERTISING Agenda Get off to a mobile start with Media Impact! Why mobile? MI

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

Deployment Characteristics of &quot;The Edge&quot; in Mobile Edge Computing Meenakshi Syamkumar *

Effect of Edge Preparation Methods on Effect of Edge Preparation Methods on Edge Retention Rate

Next Edge Theta Yield Fund Next Edge Capital Corp., January 2016 IMPORTANT NOTES The Next Edge

Next Edge Private Debt Fund Next Edge Capital Corp., June 2018 IMPORTANT NOTES The Next Edge

Mobile Capabilities And Credentials Contents Mobile Landscape Mobile Functionalities

&gt; SOFT EDGE &lt; By Iskos-Be rlin &gt; SOFT EDGE &lt; Soft Edge chair series is based on the

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys:

Pixel TPC simulations in the ilcsoft ILD framework Kees Ligtenberg Nikhef lepcol gridpix meeting

One out of billion within one second: ZK-friendly hash functions Poseidon and Starkad Dmitry

Recommendations as Treatments: Debiasing Learning and Evaluation ICML 2016, NYC Schnabel ,

Growing to Maturity in Christ Colossians 1:28 Conference Goals Navigator Military: -

Validation of an experimental on-demand cloud infrastructure for Earth Observation Web Services J.

Linear Collider - Concurrency Needs and Plans Frank Gaede, DESY Annual Concurrency Meeting

Software for LCTPC Astrid M unnich TPC Review DESY 4. November 2013 Astrid M unnich

Core Development Tools Update Java 9, Jenkins, Maven Will Iverson Co-owner, Dev9 Will

Deployment Characteristics of "The Edge" in Mobile Edge Computing Meenakshi Syamkumar *

> SOFT EDGE < By Iskos-Be rlin > SOFT EDGE < Soft Edge chair series is based on the