PARSEC: Streaming 360 o Videos Using Super-Resolution Mallesham - - PowerPoint PPT Presentation

parsec streaming 360 o videos using super resolution
SMART_READER_LITE
LIVE PREVIEW

PARSEC: Streaming 360 o Videos Using Super-Resolution Mallesham - - PowerPoint PPT Presentation

PARSEC: Streaming 360 o Videos Using Super-Resolution Mallesham Dasari, Arani Bhattacharya, Santiago Vargas, Pranjal Sahu, Aruna Balasubramanian, Samir R. Das Department of Computer Science https://www3.cs.stonybrook.edu/~mdasari/parsec 360 o


slide-1
SLIDE 1

PARSEC: Streaming 360o Videos Using Super-Resolution

Mallesham Dasari, Arani Bhattacharya, Santiago Vargas, Pranjal Sahu, Aruna Balasubramanian, Samir R. Das Department of Computer Science

https://www3.cs.stonybrook.edu/~mdasari/parsec

slide-2
SLIDE 2

360 360o Vi Video Streaming

Immersive Experience

qCentral to many immersive applications (e.g., VR/AR)

$ Billion Market

Popularity of 360o Video is on the Rise!

Image credit: Oculus

http://blog.dsky.co/tag/head-tracking/

slide-3
SLIDE 3

180o 360o

Image from Rollercoaster video

Gr Gran and Challe allenge

q 360o videos require 8x bandwidth compared to regular videos for the same perceived quality

110o 80o 25Mbps 200Mbps

http://blog.dsky.co/tag/head-tracking/

slide-4
SLIDE 4

Cu Curr rrent Sol Solution

  • ns

qViewport-adaptive streaming

  • Divide video into tiles

(e.g., 192x192 pixels)

4 Flare [MobiCom’18], Rubiks [MobiSys’18], MOSAIC [IFIP Networking’19] PANO [SIGCOMM’19], ClusTile [INFOCOM’19]

slide-5
SLIDE 5

Cu Curr rrent Sol Solution

  • ns

qViewport-adaptive streaming

  • Divide video into tiles

(e.g., 192x192 pixels)

  • Predict viewport tiles

based on head tracking and video saliency analysis

  • Stream only viewport

specific tiles using ABR algorithm

5 Flare [MobiCom’18], Rubiks [MobiSys’18], MOSAIC [IFIP Networking’19] PANO [SIGCOMM’19], ClusTile [INFOCOM’19]

slide-6
SLIDE 6

Li Limi mitation

  • ns of
  • f Cu

Curr rrent Sol Solution

  • ns

qViewport Prediction (VP)

  • Predicting user head movement is

hard

  • Fetch more tiles to avoid the tile

misses

  • Fetching more tiles competes for

bandwidth and reduces video quality

qNetwork is the only resource for achieving good video quality

6

Can we improve client’s video quality without relying much on network?

1 2 3 Prediction Window (seconds) 20 40 60 80 100 Accuracy (%) Mosaic [IFIP Networking’19] Flare [MobiCom’18]

slide-7
SLIDE 7

Op Opportunity1 y1: Super-re resolution

qUse low resolution image/video, hallucinate the details to produce high resolution

  • Idea dates to the 90s
  • Currently benefiting from deep neural networks (DNNs)

7

https://amundtveit.com/2017/06/04/deep-learning-for-image-super-resolution-scale-up/

DNNs are computationally expensive

slide-8
SLIDE 8

Op Opportunity2 y2: Computation

qSignificant improvement in GPU capacity over the decade

  • Often underutilized

qLeverage this compute capacity on the client to do super- resolution

8

NAS [OSDI’2018]

Is this compute power enough to do super-resolution?

slide-9
SLIDE 9

Su Super-re resolution Challenges

9

q Large variance in quality enhancement q Bulky DNN models

  • Slower inference (e.g.,

less than 2FPS for a 1- minute 4k video)

  • Large model sizes

Model trained for one-minute video duration

How to make the models smaller, faster & better?

slide-10
SLIDE 10

Li Lightweight Mi Micr cro-mod models fo for Su Super-re resolution

qTrain a model for each segment qFetch the model along with segment download qEnhance the quality of few viewport-specific tiles instead of whole frame

10 Reconstructed High-Quality Tiles

192x192

. . .

360 Video Segment

3840x1920

(20x10 Tiling)

Original High-Quality Tiles Down- Sampled Tiles Compressed ULR Tiles

192x192 24x24 24x24

. . . . . . . . .

slide-11
SLIDE 11

Li Lightweight Mi Micr cro-mod models fo for Su Super-re resolution

11

qBenefits

üSmall model footprint üFaster inference

q Additional challenges

  • Still only few tile/sec

inference rate

q Key Questions

  • Which tiles to download

and at what quality?

  • Which tiles to generate

(using super-resolution)?

  • Which tiles to ignore?

Need a new ABR algorithm that combines compute and network resources

slide-12
SLIDE 12

Ne Neural-Aw Aware ABR

12

Video Server Quality1 Quality2 Quality3 ULR tiles Client Player Request tiles Respond tiles Player Buffer Viewport Prediction 0.09 0.43 0.31 0.19 0.21 0.02 0.09 0.24 0.98 0.98 0.63 0.08 0.14 0.21 0.99 0.96 0.56 0.11 0.02 0.13 0.23 0.34 0.27 0.12 0.09 0.43 0.31 0.19 0.21 0.02 0.09 0.24 0.98 0.98 0.63 0.08 0.14 0.21 0.99 0.96 0.56 0.11 0.02 0.13 0.23 0.34 0.27 0.12 GPU (SR model)

Compute Capacity Network Capacity

[IFIP Networking’19]

slide-13
SLIDE 13

Ne Neural-Aw Aware ABR

13

Maximize

How to Find a Solution Fast? Greedy Algorithm

slide-14
SLIDE 14

Pu Putting Eve verything To Together

14

Offline Super Resolution Model Training

HEVC Encoded Segments

HTTP Server

ULR Tiles & Micro- Models

Network State Viewport Prediction Generated and Downloaded Tile Qualities Compute Capacity

Only these components present in state-of-the- art 360° video streaming

Client

Render and Display Decoded Playback Buffer Neural-Aware ABR Algorithm Inference Scheduler for Generated Tiles Bitrate Selection for Downloaded Tiles

slide-15
SLIDE 15

Im Implem plemen entatio tion n & & Evalua aluatio tion

  • Linux server
  • Node.js
  • Client
  • Pixel3 phone
  • Super-resolution model
  • Keras with Tensorflow

backend

  • Diverse network

conditions

  • Real traces: WiFi &

4G/LTE

  • FCC & Belgium traces
  • 360o video dataset
  • 10 videos
  • MMSYS’17 head

movement dataset

15

slide-16
SLIDE 16

Pe Performance Comparison

  • VP_Only [NOSSDAV’17]
  • Download only

viewport-specific tiles

  • FLARE [MobiCom’18]
  • Fetch additional tiles to

accommodate VP inaccuracy

  • NAS-regular [OSDI’18]
  • A recent regular video

streaming system using super-resolution

  • NAS-360
  • A modified version of

NAS-regular for 360o video

16

slide-17
SLIDE 17

Pe Performance Comp Compari rison

  • n

Average Quality y and Tile Misses

17

30% improvement compared to Flare [MobiCom’18] 26% improvement at the 90th percentile compared to Flare [MobiCom’18]

slide-18
SLIDE 18

Ov Overall QoE QoE Pe Performance

18

37% improvement compared to Flare [MobiCom’18]

slide-19
SLIDE 19

Im Impac pact t of Comput putatio tion

19

PARSEC performs better as we increase the computing power

slide-20
SLIDE 20

Con Conclusion

  • n
  • PARSEC
  • A panoramic video streaming system
  • DNN based super-resolution
  • Neural-aware ABR algorithm
  • PARSEC provides high QoE compared to the state-
  • f-the-art solutions

20

https://www3.cs.stonybrook.edu/~mdasari/parsec For more details please visit: