Video Streams based on User Access Pattern Ngo Quang Minh Khiem - - PowerPoint PPT Presentation

video streams based on user
SMART_READER_LITE
LIVE PREVIEW

Video Streams based on User Access Pattern Ngo Quang Minh Khiem - - PowerPoint PPT Presentation

Adaptive Encoding of Zoomable Video Streams based on User Access Pattern Ngo Quang Minh Khiem Guntur Ravindra Wei Tsang Ooi National University of Singapore Zoomable Video Zoomable Video with Bitstream Switching (x,y,w,h) Server Client


slide-1
SLIDE 1

Adaptive Encoding of Zoomable Video Streams based on User Access Pattern

Ngo Quang Minh Khiem Guntur Ravindra Wei Tsang Ooi National University of Singapore

slide-2
SLIDE 2

Zoomable Video

slide-3
SLIDE 3

Zoomable Video with Bitstream Switching

slide-4
SLIDE 4

Server Client (x,y,w,h)

GOAL: Minimize bandwidth to transmit RoIs

slide-5
SLIDE 5

Dynamic Cropping of ROI

Encode video once Support any RoI cropping

Tiled Streaming (TS) Monolithic Streaming (MS)

slide-6
SLIDE 6

Tiled Streaming One tile = k x k macroblocks Encode each tile as independantly decodable video streams Tiles overlapping with the RoI are transmitted

slide-7
SLIDE 7

Monolithic Streaming Data outside RoI need for decoding RoI Single monolithic video

slide-8
SLIDE 8

Trade-offs with TS and MS

TS

Bigger tile  More waste More bits Smaller tile  Less compression More bits

slide-9
SLIDE 9

Longer MV  More dependency More bits Shorter MV  Less compression More bits

MS

slide-10
SLIDE 10

RoI Access Pattern

Reduce bandwidth further, given RoI access statistics?

slide-11
SLIDE 11

Questions in this paper

  • Tiled Streaming
  • Different tile size in the same frame?
  • Monolithic Streaming
  • Different motion search range?
  • How?
slide-12
SLIDE 12

Adaptive Encoding

Given RoI access statistics, adapt the encoding parameters such that the expected bandwidth E needed to transmit a RoI is minimized

R r

r p r c E ) ( ) (

c(r): compressed size of RoI r p(r): access probability of RoI r

slide-13
SLIDE 13

Log user selection of RoI (Online) Adaptive Encoded Video RoI Access Pattern Encoded Video

slide-14
SLIDE 14

Adaptive Encoding

Adaptive Tiling (AT) Monolithic Streaming with RoI-aware Coding (MS-PB)

slide-15
SLIDE 15

Adaptive Tiling

Given RoI access pattern, tile the video such that E is minimized

T t

t p t c E ) ( ) (

c(t): compressed size of tile t p(t): access probability of tile t

slide-16
SLIDE 16

Intuition

Allowing tiles of different sizes can reduce bandwidth

Regular tiling with 2x2 tiles Adaptive tiling 2 4 1 3 RoI accessed by most users Merge tiles 1,2,3 and 4

slide-17
SLIDE 17

Greedy Heuristic Tiling

  • Start with regular 1x1 tiles
  • Merge a tile with its neighbors if expected

bandwidth is reduced

  • Merge newly-formed tile with its neighbors

bandwidth is reduced

slide-18
SLIDE 18

t1

c(t1) = 9 p(t1) = 0.8

t2

c(t2) = 6 p(t2) = 0.8

t12

c(t12) = 11 p(t12) = 1

) )c(t p(t ) )c(t p(t ) )c(t p(t

12 12 2 2 1 1

 

slide-19
SLIDE 19

Resulting tile map RoI Access Pattern

slide-20
SLIDE 20

Monolithic Streaming with RoI-aware Coding

  • Referenced MBs form large region outside RoI
  • Short motion vector: less bandwidth efficient
  • Probabilistic boxing motion vector (MS-PB)
slide-21
SLIDE 21

Intuition

  • P(A) – P(AB) > P(B)
  • Increase in size of A when sending R2 is marginal
  • P(A) – P(AB) < P(B)
  • Increase in size of A when sending R2 is higher
  • [P(A)-P(AB)] S(A) > P(B) S(B)

P(A), P(B): sending A, B P(AB) : A and B in same RoI P(A) – P(AB): sending A independent of B R2 R1 B A

slide-22
SLIDE 22

Motion Vector Spread after MS-PB

slide-23
SLIDE 23

Evaluation

  • Evaluate AT and MS-PB in terms of
  • Bandwidth efficiency
  • Compression efficiency
  • Benchmark methods
  • Per-RoI
  • Tiled Streaming
  • Monolithic Streaming
slide-24
SLIDE 24

Video Sequences

Rush-Hour (500 frames) Bball (200 frames) Rainbow (350 frames) Tractor (688 frames)

slide-25
SLIDE 25

Experiment Setup

  • RoI size: 320x192 pel
  • Video resolution 1920x1080 pel
  • Evaluation is conducted by a training-testing

framework

  • Training and test sets have the same distribution
  • One training and test set for each GoP
slide-26
SLIDE 26

0.5 1 1.5 2 2.5 3 3.5 4 Bball Rainbow

Expected Data Rate (Mbps) Test Video

Expected Data Rate for Different Videos without B-Frames

PerRoI MS-PB MS AT TS4x4

slide-27
SLIDE 27

0.5 1 1.5 2 2.5 3 3.5 4 4.5 Bball Rainbow

Expected Data Rate (Mbps) Test Video

Expected Data Rate for Different Videos with 2 B-Frames

PerRoI MS-PB MS AT TS4x4

slide-28
SLIDE 28

20 40 60 80 100 120 140 160 Bball Rainbow

File Size (MB) Test Video

Compressed Video File Size with 2 B-Frames

PerRoI MS-PB MS AT TS16x16 20 40 60 80 100 120 140 Bball Rainbow

File Size (MB) Test Video

Compressed Video File Size without B-Frames

PerRoI MS-PB MS AT TS16x16

slide-29
SLIDE 29

Presence of B-frame

Motion Vector Spread without B-frame Motion Vector Spread with 2 B-frame Without B-frame

MS-PB < MS

With B-frame

MS-PB ≈ MS

slide-30
SLIDE 30

Conclusion & Future Work

  • Propose an adaptive encoding approach based on

user access patterns

  • Reduce bandwidth by 21% (MS-PB) and 27% (AT)
  • Limiting motion vector is beneficial to zoomable

video with wide spread of dependency

  • Future work:
  • Computational complexity
  • Diverse user interest of RoI
  • Frequency of Adaptation
slide-31
SLIDE 31

Thank you

  • Questions?
  • Feedback/Suggesetion?