Holograms are the Next Video
Philip A. Chou, 8i Labs, Inc. ACM Multimedia Systems Conference 13 June 2018
Holograms are the Next Video Philip A. Chou, 8i Labs, Inc. ACM - - PowerPoint PPT Presentation
Holograms are the Next Video Philip A. Chou, 8i Labs, Inc. ACM Multimedia Systems Conference 13 June 2018 Princess Leia Star Wars Episode IV, 1977 The Holodeck Star Trek Next Generation, Episode 12, 1988 Black Panther, 2018 No, you
Philip A. Chou, 8i Labs, Inc. ACM Multimedia Systems Conference 13 June 2018
Star Wars Episode IV, 1977 Princess Leia
Star Trek Next Generation, Episode 12, 1988 The Holodeck
Black Panther, 2018
Solo, 2018 “No, you can’t wipe `em off. They’re holograms.” – Tobias Becket to Chewbacca.
“A new microscopic principle,” Nature, 1948.
from Ancient Greek (hólos, whole) + (grammḗ, letter, line, writing, message)
Decode Encode
https://en.wikipedia.org/wiki/Holography
𝑤 𝑣 𝜄 𝜚 𝑤 𝑣 𝜄 𝜚
𝑤 𝑣 𝑡 𝑢
VR puts you in a Virtual World AR puts virtual objects in your world
On-Demand Live Broadcast Telecommunication
On-Demand Live Broadcast Telecommunication
On-Demand Live Broadcast Telecommunication
Buzz Aldrin: Cycling Pathways to Mars
Arun Netravali, Head Visual Communications Research Dept. IEEE Transactions on Computers, 1974
Today: > 1 Trillion photos/year Photo from JPEG (Macau, October 2017): Celebration of 25th Anniversary
Hologram compression today is like video compression in 1988
3.9 6.0 13 27 Mbps 3.5 6.0 11 20 Mbps 3.5 6.0 9 18 Mbps
1. Static (single-frame) 3. Dynamic Acquisition (e.g., from Mobile Mapping Systems)
2. Dynamic (multi-frame)
10010001 10010001 11001001 10010001
221,136,255 255,153,255 255,102,255 153,153,255
e.g., Y, U, V
Ricardo L. de Queiroz and Philip A. Chou, “Compression of 3D Point Clouds Using a Region-Adaptive Hierarchical Transform,” IEEE Trans. Image Processing, Aug 2016. Maja Krivokuca, Maxim Koroteev, Philip A. Chou, Robert Higgs, and Charles Loop, “A Volumetric Approach to Point Cloud Compression,” in preparation.
𝑑 and ∪ 𝑇𝑗 ∈ ℬ)
in 𝑇
𝜈 = 𝜈 = 2
𝑦2 𝑦1 𝑦2 𝑦1
∫ 𝑔 𝒚 𝑒𝜈 𝒚 = lim
Δ→0 Δ 𝑜
𝜈( 𝒚 | 𝑔 𝒚 ≥ 𝑜Δ ) =
𝑗
𝑔 𝒚𝑗
𝑜Δ 𝜈 𝒚 | 𝑔 𝒚 ≥ 𝑜Δ 𝑔(𝑦) 𝑦 Δ
2
Scaling functions Integer shifts of scaling functions span space of functions that are
𝑞 − 1 over unit intervals
𝑊 𝑊
1
𝑊
2
𝑊
2
𝑊
1
𝑊 𝑊
0 ⊕ 𝑋 0 = 𝑊 1
𝑊
1 ⊕ 𝑋 1 = 𝑊 2
Nested subspaces
Using Lebesgue Measure Using Counting Measure
𝑊 𝑋 𝑋
1 1 1 2 1 1/2 1 −1 − 2 −1 −1 − 2
2 × 1 3 × 2 3 × 1 2
𝑊 𝑊
1
𝑊
2 Using Lebesgue Measure Using Counting Measure
Level 7 (15604 coeffs) Level 6 (3821 coeffs) Level 5 (917 coeffs) Level 8 (62073 coeffs) Level 9 (237965 coeffs)
Level 7 (30455 coeffs) Level 6 (7213 coeffs) Level 5 (1699 coeffs) Level 8 (125244 coeffs) Level 9 (497199 coeffs)
Comparison to Zhang, Florencio, and Loop, “Point cloud attribute compression with graph transform,” ICIP 2014
Xiang Zhang, Philip A. Chou, Ming-Ting Sun, Maolong Yang, et al., “Surface Light Field Compression using a Point Cloud Codec,” submitted to IEEE JETCAS special issue on immersive video, and to appear at ICIP 2018.
Multiview representation Lenslet representation
𝑔 𝒒, 𝝏 , representing the color of surface point 𝒒 = (𝑦, 𝑧, 𝑨) when viewed from direction 𝝏 = (𝜄, 𝜚).
for each 𝒒 generalizes lenslet representation.
some image basis
points to reduce spatial redundancy
W.-C. Chen, et al., “Light field mapping: efficient representation and hardware rendering of surface light fields,” SIGGRAPH 2002
vie view map
𝑔 𝝏 𝒒
Linear combination of basis functions: 𝑔 𝝏 𝒒 = ∑𝑗 𝐻𝑗 𝝏 𝛽𝑗(𝒒)
Basis functions
𝒅 𝑯𝜷 𝑯 𝜷
Coefficients Observations B-spline wavelet basis functions
𝜷 = argmin
𝜷
𝑯𝜷 − 𝒅 2 + 𝜇 𝜷 2 + 𝛾 𝜷 − ഥ 𝜷 2
𝜄 sin 𝜚
Synthetic datasets: Can, Die Natural datasets: Elephant, Fish
N=1, 0.30 MB N=8, 0.62 MB N=32, 1.71 MB N=128, 3.90 MB
N=1, 0.24 MB N=8, 0.53 MB N=32, 1.57 MB N=128, 4.02 MB
Fish Elephant
Jounsup Park, Philip A. Chou, and Jenq-Neng Hwang, “Rate-Utility Optimized Streaming of Volumetric Media for Augmented Reality,” arXiv:1804.09864. Also submitted to IEEE JETCAS special issue on immersive video, and to appear at Globecom 2018.
Streaming
QCIF (176x144) streaming video over 56 Kbps in 1997
Hologram streaming today is like video streaming in 1997
https://bitmovin.com/bitmovin-receives-excellence-dash-award-tile-based-streaming-vr-360-video/
Capture Stage VPC 3D Tiles Multiple Representations Client Buffer Manager Reconstructed VPC User’s viewport Decoding Rendering Request data chunks Viewport information Encoding Representation Tile Data delivery
Play out New data Buffer
Object Object Object Object Segment Manifest Segment Segment Represen- tation Represen- tation Represen- tation GOF GOF GOF GOF Tile Tile Tile ⋮ Tile Index ⋮ ⋮ ⋮ ⋮ File
𝑋
𝑚𝑓𝑏𝑒(𝑢)
𝑋
𝑢𝑠𝑏𝑗𝑚(𝑢)
𝑢0 𝑢 𝜐0 𝜐 Media Timeline Playout Timeline 𝑢𝑗 𝑢𝑗+1 𝑢𝑗−1
Maximize 𝑉 𝑠
1, … , 𝑠 𝐿 = ∑𝑙=1 𝐿
𝑉𝑙 𝑠
𝑙
subject to 𝐶 𝑠
1, … , 𝑠 𝐿 = ∑𝑙=1 𝐿
𝐶𝑙(𝑠
𝑙) ≤ 𝐶𝑗𝑢𝐷𝑝𝑣𝑜𝑢𝑢
max
𝑠1,…,𝑠𝑙 𝑉 𝑠 1, … , 𝑠 𝐿 − 𝜇𝐶 𝑠 1, … , 𝑠 𝐿
= max
𝑠1,…,𝑠𝑙
∑𝑙=1
𝐿
[𝑉𝑙 𝑠
𝑙 − 𝜇𝐶𝑙 𝑠 𝑙 ] = ∑𝑙=1 𝐿
max
𝑠 [𝑉𝑙 𝑠 − 𝜇𝐶𝑙 𝑠 ]
𝑠
𝑙(𝜇) = argmax 𝑠
𝑉𝑙 𝑠 − 𝜇𝐶𝑙 𝑠
𝐶(4) 𝑉(0) 𝑉 4 , 𝑉(5) 𝑉(2) Utility 𝒯 መ 𝒯 5 3 4 2 1 𝐶(1) 𝐶(2) 𝐶(3) 𝐶(5) 𝐶(0) 𝑉(1) 𝑉(3) Bits 𝜇45
𝐶(4) 𝑉(0) 𝑉 4 , 𝑉(5) 𝑉(2) Utility 𝒯 መ 𝒯 5 3 4 2 1 𝐶(2) 𝐶(3) 𝐶(5) 𝐶(1) 𝑉(1) 𝑉(3) Bits 𝜇45
𝑙 = 𝑣 𝑆𝑠𝑙 × 𝑄𝑙 𝑤 × 𝑀𝑃𝐸𝑙(𝑠 𝑙, 𝑤)
𝑆 > 0 𝑆 = 0
𝑓𝑠𝑠(𝑤)
if 𝑙 currently visible from 𝑤 𝑄𝑙
𝑓𝑠𝑠(𝑤)
2
𝑒𝑗𝑡𝑢(𝑤)
𝑝𝑐𝑘𝑓𝑑𝑢.𝑠𝑓𝑞𝑠𝑓𝑡𝑓𝑜𝑢𝑏𝑢𝑗𝑝𝑜 𝑠 .𝑥𝑗𝑒𝑢ℎ∗𝑒𝑗𝑡𝑢(𝑤) 𝑝𝑐𝑘𝑓𝑑𝑢.𝑛𝑏𝑦𝑋𝑗𝑒𝑢ℎ∗𝑝𝑐𝑘𝑓𝑑𝑢.𝑑𝑣𝑐𝑓𝑈𝑝𝑃𝑐𝑘𝑓𝑑𝑢𝑇𝑑𝑏𝑚𝑓
𝑒𝑗𝑡𝑞𝑚𝑏𝑧.ℎ𝑝𝑠𝑨𝑄𝑗𝑦𝑓𝑚𝑡 𝑤𝑗𝑓𝑥[𝑤].𝑔𝑠𝑣𝑡𝑢𝑣𝑛.ℎ𝑝𝑠𝑨𝐺𝑃𝑊
Basic utility, based on bitrate
Probability of user seeing tile, based on user prediction model Level of Detail provided by tile Radians subtended Tile voxels per Radian Display pixels per Radian For user view 𝑤
Test Dataset 1 2 3 4 5 Queen 3 5* 15* 30 55* Loot 3.5* 5 8* 16 27* Redandblack 3.5* 6 9* 18 30* Soldier 3.5* 6 11* 20 37.1* Longdress 3.9* 6 13* 27 42.7*
Call for Proposals for Point Cloud Coding V2, ISO/IEC JTC1/SC29 WG11 Doc. N16763, Hobart, 2017
(a) TBA-Bitrates (b) BBA-Bitrates (c) WBA (proposed)-Bitrates (a) TBA-Buffer Occupancy (b) BBA-Buffer Occupancy (c) WBA (proposed)-Buffer Occupancy
GoF
GoF
Hologram compression today is like video compression in 1988 Hologram streaming today is like video streaming in 1997
(Hint: If you’ve seen it for video, you’ll see it for holograms.)
generated content
Philip A. Chou, 8i Labs, Inc. ACM Multimedia Systems Conference 13 June 2018