100x Evolution of Vid ideo Codec Chips
Jinjia Zhou1, Dajiang Zhou2, Satoshi Goto2
1Hosei University, Tokyo, Japan 2Waseda University, Kitakyushu, Japan
Codec Chips Tribute to Prof. Goto Jinjia Zhou 1 , Dajiang Zhou 2 , - - PowerPoint PPT Presentation
100x Evolution of Vid ideo Codec Chips Tribute to Prof. Goto Jinjia Zhou 1 , Dajiang Zhou 2 , Satoshi Goto 2 1 Hosei University, Tokyo, Japan 2 Waseda University, Kitakyushu, Japan Prof. Goto has been my supervisor from 2008 to 2015. (M.S ->
Jinjia Zhou1, Dajiang Zhou2, Satoshi Goto2
1Hosei University, Tokyo, Japan 2Waseda University, Kitakyushu, Japan
2
first to use SiS DRAM (VLSI’07 and JSSC’09)
(VLSI’12)
JSSC’14)
JSSC’16)
3
4
Source video data Encoder Receiver Compressed video stream Decoder Compression Decompression Video camera Display device Restored video data stream Transmit. Channel/ Storage 100% 100% ~1% ~1% ~1%
5
Enc./Dec. Chip TV conference Surveillance Automotive Mobile/Portable Small frame delay Ultra-low power High compression High video quality Free-point view …. ……
Source of the images: http://www.artesanosdecastillalamancha.org/wp-content/uploads/2015/06/28.png http://www.caradvice.com.au/67890/2011-brakes-camera-action-pedestrian-detection-automated-platooning/photos/
Home entertainment
6
25~30fps
≥120fps
7680 pixels
7
1990 1995 2000 2005 2010
MPEG-1 MPEG-4 H.261 H.263 (+/++) H.262 MPEG-2 H.264 MPEG-4 AVC
H.265 HEVC ITU-T standards MPEG standards Joint ITU-T & MPEG standards Compression ratio
~50:1 ~100:1
~200:1 2013
8
throughput in the communication channel
high complexity
9
H.264 (2003) ~480Mbps ~240Mbps RAW HEVC (2013) 48000Mbps
1 20 307.2 0.1 1 10 100 1000 1080p/MPEG-2 4K/H.264 8K/HEVC Complexity/pixel Throughput Overall complexity
10
NHK 8K codec (2007) NHK 8K encoder (2013) Our target: Single chip/chipset
11
consumed by DRAM traffic
12
Codec DRAM Memory
traffic
13
14
Transform & Quant.
Deblocking Filter Reference Frames Frame Output Motion Compensation Motion Estimation Intra Prediction
Coding
Source Frm. Decoder
Memory bandwidth requirements Computational complexity Data dependencies
15
System Algorithm Architecture Circuits Device Evaluation
Bus/interface optimization, 3DLSI Processing order optimization Embedded compression 2-D cache Reduce complexity Trade-off b/w time & quality Hardware friendliness, … Alleviate data dependencies Processing order optimization Predictive execution, …
Reduce memory access / Increase memory bandwidth
<16>
Word Block
<17>
Merged
<18>
automated by BIBO given both word addressing and block scan follow a Z-scan order
<19>
Push
<20>
Pull
<21>
<22>
Source: http://www.f.waseda.jp/goto/html/chip.html 23
24
25
27.6 1990 249 3981 1000 2000 3000 4000 MIT ASSCC'08 Ours ISSCC'12 Ours VLSIC'12 NTU VLSIC'13 MIT ISSCC'13 NTT VLSIC'15 Ours ISSCC'16
Mpixel/s
H.264 decoder HEVC decoder H.264 encoder HEVC encoder
26
27