CMPT365 Multimedia Systems 1
Media Compression
- Video Coding Standards
Spring 2017
CMPT 365 Multimedia Systems Media Compression - Video Coding - - PowerPoint PPT Presentation
CMPT 365 Multimedia Systems Media Compression - Video Coding Standards Spring 2017 Edited from slides by Dr. Jiangchuan Liu CMPT365 Multimedia Systems 1 Video Coding Standards H.264/AVC CMPT365 Multimedia Systems 2 Coding Rate and
CMPT365 Multimedia Systems 1
Spring 2017
CMPT365 Multimedia Systems 2
CMPT365 Multimedia Systems 3
8 16 64 384 1.5 5 20
kbit/s Mbit/s
Very low bitrate
Mobile videophone Videophone
ISDN videophone Digital TV HDTV Video CD
CMPT365 Multimedia Systems 4
❒
❍
standards for advanced moving image coding methods appropriate for conversational and non-conversational audio/visual applications.
❒
Group)
❍
standards for compression and coding, decompression, processing, and coded representation of moving pictures, audio, and their combination
❒
❍
ITU-T H.262~ISO/IEC 13818-2(mpeg2) Generic Coding of Moving Pictures and Associated Audio.
❍
ITU-T H.263~ISO/IEC 14496-2(mpeg4) WG - work group SG – sub group ISO/IEC JTC 1/SC 29/WG 1 Coding of Still Pictures ISO/IEC JTC 1/SC 29/WG 11
CMPT365 Multimedia Systems 5
CMPT365 Multimedia Systems 6
CMPT365 Multimedia Systems 7
❍ – 352 × 240 for NTSC video at 30 fps ❍ – 352 × 288 for PAL video at 25 fps ❍ – It uses 4:2:0 chroma subsampling
CMPT365 Multimedia Systems 8
CMPT365 Multimedia Systems 9
CMPT365 Multimedia Systems 10
(one from the forward and one from the backward prediction).
and the two corresponding matching MBs are averaged (indicated by ‘%’ in the figure) before comparing to the Target MB for generating the prediction error.
frames, then only one MV and its corresponding MB will be used from either the forward or backward prediction.
CMPT365 Multimedia Systems 11
❒ Fig 11.2: B-frame Coding Based on Bidirectional Motion
CMPT365 Multimedia Systems 12
❒ Fig 11.3: MPEG Frame Sequence.
CMPT365 Multimedia Systems 13
❒
❒ ❒
CMPT365 Multimedia Systems 14
CMPT365 Multimedia Systems 15
CMPT365 Multimedia Systems 16
❒ Fig 11.4: Slices in an MPEG-1 Picture.
CMPT365 Multimedia Systems 17
❒
❒
1
2
CMPT365 Multimedia Systems 18
16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 8 16 19 22 26 27 29 34 16 16 22 24 27 29 34 37 19 22 26 27 29 34 34 38 22 22 26 27 29 34 37 40 22 26 27 29 32 25 40 48 26 27 29 32 35 40 48 58 26 27 29 34 38 46 56 69 27 29 35 38 46 56 69 83
CMPT365 Multimedia Systems 19
CMPT365 Multimedia Systems 20
frames — because temporal redundancy is exploited in inter-frame compression.
directional prediction and (b) the lowest priority given to B-frames.
CMPT365 Multimedia Systems 21
❒ Fig 11.5: Layers of MPEG-1 Video Bitstream.
CMPT365 Multimedia Systems 22
CMPT365 Multimedia Systems 23
❒
Table 11.5: Profiles and Levels in MPEG-2 Level Simple profile Main profile SNR Scalable profile Spatially Scalable profile High Profile 4:2:2 Profile Multiview Profile High * * * High 1440 * * * * Main * * * * * * Low * * *
Level Max. Resolution Max fps Max pixels/sec Max coded Data Rate (Mbps) Application High 1920 × 1152 60 62.7 × 106 80 film production High 1440 1440 × 1152 60 47.0 × 106 60 consumer HDTV Main 720 × 576 30 10.4 × 106 15 Studio TV Low 352 × 288 30 3.0 × 106 4 consumer tape equiv.
CMPT365 Multimedia Systems 24
❍ – In a Frame-picture, all scanlines from both fields are
❍ – If each field is treated as a separate picture, then it is
CMPT365 Multimedia Systems 25
(a) Frame−picture vs. Field−pictures (b) Field Prediction for
CMPT365 Multimedia Systems 26
CMPT365 Multimedia Systems 27
3.
4.
5.
CMPT365 Multimedia Systems 28
CMPT365 Multimedia Systems 29
❒ Fig 11.7: Zigzag and Alternate Scans of DCT Coefficients for
CMPT365 Multimedia Systems 30
CMPT365 Multimedia Systems 31
decoded to obtain basic video quality.
dependent on the base layer or the previous enhancement layer.
❍ – Networks with very different bit-rates. ❍ – Networks with variable bit rate (VBR) channels. ❍ – Networks with noisy connections.
CMPT365 Multimedia Systems 32
1.
2.
3.
4.
5.
CMPT365 Multimedia Systems 33
1.
2.
3.
CMPT365 Multimedia Systems 34
❒ Fig 11.8 (a): MPEG-2 SNR Scalability (Encoder).
CMPT365 Multimedia Systems 35
❒ Fig 11.8 (b): MPEG-2 SNR Scalability (Decoder).
CMPT365 Multimedia Systems 36
CMPT365 Multimedia Systems 37
❒Fig. 11.9: Encoder for MPEG-2 Spatial Scalability. ❒(a) Block Diagram. (b) Combining Temporal and Spatial Predictions
CMPT365 Multimedia Systems 38
CMPT365 Multimedia Systems 39
❒ Fig 11.10: Encoder for MPEG-2 Temporal Scalability.
CMPT365 Multimedia Systems 40
❒ Fig 11.10 (Cont’d): Encoder for MPEG-2 Temporal Scalability
CMPT365 Multimedia Systems 41
CMPT365 Multimedia Systems 42
CMPT365 Multimedia Systems 43
CMPT365 Multimedia Systems 44
1.
2.
CMPT365 Multimedia Systems 45
CMPT365 Multimedia Systems 46
❒ Fig. 11.11: Composition and Manipulation of MPEG-4 Videos.
CMPT365 Multimedia Systems 47
Slides Credit: http://www.oipf.tv/docs/mpegif/smpteenvivio.pdf
CMPT365 Multimedia Systems 48
(a)
(b)
(c)
CMPT365 Multimedia Systems 49
CMPT365 Multimedia Systems 50
CMPT365 Multimedia Systems 51
1.
Video-object Sequence (VS)—delivers the complete MPEG-4 visual scene, which may contain 2-D or 3-D natural or synthetic objects.
2.
Video Object (VO) — a particular object in the scene, which can be
background of the scene.
3.
Video Object Layer (VOL) — facilitates a way to support (multi- layered) scalable coding. A VO can have multiple VOLs under scalable coding, or have a single VOL under non-scalable coding.
4.
Group of Video Object Planes (GOV) — groups Video Object Planes together (optional level).
5.
Video Object Plane (VOP) — a snapshot of a VO at a particular moment.
CMPT365 Multimedia Systems 52
CMPT365 Multimedia Systems 53
❒
CMPT365 Multimedia Systems 54
prediction is employed, or B-VOPs if bi-directional predictions are employed.
❒
CMPT365 Multimedia Systems 55
(a)
(b)
(c)
CMPT365 Multimedia Systems 56
❒ Fig. 11.15: Bounding Box and Boundary Macroblocks of VOP.
CMPT365 Multimedia Systems 57
CMPT365 Multimedia Systems 58
CMPT365 Multimedia Systems 59
decoder only once at the beginning of the video sequence.
and parameters describing the camera movements thus far, it can reconstruct the scene in an efficient manner.
a sequence of video frames.
CMPT365 Multimedia Systems 60
❒
Piper image courtesy of Simon Fraser University Pipe Band.
CMPT365 Multimedia Systems 61
CMPT365 Multimedia Systems 62
CMPT365 Multimedia Systems 63
CMPT365 Multimedia Systems 64
❒ Fig. 11.21: 2D Mesh Object Plane (MOP) Encoding Process
CMPT365 Multimedia Systems 65
❒ Fig. 11.24: A breadth-first order of MOP triangles for 2D
CMPT365 Multimedia Systems 66
❒ Fig. 11.25: Mesh-based texture mapping for
CMPT365 Multimedia Systems 67
CMPT365 Multimedia Systems 68
A video frame Background VOP VOP VOP
Alpha map SA DCT
CMPT365 Multimedia Systems 69
Object 2 Object 1 Object 3 Object 4
CMPT365 Multimedia Systems 70