Image and Video Coding: Exam Preparation bitstream encoder - PowerPoint PPT Presentation

Image and Video Coding: Exam Preparation bitstream encoder decoder

What Type of Exam? - When and Where? General Online exam via WebEx (or any other system) Possible dates: After August, 3 Write e-mail to heiko.schwarz@hhi.fraunhofer.de with 2–3 suggestions for an exam date Choose Your Type of Exam 1 Conventional Oral Exam Oral exam of 30 minutes Explain fundamental concepts of image and video coding Answer questions to topics discussed in lectures 2 Alternative: Implementation Project Implement an improvement for our image or video codec Present your implementation and simulation results Answer questions to implemented coding tool and your implementation Heiko Schwarz (Freie Universität Berlin) — Image and Video Coding: Exam Preparation 2 / 13

Possible Questions for Oral Exam Topic: Human Visual Perception and Raw Data Formats How are images and videos represented in raw data formats? Calculate the raw data rate (in bits per second) of a full HD (1920 × 1080) video with 50 pictures per second and a bit depth of 8 bits per sample? Why do we need at least 3 color components for representing color images? What is the XYZ color format and how can it be converted into an RGB format? What does the chromaticity diagram show? There are multiple RGB format, how do they differ? What is the “color gamut” of a display or representation format? Is it possible to represent all colors (perceivable by human beings) with an RGB display? What is “gamma encoding” and why is it used? Why are images and videos typically coded in the Y’CbCr format (instead of an RGB format)? What is the YCbCr 4:2:0 chroma sampling format and why is it used? Heiko Schwarz (Freie Universität Berlin) — Image and Video Coding: Exam Preparation 3 / 13

Possible Questions for Oral Exam Topic: Transform Coding of Sample Blocks What are the components of a transform encoder and decoder? Why is the quantization done in the transform domain instead of the original sample space? What is the main goal of the transform? What are orthogonal transforms and why are non-orthogonal transforms not used in lossy coding? What is the Karhunen Loève transform (KLT) and why is it often considered as optimal transform? Why do we mainly use separable transforms in image and video coding? Why is the DCT-II used in most image and video codecs? What signal characteristics determine the effectiveness of transform coding? What is the purpose of quantization? What is a uniform reconstruction quantizer (URQ) and why is it used in most lossy codecs? What is the quantization step size (for URQs) and how does it impact the bit rate and reconstruction quality? Heiko Schwarz (Freie Universität Berlin) — Image and Video Coding: Exam Preparation 4 / 13

Possible Questions for Oral Exam Topic: Entropy Coding What is the goal of entropy coding? What influences the efficiency of entropy coding? Explain the Huffman algorithm for a simple given probability mass function. What are advantages of conditional entropy coding? What are advantages and disadvantages of block entropy coding? What is entropy, conditional entropy, block entropy, entropy rate? Why are neither scalar Huffman coding, conditional Huffman coding, nor block Huffman coding used for coding the quantization indexes in image and video codecs? Explain the concept of run-level coding (as used in JPEG or MPEG-2 Video) and its advantages. Why is the zig-zag scan (or a similar diagonal scan) used for transform coefficient levels? What are the advantages of arithmetic coding? What design aspects impact the efficiency of arithmetic coding for quantization indexes? Heiko Schwarz (Freie Universität Berlin) — Image and Video Coding: Exam Preparation 5 / 13

Possible Questions for Oral Exam Topic: Advanced Intra-Picture Coding Why is it important to utilize statistical dependencies between transform blocks? Explain the effect of the prediction of the DC quantization index used in JPEG. What are the advantages of intra prediction in the sample domain? Explain the idea of directional intra prediction modes. How can the intra prediction mode be selected in an encoder? How does the block size impact the effectiveness of intra prediction and transform coding? Why do modern codecs support the concept of variable block sizes? Why are only simple partitioning concepts supported in modern codecs? Explain the idea of quadtree partitioning. How can an encoder determine a suitable quadtree partitioning for a given block. What are the main design aspects in modern video coding standards that lead to a substantial coding efficiency improvement relative to JPEG for images? Heiko Schwarz (Freie Universität Berlin) — Image and Video Coding: Exam Preparation 6 / 13

Possible Questions for Oral Exam Topic: Encoder Control What parameters have to chosen in a video encoder? Explain the difficulty in selecting coding parameters in a video encoder (based on examples). Explain why encoders that are based on distortion minimization are not very efficient. Explain the concept of Lagrangian optimization. Explain the basic approach of a Lagrangian encoder control. How can the Lagrangian concept be used for optimizing the selection of coding modes or intra prediction modes motion vectors and reference indexes a block partitioning quantization indexes Explain why using a “rounding offset” in encoder quantization represents a suitable low-complexity alternative to rate-distortion optimized quantization. Explain some ideas for implementing a fast motion estimation. Heiko Schwarz (Freie Universität Berlin) — Image and Video Coding: Exam Preparation 7 / 13

Possible Questions for Oral Exam Topic: Motion-Compensated Prediction and Hybrid Video Coding Draw a block diagram of a hybrid video encoder and explain the main components. Explain the main idea of motion-compensated prediction. Why do modern video coding standards use quarter-sample accurate motion vectors? Why do we need interpolation for sub-sample accurate motion vectors? Why can longer interpolation filters provide better coding efficiency than a simple bi-linear interpolation? Why are motion vectors predictively coded? Explain an example for motion vector prediction. Explain an example for a coding mode, for which the motion data are not explicitly coded, but derived based on already coded data. Why do all video coding standards support intra coding modes in predictively coded pictures? Why are video encoders much more complex than video decoders? Heiko Schwarz (Freie Universität Berlin) — Image and Video Coding: Exam Preparation 8 / 13

Possible Questions for Oral Exam Topic: Advanced Inter-Picture Prediction Explain the concept of “multiple reference pictures”. How can we select the reference picture for a block in a video encoder? Explain the concept of bi-prediction and why it can improve coding efficiency. Why provide video codecs the possibility to select between conventional uni-prediction and bi-prediction on a block by block basis? What are B pictures? Why is it typically advantageous to code video pictures in an order that differs from the acquisition and display order? Explain an example for such an improved coding structure. What are “random access points” and why do we need them in broadcast and streaming? Explain the difference between random access points with “closed GOP” and “open GOP”. What is the goal and the basic concept of a deblocking filter? Why are filters such as a deblocking filter applied inside the motion compensation loop, and not as post filters? Heiko Schwarz (Freie Universität Berlin) — Image and Video Coding: Exam Preparation 9 / 13

Possible Implementation Tasks for our Image and Video Codec Possible Improvements for our Image Codec Implementation of additional directional intra prediction modes (at least two), with Lagrangian decision between the intra prediction modes Implementation of the planar prediction mode (as in HEVC) as additional intra prediction mode, with Lagrangian decision between the intra prediction modes Simple version of variable block sizes with two partitioning options: code B × B block as B × B block (as currently implemented) code B × B block as four ( B / 2 ) × ( B / 2 ) blocks Use Lagrangian decision for selecting between these two partitioning options Extend codec for PGM images to codec for PPM images and compare two variants: Independent coding of R, G, and B color components Color transform to YCbCr (or YCoCg) and coding of these components (with or without chroma subsampling), reconstruction of R, G, and B after decoding and output as PPM image Test you own ideas ... Heiko Schwarz (Freie Universität Berlin) — Image and Video Coding: Exam Preparation 10 / 13

Image and Video Coding: Exam Preparation bitstream encoder - PowerPoint PPT Presentation

Image and Video Coding: Exam Preparation bitstream encoder decoder What Type of Exam? - When and Where? General Online exam via WebEx (or any other system) Possible dates: After August, 3 Write e-mail to heiko.schwarz@hhi.fraunhofer.de with

Image and Video Coding: Video Coding Extensions Screen Content Coding Screen Content Coding

Image and Video Coding: Encoder Control D D = - R d R Problem Statement / Scope of Image

Image and Video Coding: Introduction bitstream encoder decoder Motivation Image and Video

Image and Video Coding: Improved Inter-Picture Prediction Review of Hybrid Video Coding Last

Image and Video Coding: Hybrid Video Coding s n 1 [ x , y ] s n [ x , y ] m k = ( m x , m

ADVANCED MULTIMEDIA ADVANCED MULTIMEDIA CODING CODING Fernando Pereira Instituto Superior

Formal Modeling in Cognitive Science 1 Coding Theorems Lecture 28: Kraft Inequality; Source Coding

Image and Video Coding: Motion Estimation and Coding 4 5 6 B C D 1 D 0 3 7 A current 2

VIDEO SIGNALS Lossless coding g LOSSLESS CODING LOSSLESS CODING The goal of lossless image

Image and Video Coding: Video Coding Standards s k [ x , y ] u k [ x , y ] quantization indexes q

Dynamical systems Expanding maps on the circle. Coding Jana Rodriguez Hertz ICTP 2018 coding

Exam4 Information and Guidance General Topics General Exam Information Exam types

Quicksort Sorting Lower Bound Exam Exam Exam Exam 2 2 tomorrow evening 2 2 tomorrow

Image and Video Coding: Transform Coefficient Coding 18 6 2 0 1 0 0 0 2 0 1 0 0 0 0 0 1 2 0 0 0

Learning-Based Image/Video Coding Lu Yu Zhejiang University Outlines System architecture of

Image Restoration Image Enhancement and Image Restoration both deal with improving images. Image

COMP 3403 Algorithm Analysis Part 5 Chapter 9 Jim Diamond CAR 409 Jodrey School of

ANALOG AND DIGITAL VIDEO Henning Schulzrinne Columbia University COMS 6181 - Spring 2015 with

D ATA C OMPRESSION May. 7, 2015 Acknowledgement: The course slides are

1 Data structures for decoder: Construction of canonical Huffman: (sketch) The array

In the name of Allah the compassionate, the merciful Digital Image Processing S. Kasaei Kasaei

TSBK01 J RGEN A HLBERG - History - How many samples/pixels/bits? I MAGE CODING AND DATA 3. A

Architectural Specialization for Inter-Iteration Loop Dependence Patterns Shreesha Srinath,

Robust Header Compression (ROHC) 53rd IETF Minneapolis, March 2002 Chairs: Carsten Bormann