DISCRETE COSINE TRANSFORM Laboratory session Fernando Pereira - PowerPoint PPT Presentation

DISCRETE COSINE TRANSFORM Laboratory session Fernando Pereira Instituto Superior Técnico Audiovisual Communication, Fernando Pereira, 2020/2021

DCT Based Image Coding: Let’s Create Bits ! Statistical Redundancy Spatial Quantization Redundancy Coding tables tables bits Block Entropy DCT Quantization PCM splitting coder component samples 8×8 8×8 samples 8×8 DCT coeffs ≠ block quantized DCT coeffs Transmission Irrelevancy or storage Quantization Coding tables tables Inverse Entropy Block IDCT quantization decoder assembling Audiovisual Communication, Fernando Pereira, 2020/2021

The JPEG Sequencial Mode: one Single Scan The image is made available at a single resolution, at a single quality, in a single scan ! No flexibility to serve other ‘clients’ with different needs in terms of resolution and quality … Audiovisual Communication, Fernando Pereira, 2020/2021

What is Transformed ? The Samples ! Transform is applied to each image component, block after block ...   87 89 101 106 118 130 142 155   85 91 101 105 116 129 135 149     86 92 96 105 112 128 131 144   Same process (in 92 88 102 101 116 129 135 147   Y =   parallel) for 88 94 94 98 113 122 130 139   luminance and the   88 95 98 97 113 119 133 141   92 99 98 106 107 118 135 145 chrominances !       89 95 98 107 104 112 130 144 Audiovisual Communication, Fernando Pereira, 2020/2021

Where does Compression come from ?  REDUNDANCY – Regards the similarities, correlation and predictability of samples and symbols corresponding to the image/audio/video data. -> redundancy reduction does not involve any information loss, implying it is a reversible process – > lossless coding  IRRELEVANCY – Regards the part of the information which is imperceptible for the visual or auditory human systems. -> irrelevancy reduction involves removing non-redundant information, implying it is an irreversible process -> lossy coding Source coding exploits these two concepts: for this, it is necessary to know the source statistics and the human visual/auditory systems characteristics. Audiovisual Communication, Fernando Pereira, 2020/2021

Exploiti ploiting ng th the Spati patial al Re Redundancy dundancy Audiovisual Communication, Fernando Pereira, 2020/2021

Discrete Cosine Transform (DCT) The DCT is one of the several sinusoidal transforms available; its basis functions correspond to discretized sinusoisal functions. DCT Transform Image coefficients block Image DCT block Transform coefficients The DCT is the most used transform for image and video coding since its performance is close to the KLT performance for highly correlated signals; moreover, there are fast implementation algorithms available. Audiovisual Communication, Fernando Pereira, 2020/2021

DCT Bidimensional Basis Functions (N=8) All existing and future image blocks can be rather efficienctly represented with these 64 (8×8) basic images !!! You see here 64 8x8 sample blocks ! Audiovisual Communication, Fernando Pereira, 2020/2021

  87 89 101 106 118 130 142 155 64 PCM samples are   85 91 101 105 116 129 135 149   transformed into 64   86 92 96 105 112 128 131 144 DCT coefficients !   Luminance 92 88 102 101 116 129 135 147     Samples, Y = 88 94 94 98 113 122 130 139 But more     88 95 98 97 113 119 133 141 perceptual   92 99 98 106 107 118 135 145 compression       89 95 98 107 104 112 130 144 friendly ! DCT   898.0000 - 149.5418 26.6464 - 14.0897 0.7500 - 5.7540 3.5750 0.0330   12.1982 - 16.5235 - 7.6122 5.2187 - 0.2867 - 1.9909 8.4265 1.2591     5.3355 - 2.6557 2.3410 - 9.9277 2.4614 4.4558 - 3.1945 - 3.1640   DCT  1.9463 - 2.7271 1.5106 2.8421 - 2.1336 - 2.7203 - 2.7510 5.4051  Coefficients =   0.7500 - 2.0745 0.8610 0.2085 2.5000 1.8446 2.0787 2.4750     7.9536 - 2.6624 2.6308 0.4010 0.4772 3.3000 1.7394 0.3942   - 4.1042 - 0.1650 - 0.6945 0.0601 0.0628 - 0.7874 - 0.8410 0.3496       - 3.4688 2.3804 0.1559 0.8696 0.1142 - 0.5240 - 3.9974 - 5.6187 Audiovisual Communication, Fernando Pereira, 2020/2021

How Does the DCT Work ? Spatial Domain, samples Frequency Domain, DCT coefficients X X X X X X X X x y a X X X X X X X X C f d B c DCT X X X X X X X X H k X X X X X X X X Y i p X X X X X X X X w q d X X X X X X X X X X X X X X X X n m X X X X X X X X z 8×8×8=512 bits Audiovisual Communication, Fernando Pereira, 2020/2021

Easy/Cheap versus Though/Expensive Blocks 8×8 samples 8×8 samples 8×8 samples All blocks above have the same price (8×8×8)=512 bits in the PCM/spatial domain because redundancy is not exploited ! In the DCT/frequency domain, simpler blocks will be cheaper and vice- versa because ‘information’ is bought with more DCT coefficients and associated rate. Audiovisual Communication, Fernando Pereira, 2020/2021

Exploiti ploiting ng th the Pe Perce rceptual ptual Ir Irrelevance relevance Audiovisual Communication, Fernando Pereira, 2020/2021

How Does DCT Coding Work ? Quantization Level for Samples DCT DCT Quantized Coefficients (spatial domain) coefficients Round (S/Q) Sij s ij Sqij ≠ ≠ Transmission Quantization or tables storage Qij Reconstructed Dec. samples Level for IDCT DCT Quantized (spatial e.g. R = Sq*Q coefficients domain) coefficients r ij Sqij Rij Inverse quantization Audiovisual Communication, Fernando Pereira, 2020/2021

Quantization Matrices For transparent quality, JPEG suggests to quantize the DCT coefficients using the values for the ‘minimum perceptual difference’ (for each coefficient) multiplied by 2; for more compression, a multiple of them may be used. The quantization matrixes have to be always transmitted or at least signalled. 16 11 10 16 24 40 51 61 17 18 24 47 99 99 99 99 12 12 14 19 26 58 60 55 18 21 26 66 99 99 99 99 24 26 56 99 99 99 99 99 14 13 16 24 40 57 69 56 47 66 99 99 99 99 99 99 14 17 22 29 51 87 80 62 99 99 99 99 99 99 99 99 18 22 37 56 68 109 103 77 99 99 99 99 99 99 99 99 24 35 55 64 81 104 113 92 99 99 99 99 99 99 99 99 49 64 78 87 103 121 120 101 99 99 99 99 99 99 99 99 72 92 95 98 112 100 103 99 Situation: Luminance and crominance with 2:1 horizontal subsampling; samples with 8 bits ( Lohscheller ) Audiovisual Communication, Fernando Pereira, 2020/2021

From DCT Coeffs to Quantized DCT Coeffs   898.0000 - 149.5418 26.6464 - 14.0897 0.7500 - 5.7540 3.5750 0.0330   12.1982 - 16.5235 - 7.6122 5.2187 - 0.2867 - 1.9909 8.4265 1.2591     5.3355 - 2.6557 2.3410 - 9.9277 2.4614 4.4558 - 3.1945 - 3.1640   1.9463 - 2.7271 1.5106 2.8421 - 2.1336 - 2.7203 - 2.7510 5.4051     0.7500 - 2.0745 0.8610 0.2085 2.5000 1.8446 2.0787 2.4750     7.9536 - 2.6624 2.6308 0.4010 0.4772 3.3000 1.7394 0.3942   - 4.1042 - 0.1650 - 0.6945 0.0601 0.0628 - 0.7874 - 0.8410 0.3496       - 3.4688 2.3804 0.1559 0.8696 0.1142 - 0.5240 - 3.9974 - 5.6187   56 - 14 3 - 1 0 0 0 0 Quantizing   Finally, the with 1 - 1 - 1 0 0 0 0 0   waited   selected 0 0 0 0 0 0 0 0 miracle !   quantization  0 0 0 0 0 0 0 0  matrix …   0 0 0 0 0 0 0 0     0 0 0 0 0 0 0 0 The rate is reduced,   eventually at no 0 0 0 0 0 0 0 0     quality cost !   0 0 0 0 0 0 0 0 Audiovisual Communication, Fernando Pereira, 2020/2021

The DCT Pipeline Sample blocks Decoded sample blocks Audiovisual Communication, Fernando Pereira, 2020/2021

Photo with the compression rate decreasing, and hence quality increasing, from left to right. DCT coefficients selection and quantization matrix are the secrets ! And not defined by the standard … Audiovisual Communication, Fernando Pereira, 2020/2021

Qual ualit ity y Assessm essment ent Audiovisual Communication, Fernando Pereira, 2020/2021

Visual (Subjective) Inspection Audiovisual Communication, Fernando Pereira, 2020/2021

Objective Quality Assessment Compression X(m,n) Y(m,n) Decoded Original/reference 2 255  PSNR(dB) 10 log 10 MSE There are other M N Objective evaluation 1   y  objective 2 MSE ( x ) ij ij x and y are the original and quality metrics ! MN   i 1 j 1 decoded data Audiovisual Communication, Fernando Pereira, 2020/2021

Audiovisual Communication, Fernando Pereira, 2020/2021

DISCRETE COSINE TRANSFORM Laboratory session Fernando Pereira - PowerPoint PPT Presentation

DISCRETE COSINE TRANSFORM Laboratory session Fernando Pereira Instituto Superior Tcnico Audiovisual Communication, Fernando Pereira, 2020/2021 DCT Based Image Coding: Lets Create Bits ! Statistical Redundancy Spatial Quantization

Algorithms of Scientific Computing Discrete Cosine Transform (DCT) Michael Bader Summer Term

ECE 417, Lecture 6: Discrete Cosine Transform Mark Hasegawa-Johnson 9/6/2019 Outline DCT

Fourier Series and Transform Overview Why Fourier transform? Trigonometric functions Who is

Topic 10: The Z Transform o Introduction to Z Transform o Relationship to the Fourier transform o

Transform Coding - Overview Principle of block-wise transform coding Properties of orthonormal

Cosine (1.2 continued) Objectives: 1. Determine the range and period for sine and cosine and use

2 Unit Bridging Course Day 10 Circular Functions III The cosine function, identities and

Mesh Simplification Mesh Simplification 1 Spring 2010 The The Law The The Law Law of Law

Sine/Cosine using Sine/Cosine using CORDIC Algorithm CORDIC Algorithm Prof. Kris Gaj Gaj

Fast Fourier Transform Discrete-time windowing Discrete Fourier Transform Relationship

Discrete Fourier Transform Graduate School of Culture Technology (GSCT) Juhan Nam 1 Outlines

Fourier Transform for Partial Differential Equations Introduction: Fourier Transform

SMART GOVERNMENT INVOICING: INVOICE PROCESSING PLATFORM LEAD. TRANSFORM. DELIVER LEAD. TRANSFORM.

Topic 5: Discrete-Time Fourier Transform (DTFT) o DT Fourier Transform o Overview of Fourier

Fast Fourier Transform Fourier Series & Transform Summary Discrete-time windowing X [

Optimizing Discrete Wavelet Transform Optimizing Discrete Wavelet Transform on the Cell Broadband

DCPS: Building Expert Readers through Content-Rich Curriculum

Data Management Systems Query Processing Introduction Execution models Optimization I

Introduction to Relational Algebra Elmasri/Navathe ch 6 Padron-McCarthy/Risch ch 10 Silvia

Taking Storage for a Ride Ren W. Schmidt, Storage Platform NOVEMBER 4, 2016 About me Uber

CSEE 6861 CAD of Digital Systems Handout: Lecture #14 4/28/16 Prof. Steven M. Nowick

Layout Hotspot Detection with Feature Tensor Generation and Deep Biased Learning Haoyu Yang 1 ,

Null Space Gradient Flows for Constrained Optimization with Applications to Shape Optimization

MuMMI : Multiple Metrics Modeling Infrastructure Valerie Taylor, Xingfu Wu, Charles Lively

DISCRETE COSINE TRANSFORM Laboratory session Fernando Pereira - PowerPoint PPT Presentation

DISCRETE COSINE TRANSFORM Laboratory session Fernando Pereira Instituto Superior Tcnico Audiovisual Communication, Fernando Pereira, 2020/2021 DCT Based Image Coding: Lets Create Bits ! Statistical Redundancy Spatial Quantization

Algorithms of Scientific Computing Discrete Cosine Transform (DCT) Michael Bader Summer Term

ECE 417, Lecture 6: Discrete Cosine Transform Mark Hasegawa-Johnson 9/6/2019 Outline DCT

Fourier Series and Transform Overview Why Fourier transform? Trigonometric functions Who is

Topic 10: The Z Transform o Introduction to Z Transform o Relationship to the Fourier transform o

Transform Coding - Overview Principle of block-wise transform coding Properties of orthonormal

Cosine (1.2 continued) Objectives: 1. Determine the range and period for sine and cosine and use

2 Unit Bridging Course Day 10 Circular Functions III The cosine function, identities and

Mesh Simplification Mesh Simplification 1 Spring 2010 The The Law The The Law Law of Law

Sine/Cosine using Sine/Cosine using CORDIC Algorithm CORDIC Algorithm Prof. Kris Gaj Gaj

Fast Fourier Transform Discrete-time windowing Discrete Fourier Transform Relationship

Discrete Fourier Transform Graduate School of Culture Technology (GSCT) Juhan Nam 1 Outlines

Fourier Transform for Partial Differential Equations Introduction: Fourier Transform

SMART GOVERNMENT INVOICING: INVOICE PROCESSING PLATFORM LEAD. TRANSFORM. DELIVER LEAD. TRANSFORM.

Topic 5: Discrete-Time Fourier Transform (DTFT) o DT Fourier Transform o Overview of Fourier

Fast Fourier Transform Fourier Series &amp; Transform Summary Discrete-time windowing X [

Optimizing Discrete Wavelet Transform Optimizing Discrete Wavelet Transform on the Cell Broadband

DCPS: Building Expert Readers through Content-Rich Curriculum

Data Management Systems Query Processing Introduction Execution models Optimization I

Introduction to Relational Algebra Elmasri/Navathe ch 6 Padron-McCarthy/Risch ch 10 Silvia

Taking Storage for a Ride Ren W. Schmidt, Storage Platform NOVEMBER 4, 2016 About me Uber

CSEE 6861 CAD of Digital Systems Handout: Lecture #14 4/28/16 Prof. Steven M. Nowick

Layout Hotspot Detection with Feature Tensor Generation and Deep Biased Learning Haoyu Yang 1 ,

Null Space Gradient Flows for Constrained Optimization with Applications to Shape Optimization

MuMMI : Multiple Metrics Modeling Infrastructure Valerie Taylor, Xingfu Wu, Charles Lively

Fast Fourier Transform Fourier Series & Transform Summary Discrete-time windowing X [