Mode-dependent Rate-distortion Optimized Transforms Using Graph - PowerPoint PPT Presentation

Mode-dependent Rate-distortion Optimized Transforms Using Graph Signal Processing Methods Keng-Shih Lu and Antonio Ortega October 22, 2019 Mode-dependent Rate-distortion Optimized Transforms Using Graph Signal Processing Metho K. Lu and A. Ortega / 24

Outline Background: Graph Signal Processing 1 Mode-dependent Data-driven Transforms 2 Fast GFTs based on Graph Symmetries 3 Efficient RD Approximation using Laplacian Operators 4 Conclusion 5 Mode-dependent Rate-distortion Optimized Transforms Using Graph Signal Processing Metho K. Lu and A. Ortega / 24

Graph Signal Processing (GSP) Graph Fourier Transform (GFT) (a.k.a. graph-based transforms) Laplacian matrix L = D − W + S Examples: GFT basis functions U : eigenvectors of L ( L = UΛU ⊤ ) GFTs of G D and G A are DCT and ADST Mode-dependent Rate-distortion Optimized Transforms Using Graph Signal Processing Metho K. Lu and A. Ortega / 24

Graph Signal Processing (GSP) Graph Fourier Transform (GFT) (a.k.a. graph-based transforms) Laplacian matrix L = D − W + S Examples: GFT basis functions U : eigenvectors of L ( L = UΛU ⊤ ) GFTs of G D and G A are DCT and ADST Probabilistic interpretations: Graph ← → Gaussian Markov Random Field (GMRF) Large edge weight ← → high correlation GFT on graph signal ← → decorrelation (PCA) of GMRF data Designing graph weights ← → parameter estimation for a GMRF Mode-dependent Rate-distortion Optimized Transforms Using Graph Signal Processing Metho K. Lu and A. Ortega / 24

DCT and ADST Discrete cosine transform (DCT) Asymmetric discrete sine transform (ADST) (a) Graph (a) Graph Mode-dependent Rate-distortion Optimized Transforms Using Graph Signal Processing Metho K. Lu and A. Ortega / 24

DCT and ADST Discrete cosine transform (DCT) Asymmetric discrete sine transform (ADST) (a) Graph (a) Graph (b) u 1 (b) u 1 Each node corresponds to one pixel Large self-loop ← → small value in u 1 Mode-dependent Rate-distortion Optimized Transforms Using Graph Signal Processing Metho K. Lu and A. Ortega / 24

GSP for Image and video compression Prior work Graph template transforms for texture images [Pavez et. al. 2015] Piecewise smooth image compression [Hu et. al. 2015] Generalized GFTs for intra predicted video coding [Hu et. al. 2015] Edge-adaptive GFTs for inter predicted video coding [Egilmez et. al. 2015] In this talk: graph-based methods for AV1/AV2 Rate-distortion optimized transforms (with graph-based regularizations) Transforms obtained are mode-dependent Achieved compression gains on AV1/AV2 Fast GFT designs Fast RD approximation Achieved speedup in transform type search Mode-dependent Rate-distortion Optimized Transforms Using Graph Signal Processing Metho K. Lu and A. Ortega / 24

Rate-Distortion Optimized Transforms (RDOT) RDOT [Effros et. al., 1999], [Zhao et. al. 2012], [Zou et. al. 2013], Goal: learn a transform in a system using multiple transforms (e.g. AV1) Main idea: use RD-based transform selection during learning Procedure : for each iteration Note Can be easily extended to multiple learned transforms Lloyd-like algorithm − → solution depends on the initialization Mode-dependent Rate-distortion Optimized Transforms Using Graph Signal Processing Metho K. Lu and A. Ortega / 24

Training RDOT for AV1 Goal: introduce a new 1D transform for each inter/intra block Intra–block statistics are highly mode-dependent We train MD-RDOT: mode-dependent RDOTs Inter–block statistics are symmetric Learn RDOT and FLIPRDOT together New transform types: 2D combinations of Each intra mode: MD-RDOT & DCT Inter: RDOT, FLIPRDOT, and DCT Implementation details Training data: 2D residues extracted from AV1 We use weighted sum of squared transform coefficients for classification Proxy of the RD cost Mode-dependent Rate-distortion Optimized Transforms Using Graph Signal Processing Metho K. Lu and A. Ortega / 24

Graph-based Regularizations Idea: force the RDOT to be a GFT Learning a graph from data (covariance matrix S ) L is a Laplacian − log det ( L ) + trace ( LS ) minimize Convex problem with iterative solver [Egilmez et. al. 2018] Transforms with different regularization settings KLT: no regularization GFT: with graph Laplacian constraints LGT: line graph transform (graph Laplacian with line graph topology) Mode-dependent Rate-distortion Optimized Transforms Using Graph Signal Processing Metho K. Lu and A. Ortega / 24

Resulting Transform Bases (a) KLT for inter (b) GFT for inter (c) LGT for inter Observations: when using regularization constraints Similar shape to KLT But more localized basis functions with sharper transitions Fewer parameters to choose Mode-dependent Rate-distortion Optimized Transforms Using Graph Signal Processing Metho K. Lu and A. Ortega / 24

Experimental Results Graph-based regularization Compression gain on AV1 w.r.t. training set size Training set size per mode 12500 25000 50000 100000 KLT 0.7317% 0.6922% 0.8476% 0.7749% GFT 0.7480% 0.6935% 0.6235% 0.4233% LGT 0.5527% 0.5401% 0.7235% 0.5698% Graph-based transforms may outperform KLT when training set is small Mode-dependent Rate-distortion Optimized Transforms Using Graph Signal Processing Metho K. Lu and A. Ortega / 24

Experimental Results Graph-based regularization Compression gain on AV1 w.r.t. training set size Training set size per mode 12500 25000 50000 100000 KLT 0.7317% 0.6922% 0.8476% 0.7749% GFT 0.7480% 0.6935% 0.6235% 0.4233% LGT 0.5527% 0.5401% 0.7235% 0.5698% Graph-based transforms may outperform KLT when training set is small AV2 Experiment–CONFIG MODE DEP TX RDOT with KLT applied Compression gains on AOM lowres test set Overall Key frames With sep. KLT 0.70% 0.64% With sep. & non-sep. KLTs 0.79% 1.09% Mode-dependent Rate-distortion Optimized Transforms Using Graph Signal Processing Metho K. Lu and A. Ortega / 24

Fast GFT? Example: DCT Key components cos π / 4 cos π / 4 cos π / 4 cos π / 4 cos π / 4 cos π / 4 − cos π / 4 − cos π / 4 (a) Givens rotation sin π / 8 sin π / 8 - cos π / 8 cos π / 8 − sin 3 π / 8 − sin 3 π / 8 - cos 3 π / 8 cos 3 π / 8 sin π / 16 sin π / 16 - cos π / 16 cos π / 16 − cos π / 4 − cos π / 4 sin 5 π / 16 sin 5 π / 16 - - cos π / 4 cos π / 4 cos 5 π / 16 cos 5 π / 16 - cos π / 4 cos π / 4 − sin 5 π / 16 − sin 5 π / 16 (b) Haar unit - cos π / 4 cos π / 4 cos 3 π / 16 cos 3 π / 16 − sin 7 π / 16 − sin 7 π / 16 - cos 7 π / 16 cos 7 π / 16 Mode-dependent Rate-distortion Optimized Transforms Using Graph Signal Processing Metho K. Lu and A. Ortega / 24

Fast GFT? Example: DCT Key components cos π / 4 cos π / 4 cos π / 4 cos π / 4 cos π / 4 cos π / 4 − cos π / 4 − cos π / 4 (a) Givens rotation sin π / 8 sin π / 8 - cos π / 8 cos π / 8 − sin 3 π / 8 − sin 3 π / 8 - cos 3 π / 8 cos 3 π / 8 sin π / 16 sin π / 16 - cos π / 16 cos π / 16 − cos π / 4 − cos π / 4 sin 5 π / 16 sin 5 π / 16 - - cos π / 4 cos π / 4 cos 5 π / 16 cos 5 π / 16 - cos π / 4 cos π / 4 − sin 5 π / 16 − sin 5 π / 16 (b) Haar unit - cos π / 4 cos π / 4 cos 3 π / 16 cos 3 π / 16 − sin 7 π / 16 − sin 7 π / 16 - cos 7 π / 16 cos 7 π / 16 Graph structural property − → fast GFT? We will focus on butterfly stages with Haar units Mode-dependent Rate-distortion Optimized Transforms Using Graph Signal Processing Metho K. Lu and A. Ortega / 24

GFTs with Haar Units Theorem GFT has a left butterfly stage ⇐ ⇒ graph is symmetric See [Lu and Ortega, TSP 2019] for formal definition of symmetry (node pairing) Mode-dependent Rate-distortion Optimized Transforms Using Graph Signal Processing Metho K. Lu and A. Ortega / 24

Examples of Fast GFTs Fast GFTs on 1D blocks : symmetric line graph Fast GFTs on 2D blocks : symmetric grid graph (a) Up-down symmetric (c) Centrosymmetric (b) Diagonal symmetric Each symmetry ⇒ multiplications reduced by half Leads to fast separable & non-separable transforms Coding gain achieved in [Gnutti et. al., 2018] Mode-dependent Rate-distortion Optimized Transforms Using Graph Signal Processing Metho K. Lu and A. Ortega / 24

Rate-distortion Optimization RD cost evaluation: D + λ × R For each ( partition , mode , tx type ) , we need transform & quantization & entropy coding ⇒ Brute force is very computationally expensive Mode-dependent Rate-distortion Optimized Transforms Using Graph Signal Processing Metho K. Lu and A. Ortega / 24

Mode-dependent Rate-distortion Optimized Transforms Using Graph - PowerPoint PPT Presentation

Mode-dependent Rate-distortion Optimized Transforms Using Graph Signal Processing Methods Keng-Shih Lu and Antonio Ortega October 22, 2019 Mode-dependent Rate-distortion Optimized Transforms Using Graph Signal Processing Metho K. Lu and A.

Labor Classification Yrs Rate 1 Rate 2 Rate 3 Rate 4 Rate 5 Rate 6 Rate 7 Rate 8 Rate 9

Chapter 10 Rate Distortion Theory Peng-Hua Wang Graduate Inst. of Comm. Engineering National

Motion Estimation by Affine Transforms Motion Estimation by Affine Transforms Motion Estimation

Why Dependent Origination? So what is dependent origination? Dependent on ignorance, there

Control of switch-mode converters Current Programmed Mode control CPM Mor M. Peretz, Switch-Mode

JUST THE MATHS SLIDES NUMBER 16.7 LAPLACE TRANSFORMS 7 (An appendix) by A.J.Hobson One

Drawing on the Web CSS CSCI-UA 380 Transforms, Transitions, and Animation Drawing on the Web

A Rate-Distortion One-Class Model and its Applications to Clustering Fernando Pereira 1 Koby

Magnetic Distortion Distortion of of Magnetic HPD Images HPD Images smund Skjveland

Using Distortion in 3D Using Distortion in 3D Sheelagh Carpendale Sheelagh Carpendale

Digital Pre-Distortion Derek Kozel What is Digital Pre-Distortion (DPD) A technique for

CMB Spectral Distortion Computations using the Greens function package of CosmoTherm Primordial

Temporal Distortion Temporal Distortion Perspective) Perspective) t t Blue view Blue view y

Non Linear Distortion and Dynamic Range Issues Non Linear Distortion and Dynamic Range Issues in

Resolving Profile Distortion Resolving Profile Distortion for Electron-based IPMs for

NATIVE MODE PROGRAMMING Fiona Reid Overview What is native mode? What codes are suitable

Interrupts "Interrupt - An event that alters (or interrupts) the normal fetch-decode-execute

Outline Please check the web page about your recitation group s meeting place.

Interrupt-Driven Input/Output Textbook: Chapter 11 (Interrupts) ARM Cortex-M4 User Guide

Write Fast, Read in the Past: Causal Consistency for Client-side Apps with SwiftCloud App App

Lecture 6 Polar Coding I-Hsiang Wang Department of Electrical Engineering National Taiwan

The Minisatellite Transformation Problem: The Run-Length-Encoding Approach and Further

6.975 Week 5 Universal Compression Via Grammar Based Codes Presenter: Emin Martinian Grammar

Ring-LWE Implementation Tobias Oder 1 , Tobias Schneider 2 , Thomas Pppelmann 3 , Tim Gneysu

Mode-dependent Rate-distortion Optimized Transforms Using Graph - PowerPoint PPT Presentation

Mode-dependent Rate-distortion Optimized Transforms Using Graph Signal Processing Methods Keng-Shih Lu and Antonio Ortega October 22, 2019 Mode-dependent Rate-distortion Optimized Transforms Using Graph Signal Processing Metho K. Lu and A.

Labor Classification Yrs Rate 1 Rate 2 Rate 3 Rate 4 Rate 5 Rate 6 Rate 7 Rate 8 Rate 9

Chapter 10 Rate Distortion Theory Peng-Hua Wang Graduate Inst. of Comm. Engineering National

Motion Estimation by Affine Transforms Motion Estimation by Affine Transforms Motion Estimation

Why Dependent Origination? So what is dependent origination? Dependent on ignorance, there

Control of switch-mode converters Current Programmed Mode control CPM Mor M. Peretz, Switch-Mode

JUST THE MATHS SLIDES NUMBER 16.7 LAPLACE TRANSFORMS 7 (An appendix) by A.J.Hobson One

Drawing on the Web CSS CSCI-UA 380 Transforms, Transitions, and Animation Drawing on the Web

A Rate-Distortion One-Class Model and its Applications to Clustering Fernando Pereira 1 Koby

Magnetic Distortion Distortion of of Magnetic HPD Images HPD Images smund Skjveland

Using Distortion in 3D Using Distortion in 3D Sheelagh Carpendale Sheelagh Carpendale

Digital Pre-Distortion Derek Kozel What is Digital Pre-Distortion (DPD) A technique for

CMB Spectral Distortion Computations using the Greens function package of CosmoTherm Primordial

Temporal Distortion Temporal Distortion Perspective) Perspective) t t Blue view Blue view y

Non Linear Distortion and Dynamic Range Issues Non Linear Distortion and Dynamic Range Issues in

Resolving Profile Distortion Resolving Profile Distortion for Electron-based IPMs for

NATIVE MODE PROGRAMMING Fiona Reid Overview What is native mode? What codes are suitable

Interrupts &quot;Interrupt - An event that alters (or interrupts) the normal fetch-decode-execute

Outline Please check the web page about your recitation group s meeting place.

Interrupt-Driven Input/Output Textbook: Chapter 11 (Interrupts) ARM Cortex-M4 User Guide

Write Fast, Read in the Past: Causal Consistency for Client-side Apps with SwiftCloud App App

Lecture 6 Polar Coding I-Hsiang Wang Department of Electrical Engineering National Taiwan

The Minisatellite Transformation Problem: The Run-Length-Encoding Approach and Further

6.975 Week 5 Universal Compression Via Grammar Based Codes Presenter: Emin Martinian Grammar

Ring-LWE Implementation Tobias Oder 1 , Tobias Schneider 2 , Thomas Pppelmann 3 , Tim Gneysu

Interrupts "Interrupt - An event that alters (or interrupts) the normal fetch-decode-execute