Algorithms in the Real World Data Compression 4 Page 1 Compression - PowerPoint PPT Presentation

Algorithms in the Real World Data Compression 4 Page 1

Compression Outline Introduction : Lossy vs. Lossless, Benchmarks, … Information Theory : Entropy, etc. Probability Coding : Huffman + Arithmetic Coding Applications of Probability Coding : PPM + others Lempel-Ziv Algorithms : LZ77, gzip, compress, … Other Lossless Algorithms: Burrows-Wheeler Lossy algorithms for images: JPEG, MPEG, ... – Scalar and vector quantization – JPEG and MPEG Compressing graphs and meshes: BBK 296.3 Page 2

Scalar Quantization Quantize regions of values into a single value: output output input input uniform non uniform Quantization is lossy Can be used, e.g., to reduce # of bits for a pixel Page 3

Vector Quantization: Example representative codevectors Input vectors are (Height, Weight) pairs. Map each input vector to a representative “codevector”. Codevectors are stored in a codebook. Page 4

Vector Quantization input output generate output generate input vector vector codevector index find of code- index closest vector compress decompress code- index index vector codebook codebook decode encode Page 5

Vector Quantization What do we use as vectors? • Color (Red, Green, Blue) – Can be used, for example to reduce 24bits/pixel to 12bits/pixel – Used in some terminals to reduce data rate from the CPU (colormaps) • k consecutive samples in audio • Block of k x k pixels in an image How do we decide on a codebook • Typically done with clustering Page 6

Linear Transform Coding Want to encode values over a region of time or space – typically used for images or audio – represented as a vector [x 1 ,x 2 ,…] Select a set of linear basis functions ϕ i that span the space – sin, cos, spherical harmonics, wavelets, … – defined at discrete points Page 7

Linear Transform Coding Coefficients: x ( j ) x a ∑ ∑ Θ = φ = i j i j ij j j th i resulting coefficien t Θ = i th x j input valu e = j th a ij transform coefficien t i j ( ) = = φ ij Ax Θ = In matrix notation: − 1 x A = Θ Where A is an n x n matrix, and each row defines a basis function Page 8

Example: Cosine Transform 1 j ( ) 0 j ( ) φ 2 j ( ) φ φ … x ( j ) ∑ Θ = φ i j i Θ i x j j j small values ? Page 9

Other Transforms Polynomial: 1 x x 2 Wavelet (Haar): Page 10

How to Pick a Transform Goals: – Decorrelate (remove repeated patterns in data) – Low coefficients for many terms – Some terms affect perception more than others Why is using a Cosine or Fourier transform across a whole image bad? -- If there is no periodicity in the image, large coefficients for high-frequency terms How might we fix this? -- use basis functions that are not as smoothly periodic Page 11

Usefulness of Transform Typically transforms A are orthonormal : A -1 = A T Properties of orthonormal transforms : • ∑ x 2 = ∑ Θ 2 (energy conservation) Would like to compact energy into as few coefficients as possible 1 2 ∑ σ i (the transform coding gain ) n G = TC 1 arithmetic mean/geometric mean ( ) n 2 ∏ σ i σ i = ( Θ i - Θ av ) The higher the gain, the better the compression Page 12

Case Study: JPEG A nice example since it uses many techniques: – Transform coding (Discrete Cosine Transform) – Scalar quantization – Difference coding – Run-length coding – Huffman or arithmetic coding JPEG (Joint Photographic Experts Group) was designed in 1991 for lossy and lossless compression of color or grayscale images . The lossless version is rarely used. Can be adjusted for compression ratio (typically 10:1) 296.3 Page 13

JPEG in a Nutshell Typically down- sample I and Q original image inter- planes by a phase Brightness factor of 2 in 0.59 Green + three planes each dimension – 0.30 Red + quadra- of 8-bit lossy. Factor of 0.11 Blue ture pixel values 4 compression for I and Q, 2 overall. (two-dimensional DCT) break each plane into 8x8 blocks of pixels Page 14

JPEG: Quantization Table 16 11 10 16 24 40 51 61 12 12 14 19 26 58 60 55 14 13 16 24 40 57 69 56 14 17 22 29 51 87 80 62 18 22 37 56 68 109 103 77 24 35 55 64 81 104 113 92 49 64 78 87 103 121 120 101 72 92 95 98 112 100 103 99 Divide each coefficient by factor shown. Also divided through uniformly by a quality factor which is under control. Page 15

JPEG: Block scanning order Scan block of coefficients in zig-zag order • Use difference coding upper left (DC) coefficient • between consecutive blocks Uses run-length coding for sequences of zeros for • rest of block Page 16

JPEG: example .125 bits/pixel (factor of 192) Page 17

Case Study: MPEG Pretty much JPEG with interframe coding Three types of frames – I = intra frame (approx. JPEG) anchors – P = predictive coded frames – based on previous I or P frame in output order – B = bidirectionally predictive coded frames - based on next and/or previous I or P frames in output order ordered Example: chronologically in input Type: I B B P B B P B B P B B I Output 1 3 4 2 6 7 5 9 10 8 12 13 11 Order: I frames are used for random access. In the sequence, each B frame appears after any frame on which it depends. Page 18

MPEG matching between frames Page 19

MPEG: Compression Ratio 356 x 240 image Type Size Compression I 18KB 7/1 P 6KB 20/1 B 2.5KB 50/1 Average 4.8KB 27/1 30 frames/sec x 4.8KB/frame x 8 bits/byte = 1.2 Mbits/sec + .25 Mbits/sec (stereo audio) HDTV has 15x more pixels = 18 Mbits/sec Page 20

MPEG in the “real world” • DVDs – Adds “encryption” and error correcting codes • Direct broadcast satellite • HDTV standard – Adds error correcting code on top • Storage Tech “Media Vault” – Stores 25,000 movies Encoding is much more expensive than decoding. Still requires special purpose hardware for high resolution and good compression. Page 21

Compression Summary How do we figure out the probabilities – Transformations that skew them • Guess value and code difference • Move to front for temporal locality • Run-length • Linear transforms (Cosine, Wavelet) • Renumber (graph compression) – Conditional probabilities • Neighboring context In practice one almost always uses a combination of techniques Page 22

Algorithms in the Real World Data Compression 4 Page 1 Compression - PowerPoint PPT Presentation

Algorithms in the Real World Data Compression 4 Page 1 Compression Outline Introduction : Lossy vs. Lossless, Benchmarks, Information Theory : Entropy, etc. Probability Coding : Huffman + Arithmetic Coding Applications of Probability Coding :

Real graduates, Real graduates, real transitions, real transitions, real stories: real

Algorithms X. Zhang Fordham Univ. 1 Real World applications of algorithms Algorithms for

Graph Algorithms Chapter 22 1 CPTR 430 Algorithms Graph Algorithms Why Study Graph Algorithms?

Greedy Algorithms Chapter 16 1 CPTR 430 Algorithms Greedy Algorithms Greedy Algorithms For

Algorithms Chapter 3 Chapter Summary Algorithms n Example Algorithms n Algorithmic Paradigms

Real Students Real World Real Work Real Life: A Plan for a Holistic Approach to Supporting

WORLD WORLD WORLD WORLD WORLD WORLD En End of of the Br Bron onze Age ME MEETI NG 8

Real Estate Centers Real Estate Centers Hampton Roads Real Estate Hampton Roads Real Estate

Real- Real -Time Systems Time Systems Real- -Time Systems Time Systems Real

Real Real- -Time Systems Time Systems Designing a real- Designing a real -time system time

Real- Real -time systems time systems Real- Real -time programming time programming

Frontal Crash Protection Frontal Crash Protection Real World Experience with Real World

General remarks Algorithms Algorithms Oliver Oliver Week 8 Kullmann Kullmann Greedy Greedy

QUELLAVECO A WORLD CLASS COPPER PROJECT TOM MCCULLEY, CEO QUELLAVECO 27 NOVEMBER 2018 Real

Algorithms using real numbers (continued) Noter ch.4 Algorithms using real numbers

Real-World Protocols Prof. Tom Austin San Jos State University Real-World Protocols Next,

Paym yment ent Met Methods ods Presen ented ed by: : Michele ele Madison ison, , Partner

Chemring Group PLC Results for the six months to 30 April 2018 Interim FY18 scorecard Revenue

Borough Board December 17, 2015 Agenda Introduction Mario J. Paredes ACP Chief Executive

the 2014-2016 Biennium Budget A Summary of New Spending Initiatives and Savings Strategies for the

Staking Claims in Nanotech Lamar Bush, Biology, Santa Barbara City College Kasim Alimahomed,

Transit and Rail Innovative Funding and Finance Update Sasha Page, March 20, 2018 VALUE CAPTURE

Investor Presentation Q1-FY2021 Cautionary statement Forward-Looking Information This

MeasuringImageQuality HeathNielson MeasuringQuality What