Information theory and coding 1.00 Image, video and audio - PowerPoint PPT Presentation

Entropy coding review – Huffman Information theory and coding – 1.00 Image, video and audio compression 0 1 0.40 0.60 Markus Kuhn 0 1 0 1 v w 0.25 0.20 0.20 u 0 1 0.35 x Computer Laboratory 0.10 0.15 0 1 Huffman’s algorithm constructs an optimal code-word tree for a set of http://www.cl.cam.ac.uk/Teaching/2003/InfoTheory/mgk/ symbols with known probability distribution. It iteratively picks the two y z elements of the set with the smallest probability and combines them into 0.05 0.05 a tree by adding a common root. The resulting tree goes back into the set, labeled with the sum of the probabilities of the elements it combines. Michaelmas 2003 – Part II The algorithm terminates when less than two elements are left. 3 Structure of modern audiovisual Other variable-length code tables communication systems Huffman’s algorithm generates an optimal code table. Disadvantage: this code table (or the distribution from which is was generated) needs to be stored or transmitted. Adaptive variants of Huffman’s algorithm modify the coding tree in the encoder and decoder Sensor+ Perceptual Entropy Channel Signal synchronously, based on the distribution of symbols encountered so far. This enables one-pass ✲ ✲ ✲ ✲ coding sampling coding coding processing and avoids the need to transmit or store a code table, at the cost of starting with a less efficient encoding. Unary code ❄ Encode the natural number n as the bit string 1 n 0 . This code is optimal Noise Channel ✲ when the probability distribution is p ( n ) = 2 − ( n +1) . Example: 3 , 2 , 0 → 1110 , 110 , 0 ❄ Golomb code Perceptual Entropy Channel Human Display ✛ ✛ ✛ ✛ senses decoding decoding decoding Select an encoding parameter b . Let n be the natural number to be encoded, q = ⌊ n/b ⌋ and r = n − qb . Encode n as the unary code word for q , followed by the ( log 2 b )-bit binary code word for r . Where b is not a power of 2, encode the lower values of r in ⌊ log 2 b ⌋ bits, and the rest in ⌈ log 2 b ⌉ bits, such that the leading digits distinguish the two cases. 2 4

Arithmetic coding Examples: b = 1 : 0, 10, 110, 1110, 11110, 111110, . . . (this is just the unary code) b = 2 : 00, 01, 100, 101, 1100, 1101, 11100, 11101, 111100, 111101, . . . Several advantages: b = 3 : 00, 010, 011, 100, 1010, 1011, 1100, 11010, 11011, 11100, 111010, . . . b = 4 : 000, 001, 010, 011, 1000, 1001, 1010, 1011, 11000, 11001, 11010, . . . → Length of output bitstring can approximate the theoretical in- Golomb codes are optimal for geometric distributions of the form p ( n ) = u n ( u − 1) (e.g., run lengths of Bernoulli experiments) if b is chosen suitably for a given u . formation content of the input to within 1 bit. S.W. Golomb: Run-length encodings. IEEE Transactions on Information Theory, IT-12(3):399– 401, July 1966. → Performs well with probabilities > 0.5, where the information Elias gamma code per symbol is less than one bit. Start the code word for the positive integer n with a unary-encoded → Interval arithmetic makes it easy to change symbol probabilities length indicator m = ⌊ log 2 n ⌋ . Then append from the binary notation (no need to modify code-word tree) ⇒ convenient for adaptive of n the rightmost m digits (to cut off the leading 1). coding 1 = 0 4 = 11000 7 = 11011 10 = 1110010 2 = 100 5 = 11001 8 = 1110000 11 = 1110011 Can be implemented efficiently with fixed-length arithmetic by rounding 3 = 101 6 = 11010 9 = 1110001 . . . probabilities and shifting out leading digits as soon as leading zeros appear in interval size. Usually combined with adaptive probability P. Elias: Universal codeword sets and representations of the integers. IEEE Transactions on Information Theory, IT-21(2)194–203, March 1975. estimation. More such variable-length integer codes are described by Fenwick in IT-48(8)2412–2417, August Huffman coding remains popular because of its simplicity and lack of patent licence issues. 2002. (Available on http://ieeexplore.ieee.org/ ) 5 7 Entropy coding review – arithmetic coding Coding of sources with memory and correlated symbols Partition [0,1] according 0.0 0.35 0.55 0.75 0.9 0.95 1.0 Run-length coding: to symbol probabilities: u v w x y z Encode text wuvw . . . as numeric value (0.58. . . ) in nested intervals: ↓ 1.0 0.75 0.62 0.5885 0.5850 z z z z z y y y y y 5 7 12 3 3 x x x x x Predictive coding: encoder decoder w w w w w f(t) g(t) g(t) f(t) − + v v v v v predictor predictor P(f(t−1), f(t−2), ...) P(f(t−1), f(t−2), ...) u u u u u Delta coding (DPCM): P ( x ) = x n � Linear predictive coding: P ( x 1 , . . . , x n ) = a i x i 0.55 0.0 0.55 0.5745 0.5822 i =1 6 8

Fax compression Group 4 MMR fax code → 2-dimensional code, references previous line International Telecommunication Union specifications: → Vertical mode encodes transitions that have shifted up to ± 3 pixels horizontally. → Group 1 and 2: obsolete analog 1970s fax systems, required − 1 → 010 0 → 1 1 → 011 several minutes for uncompressed transmission of each page. − 2 → 000010 2 → 000011 − 3 → 0000010 3 → 0000011 → Group 3: fax protocol used on the analogue telephone network → Pass mode skip edges in previous line that have no equivalent (9.6–14.4 kbit/s), with “modified Huffman” (MH) compression in current line (0001) of run-length codes. → Horizontal mode uses 1-dimensional run-lengths independent Modern G3 analog fax machines also support the better G4 and JBIG encodings. of previous line (001 plus two MH-encoded runs) → Group 4: enhanced fax protocol for ISDN (64 kbit/s), intro- duced “modified modified relative element address designate (READ)” (MMR) coding. ITU-T Recommendations, such as the ITU-T T.4 and T.6 documents that standardize the fax coding algorithms, are available on http://www.itu.int/ITU-T/publications/recs.html . 9 11 Group 3 MH fax code JBIG (Joint Bilevel Experts Group) → lossless algorithm for 1–6 bits per pixel pixels white code black code • Run-length encoding plus modified Huffman 0 00110101 0000110111 → main applications: fax, scanned text documents code 1 000111 010 2 0111 11 • Fixed code table (from eight sample pages) → context-sensitive arithmetic coding 3 1000 10 • separate codes for runs of white and black 4 1011 011 pixels → adaptive context template for better prediction efficiency with 5 1100 0011 • termination code in the range 0–63 switches 6 1110 0010 rastered photographs (e.g. in newspapers) between black and white code 7 1111 00011 8 10011 000101 • makeup code can extend length of a run by → support for resolution reduction and progressive coding 9 10100 000100 a multiple of 64 10 00111 0000100 • termination run length 0 needed where run → “deterministic prediction” avoids redundancy of progr. coding 11 01000 0000101 length is a multiple of 64 12 001000 0000111 • single white column added on left side be- → “typical prediction” codes common cases very efficiently 13 000011 00000100 fore transmission 14 110100 00000111 • makeup codes above 1728 equal for black 15 110101 000011000 → typical compression factor 20, 1.1–1.5 × better than Group 4 and white 16 101010 0000010111 . . . . . . . . . fax, about 2 × better than “ gzip -9 ” and about ≈ 3–4 × better • 12-bit end-of-line marker: 000000000001 63 00110100 000001100111 (can be prefixed by up to seven zero-bits than GIF (all on 300 dpi documents). 64 11011 0000001111 to reach next byte boundary) 128 10010 000011001000 Information technology — Coded representation of picture and audio information — progressive Example: line with 2 w, 4 b, 200 w, 3 b, EOL → 192 010111 000011001001 bi-level image compression. International Standard ISO 11544:1993. 1000 | 011 | 010111 | 10011 | 10 | 000000000001 . . . . . . . . . Example implementation: http://www.cl.cam.ac.uk/~mgk25/jbigkit/ 1728 010011011 0000001100101 10 12

Information theory and coding 1.00 Image, video and audio - PowerPoint PPT Presentation

Entropy coding review Huffman Information theory and coding 1.00 Image, video and audio compression 0 1 0.40 0.60 Markus Kuhn 0 1 0 1 v w 0.25 0.20 0.20 u 0 1 0.35 x Computer Laboratory 0.10 0.15 0 1 Huffmans

Formal Modeling in Cognitive Science 1 Coding Theorems Lecture 28: Kraft Inequality; Source Coding

Image and Video Coding: Video Coding Extensions Screen Content Coding Screen Content Coding

ADVANCED MULTIMEDIA ADVANCED MULTIMEDIA CODING CODING Fernando Pereira Instituto Superior

Dynamical systems Expanding maps on the circle. Coding Jana Rodriguez Hertz ICTP 2018 coding

Coding and Applications in Sensor Networks Why coding? Information compression

Overview Coding and Information Theory What is information theory? Entropy Coding Chris

Coding and Applications in Sensor Networks Coding and Applications in Sensor Networks Why coding?

An Introduction to (Network) Coding Theory Anna-Lena Horlemann-Trautmann University of St.

Risk-Based Coding and Reimbursement What is Risk-Based Coding? Risk-Based Coding Overview A

Applications of Random Coding and Algebraic Coding Theories to Universal Lossless Source Coding

Entropy Coding Definition of Entropy Three Entropy coding techniques: (taken from the

Image and Video Coding: Motion Estimation and Coding 4 5 6 B C D 1 D 0 3 7 A current 2

Speech & Audio Coding TSBK01 Image Coding and Data Compression Lecture 11, 2003 Jrgen

Image and Video Coding: Hybrid Video Coding s n 1 [ x , y ] s n [ x , y ] m k = ( m x , m

Image and Video Coding: Improved Inter-Picture Prediction Review of Hybrid Video Coding Last

CCN & Network Coding Cedric Westphal Huawei and UCSC ICN & Network Coding - RFC7933

Detecting changes in Detecting changes in the rate the rate of a of a Poisson process

More on Reconstructing from Random Traces: Insertions and Deletions Sampath Kannan and Andrew

Week 14 - Monday What did we talk about last time? Heap implementation Heap sort

Sorting... more 2 Announcements Homework posted, due next Sunday 3 Quicksort Runtime: Worst

INTEGRATING RANKED SET SAMPLING IN MEMORY CONTROL CHARTS: A STUDY ON PEPSI COLA Muazu Ramat

Create Sampling Distributions from Single Die V0G 11/16/2016 V0G Create Sampling Distribution

Chapter 23: The Accuracy of Averages What can we say about the average of the draws? The

Outline Experimentation (6.1) Data Presentation (6.2) ata ese tat o (6 ) Sample

Information theory and coding 1.00 Image, video and audio - PowerPoint PPT Presentation

Entropy coding review Huffman Information theory and coding 1.00 Image, video and audio compression 0 1 0.40 0.60 Markus Kuhn 0 1 0 1 v w 0.25 0.20 0.20 u 0 1 0.35 x Computer Laboratory 0.10 0.15 0 1 Huffmans

Formal Modeling in Cognitive Science 1 Coding Theorems Lecture 28: Kraft Inequality; Source Coding

Image and Video Coding: Video Coding Extensions Screen Content Coding Screen Content Coding

ADVANCED MULTIMEDIA ADVANCED MULTIMEDIA CODING CODING Fernando Pereira Instituto Superior

Dynamical systems Expanding maps on the circle. Coding Jana Rodriguez Hertz ICTP 2018 coding

Coding and Applications in Sensor Networks Why coding? Information compression

Overview Coding and Information Theory What is information theory? Entropy Coding Chris

Coding and Applications in Sensor Networks Coding and Applications in Sensor Networks Why coding?

An Introduction to (Network) Coding Theory Anna-Lena Horlemann-Trautmann University of St.

Risk-Based Coding and Reimbursement What is Risk-Based Coding? Risk-Based Coding Overview A

Applications of Random Coding and Algebraic Coding Theories to Universal Lossless Source Coding

Entropy Coding Definition of Entropy Three Entropy coding techniques: (taken from the

Image and Video Coding: Motion Estimation and Coding 4 5 6 B C D 1 D 0 3 7 A current 2

Speech &amp; Audio Coding TSBK01 Image Coding and Data Compression Lecture 11, 2003 Jrgen

Image and Video Coding: Hybrid Video Coding s n 1 [ x , y ] s n [ x , y ] m k = ( m x , m

Image and Video Coding: Improved Inter-Picture Prediction Review of Hybrid Video Coding Last

CCN &amp; Network Coding Cedric Westphal Huawei and UCSC ICN &amp; Network Coding - RFC7933

Detecting changes in Detecting changes in the rate the rate of a of a Poisson process

More on Reconstructing from Random Traces: Insertions and Deletions Sampath Kannan and Andrew

Week 14 - Monday What did we talk about last time? Heap implementation Heap sort

Sorting... more 2 Announcements Homework posted, due next Sunday 3 Quicksort Runtime: Worst

INTEGRATING RANKED SET SAMPLING IN MEMORY CONTROL CHARTS: A STUDY ON PEPSI COLA Muazu Ramat

Create Sampling Distributions from Single Die V0G 11/16/2016 V0G Create Sampling Distribution

Chapter 23: The Accuracy of Averages What can we say about the average of the draws? The

Outline Experimentation (6.1) Data Presentation (6.2) ata ese tat o (6 ) Sample

Speech & Audio Coding TSBK01 Image Coding and Data Compression Lecture 11, 2003 Jrgen

CCN & Network Coding Cedric Westphal Huawei and UCSC ICN & Network Coding - RFC7933