Wireless Communication Systems @CS.NCTU Lecture 5: Compression - PowerPoint PPT Presentation

Wireless Communication Systems @CS.NCTU Lecture 5: Compression Instructor: Kate Ching-Ju Lin ( 林靖茹 ) Chap. 7-8 of “Fundamentals of Multimedia” Some reference from http://media.ee.ntu.edu.tw/courses/dvt/15F/ 1

Outline • Concepts of data compression • Lossless Compression • Lossy Compression • Quantization 2

Why compression? • Audio, image, and video require huge storage and network bandwidth if not compressed Application uncompressed compressed Audio conference 64kbps 16-64kbps Video conference 30.41Mbps 64-768kbps Digital video on CD-ROM (30fps) 60.83Mbps 1.5-4Mbps HDTV (59.94fps) 1.33Gbps 20Mbps Remove redundancy! 3

Compression Concepts Source Channel 01011000… Encoder Encoder original Channel Source 01011001… Encoder Encoder reconstructed 4

Compression Concepts • Source Coding • Also known as data compression • The objective is to reduce the size of messages • Achieved by removing redundancy • Entropy encoding: minimize the size of messages according to a probability model • Channel Coding • Also known as error correction • Repetition codes, parity codes, Reed-Solomon codes, etc. • Ensure the decoder can still recover the original data even with errors and (or) losses • Should consider the probability of errors happening during transmission (e.g., random loss or burst loss) 5

Considerations for Compression • Lossless vs. Lossy • Quality vs. bit-rate • Variable bit rate (VBR) vs. constant bit rate (CBR) • Robustness • Combat noisy channels • Complexity • Encoding and decoding efficiency 6

Compression Performance size before • Compression ratio = size after • Signal quality • Signal-to-noise ratio Mean square error SNR = 10 log 10 ( σ 2 N n = 1 ) s X σ 2 ( x i − y i ) 2 σ 2 N n • Peak-Signal-to-noise ratio i =1 σ 2 peak PSNR = 10 log 10 ( ) σ 2 n • Mean Opinion Score (MOS) • very annoying, annoying, slightly annoying, perceptible but not annoying, imperceptible • Goal: • Higher signal quality with higher compression ratio 7

Compression Technologies • Statistical redundancy • Lossless compression • Also known as entropy coding • Build on the probabilistic characteristics of signals • Perceptual redundancy • Lossy compression • Lead to irreversible distortion • Complex and depends on context or applications 8

Information Theory • Consider an information source with alphabet S = {s 1 , s 2 , …, s n }, the self-information contained in s i is defined as 1 i ( s i ) = log 2 p i where p i is teh probability that symbol s i in S will occur • Key idea of variable length coding • Frequent symbols à represented by less bits • Infrequent symbols à represented by more bits Low probability p i à Large amount of information High probability p i à Small amount of information 9

Information Theory - Entropy • Entropy η of an information source • Expected self-information of the whole source n n 1 X X η = H ( S ) = p i ∗ i ( s i ) = p i log 2 p i i =1 i =1 n X = − p i log 2 p i i =1 • Measure the disorder of a system à more entropy, more disorder • Greater entropy when the distribution is flat • Smaller entropy when the distribution is more peaked • Shannon’s theory: best lossless compression generates an average number of bits equal to entropy Claude Elwood Shannon, “A mathematical theory of communication,” Bell 10 System Technical Journal, vol. 27, pp. 379-423 and 623-656, Jul. and Oct. 1948

Properties of Compression • Unique decodable • Encode: y = f(x) • Decode: x = f -1 (y) à there exists only a single solution • A code is not unique decodable x: symbol if f(x i ) = f(x j ) = y for some x i ≠ x j y: codeword • Instantaneous code • Also called prefix-free code or prefix code • Any codeword cannot be the prefix of any other codeword, i.e., y i not the prefix of y j for all y i ≠ y j • Why good? • When a message is sent, the recipient can decode the message unambiguously from the beginning 11

Properties – Examples • Non-unique decodable s 1 = 0 s 2 = 01 0011 could be s 4 s 3 or s 1 s 1 s 3 s 3 = 11 s 4 = 00 • Non-Instantaneous code s 4 s 1 = 0 Coded sequence: 0111111 …. 11111 s 2 = 01 s 3 = 011 à Decode until receiving all bits s 4 = 11 12

Lossless Compression • Commonly known as entropy coding • Algorithms • Huffman coding • Adaptive Huffman coding • Arithmetic coding • Run-length coding • Golomb and Rice coding • DPCM 14

Huffman Coding • Proposed by David A. Huffman in 1952 • Adopted in many applications, such as fax machines, JPEG and MPEG • Bottom-up manner: build a binary coding tree • left branches are coded 0 • right branches are coded 1 1 0 • High-level idea s 1 p max • Each leaf node is a symbol 1 0 • Each path is a codeword s 2 1 • Less frequent symbol à longer codeword path 1 s k p min

Huffman Coding • Algorithm 1. Sort all symbols according to their probabilities 2. Repeat until only one symbol left a) Pick the two symbols with the smallest probabilities b) Add the two symbols as childe nodes c) Remove the two symbols from the list d) Assign the sum of the children's probabilities to the parent e) Insert the parent node to the list 16

Huffman Coding – Example Symbol Count Probability Code A 15 0.375 0 B 7 0.175 100 C 7 0.175 101 P 4 (45) D 6 0.150 110 1 E 5 0.125 111 0 P 3 (25) 1 1. {A, B, C, D, E} 0 2. {A, B, C, P 1 } P 2 (14) P 1 (11) 3. {A, P 2 , P 1 } 1 1 0 0 4. {A, P 3 } E(5) A(20) B(7) C(7) D(6) 5. {P 4 } 17

Huffman Coding – Pro and Cons • Pros • Unique decodable • Prefix code • Optimality: average codeword length of a message approaches its entropy à shown η ≤ E[L] ≤ η +1 • Cons • Every code has an integer bit length • Why inefficient? • If a symbol occurs very frequently log 2 (1/p) close to 0 à but still need one bit 18

Arithmetic Coding • Usually outperform Huffman coding • Encode the whole message as one unit • High-level idea • Each message is represented by an interval [a,b), 0 ≤ a,b ≤ 1 • Longer message à shorter interval à more bits to represent a smaller real number • Shorter message à longer interval à less bits to represent a greater real number • Example Symbol low high range 0 1.0 1.0 C 0.3 0.5 0.2 A 0.30 0.34 0.04 E 0.322 0.334 0.012 E 0.3286 0.3322 0.0036 19 $ 0.33184 0.33220 0.00036

Arithmetic Coding – Encoding • Maintain a probability table • Frequent symbol à larger range • Need a terminator symbol $ • Algorithm : • Initialize low = 0, high = 1, range = 1 • Repeat for each symbol • low = low + range * range min (symbol) • high = low + range * range max (symbol) • Range = high - low Encode a message CAEE$ Sym probability range A 0.2 [0, 0.2) Symbol low high range B 0.1 [0.2, 0.3) 0 1.0 1.0 C 0.2 [0.3, 0.5) C 0.3 0.5 0.2 D 0.05 [0.5, 0.55) A 0.30 0.34 0.04 E 0.3 [0.55, 0.85) E 0.322 0.334 0.012 F 0.05 [0.85, 0.9) E 0.3286 0.3322 0.0036 $ 0.1 [0.9, 1) $ 0.33184 0.33220 0.00036

Arithmetic Coding – Encoding • Illustration 1 0.5 0.34 0.334 0.3322 0.3322 $ $ $ $ $ $ 0.9 F F F F F F 0.85 E E E E E E 0.55 D D D D D D 0.5 C C C C C C 0.3 B B B B B B 0.2 A A A A A A 0.3 0.3 0 0.322 0.3286 0.33184 21

Arithmetic Coding – Decoding • Algorithm • while not $ 1. Find a symbol s so that range min (s) ≤ value ≤ range max (s) 2. Output s 3. low = range min (s) 4. high = range max (s) 5. range =high – low 6. value = (value – low) / range 22

Arithmetic Coding – Properties • When the intervals shrink, we need very high- precision number for encoding • Might not be feasible • Need a special terminator symbol $ • Need to protect $ in noisy channels 23

Run-Length Coding • Input sequence: 0,0,-3,5,0,-2,0,0,0,0,2,-4,0,0,0,1 • Run-length sequence: (2,-3)(0,5)(1,-2)(4,2)(0,-4)(3,1) Number of zeros next non-zero value • Many variations • Reduce the number of samples to code • Implementation is simple 24

Compression Technologies • Statistical redundancy • Lossless compression • Also known as entropy coding • Build on the probabilistic characteristics of signals • Perceptual redundancy • Lossy compression • Lead to irreversible distortion • Complex and depends on context or applications 26

Rate-Distortion Function • Numerical measure for signal quality • SNR • PSNR • How to evaluate the tradeoff between compression ratio and signal quality? • Rate-distortion function (D = 0 means lossless0 Source:http://jov.arvojournals.org/article.aspx ?articleid=2213283

Transform Coding • Remove spatial redundancy • Spatial image data are transformed in to a different representation: transformed domain • Make the image data easier to be compressed • Transformation (T) itself does not compress data • Compression is from quantization! original domain transformed domain T ⟹ Y X Greater entropy Smaller entropy Need mores bits Need less bits 28

Wireless Communication Systems @CS.NCTU Lecture 5: Compression - PowerPoint PPT Presentation

Wireless Communication Systems @CS.NCTU Lecture 5: Compression Instructor: Kate Ching-Ju Lin ( ) Chap. 7-8 of Fundamentals of Multimedia Some reference from http://media.ee.ntu.edu.tw/courses/dvt/15F/ 1 Outline Concepts of

WIRELESS SYSTEMS CO.,LTD Wireless System Company Wireless & Computer & Telecom Solution

Security in Wireless Ecosystems Security in Wireless Ecosystems Wade Trappe Wireless Ecosystems

SK Telecom 1 U U U U U U U- U - - communication - - - - - communication

Ulwimo ULtimate WIreless MObility Developments within indoor wireless communication

An Introduction to Wireless Technologies Part 1 F. Ricci Content Wireless communication

Wireless LAN Setup & Optimizing Wireless Client in Linux Hacking and Cracking Wireless

Wireless and Mobile Networks Wireless and 802.11 LANs wireless links: shared, fading,

1 Wireless standards Path Loss Phenomena Wireless standards Path Loss Phenomena Open

EE107 Spring 2019 Lecture 7 Wireless Communication Embedded Networked Systems Sachin Katti

Wireless Communication Systems @CS.NCTU Lecture 1: Introduction to Wireless Networks Instructor:

Modern Wireless Networks Wireless Fundamentals ICEN 574 Spring 2019 Prof. Dola Saha 1

CS 4453 Computer Networks Chapter 5 Wireless and Mobile Networks 2015 Winter A wireless network

Wireless monitoring device Goal Wireless monitoring device monitors existing wireless network

Hydra An Open Source Wireless Testbed Hyrum K. Wright, Robert D. Grant Wireless Communication

Performance Measurement of Performance Measurement of Wireless LAN Using Open Source Wireless

An Introduction to Wireless Technologies Part 1 F. Ricci 2010/2011 Content Wireless

Priority Queue implementation Creating Heaps

Text Operations Text Operations Berlin Chen 2003 References: 1. Modern Information Retrieval,

Random Access Archives for Efficient Compression of Many Small Files or Avoiding the void

4.4. Arithmetic coding Advantages: Reaches the entropy (within computing precision)

Data Compression (Chapters 4-6) presented by Tapani Raiko Feb 26, 2004 Contents (Data

V.2 Index Compression Heaps law (empirically observed and postulated): Size of the vocabulary

Information Theory Lecture 3 Lossless source coding algorithms: Huffman: CT5.68

1 Data structures for decoder: Construction of canonical Huffman: (sketch) The array

Wireless Communication Systems @CS.NCTU Lecture 5: Compression - PowerPoint PPT Presentation

Wireless Communication Systems @CS.NCTU Lecture 5: Compression Instructor: Kate Ching-Ju Lin ( ) Chap. 7-8 of Fundamentals of Multimedia Some reference from http://media.ee.ntu.edu.tw/courses/dvt/15F/ 1 Outline Concepts of

WIRELESS SYSTEMS CO.,LTD Wireless System Company Wireless &amp; Computer &amp; Telecom Solution

Security in Wireless Ecosystems Security in Wireless Ecosystems Wade Trappe Wireless Ecosystems

SK Telecom 1 U U U U U U U- U - - communication - - - - - communication

Ulwimo ULtimate WIreless MObility Developments within indoor wireless communication

An Introduction to Wireless Technologies Part 1 F. Ricci Content Wireless communication

Wireless LAN Setup &amp; Optimizing Wireless Client in Linux Hacking and Cracking Wireless

Wireless and Mobile Networks Wireless and 802.11 LANs wireless links: shared, fading,

1 Wireless standards Path Loss Phenomena Wireless standards Path Loss Phenomena Open

EE107 Spring 2019 Lecture 7 Wireless Communication Embedded Networked Systems Sachin Katti

Wireless Communication Systems @CS.NCTU Lecture 1: Introduction to Wireless Networks Instructor:

Modern Wireless Networks Wireless Fundamentals ICEN 574 Spring 2019 Prof. Dola Saha 1

CS 4453 Computer Networks Chapter 5 Wireless and Mobile Networks 2015 Winter A wireless network

Wireless monitoring device Goal Wireless monitoring device monitors existing wireless network

Hydra An Open Source Wireless Testbed Hyrum K. Wright, Robert D. Grant Wireless Communication

Performance Measurement of Performance Measurement of Wireless LAN Using Open Source Wireless

An Introduction to Wireless Technologies Part 1 F. Ricci 2010/2011 Content Wireless

Priority Queue implementation Creating Heaps

Text Operations Text Operations Berlin Chen 2003 References: 1. Modern Information Retrieval,

Random Access Archives for Efficient Compression of Many Small Files or Avoiding the void

4.4. Arithmetic coding Advantages: Reaches the entropy (within computing precision)

Data Compression (Chapters 4-6) presented by Tapani Raiko Feb 26, 2004 Contents (Data

V.2 Index Compression Heaps law (empirically observed and postulated): Size of the vocabulary

Information Theory Lecture 3 Lossless source coding algorithms: Huffman: CT5.68

1 Data structures for decoder: Construction of canonical Huffman: (sketch) The array

WIRELESS SYSTEMS CO.,LTD Wireless System Company Wireless & Computer & Telecom Solution

Wireless LAN Setup & Optimizing Wireless Client in Linux Hacking and Cracking Wireless