Algorithms in the Real World Error Correcting Codes I Overview - PowerPoint PPT Presentation

Algorithms in the Real World Error Correcting Codes I – Overview – Hamming Codes – Linear Codes Page 1

General Model message m Errors introduced by the noisy channel: coder • changed fields in the codeword (e.g., a flipped bit) codeword c • missing fields in the noisy codeword (e.g., a lost channel byte). Called erasures codeword’ c’ How the decoder deals with errors. decoder • error detection vs. • error correction message m or error Page 2

Applications • Storage : CDs, DVDs, “hard drives”, • Wireless : Cell phones, wireless links • Satellite and Space : TV, Mars rover, … • Digital Television : DVD, MPEG2 layover • High Speed Modems : ADSL, DSL, .. Reed-Solomon codes are by far the most used in practice, including pretty much all the examples mentioned above. Algorithms for decoding are quite sophisticated. Page 3

Block Codes Each message and codeword message m is of fixed size k ∑ = codeword alphabet coder k =|m| n = |c| q = | ∑ | C ⊆ Σ n (codewords) codeword c (note: generally C ≠ Σ n ) noisy channel Δ (x,y) = number of positions s.t. x i ≠ y i codeword’ c’ d = min{ Δ (x,y) : x,y ∈ C, x ≠ y} decoder s = max{ Δ (c,c’)} that the code can correct message m or error Code described as: (n,k,d) q Page 4

Hierarchy of Codes C forms a linear subspace of ∑ n linear of dimension k C is linear and cyclic c 0 c 1 c 2 …c n-1 is a codeword implies c 1 c 2 …c n-1 c 0 is a codeword BCH Bose-Chaudhuri-Hochquenghem Hamming Reed-Solomon These are all block codes (operate on fixed-length strengths). Page 5

Binary Codes Today we will mostly be considering ∑ = {0,1} and will sometimes use (n,k,d) as shorthand for (n,k,d) 2 In binary Δ ( x,y) is often called the Hamming distance Page 6

Hypercube Interpretation Consider codewords as vertices on a hypercube. 110 111 codeword 010 011 d = 2 = min distance 100 n = 3 = dimensionality 101 2 n = 8 = number of nodes 001 000 The distance between nodes on the hypercube is the Hamming distance Δ . The minimum distance is d. 001 is equidistance from 000, 011 and 101. For s-bit error detection d ≥ s + 1 (1-bit in this example) For s-bit error correction d ≥ 2s + 1 (none here) Page 7

Error Detection with Parity Bit A (k+1,k,2) 2 code Encoding : m 1 m 2 …m k ⇒ m 1 m 2 …m k p k+1 where p k+1 = m 1 ⊕ m 2 ⊕ … ⊕ m k d = 2 since the parity is always even (it takes two bit changes to go from one codeword to another). Detects one-bit error since this gives odd parity Cannot be used to correct 1-bit error since any odd-parity word is equal distance Δ to k+1 valid codewords. Page 8

Error Correcting One-Bit Messages How many bits do we need to correct a one-bit error on a one-bit message? 110 111 10 010 11 011 100 101 01 001 00 000 3 bits 2 bits 0 -> 000, 1-> 111 0 -> 00, 1-> 11 (n=3,k=1,d=3) (n=2,k=1,d=2) In general need d ≥ 3 to correct one error. Why? Page 9

Example of (6,3,3) 2 systematic code Definition : A Systematic code message codeword is one in which the message appears in the codeword 000 000 000 001 001 011 Same in any bit of message 010 010 101 implies two bits of difference 011 011 110 in extra codeword columns. 100 100 110 101 101 101 110 110 011 111 111 000 Page 10

Error Correcting Multibit Messages We will first discuss Hamming Codes Detect and correct 1-bit errors. Codes are of form: (2 r -1, 2 r -1 – r, 3) for any r > 1 e.g. (3,1,3), (7,4,3), (15,11,3), (31, 26, 3), … which correspond to 2, 3, 4, 5, … “parity bits” (i.e. n-k) The high-level idea is to “localize” the error. Any specific ideas? Page 11

Hamming Codes: Encoding Localize error position to top or bottom half 1xxx or 0xxx m 15 m 14 m 13 m 12 m 11 m 10 m 9 p 8 m 7 m 6 m 5 m 3 p 0 p 8 = m 15 ⊕ m 14 ⊕ m 13 ⊕ m 12 ⊕ m 11 ⊕ m 10 ⊕ m 9 Localize error position to x1xx or x0xx m 15 m 14 m 13 m 12 m 11 m 10 m 9 p 8 m 7 m 6 m 5 p 4 m 3 p 0 p 4 = m 15 ⊕ m 14 ⊕ m 13 ⊕ m 12 ⊕ m 7 ⊕ m 6 ⊕ m 5 Localize error position to xx1x or xx0x m 15 m 14 m 13 m 12 m 11 m 10 m 9 p 8 m 7 m 6 m 5 p 4 m 3 p 2 p 0 p 2 = m 15 ⊕ m 14 ⊕ m 11 ⊕ m 10 ⊕ m 7 ⊕ m 6 ⊕ m 3 Localize error position to xxx1 or xxx0 m 15 m 14 m 13 m 12 m 11 m 10 m 9 p 8 m 7 m 6 m 5 p 4 m 3 p 2 p 1 p 0 p 1 = m 15 ⊕ m 13 ⊕ m 11 ⊕ m 9 ⊕ m 7 ⊕ m 5 ⊕ m 3 Page 12

Hamming Codes: Decoding m 15 m 14 m 13 m 12 m 11 m 10 m 9 p 8 m 7 m 6 m 5 p 4 m 3 p 2 p 1 p 0 We don’t need p 0 , so we have a (15,11,?) code. After transmission, we generate b 8 = p 8 ⊕ m 15 ⊕ m 14 ⊕ m 13 ⊕ m 12 ⊕ m 11 ⊕ m 10 ⊕ m 9 b 4 = p 4 ⊕ m 15 ⊕ m 14 ⊕ m 13 ⊕ m 12 ⊕ m 7 ⊕ m 6 ⊕ m 5 b 2 = p 2 ⊕ m 15 ⊕ m 14 ⊕ m 11 ⊕ m 10 ⊕ m 7 ⊕ m 6 ⊕ m 3 b 1 = p 1 ⊕ m 15 ⊕ m 13 ⊕ m 11 ⊕ m 9 ⊕ m 7 ⊕ m 5 ⊕ m 3 With no errors, these will all be zero With one error b 8 b 4 b 2 b 1 gives us the error location. e.g. 0100 would tell us that p 4 is wrong, and 1100 would tell us that m 12 is wrong Page 13

Hamming Codes Can be generalized to any power of 2 – n = 2 r – 1 (15 in the example) – (n-k) = r (4 in the example) – d = 3 (discuss later) – Can correct one error, but can’t tell difference between one and two! – Gives (2 r -1, 2 r -1-r, 3) code Extended Hamming code – Add back the parity bit at the end – Gives (2 r , 2 r -1-r, 4) code – Can correct one error and detect 2 – (not so obvious) Page 14

Lower bound on parity bits How many nodes in hypercube do we need so that d = 3? Each of the 2 k codewords and its n neighbors must be distinct from any other codeword or neighbor n k need 2 ( n 1 ) 2 ≥ + n k log ( n 1 ) ≥ + + 2 n k log ( n 1 ) ≥ + + ⎡ ⎤ 2 In previous hamming code 15 ≥ 11 + ⎡ log 2 (15+1) ⎤ = 15 Hamming Codes are called perfect codes since they match the lower bound exactly Page 15

Lower bound on parity bits What about fixing 2 errors (i.e. d=5)? Each of the 2 k codewords its neighbors and its n n neighbors’ neighbors are distinct, giving: ⎛ ⎞ ⎛ ⎞ 1 ⎜ ⎟ ⎜ ⎟ + + ⎜ ⎟ ⎜ ⎟ 1 2 ⎝ ⎠ ⎝ ⎠ n k 2 ( 1 n n ( n 1 ) / 2 ) 2 ≥ + + − n k log ( 1 n n ( n 1 ) / 2 ) ≥ + + + − 2 k 2 log n 1 ≥ + − 2 Generally to correct s errors: n n n ⎛ ⎞ ⎛ ⎞ ⎛ ⎞ n k log 2 ( 1 ! ) ⎜ ⎟ ⎜ ⎟ ⎜ ⎟ ≥ + + + + + ⎜ ⎟ ⎜ ⎟ ⎜ ⎟ 1 2 s ⎝ ⎠ ⎝ ⎠ ⎝ ⎠ Page 16

Lower Bounds: a side note The lower bounds assume arbitrary placement of bit errors. In practice errors are likely to be correlated, e.g. evenly spaced or clustered: x x x x x x x x x x x x Can we do better if we assume regular errors ? We will come back to this later when we talk about Reed-Solomon codes. In fact, this is the main reason why Reed-Solomon codes are used much more than Hamming-codes. Page 17

Linear Codes If ∑ is a field, then ∑ n is a vector space Definition : C is a linear code if it is a linear subspace of ∑ n of dimension k. This means that there is a set of k independent vectors v i ∈ ∑ n (1 ≤ i ≤ k) that span the subspace. i.e., every codeword can be written as: c = a 1 v 1 + … + a k v k a i ∈ ∑ The sum of two codewords is a codeword. Page 18

Linear Codes Vectors for the (7,4,3) 2 Hamming code: m 7 m 6 m 5 p 4 m 3 p 2 p 1 v 1 = 1 0 0 1 0 1 1 v 2 = 0 1 0 1 0 1 0 v 3 = 0 0 1 1 0 0 1 v 4 = 0 0 0 0 1 1 1 If the message is x 1 x 2 x 3 x 4 , the codeword is x 1 v 1 + x 2 v 2 + x 3 v 3 + x 4 v 4 where x i v i is scalar-vector multiplication using boolean AND for multiplication, + is boolean XOR How can we see that d = 3? (Not obvious.) Page 19

Generator and Parity Check Matrices Generator Matrix : A k x n matrix G such that: C = {xG | x ∈ ∑ k } Made from stacking the spanning vectors Parity Check Matrix : An (n – k) x n matrix H such that: C = {y ∈ ∑ n | Hy T = 0} Codewords are the nullspace of H Parity check matrices always exist for linear codes Page 20

Advantages of Linear Codes • Encoding is efficient (vector-matrix multiply) • Error detection is efficient (vector-matrix multiply) • Syndrome (Hy T ) has error information • Gives q n-k sized table for decoding (syndrome is (n-k) x 1) Useful if n-k is small Page 21

Example and “Standard Form” For the Hamming (7,4,3) 2 code: 1 0 0 1 0 1 1 ⎡ ⎤ ⎢ ⎥ 0 1 0 1 0 1 0 ⎢ ⎥ G = 0 0 1 1 0 0 1 ⎢ ⎥ ⎢ ⎥ 0 0 0 0 1 1 1 ⎢ ⎥ ⎣ ⎦ By swapping columns 4 and 5 it is in the form I k ,A. A code with a matrix in this form is systematic , and G is in “ standard form ” 1 0 0 0 1 1 1 ⎡ ⎤ ⎢ ⎥ 0 1 0 0 1 1 0 ⎢ ⎥ G = 0 0 1 0 1 0 1 ⎢ ⎥ ⎢ ⎥ 0 0 0 1 0 1 1 ⎢ ⎥ ⎣ ⎦ Page 22

Algorithms in the Real World Error Correcting Codes I Overview - PowerPoint PPT Presentation

Algorithms in the Real World Error Correcting Codes I Overview Hamming Codes Linear Codes Page 1 General Model message m Errors introduced by the noisy channel: coder changed fields in the codeword (e.g., a flipped bit)

Real graduates, Real graduates, real transitions, real transitions, real stories: real

Algorithms X. Zhang Fordham Univ. 1 Real World applications of algorithms Algorithms for

Graph Algorithms Chapter 22 1 CPTR 430 Algorithms Graph Algorithms Why Study Graph Algorithms?

Greedy Algorithms Chapter 16 1 CPTR 430 Algorithms Greedy Algorithms Greedy Algorithms For

Algorithms Chapter 3 Chapter Summary Algorithms n Example Algorithms n Algorithmic Paradigms

Real Students Real World Real Work Real Life: A Plan for a Holistic Approach to Supporting

WORLD WORLD WORLD WORLD WORLD WORLD En End of of the Br Bron onze Age ME MEETI NG 8

Real Estate Centers Real Estate Centers Hampton Roads Real Estate Hampton Roads Real Estate

Real- Real -Time Systems Time Systems Real- -Time Systems Time Systems Real

Real Real- -Time Systems Time Systems Designing a real- Designing a real -time system time

Real- Real -time systems time systems Real- Real -time programming time programming

Frontal Crash Protection Frontal Crash Protection Real World Experience with Real World

General remarks Algorithms Algorithms Oliver Oliver Week 8 Kullmann Kullmann Greedy Greedy

QUELLAVECO A WORLD CLASS COPPER PROJECT TOM MCCULLEY, CEO QUELLAVECO 27 NOVEMBER 2018 Real

Algorithms using real numbers (continued) Noter ch.4 Algorithms using real numbers

Real-World Protocols Prof. Tom Austin San Jos State University Real-World Protocols Next,

Failure Analysis Behind the Scene 1. Some thoughts on Failure Analysis Preparing the

EPAs: A Framework to Guide Learning and Assessment Across the Continuum of Medical Education

SBI Holdings, Inc. 2012 Information Meeting May 28 Tokyo June 1 Nagoya June 4 Osaka June 5

CB European Quality Fund Quarterly update 30 September 2015 Included in Morningstar Morningstar

Community Safety Night 1 On behalf of the Upper Avenue Community Association and the South

Integration of GRACE and PYTHIA Shigeru ODAKA High Energy Accelerator Research Organization (KEK)

Four of a Kind For the Love of Math and Computer Science Allen OHara For the Love of Math and

Constructing n -Engel Lie rings Serena Cical` o University of Trento Advisor: Willem A. de