Empirical Properties of Good Channel Codes Qinghua (Devon) Ding - PowerPoint PPT Presentation

Empirical Properties of Good Channel Codes Qinghua (Devon) Ding June 8, 2020 The Chinese University of Hong Kong

Introduction

Shannon’s Channel Coding Theorem 2

Shannon’s Channel Coding Theorem 2 { R > C ⇒ ∀C , P ( m ̸ = ˆ m ) → 1 . C = max P X I ( X ; Y ) R < C ⇒ ∃C , P ( m ̸ = ˆ m ) → 0 .

2 Shannon’s Channel Coding Theorem { R > C ⇒ ∀C , P ( m ̸ = ˆ m ) → 1 . C = max P X I ( X ; Y ) R < C ⇒ ∃C , P ( m ̸ = ˆ m ) → 0 . P ∗ X = arg max P X I ( X ; Y ) need not be unique.

X 3 Result I: Characterization of the P ∗ Consider a channel W = ( p 1 , ..., p |X| ) , denote r = ( H ( p 1 ) , ..., H ( p m )) .

X 3 Result I: Characterization of the P ∗ Consider a channel W = ( p 1 , ..., p |X| ) , denote r = ( H ( p 1 ) , ..., H ( p m )) . Given some P ∗ X ∈ arg max P X I ( X ; Y ) (e.g. by Blahut-Amiroto algorithm).

3 X There’s no analytical solutions in general. The whole set of capacity-achieving input distribution is r Result I: Characterization of the P ∗ Consider a channel W = ( p 1 , ..., p |X| ) , denote r = ( H ( p 1 ) , ..., H ( p m )) . ( W )} { P ∗ P ∗ X = X + ker ∩ R m + . 1 1 A non-linear equation system for P ∗ X is developed in [Mur53] and its followup works.

Optimizing Input Distribution 4 X ) = X ′ ⊂ X . Suppose W has unique CAID P ∗ X with supp ( P ∗

Optimizing Input Distribution 4 X ) = X ′ ⊂ X . Suppose W has unique CAID P ∗ X with supp ( P ∗ Claim . The distributions { p i , i ∈ X ′ } should be linearly independent.

Optimizing Input Distribution Proof by contrapositive (details later). 4 X ) = X ′ ⊂ X . Suppose W has unique CAID P ∗ X with supp ( P ∗ Claim . The distributions { p i , i ∈ X ′ } should be linearly independent.

Property of Random Code Ensemble 5 { R > C ⇒ ∀C , P ( W ̸ = ˆ W ) → 1 . C = max P X I ( X ; Y ) R < C ⇒ ∃C , P ( W ̸ = ˆ W ) → 0 .

Property of Random Code Ensemble Random code ensemble is capacity-achieving. 5 { R > C ⇒ ∀C , P ( W ̸ = ˆ W ) → 1 . C = max P X I ( X ; Y ) R < C ⇒ ∃C , P ( W ̸ = ˆ W ) → 0 .

Property of Random Code Ensemble X . 6 Random codes: each alphabet i.i.d. from P ∗ P ∗ X ∈ arg max P X I ( X ; Y )

Property of Random Code Ensemble X . 2 This condition is difgerent from [HV93, PV13, SV97]. 6 Random codes: each alphabet i.i.d. from P ∗ P ∗ X ∈ arg max P X I ( X ; Y ) Empirical independence: # { ( x i , x ′ i ) = ( a , b ) } ≈ nP ∗ X ( a ) P ∗ X ( b ) . 2

Property of Random Code Ensemble X . codeword pairs that’s empirically independent, w.h.p. 2 This condition is difgerent from [HV93, PV13, SV97]. 6 Random codes: each alphabet i.i.d. from P ∗ P ∗ X ∈ arg max P X I ( X ; Y ) Empirical independence: # { ( x i , x ′ i ) = ( a , b ) } ≈ nP ∗ X ( a ) P ∗ X ( b ) . 2 Observation . Random codes have “most” (1 − o ( 1 ) fraction)

Property of Random Code Ensemble X . k -tuples that’s empirically independent, w.h.p. 6 Random codes: each alphabet i.i.d. from P ∗ P ∗ X ∈ arg max P X I ( X ; Y ) # { ( x i , x ′ i , x ′′ i ) = ( a , b , c ) } ≈ nP X ∗ ( a ) P X ∗ ( b ) P X ∗ ( c ) . Generalization to k = O ( 1 ) . Random codes have “most” codeword

Result II: Necessary Conditions for Good Codes Capacity-achieving code (or good code) 7 { R = C − ϵ, P ( m ̸ = ˆ m ) → 0 .

Result II: Necessary Conditions for Good Codes Capacity-achieving code (or good code) Theorem (Property of Good Codes) X , any good code for it should have 7 { R = C − ϵ, P ( m ̸ = ˆ m ) → 0 . For any DMC with unique P ∗ 1 − o ( 1 ) fraction of codeword k-tuples empirically independent.

Result II: Necessary Conditions for Good Codes Capacity-achieving code (or good code) Theorem (Property of Good Codes) X , any good code for it should have 7 { R = C − ϵ, P ( m ̸ = ˆ m ) → 0 . For any DMC with unique P ∗ 1 − o ( 1 ) fraction of codeword k-tuples empirically independent. Similar results holds for AWGN channel.

Advertisement Parallel work [ZVJ20] on Quadratically Constrained Two-Way Adversarial Channels ISIT 2020 https://sites.google.com/view/yihan/ 8

Result III: Non-universality of Good Codes 9 Two channels W and W ′ are similar ifg P ∗ X = P ∗ X ′ and C = C ′ .

Result III: Non-universality of Good Codes capacity under vanishing error probability for all similar channels. 9 Two channels W and W ′ are similar ifg P ∗ X = P ∗ X ′ and C = C ′ . ∼ P ∗ i . i . d . Observation . Random code ensemble with alphabet X achieves

Result III: Non-universality of Good Codes capacity under vanishing error probability for all similar channels. Theorem (Non-universality of Good Codes) 9 Two channels W and W ′ are similar ifg P ∗ X = P ∗ X ′ and C = C ′ . ∼ P ∗ i . i . d . Observation . Random code ensemble with alphabet X achieves There exists similar DMCs W, W ′ and code C that’s capacity-achieving for W, s.t. no expurgation of C with the same rate is good for W ′ .

Result III: Non-universality of Good Codes 10

Proof Ideas

r Proof to Characterization Result 11 For DMC W , given some P X ∗ ∈ arg max P X I ( X ; Y ) , we have ( W )} { P ∗ P ∗ X = X + ker ∩ R m + .

r Proof to Characterization Result Proof by standard linear algebra. 11 For DMC W , given some P X ∗ ∈ arg max P X I ( X ; Y ) , we have ( W )} { P ∗ P ∗ X = X + ker ∩ R m + .

Proof to Characterization Result Generalizing to k -use channel , we have X 12 ( W ⊗ k )} { P ∗ P ∗⊗ k X k = + ker ∩ R m k + . r ( k )

Proof to Characterization Result Consider the following noisy typewritter channel. 0 0 0 0 0 0 Generalizing to k -use channel , we have 0 0 12 X ( W ⊗ k )} { P ∗ P ∗⊗ k X k = + ker ∩ R m k + . r ( k )   1 / 2 1 / 2 1 / 2 1 / 2   W =    1 / 2 1 / 2    1 / 2 1 / 2 Although C and P ∗ Y tensorize, P ∗ X k does not tensorize.

Proof to Linear Indepence Lemma Proof by contrapositive. 13 X ) = X ′ ⊂ X . Suppose W has unique CAID P ∗ X with supp ( P ∗ Claim . The distributions { p i , i ∈ X ′ } should be linearly independent.

Proof to Linear Indepence Lemma Suppose linear independence does not hold. 14

Proof to Linear Indepence Lemma Suppose linear independence does not hold. 14 We can find feasible direction δ ̸ = 0 such that ⟨ r , δ ⟩ = 0 , W δ = 0 .

Proof to Linear Indepence Lemma Suppose linear independence does not hold. 14 We can find feasible direction δ ̸ = 0 such that ⟨ r , δ ⟩ = 0 , W δ = 0 . Then I ( P X ∗ + ϵδ ; P Y ) = C for small enough ϵ , contradiction!

Proof to Empirical Independence Property 15 Consider a discrete memoryless channel W with unique P X ∗ .

Proof to Empirical Independence Property X 15 Consider a discrete memoryless channel W with unique P X ∗ . Claim . Any good code C for W has the property that ∀ δ > 0, P x 1 ,..., x k ∼C ( ∥ τ x 1 ,..., x k − P ∗⊗ k ∥ 1 > δ ) → 0 .

Proof to Empirical Independence Property X 15 Consider a discrete memoryless channel W with unique P X ∗ . Claim . Any good code C for W has the property that ∀ δ > 0, P x 1 ,..., x k ∼C ( ∥ τ x 1 ,..., x k − P ∗⊗ k ∥ 1 > δ ) → 0 . Proof by considering the k -use channel.

Proof to Empirical Independece Property 16

Proof to Empirical Independence Property (Cont’d) 17 Consider the AWGN ( P , N ) channel denoted as W .

Proof to Empirical Independence Property (Cont’d) 17 Consider the AWGN ( P , N ) channel denoted as W . Claim . Any good code for W has the property that ∀ δ > 0, P X 1 ,..., X k ∼C ( ∃ i ̸ = j , |⟨ X i , X j ⟩| > δ n ) → 0 .

Proof to Empirical Independence Property (Cont’d) Proof by contrapositive. 17 Consider the AWGN ( P , N ) channel denoted as W . Claim . Any good code for W has the property that ∀ δ > 0, P X 1 ,..., X k ∼C ( ∃ i ̸ = j , |⟨ X i , X j ⟩| > δ n ) → 0 .

Properties of Good Channel Codess Suppose the codewords are empirically correlated. 18

Properties of Good Channel Codess Suppose the codewords are empirically correlated. Then we can extract a large subcode good for another channel with 18 P ′ < P , contradiction!

Non-universality of Good Channel Codes

Non-universality of Good Channel Codes 2 The figures are from [CT12]. 19 Consider similar channels BEC ( H ( p )) and BSC ( p ) . 2

Non-universality of Good Channel Codes 2 The figures are from [CT12]. 19 Consider similar channels BEC ( H ( p )) and BSC ( p ) . 2 Claim . There exists good code for BEC ( H ( p )) such that no expurgated subcode of the same rate can be good for BSC ( p ) .

Empirical Properties of Good Channel Codes Qinghua (Devon) Ding - PowerPoint PPT Presentation

Empirical Properties of Good Channel Codes Qinghua (Devon) Ding June 8, 2020 The Chinese University of Hong Kong 1 Introduction Shannons Channel Coding Theorem 2 Shannons Channel Coding Theorem 2 { R > C C , P ( m = m

CHANNEL ALLOCATION Channel Language Translation Channel Translation Language Channel 1 German

ANNUAL ACCOUNTS PRESS CONFERENCE CHANNEL ALLOCATION. Channel Language Translation Channel

Channel Assignment and Channel Hopping in IEEE 802.11 Operating Channels for 802.11b Europe

Formal Modeling in Cognitive Science Source Codes Lecture 30: Codes; Kraft Inequality; Source

ANNUAL ACCOUNTS PRESS CONFERENCE LANGUAGE CHANNELS. Channel Language Channel (translation)

Channel design Channel coverage Intensive Selective Exclusive Channel

Building Codes Building Codes Building Codes Building Codes 1 1 Builder Responsibilities

ECEN 5682 Theory and Practice of Error Control Codes Cyclic Codes Peter Mathys University of

Lecture no: 7 Overview Block codes Convolution codes Fading channel and

1 Simultaneous interpretation EN channel 1 FR channel 2 ES channel 3 DE channel 4 2 The Future

CODES FOR ALL SEASONS Emina Soljanin, Bell Labs IN THE CLOUD? CODES Emina @ Bell Labs Codes at

G ENERALIZED R EED -S OLOMON CODES (GRS CODES ) A CHARACTERIZATION OF MDS CODES THAT HAVE AN ERROR

Lattices from Codes or Codes from Lattices Amin Sakzad Dept of Electrical and Computer Systems

Error-Correcting codes: Application of convolutional codes to Video Streaming Diego Napp

Information Theory Lecture 8 BCH codes BCH codes: R8.45 (R5.6) Decoding BCH (and

Formal Modeling in Cognitive Science 1 Noisy Channel Model Channel Capacity Lecture 29: Noisy

Coding and Information Theory Chris Williams School of Informatics, University of Edinburgh

Coding and Applications in Sensor Networks Why coding? Information compression

Lecture 4 Channel Coding I-Hsiang Wang Department of Electrical Engineering National Taiwan

Lecture 4 Channel Coding I-Hsiang Wang Department of Electrical Engineering National Taiwan

Graph ensemble design for channel coding A. Montanari 1 A. Amraoui 2 T. Richardson 3 R. Urbanke 2

Resource-Efficient Encoding Communication and Fusion in Wireless Networks of Sensors and

Quantifying Information is a fundamental issue in computer security: Flow Using Min-Entropy

COL863: Quantum Computation and Information Ragesh Jaiswal, CSE, IIT Delhi Ragesh Jaiswal, CSE,