Multicore Implementation of LDPC Decoders based on ADMM Algorithm - PowerPoint PPT Presentation

Multicore Implementation of LDPC Decoders based on ADMM Algorithm Imen DEBBABI 1 , Nadia KHOUJA 1 , Fethi TLILI 1 ,   Bertrand LE GAL 2 and Christophe JEGO 2 1 - SUP’COM, GRESCOM Lab,   University of Carthage, Tunisia 2 - Bordeaux-INP, IMS-lab., CNRS UMR 5218   University of Bordeaux, France B. Le Gal ICASSP 2016 - Implementation of Signal Processing Systems March 23, 2016

The LP decoding for LDPC codes 2 B. Le Gal ICASSP 2016 - Implementation of Signal Processing Systems March 23, 2016

Introduction to LDPC codes ๏ LDPC codes are well-known EECs V 0 V 1 V 2 V 3 V 4 V 5 V 6 V 7 working on data blocs,   C 0 1 1 1 0 0 0 0 0 - K information bits; C 1 0 0 0 1 1 1 0 0   H =   - N transmitted values, C 2 1 0 0 1 0 0 1 0   C 3 0 1 0 0 1 0 0 1 - (N-K) redundant values, ๏ The LDPC code structure is defined by a H matrix, - Provides VN/CN involved in computations, V 0 V 1 V 2 V 3 V 4 V 5 V 6 V 7 - Visually represented as a Tanner graph. ๏ Common approach for decoding is based on MP algorithm; - MP algorithm is iterative, C 0 C 1 C 2 C 3 - Visually represented as a Tanner graph. 1 Tanner graph representation. 3 B. Le Gal ICASSP 2016 - Implementation of Signal Processing Systems March 23, 2016

Related works on LDPC decoding ๏ During the last decade, lots of works were proposed around LDPC codes: - Efficient MP decoding algorithms ‣ SPA algorithm is efficient but complex to implement, ‣ MS, OMS, NMS, 2NMS, lambda-min, ANMS, etc. - Advanced computation schedules ‣ Flooding, TDMP , conditional activation, etc. - Efficient Hard/Software decoders, ‣ Hardware (ASIC, FPGA) for efficiency, ‣ Software (CPU & GPU) for flexibility. ๏ Linear Programming (LP) approach for LDPC decoding is a « recent » way. 4 B. Le Gal ICASSP 2016 - Implementation of Signal Processing Systems March 23, 2016

LP decoding of LDPC codes ๏ Linear programming formulation of LDPC decoding, - First, proposed by in [ 1 ], - Huge memory & computation complexities, - Limited to very short frames (N < 200), ๏ Interesting FER performance - Even against SPA algorithm especially in Error floors, - ML certificate when frame is successfully decoded (not decoded otherwise). ๏ Lower complexity formulation, - Initial LP ADMM algorithm [ 2 ], Increase mainly according - Good FER performance ADMM-l2 against SPA [ 3 ], to N, N-K and deg(Ci) parameters - Reduced complexity s-ADMM-l2 [ 4 ] ๏ LP LDPC decoding is affordable for implementation purpose. [1] J. Feldman, Decoding Error-Correcting Codes via Linear Programming. PhD thesis, Massachussets Institute of Technology , 2003. 5 B. Le Gal ICASSP 2016 - Implementation of Signal Processing Systems March 23, 2016

LP decoding of LDPC codes F ER for WiMAX 1152 × 288 rate 0 . 75 B LDPC code SPA SPA 10 0 ๏ Linear programming formulation of F ER for WiMAX 576 × 288 LDPC code 10 − 1 ADMM- l 2 ADMM- l 2 10 − 1 LDPC decoding, 10 − 2 - First, proposed by in [ 1 ], 10 − 2 - Huge memory & computation complexities, 10 − 3 10 − 3 - Limited to very short frames (< 200 bits), 10 − 4 10 − 4 ๏ Interesting FER performance - Even against SPA algorithm, 10 − 5 10 − 5 - ML certificate when frame is successfully decoded 10 − 6 10 − 6 (not decoded otherwise). 1 2 3 4 5 1 . 4 2 . 4 3 . 4 ๏ Lower complexity formulation, Eb/N 0 Eb/N 0 - Initial LP ADMM algorithm [ 2 ], Fig. 1 . FER comparison of ADMM- l 2 penalized decoders with SPA - Good FER performance ADMM-l2 against SPA [ 3 ], decoders on AWGN channel. - Reduced complexity s-ADMM-l2 [ 4 ] [2] Xiaojie Zhang and Paul H.Siegel, “Efficient iterative LP decoding of LDPC codes with alternating direction method of multipliers,” IEEE International Symposium on ๏ LP LDPC decoding becomes now Information Theory (ISIT) , 2013. [3] X. Jiao, H. Wei, J. Mu, and C. Chen, “Improved ADMM penalized decoder for realistic for implementation purpose. irregular low-density parity-check codes,” IEEE Communications Letters , June 2015. [4] H. Wei, X. Jiao, and J. Mu, “Reduced-complexity linear programming decoding based on ADMM for LDPC codes,” IEEE Communications Letters , June 2015. 6 B. Le Gal ICASSP 2016 - Implementation of Signal Processing Systems March 23, 2016

The ADMM decoding algorithm 7 B. Le Gal ICASSP 2016 - Implementation of Signal Processing Systems March 23, 2016

Formulation of the ADMM decoding algorithm Algorithm 1 Flooding based ADMM - l 2 Algorithm. 1: Kernel 1 : Initialization ๏ The ADMM algorithm has a MP-based ∀ j ∈ J , i ∈ N c ( j ) : z (0) j → i = 0 . 5, λ (0) 2: j → i = 0 formulation of the LP problem, 3: ∀ i ∈ I : n i = γ i µ 4: for all k = 1 → q when stop criterion = false do 2 2 , - Proposed in [ 2 ] and correction improved in [ 3 ], 5: Kernel 2: For all variable nodes in the code 6: for all i ∈ I , j ∈ N v ( i ) do - Formulation provides LP features, t ( k ) ( z ( k − 1) − λ ( k − 1) 7: = P ) - Traditional flooding schedule, i j → i j → i j ∈ N v ( i ) - Based on Euclidian projection; L ( k ) µ ( t ( k ) 1 8: i → j = Π [0 , 1] ( − n i − α µ )) i d vi − 2 α 9: end for ๏ Based on 4 distinct kernels 10: Kernel 3: For all check nodes in the code 11: for all j ∈ J , i ∈ N c ( j ) do - Kernel 1, initializes the decoder; z ( k ) j → i = Π P dcj [ ρ L ( k ) i → j + (1 − ρ ) z ( k − 1) + λ ( k − 1) 12: ] j → i j → i - Kernel 2, processes all VNs; λ ( k ) j → i = λ ( k − 1) + ρ L ( k ) i → j + (1 − ρ ) z ( k − 1) − z ( k ) 13: j → i j → i j → i - Kernel 3, processes all CNs; 14: end for 15: end for - Kernel 4, takes hard decision; 16: Kernel 4 : Hard decisions from soft-values ! ๏ Kernels 2 and 3 are iterated k times 17: ∀ i ∈ I : ˆ c i = P > 0 . 5 L i → j j ∈ N v ( i ) (# iterations) [2] Xiaojie Zhang and Paul H.Siegel, “Efficient iterative LP decoding of LDPC codes with - Computation complexity is located there; alternating direction method of multipliers,” IEEE International Symposium on Information Theory (ISIT) , 2013. [3] X. Jiao, H. Wei, J. Mu, and C. Chen, “Improved ADMM penalized decoder for irregular low-density parity-check codes,” IEEE Communications Letters , June 2015. 8 B. Le Gal ICASSP 2016 - Implementation of Signal Processing Systems March 23, 2016

Formulation of the ADMM decoding algorithm Algorithm 1 Flooding based ADMM - l 2 Algorithm. 1: Kernel 1 : Initialization ๏ The ADMM algorithm has a flooding- ∀ j ∈ J , i ∈ N c ( j ) : z (0) j → i = 0 . 5, λ (0) 2: j → i = 0 based formulation of the LP problem, 3: ∀ i ∈ I : n i = γ i µ 4: for all k = 1 → q when stop criterion = false do 2 2 , - Proposed in [ 2 ] and correction improved in [ 3 ], 5: Kernel 2: For all variable nodes in the code 6: for all i ∈ I , j ∈ N v ( i ) do - Formulation provides LP features, t ( k ) ( z ( k − 1) − λ ( k − 1) 7: = P ) - Traditional flooding schedule, i j → i j → i j ∈ N v ( i ) - Based on Euclidian projection; L ( k ) µ ( t ( k ) 1 8: i → j = Π [0 , 1] ( − n i − α µ )) i d vi − 2 α 9: end for ๏ Based on 4 distinct kernels 10: Kernel 3: For all check nodes in the code 11: for all j ∈ J , i ∈ N c ( j ) do - Kernel 1, initializes the decoder; z ( k ) j → i = Π P dcj [ ρ L ( k ) i → j + (1 − ρ ) z ( k − 1) + λ ( k − 1) 12: ] j → i j → i - Kernel 2, processes all VNs; λ ( k ) j → i = λ ( k − 1) + ρ L ( k ) i → j + (1 − ρ ) z ( k − 1) − z ( k ) 13: j → i j → i j → i - Kernel 3, processes all CNs; 14: end for 15: end for - Kernel 4, takes hard decision; 16: Kernel 4 : Hard decisions from soft-values ! ๏ Kernels 2 and 3 are iterated k times 17: ∀ i ∈ I : ˆ c i = P > 0 . 5 L i → j j ∈ N v ( i ) (# iterations) [2] Xiaojie Zhang and Paul H.Siegel, “Efficient iterative LP decoding of LDPC codes with - Computation complexity is located there; alternating direction method of multipliers,” IEEE International Symposium on Information Theory (ISIT) , 2013. [3] X. Jiao, H. Wei, J. Mu, and C. Chen, “Improved ADMM penalized decoder for irregular low-density parity-check codes,” IEEE Communications Letters , June 2015. 9 B. Le Gal ICASSP 2016 - Implementation of Signal Processing Systems March 23, 2016

Multicore Implementation of LDPC Decoders based on ADMM Algorithm - PowerPoint PPT Presentation

Multicore Implementation of LDPC Decoders based on ADMM Algorithm Imen DEBBABI 1 , Nadia KHOUJA 1 , Fethi TLILI 1 , Bertrand LE GAL 2 and Christophe JEGO 2 1 - SUPCOM, GRESCOM Lab, University of Carthage, Tunisia 2 - Bordeaux-INP,

Multicore Implementation of LDPC Decoders based on ADMM Algorithm Imen DEBBABI 1 , Nadia KHOUJA 1

Combinational Logic Building Blocks Chapter 6 Combinational Logic Introduction Decoders Basic

Design of Energy-Efficient LDPC Codes and Decoders Elsa Dupraz 16/04/2019 Section 1:

Anytime Reliability of Systematic LDPC Motivation Convolutional Codes LDPC Convolutional Codes

EWG on Maritime Security Brunei Darussalam and New Zealand Co-Chairs 2014 2017 ADMM-PLUS EWG

An Efficient GPU-based An Efficient GPU-based LDPC Decoder for Long LDPC Decoder for Long

Learned Scheduling of LDPC Decoders Based on Multi-armed Bandits Salman Habib, Allison Beemer,

A Reaction Attack on the QC-LDPC McEliece Cryptosystem Tomas Fabsic 1 , Viliam Hromada 1 , Paul

Design and Analysis of LDPC for MIMO-OFDM Guosen Yue NEC Labs Research Princeton, NJ Joint work

- tunnel-effect ( "micro-convergence" ) for SC-LDPC [ 1 ] [ 1 ] Schmalen, ten Brink,

State of Multicore OCaml KC Sivaramakrishnan University of OCaml Labs Cambridge Outline

The Why, Where and How of Multicore Anant Agarwal MIT and Tilera Corp. What is Multicore?

Multicore Multicore curiculum 1 Motivation Moores Law: the number of transistors double

Low-latency software LDPC decoders for x86 multi-core devices Bertrand LE GAL and Christophe JEGO

COLOR CODE DECODERS FROM TORIC CODE DECODERS Aleksander Kubica work w/ N. Delfosse

Unit 7 Fundamental Digital Building Blocks: Decoders & Multiplexers 7.2 CHECKERS / DECODERS

HiPANQ Overview of NVIDIA GPU Architecture and Introduction to CUDA/OpenCL Programming, and

Turbulence Visualization at the Terascale on Desktop PCs Marc Treib, Kai Brger, Florian

CSS codes Shyam Sundhar R November 12, 2017 Shyam Sundhar R CSS codes November 12, 2017 1 /

I nte llie tc h Vide o & Image Pr oc e ssing F or any que r y c ontac t us at sale

WHATS MY CODE? CODING CASE STUDIES ONP EDUCATION CONFERENCE OCTOBER 2018 DIS CLAIMER

OUR VISION To assist healthcare organizations in achieving their goals by providing quality

CodeRVA High School CODE RVA Donna M. Dalton, Chief Academic Officer High School Innovation

Take Coaching to the Center Next Level: Tools for Coaching and Training Marta M. Shinn, Ph.D.

Sambuz

Useful Links

Newsletter

Mail Us

Multicore Implementation of LDPC Decoders based on ADMM Algorithm - PowerPoint PPT Presentation

Multicore Implementation of LDPC Decoders based on ADMM Algorithm Imen DEBBABI 1 , Nadia KHOUJA 1 , Fethi TLILI 1 , Bertrand LE GAL 2 and Christophe JEGO 2 1 - SUPCOM, GRESCOM Lab, University of Carthage, Tunisia 2 - Bordeaux-INP,

Multicore Implementation of LDPC Decoders based on ADMM Algorithm Imen DEBBABI 1 , Nadia KHOUJA 1

Combinational Logic Building Blocks Chapter 6 Combinational Logic Introduction Decoders Basic

Design of Energy-Efficient LDPC Codes and Decoders Elsa Dupraz 16/04/2019 Section 1:

Anytime Reliability of Systematic LDPC Motivation Convolutional Codes LDPC Convolutional Codes

EWG on Maritime Security Brunei Darussalam and New Zealand Co-Chairs 2014 2017 ADMM-PLUS EWG

An Efficient GPU-based An Efficient GPU-based LDPC Decoder for Long LDPC Decoder for Long

Learned Scheduling of LDPC Decoders Based on Multi-armed Bandits Salman Habib, Allison Beemer,

A Reaction Attack on the QC-LDPC McEliece Cryptosystem Tomas Fabsic 1 , Viliam Hromada 1 , Paul

Design and Analysis of LDPC for MIMO-OFDM Guosen Yue NEC Labs Research Princeton, NJ Joint work

- tunnel-effect ( &quot;micro-convergence&quot; ) for SC-LDPC [ 1 ] [ 1 ] Schmalen, ten Brink,

State of Multicore OCaml KC Sivaramakrishnan University of OCaml Labs Cambridge Outline

The Why, Where and How of Multicore Anant Agarwal MIT and Tilera Corp. What is Multicore?

Multicore Multicore curiculum 1 Motivation Moores Law: the number of transistors double

Low-latency software LDPC decoders for x86 multi-core devices Bertrand LE GAL and Christophe JEGO

COLOR CODE DECODERS FROM TORIC CODE DECODERS Aleksander Kubica work w/ N. Delfosse

Unit 7 Fundamental Digital Building Blocks: Decoders &amp; Multiplexers 7.2 CHECKERS / DECODERS

HiPANQ Overview of NVIDIA GPU Architecture and Introduction to CUDA/OpenCL Programming, and

Turbulence Visualization at the Terascale on Desktop PCs Marc Treib*, Kai Brger*, Florian

CSS codes Shyam Sundhar R November 12, 2017 Shyam Sundhar R CSS codes November 12, 2017 1 /

I nte llie tc h Vide o &amp; Image Pr oc e ssing F or any que r y c ontac t us at sale

WHATS MY CODE? CODING CASE STUDIES ONP EDUCATION CONFERENCE OCTOBER 2018 DIS CLAIMER

OUR VISION To assist healthcare organizations in achieving their goals by providing quality

CodeRVA High School CODE RVA Donna M. Dalton, Chief Academic Officer High School Innovation

Take Coaching to the Center Next Level: Tools for Coaching and Training Marta M. Shinn, Ph.D.

Sambuz

Useful Links

Newsletter

Mail Us

- tunnel-effect ( "micro-convergence" ) for SC-LDPC [ 1 ] [ 1 ] Schmalen, ten Brink,

Unit 7 Fundamental Digital Building Blocks: Decoders & Multiplexers 7.2 CHECKERS / DECODERS

Turbulence Visualization at the Terascale on Desktop PCs Marc Treib, Kai Brger, Florian

I nte llie tc h Vide o & Image Pr oc e ssing F or any que r y c ontac t us at sale