CODES FOR ALL SEASONS Emina Soljanin, Bell Labs IN THE CLOUD? - PowerPoint PPT Presentation

CODES FOR ALL SEASONS Emina Soljanin, Bell Labs

IN THE CLOUD? CODES Emina @ Bell Labs

Codes at all Levels and for all Layers Files have chunks. . . . Chunks have packets. . . . CODING Packets have symbols. . . . Codes we can’t refuse. 3 / 52

Theory vs. Systems (Practice, Networking) The moon is more useful than the sun! Why? We need the light more during the night than during the day. 4 / 52

Codes for . . . Q n i o J k r . . . o F . . . all . . . CLOUDS, FAILURES, CROWDS Q r o t p a . . . R . . . all users. MULTICAST Q R A H . . . . . . all SNRs. FADING Great codes are not born great, they grow great? 5 / 52

An IR-HARQ Example CLAUDE SHANNON THE FATHER OF INFORMATION THEORY at the transmitter transmission # 1 transmission # 2 transmission # 3 transmission # 4 at the receiver CLAUDE SHANNON THE FATHER OF INFORMATION THEORY 6 / 52

IR-HARQ in Practice ◮ Considered one of the most important features of cdma2000 ◮ on the forward link in Release 0 and ◮ on both the reverse and forward links in Revision A ◮ The mother code is the rate 1 / 5 turbo code of IS-95. ◮ Decoding is ◮ attempted if the total received power is deemed sufficient, ◮ done by performing only a few iterations of message passing. ◮ There are ◮ prescribed puncturing patterns and error prediction methods, ◮ either predetermined or no transparent transmission rules. 7 / 52

Garden State (Video) Content Delivery Multicast is often the most cost effective, e.g., on stadiums. Verizon demoed LTE Multicast over the 2014 Super Bowl week. 8 / 52

Fountain (Rateless) Codes Claim to Fame Enable reliable communications over multiple, unknown channels: 1. They are rateless ⇒ their redundancy can be flexibly adapted to changing channel/network conditions of e.g. as in wireless. 2. In the case of multiple erasure channels as in Multimedia Broadcast/Multicast Service (MBMS), they can be made to universally achieve the channel capacity for all erasure rates. Fountain codes are simple to encode and decode . Isn’t Hybrid ARQ Enough? 9 / 52

Rateless 10 / 52

What is Ratelessness and is it Overrated? Ratlessness matters and means different things to different people: ◮ encoder can produce potentially infinite stream of symbols ◮ each code symbol is statistically identical Can fixed rate codes be made rateless? throughput 1 . BEC capacity R/ ( 1 − ǫ ∗ ) R/ ( 1 − ǫ ∗ ) . LDPC (rate R, BP threshold ǫ ∗ ) R R R R/ 2 . LT (length 10 4 , BP) ǫ √ ǫ ∗ 1 ǫ ∗ ǫ ∗ ǫ ∗ 11 / 52

The (Un)Bearable Ratelessness of Coding Two eMBMS Phases: Multicast Delivery & Unicast Repair: 12 / 52

What are Raptor Codes? for the conception, development, and analysis of practical rateless codes 13 / 52

Content Download Model and Objective Goal: Download a content chunk of k p packets with k s symbols per packet. Inner and Outer Codes & Channels: ◮ At the inner level, there is a (possibly rateless) ( n s , k s ) code. Symbols are sent through a channel with erasure probability ǫ s . ◮ At the outer level, there is a (possibly rateless) ( n p , k p ) code. What channel do the packets go through? To guarantee the file download, one code has to be rateless. If the inner code is rateless, there is no need for any outer coding. Feedback is instantaneous, noiseless, and possible after each symbol. 14 / 52

Infinite Incremental Redundancy (IIR) ◮ The inner (physical layer) code is rateless ⇔ coded symbols are transmitted until k s of them are received. ⇒ #transmissions/packet is ( k s , 1 − ǫ s ) negative Binomial ( NB ). ◮ The outer (packet level) code is not needed (no dropped packets). ◮ #transmissions/chunk T IIR is the sum of k p i.i.d. NB ( k s , 1 − ǫ s ) ⇒ also a negative Binomial with parameters ( k p · k s , 1 − ǫ s ) ⇒ = k s · k p T IIR � � E . 1 − ǫ s 15 / 52

Finite Redundancy (FR) ◮ A fixed-rate code ( n s , k s ) is used at the physical layer (per packet). ⇒ Each pcket is erased independently with probability n s � n s � � ǫ j s ( 1 − ǫ s ) n s − j . ǫ p = j j = n s − k s + 1 ◮ The outer (packet level) code is rateless ⇔ coded packets are transmitted until k p of them are received. ⇒ #packet-transmissions/chunk is NB ( k p , 1 − ǫ p ) . ◮ #symbol-transmissions/chunk T FR is not NB , but still, k p � T FR � = · n s . E 1 − ǫ p How do we compare = k s · k p = n s · k p T IIR � T FR � � � E vs. E ( n s � k s & ǫ s � ǫ p ) 1 − ǫ s 1 − ǫ p 16 / 52

IIR vs. FR in Point-to-Point Transmission � T IIR � � T FR � E � E Proof: Let S be the number of received symbols after n s transmission. ⇒ S is Binomial with expectation is n s · ( 1 − ǫ s ) . ⇒ 1 − ǫ p = P ( S � k s ) � n s · ( 1 − ǫ s ) , k s by the definition of ǫ p and Markov’s inequality. ♠ Ratless coding at the inner rather than at the outer layer results in fewer channel uses on average for chunk download. 17 / 52

Multicast Scenarios Point-to-Point Like: Infinite Incremental Redundancy (IIR): ◮ transmit coded symbols until k s are received by all users . ⇒ no coding or re-transmission at the packet level is needed. Fixed Redundancy (FR): ◮ a fixed a rate code ( n s , k s ) is used at the physical layer. ⇒ coding or retransmission at the packet level is required. ◮ transmit coded packets until k p are received by all users . Rateless coding may sometimes be used at both layers, e.g., & transmit coded symbols until k s are received by some ℓ users , & & and transmit coded packets until k p are received by all users . & 18 / 52

IIR vs. FR With u Users (normalized average download time) k p = 100 , k s = 100 , ǫ s ∈ { 0.1, 0.2, 0.3.0.4, 0.5 } . 2.8 Fixed Rate (FR) Infinite Incremental Redundancy (IIR) 2.6 2.4 2.2 normalized E[T] 2 1.8 1.6 1.4 1.2 1 0 1 2 3 4 10 10 10 10 10 number of users 19 / 52

IIR vs. FR in Multicast (normalized average download time) k p = 100 , k s = 1000 , ǫ s ∈ { 0.1, 0.2, 0.3.0.4, 0.5 } . 2.4 Fixed Rate (FR) Infinite Incremental Redundancy (IIR) 2.2 2 normalized E[T] 1.8 1.6 1.4 1.2 1 0 1 2 3 4 10 10 10 10 10 number of users 20 / 52

Codes for the Cloudy Season We want our content to be reliably stored & quickly accessible , and that requires energy for storage transmission computation Coding minimizes resources needed to maintain the guaranteed reliability. But what does it do to accessibility? 21 / 52

Getting Content from the Cloud(s) 22 / 52

( n , k ) Multiple Broadcasts Content Access Model ◮ Users request the same content (file F ), stored in the cloud. ◮ Server s , s = 1, . . . , n , 1. acquires F from the cloud at the time W s ( W s are iid RVs, exponentially distributed with the mean W ) 2. delivers F by broadcast to the users in time D s ( D s are iid RVs, highly concentrated around the mean D ) ⇒ File F download time from server s is W s + D s . ◮ F is split into k blocks and encoded into n blocks s.t. any k out of n blocks are sufficient for content reconstruction. ◮ Each user’s request is sent to all n servers. When do k out of n servers delver their F/k -size blocks? 23 / 52

Order Statistics ◮ For iid RVs { X 1 , X 2 , · · · , X n } , the k th smallest among them is an RV, known as the k th order statistics X k , n . ◮ When X i ’s are exp ( W ) , the mean and variance of X k , n are E [ X k , n ] = W ( H n − H n − k ) and V [ X k , n ] = W 2 ( H n 2 − H ( n − k ) 2 ) , where H n and H n 2 are (generalized) harmonic numbers n n 1 1 � � H n = and H n 2 = j 2 . j j = 1 j = 1 When k decreases, for fixed n E [ X k , n ] decreases; good for diversity but what about parallelism? 24 / 52

( n , k ) Multiple Broadcasts Response Time Theorem: 1. The mean download completion time is given by T n , k = W ( H n − H n − k ) + D k , where H n = 1 + 1 2 + · · · + 1 n . 2. T n , k depends on W vs. D , and is minimized when √ D 2 + 4 nWD k ∗ ≈ − D + 2 W 25 / 52

What is Really the Optimal k ? 4.0 ● 3.5 W=2, D=1 W=1, D=2 expected response time 3.0 ● 2.5 ● ● ● 2.0 ● ● 1.5 ● ● ● ● ● ● 1.0 ● ● ● ● ● 2 4 6 8 10 k VS. 26 / 52

Who’d Know? You know a lot of geniuses, y’know. You should meet some stupid people once in a while, y’know. You could learn something! 27 / 52

Queueing for Content 28 / 52

Two Queueing Models Single M/M/1 Queue: ◮ Requests arrive at rate λ according to a Poisson process. ◮ Job service times have an exponential distribution with rate µ . ◮ Many metrics of interest are well understood for this model, e.g. the response time is exponential with rate µ − λ . Fork-Join FHW Queue: ◮ Jobs are split on arrival and must be joined before departure. ◮ In the FHW (Flatto, Hahn, Wright) model, at each queue, ◮ Requests arrive at rate λ according to a Poisson process. ◮ Job service times have an exponential distribution with rate µ . ◮ It is seen as a key model for parallel/distributed systems, e.g., RAID. ◮ There is a renewed interest in the problem (e.g., map-reduce). ◮ Few analytical results exist, but various approximations are known. 29 / 52

CODES FOR ALL SEASONS Emina Soljanin, Bell Labs IN THE CLOUD? - PowerPoint PPT Presentation

CODES FOR ALL SEASONS Emina Soljanin, Bell Labs IN THE CLOUD? CODES Emina @ Bell Labs Codes at all Levels and for all Layers Files have chunks. . . . Chunks have packets. . . . CODING Packets have symbols. . . . Codes we cant refuse.

HYBRID ARQ IN WIRELESS NETWORKS Emina Soljanin Mathematical Sciences Research Center, Bell Labs

CODES FOR STORAGE WITH QUEUES FOR ACCESS Mentors: Dr. Emina Soljanin and Graduate

Diversity vs. Parallelism in Distributed Computing with Redundancy Pei Peng, Emina Soljanin,

Absence of Comprehensive Edge Datasets Oleg Kolosov , Gala Yadgar Sumit Maheshwari, Emina Soljanin

July 27, 2020 An Encouraging Word For Uncertain Seasons God is the Lord of all times and

The Joel Test: 12 Steps to Better Code Emina Torlak emina@cs.washington.edu Outline 1. Do you

Symbolic Execution Emina Torlak emina@cs.washington.edu Outline What is symbolic execution?

Black Bear Seasons CR 17-03 2017 Black Bear Seasons Sixth year of proposed hunt 6

The Seasons Questions Why does it get dark every day? Why do we have hot and cold seasons? 1

Building Codes Building Codes Building Codes Building Codes 1 1 Builder Responsibilities

ECEN 5682 Theory and Practice of Error Control Codes Cyclic Codes Peter Mathys University of

Formal Modeling in Cognitive Science Source Codes Lecture 30: Codes; Kraft Inequality; Source

On Delay-Storage Trade-o ff s in Content Download from Coded Distributed Storage Systems Gauri

Fidelity of Finite Length Quantum Codes in Qubit Erasure Channel Alexei Ashikhmin, Bell Labs

ViFi: Virtualizing WLAN using Commodity Hardware Katherine Guo(Bell Labs), Shruti Sanadhya (HP

G ENERALIZED R EED -S OLOMON CODES (GRS CODES ) A CHARACTERIZATION OF MDS CODES THAT HAVE AN ERROR

Secure Data Management: Foundations, Systems and Applications My Journey 1985-Present Dr.

Lecture 5 Lossless Coding (II) May 20, 2009 Shujun LI ( ): INF-10845-20091 Multimedia

High Capability Multidimensional Data High Capability Multidimensional Data Compression on GPUs

Image coding/compression EE5364 DSP Project Pradeep Suthram, David Hemmert, Tammo Heeren All

An Examination of Routing Algorithms for Parallel Computing Environments By James Kurtz Luis

Sixth ACM/IEEE International Symposium on Networks-on-Chip , May 9-11, 2012, Copenhagen, Denmark

More Efficient Routing Algorithm for Ad Hoc Network ENSC 835: HIGH-PERFORMANCE NETWORKS

A new metric to describe the efficiency of data communication networks. Elena Marchei 1 , Sandro

CODES FOR ALL SEASONS Emina Soljanin, Bell Labs IN THE CLOUD? - PowerPoint PPT Presentation

CODES FOR ALL SEASONS Emina Soljanin, Bell Labs IN THE CLOUD? CODES Emina @ Bell Labs Codes at all Levels and for all Layers Files have chunks. . . . Chunks have packets. . . . CODING Packets have symbols. . . . Codes we cant refuse.

HYBRID ARQ IN WIRELESS NETWORKS Emina Soljanin Mathematical Sciences Research Center, Bell Labs

CODES FOR STORAGE WITH QUEUES FOR ACCESS Mentors: Dr. Emina Soljanin and Graduate

Diversity vs. Parallelism in Distributed Computing with Redundancy Pei Peng*, Emina Soljanin*,

Absence of Comprehensive Edge Datasets Oleg Kolosov , Gala Yadgar Sumit Maheshwari, Emina Soljanin

July 27, 2020 An Encouraging Word For Uncertain Seasons God is the Lord of all times and

The Joel Test: 12 Steps to Better Code Emina Torlak emina@cs.washington.edu Outline 1. Do you

Symbolic Execution Emina Torlak emina@cs.washington.edu Outline What is symbolic execution?

Black Bear Seasons CR 17-03 2017 Black Bear Seasons Sixth year of proposed hunt 6

The Seasons Questions Why does it get dark every day? Why do we have hot and cold seasons? 1

Building Codes Building Codes Building Codes Building Codes 1 1 Builder Responsibilities

ECEN 5682 Theory and Practice of Error Control Codes Cyclic Codes Peter Mathys University of

Formal Modeling in Cognitive Science Source Codes Lecture 30: Codes; Kraft Inequality; Source

On Delay-Storage Trade-o ff s in Content Download from Coded Distributed Storage Systems Gauri

Fidelity of Finite Length Quantum Codes in Qubit Erasure Channel Alexei Ashikhmin, Bell Labs

ViFi: Virtualizing WLAN using Commodity Hardware Katherine Guo(Bell Labs), Shruti Sanadhya (HP

G ENERALIZED R EED -S OLOMON CODES (GRS CODES ) A CHARACTERIZATION OF MDS CODES THAT HAVE AN ERROR

Secure Data Management: Foundations, Systems and Applications My Journey 1985-Present Dr.

Lecture 5 Lossless Coding (II) May 20, 2009 Shujun LI ( ): INF-10845-20091 Multimedia

High Capability Multidimensional Data High Capability Multidimensional Data Compression on GPUs

Image coding/compression EE5364 DSP Project Pradeep Suthram, David Hemmert, Tammo Heeren All

An Examination of Routing Algorithms for Parallel Computing Environments By James Kurtz Luis

Sixth ACM/IEEE International Symposium on Networks-on-Chip , May 9-11, 2012, Copenhagen, Denmark

More Efficient Routing Algorithm for Ad Hoc Network ENSC 835: HIGH-PERFORMANCE NETWORKS

A new metric to describe the efficiency of data communication networks. Elena Marchei 1 , Sandro

Diversity vs. Parallelism in Distributed Computing with Redundancy Pei Peng, Emina Soljanin,