The Source Coding Theorem Mathias Winther Madsen - - PowerPoint PPT Presentation

the source coding theorem
SMART_READER_LITE
LIVE PREVIEW

The Source Coding Theorem Mathias Winther Madsen - - PowerPoint PPT Presentation

The Source Coding Theorem Mathias Winther Madsen mathias.winther@gmail.com Institute for Logic, Language, and Computation University of Amsterdam March 2015 The Convergence of Averages Problem Which of the following is more probable? 1. an


slide-1
SLIDE 1

The Source Coding Theorem

Mathias Winther Madsen mathias.winther@gmail.com

Institute for Logic, Language, and Computation University of Amsterdam

March 2015

slide-2
SLIDE 2

The Convergence of Averages

Problem

Which of the following is more probable?

  • 1. an average of 4,000 in 1,000 dice rolls;
  • 2. an average of 4,000,000 in 1,000,000 dice rolls.
slide-3
SLIDE 3

The Convergence of Averages

The Weak Law of Large Numbers

For every ε > 0 and α > 0 there is a t such that Pr

  • n

i=1 Xi

n − E[X]

  • > ε
  • ≤ α.

10 20 2 4 6 10 20 50 100

slide-4
SLIDE 4

Sequence Probabilities

Problem

With the point probabilities x t s e p(x) .25 .50 .25 Given that we draw 10 letters from this distribution,

  • 1. what is Pr(stetsesses)?
  • 2. what is the most probable sequence?
slide-5
SLIDE 5

Sequence Probabilities · s t e t s e s s e s

0.2 0.4 0.6 0.8 1 Probability

slide-6
SLIDE 6

Sequence Probabilities · s t e t s e s s e s

10−6 10−5 10−4 10−3 10−2 10−1 100 Probability

slide-7
SLIDE 7

Sequence Probabilities · s t e t s e s s e s

−20 −15 −10 −5 Logarithmic probability

slide-8
SLIDE 8

Sequence Probabilities · s t e t s e s s e s

−3 −2 −1 Logarithmic probability

slide-9
SLIDE 9

Typical Sequences

Definition

The entropy of a random variable X is H = E

  • log

1 p(X)

  • = −E [log p(X)] .

Definition

An ε-typical sequence of length n is a sequence for which

  • log

1 p(x1, x2, . . . , xn) − Hn

  • < ε.
slide-10
SLIDE 10

Typical Sequences

The Asymptotic Equipartition Property

Eventually, everything has the same probability.

The Source Coding Theorem

For large n, there are only 2Hn sequences worth caring about.

slide-11
SLIDE 11

Typical Sequences