SLIDE 1
The Source Coding Theorem
Mathias Winther Madsen mathias.winther@gmail.com
Institute for Logic, Language, and Computation University of Amsterdam
March 2015
SLIDE 2 The Convergence of Averages
Problem
Which of the following is more probable?
- 1. an average of 4,000 in 1,000 dice rolls;
- 2. an average of 4,000,000 in 1,000,000 dice rolls.
SLIDE 3 The Convergence of Averages
The Weak Law of Large Numbers
For every ε > 0 and α > 0 there is a t such that Pr
i=1 Xi
n − E[X]
10 20 2 4 6 10 20 50 100
SLIDE 4 Sequence Probabilities
Problem
With the point probabilities x t s e p(x) .25 .50 .25 Given that we draw 10 letters from this distribution,
- 1. what is Pr(stetsesses)?
- 2. what is the most probable sequence?
SLIDE 5
Sequence Probabilities · s t e t s e s s e s
0.2 0.4 0.6 0.8 1 Probability
SLIDE 6
Sequence Probabilities · s t e t s e s s e s
10−6 10−5 10−4 10−3 10−2 10−1 100 Probability
SLIDE 7
Sequence Probabilities · s t e t s e s s e s
−20 −15 −10 −5 Logarithmic probability
SLIDE 8
Sequence Probabilities · s t e t s e s s e s
−3 −2 −1 Logarithmic probability
SLIDE 9 Typical Sequences
Definition
The entropy of a random variable X is H = E
1 p(X)
Definition
An ε-typical sequence of length n is a sequence for which
1 p(x1, x2, . . . , xn) − Hn
SLIDE 10
Typical Sequences
The Asymptotic Equipartition Property
Eventually, everything has the same probability.
The Source Coding Theorem
For large n, there are only 2Hn sequences worth caring about.
SLIDE 11
Typical Sequences