Azumas Inequality Will Perkins March 28, 2013 Azumas Inequality - - PowerPoint PPT Presentation

azuma s inequality
SMART_READER_LITE
LIVE PREVIEW

Azumas Inequality Will Perkins March 28, 2013 Azumas Inequality - - PowerPoint PPT Presentation

Azumas Inequality Will Perkins March 28, 2013 Azumas Inequality Theorem (Azumas Inequality) Let X n be a Martingale so that | X i X i 1 | d i (with probability 1). Then Pr[ | X n X 0 | t ] 2 e t 2 / 2 D 2


slide-1
SLIDE 1

Azuma’s Inequality

Will Perkins March 28, 2013

slide-2
SLIDE 2

Azuma’s Inequality

Theorem (Azuma’s Inequality) Let Xn be a Martingale so that |Xi − Xi−1| ≤ di (with probability 1). Then Pr[|Xn − X0| ≥ t] ≤ 2e−t2/2D2 where D2 = n

i=1 d2 i .

slide-3
SLIDE 3

Azuma’s Inequality

Theorem (Azuma’s Inequality) Let Xn be a Martingale so that |Xi − Xi−1| ≤ di (with probability 1). Then Pr[|Xn − X0| ≥ t] ≤ 2e−t2/2D2 where D2 = n

i=1 d2 i .

If all the di’s are 1, we get an analogue of the Chernoff Bound: Pr[|Xn − X0| ≥ t] ≤ 2e−t2/2n

slide-4
SLIDE 4

Azuma’s Inequality

Proof: Assume for simplicty that X0 = 0. We will prove one side of the inequality.

slide-5
SLIDE 5

Azuma’s Inequality

Proof: Assume for simplicty that X0 = 0. We will prove one side of the inequality. 1. Use the exponential Markov inequality: Pr[Xn ≥ t] ≤ e−λtEeλXn

slide-6
SLIDE 6

Azuma’s Inequality

  • 2. Find a bound for EeλXn.
slide-7
SLIDE 7

Azuma’s Inequality

  • 2. Find a bound for EeλXn.

EeλXn = E[E[eλ(Xn−Xn−1)+λXn−1|Fn−1]] = E[eλXn−1E[eλ(Xn−Xn−1)|Fn−1]]

slide-8
SLIDE 8

Azuma’s Inequality

Now find a bound for the one term, E[eλ(Xn−Xn−1)|Fn−1]: Let y = (Xn − Xn−1)/dn. −1 ≤ y ≤ 1 with probability 1. By convexity of ex, ednλy ≤ 1 + y 2 ednλ + 1 − y 2 e−dnλ

slide-9
SLIDE 9

Azuma’s Inequality

Now find a bound for the one term, E[eλ(Xn−Xn−1)|Fn−1]: Let y = (Xn − Xn−1)/dn. −1 ≤ y ≤ 1 with probability 1. By convexity of ex, ednλy ≤ 1 + y 2 ednλ + 1 − y 2 e−dnλ E[ednλy|Fn−1] ≤ 1 2ednλ + 1 2e−dnλ since E[y|Fn−1] = 0 (Martingale Property). = cosh(dnλ) ≤ eλ2d2

n/2

slide-10
SLIDE 10

Azuma’s Inequality

  • 3. This gives us:

EeλXn ≤ eλ2d2

n/2EeλXn−1

and now we can repeat the same thing n − 1 more times.

slide-11
SLIDE 11

Azuma’s Inequality

  • 3. This gives us:

EeλXn ≤ eλ2d2

n/2EeλXn−1

and now we can repeat the same thing n − 1 more times. EeλXn ≤ eλ2 d2

i /2 = eλ2D2/2

and so Pr[Xn ≥ t] ≤ e−λteλ2D2/2

slide-12
SLIDE 12

Azuma’s Inequality

  • 4. Now optimize over λ:

f (λ) = λ2D2/2 − λt f ′(λ) = λD2 − t so setting λ = t/D2 mimimizes the exponent, and gives us: Pr[Xn ≥ t] ≤ e−t2/2D2

slide-13
SLIDE 13

Azuma’s Inequality

  • 4. Now optimize over λ:

f (λ) = λ2D2/2 − λt f ′(λ) = λD2 − t so setting λ = t/D2 mimimizes the exponent, and gives us: Pr[Xn ≥ t] ≤ e−t2/2D2 The same thing works to show Pr[Xn ≤ −t] ≤ e−t2/2D2

slide-14
SLIDE 14

Chromatic number of a random graph

The chromatic number of a graph, χ(G), is the smallest k so that G can be properly colored with k colors. Examples:

1 A bipartite graph has chromatic number 2. 2 A planar graph as chromatic number at most 4 (the famous 4

color theorem)

slide-15
SLIDE 15

Chromatic number of a random graph

The chromatic number of a graph, χ(G), is the smallest k so that G can be properly colored with k colors. Examples:

1 A bipartite graph has chromatic number 2. 2 A planar graph as chromatic number at most 4 (the famous 4

color theorem) Q: What is the chromatic number of the random graph G(n, p)? This is an old and difficult problem that is not yet fully solved.

slide-16
SLIDE 16

Chromatic number of a random graph

It is difficult to even compute Eχ(G). Nevertheless, Azuma’s Inequality will give us something: Theorem Pr[|χ(G) − Eχ(G)| ≥ r √ n − 1] ≤ 2e−r2/2

slide-17
SLIDE 17

Chromatic number of a random graph

It is difficult to even compute Eχ(G). Nevertheless, Azuma’s Inequality will give us something: Theorem Pr[|χ(G) − Eχ(G)| ≥ r √ n − 1] ≤ 2e−r2/2 This theorem states that the chromatic number is concentrated within O(√n) from its mean, whatever that is, whp.

slide-18
SLIDE 18

Chromatic number of a random graph

Proof: We are working on the probability space defined by G(n, p) - Ω = {0, 1}(n

2), F is all subsets, and P is the product measure in

which each edge appears with probability p.

slide-19
SLIDE 19

Chromatic number of a random graph

Proof: We are working on the probability space defined by G(n, p) - Ω = {0, 1}(n

2), F is all subsets, and P is the product measure in

which each edge appears with probability p. To define a martingale we need a filtration. There are two especially useful filtrations for a random graph: the vertex exposure filtration and the edge exposure filtration.

slide-20
SLIDE 20

Edge Exposure Filtration

Let F0 = {Ω, ∅}. Let Fk = σ(e1, . . . ek) where ei is the ith edge of the n

2

  • possible

edges. Notice that F(n

2) = F, all subsets of Ω. So the filtration has length

n

2

  • .
slide-21
SLIDE 21

Vertex Exposure Filtration

Let F1 = {Ω, ∅}. Let Fk = σ({e : e ⊂ {v1, . . . vk}) where vi is the ith vertex of the n vertices. Here Fn = F and the filtration has length n − 1. Notice that we can order the vertices and edges so that the vertex filtration is a subsequence of the edge filtration.

slide-22
SLIDE 22

The Martingale

We will use the vertex filtration. Let Xk = E[χ(G)|Fk]. Then X1 = Eχ(G) Xn = χ(G) Xk is a (Doob’s) martingale with respect to Fk

slide-23
SLIDE 23

The Martingale

We will use the vertex filtration. Let Xk = E[χ(G)|Fk]. Then X1 = Eχ(G) Xn = χ(G) Xk is a (Doob’s) martingale with respect to Fk Can we bound |Xk − Xk−1|?

slide-24
SLIDE 24

The Martingale

We will use the vertex filtration. Let Xk = E[χ(G)|Fk]. Then X1 = Eχ(G) Xn = χ(G) Xk is a (Doob’s) martingale with respect to Fk Can we bound |Xk − Xk−1|?

  • Yes. |Xk − Xk−1| ≤ 1. Why? Say G1 and G2 are identical except

for a set of edges containing a fixed vertex v. Then |χ(G1) − χ(G2)| ≤ 1, because v can always be given a completely new color to preserve a proper coloring. We call this the vertex Lipschitz condition.

slide-25
SLIDE 25

Chromatic number of a random graph

Now we can apply Azuma’s Inequality to Xk, with D2 = (n − 1). Pr[|Xn − X1| ≥ t] ≤ 2e−t2/2(n−1)

  • r

Pr[|Xn − X1| ≥ r √ n − 1] ≤ 2e−r2/2

slide-26
SLIDE 26

Chromatic number of a random graph

Now we can apply Azuma’s Inequality to Xk, with D2 = (n − 1). Pr[|Xn − X1| ≥ t] ≤ 2e−t2/2(n−1)

  • r

Pr[|Xn − X1| ≥ r √ n − 1] ≤ 2e−r2/2 What other graph functions satisfy either an edge or vertex Lipschitz condition?

slide-27
SLIDE 27

Isoperimetric Inequalities

The Classic Isoperimetry Problem: Of all 2D shapes with area 1, which has the smallest boundary? Ans: the circle!

slide-28
SLIDE 28

Isoperimetric Inequalities

The Classic Isoperimetry Problem: Of all 2D shapes with area 1, which has the smallest boundary? Ans: the circle! Another way of writing this is to say that if a region in the plane has area x, then its boundary must be at least 2√πx. This is an isoperimetric inequality. [Check for a rectangle]

slide-29
SLIDE 29

Isoperimetric Inequalities

The Hamming Cube is the space {0, 1}n with the Hamming metric: d(x, y) is the number of coordinates in which x and y

  • differ. Neighbors are points in the cube that differ in one
  • coordinate. The boundary of a subset of the cube is the set of all

points in the subset that neighbor a point outside the subset.

slide-30
SLIDE 30

Isoperimetric Inequalities

The Hamming Cube is the space {0, 1}n with the Hamming metric: d(x, y) is the number of coordinates in which x and y

  • differ. Neighbors are points in the cube that differ in one
  • coordinate. The boundary of a subset of the cube is the set of all

points in the subset that neighbor a point outside the subset. A generalization of a boundary is the r-enlargement of a set A. We define Ar = {x : d(x, A) ≤ r} In particular, A ⊆ Ar.

slide-31
SLIDE 31

Isoperimetric Inequalities

The Hamming Cube is the space {0, 1}n with the Hamming metric: d(x, y) is the number of coordinates in which x and y

  • differ. Neighbors are points in the cube that differ in one
  • coordinate. The boundary of a subset of the cube is the set of all

points in the subset that neighbor a point outside the subset. A generalization of a boundary is the r-enlargement of a set A. We define Ar = {x : d(x, A) ≤ r} In particular, A ⊆ Ar. An isoperimetric inequality would show that if A is large, then Ar must be very large.

slide-32
SLIDE 32

Isoperimetric Inequalities

Theorem Let A ⊂ {0, 1}n. Let |A| ≥ ǫ2n and define λ so that e−λ2/2 = ǫ. Then if r = 2λ√n, |Ar| ≥ (1 − ǫ)2n

slide-33
SLIDE 33

Isoperimetric Inequalities

Theorem Let A ⊂ {0, 1}n. Let |A| ≥ ǫ2n and define λ so that e−λ2/2 = ǫ. Then if r = 2λ√n, |Ar| ≥ (1 − ǫ)2n Notice that this says that if some subset has an ǫ fraction of the total volume of the Hamming cube, then almost all the hypercube is within distance O(√n) from some point in the set.

slide-34
SLIDE 34

Isoperimetric Inequalities

Proof: We need a random variable and a filtration. Let X be the distance of a randomly chosen point x from A. [The distance of a point x from a set is the minimum distance d(x, y) over all points y ∈ A].

slide-35
SLIDE 35

Isoperimetric Inequalities

Proof: We need a random variable and a filtration. Let X be the distance of a randomly chosen point x from A. [The distance of a point x from a set is the minimum distance d(x, y) over all points y ∈ A]. Define a filtration Fk by revealing one coordinate of x at a time.

slide-36
SLIDE 36

Isoperimetric Inequalities

Proof: We need a random variable and a filtration. Let X be the distance of a randomly chosen point x from A. [The distance of a point x from a set is the minimum distance d(x, y) over all points y ∈ A]. Define a filtration Fk by revealing one coordinate of x at a time. Then Xk = E[X|Fk] is a martingale with X0 = EX Xn = X.

slide-37
SLIDE 37

Isoperimetric Inequalities

Proof: We need a random variable and a filtration. Let X be the distance of a randomly chosen point x from A. [The distance of a point x from a set is the minimum distance d(x, y) over all points y ∈ A]. Define a filtration Fk by revealing one coordinate of x at a time. Then Xk = E[X|Fk] is a martingale with X0 = EX Xn = X. Show that |Xk − Xk−1| ≤ 1.

slide-38
SLIDE 38

Isoperimetric Inequalities

Azuma’s Inequality tells us two things:

1

Pr[X − EX < −λ√n] < e−λ2/2 = ǫ

2

Pr[X − EX > λ√n] < e−λ2/2 = ǫ

slide-39
SLIDE 39

Isoperimetric Inequalities

Azuma’s Inequality tells us two things:

1

Pr[X − EX < −λ√n] < e−λ2/2 = ǫ

2

Pr[X − EX > λ√n] < e−λ2/2 = ǫ But what is EX?

slide-40
SLIDE 40

Isoperimetric Inequalities

Azuma’s Inequality tells us two things:

1

Pr[X − EX < −λ√n] < e−λ2/2 = ǫ

2

Pr[X − EX > λ√n] < e−λ2/2 = ǫ But what is EX? Actually we know that Pr[X = 0] ≥ ǫ since |A| ≥ ǫ2n. So (1) tells us that EX ≤ λ√n. Then (2) gives: Pr[X > 2λ√n] < e−λ2/2 from which we can conclude the theorem.