azuma s inequality
play

Azumas Inequality Will Perkins March 28, 2013 Azumas Inequality - PowerPoint PPT Presentation

Azumas Inequality Will Perkins March 28, 2013 Azumas Inequality Theorem (Azumas Inequality) Let X n be a Martingale so that | X i X i 1 | d i (with probability 1). Then Pr[ | X n X 0 | t ] 2 e t 2 / 2 D 2


  1. Azuma’s Inequality Will Perkins March 28, 2013

  2. Azuma’s Inequality Theorem (Azuma’s Inequality) Let X n be a Martingale so that | X i − X i − 1 | ≤ d i (with probability 1). Then Pr[ | X n − X 0 | ≥ t ] ≤ 2 e − t 2 / 2 D 2 where D 2 = � n i =1 d 2 i .

  3. Azuma’s Inequality Theorem (Azuma’s Inequality) Let X n be a Martingale so that | X i − X i − 1 | ≤ d i (with probability 1). Then Pr[ | X n − X 0 | ≥ t ] ≤ 2 e − t 2 / 2 D 2 where D 2 = � n i =1 d 2 i . If all the d i ’s are 1, we get an analogue of the Chernoff Bound: Pr[ | X n − X 0 | ≥ t ] ≤ 2 e − t 2 / 2 n

  4. Azuma’s Inequality Proof: Assume for simplicty that X 0 = 0. We will prove one side of the inequality.

  5. Azuma’s Inequality Proof: Assume for simplicty that X 0 = 0. We will prove one side of the inequality. 1. Use the exponential Markov inequality: Pr[ X n ≥ t ] ≤ e − λ t E e λ X n

  6. Azuma’s Inequality 2. Find a bound for E e λ X n .

  7. Azuma’s Inequality 2. Find a bound for E e λ X n . E e λ X n = E [ E [ e λ ( X n − X n − 1 )+ λ X n − 1 |F n − 1 ]] = E [ e λ X n − 1 E [ e λ ( X n − X n − 1 ) |F n − 1 ]]

  8. Azuma’s Inequality Now find a bound for the one term, E [ e λ ( X n − X n − 1 ) |F n − 1 ]: Let y = ( X n − X n − 1 ) / d n . − 1 ≤ y ≤ 1 with probability 1. By convexity of e x , e d n λ y ≤ 1 + y e d n λ + 1 − y e − d n λ 2 2

  9. Azuma’s Inequality Now find a bound for the one term, E [ e λ ( X n − X n − 1 ) |F n − 1 ]: Let y = ( X n − X n − 1 ) / d n . − 1 ≤ y ≤ 1 with probability 1. By convexity of e x , e d n λ y ≤ 1 + y e d n λ + 1 − y e − d n λ 2 2 E [ e d n λ y |F n − 1 ] ≤ 1 2 e d n λ + 1 2 e − d n λ since E [ y |F n − 1 ] = 0 (Martingale Property). = cosh( d n λ ) ≤ e λ 2 d 2 n / 2

  10. Azuma’s Inequality 3. This gives us: E e λ X n ≤ e λ 2 d 2 n / 2 E e λ X n − 1 and now we can repeat the same thing n − 1 more times.

  11. Azuma’s Inequality 3. This gives us: E e λ X n ≤ e λ 2 d 2 n / 2 E e λ X n − 1 and now we can repeat the same thing n − 1 more times. E e λ X n ≤ e λ 2 � d 2 i / 2 = e λ 2 D 2 / 2 and so Pr[ X n ≥ t ] ≤ e − λ t e λ 2 D 2 / 2

  12. Azuma’s Inequality 4. Now optimize over λ : f ( λ ) = λ 2 D 2 / 2 − λ t f ′ ( λ ) = λ D 2 − t so setting λ = t / D 2 mimimizes the exponent, and gives us: Pr[ X n ≥ t ] ≤ e − t 2 / 2 D 2

  13. Azuma’s Inequality 4. Now optimize over λ : f ( λ ) = λ 2 D 2 / 2 − λ t f ′ ( λ ) = λ D 2 − t so setting λ = t / D 2 mimimizes the exponent, and gives us: Pr[ X n ≥ t ] ≤ e − t 2 / 2 D 2 The same thing works to show Pr[ X n ≤ − t ] ≤ e − t 2 / 2 D 2

  14. Chromatic number of a random graph The chromatic number of a graph, χ ( G ), is the smallest k so that G can be properly colored with k colors. Examples: 1 A bipartite graph has chromatic number 2. 2 A planar graph as chromatic number at most 4 (the famous 4 color theorem)

  15. Chromatic number of a random graph The chromatic number of a graph, χ ( G ), is the smallest k so that G can be properly colored with k colors. Examples: 1 A bipartite graph has chromatic number 2. 2 A planar graph as chromatic number at most 4 (the famous 4 color theorem) Q: What is the chromatic number of the random graph G ( n , p )? This is an old and difficult problem that is not yet fully solved.

  16. Chromatic number of a random graph It is difficult to even compute E χ ( G ). Nevertheless, Azuma’s Inequality will give us something: Theorem √ n − 1] ≤ 2 e − r 2 / 2 Pr[ | χ ( G ) − E χ ( G ) | ≥ r

  17. Chromatic number of a random graph It is difficult to even compute E χ ( G ). Nevertheless, Azuma’s Inequality will give us something: Theorem √ n − 1] ≤ 2 e − r 2 / 2 Pr[ | χ ( G ) − E χ ( G ) | ≥ r This theorem states that the chromatic number is concentrated within O ( √ n ) from its mean, whatever that is, whp.

  18. Chromatic number of a random graph Proof: We are working on the probability space defined by G ( n , p ) - Ω = { 0 , 1 } ( n 2 ), F is all subsets, and P is the product measure in which each edge appears with probability p .

  19. Chromatic number of a random graph Proof: We are working on the probability space defined by G ( n , p ) - Ω = { 0 , 1 } ( n 2 ), F is all subsets, and P is the product measure in which each edge appears with probability p . To define a martingale we need a filtration. There are two especially useful filtrations for a random graph: the vertex exposure filtration and the edge exposure filtration.

  20. Edge Exposure Filtration Let F 0 = { Ω , ∅} . � n � Let F k = σ ( e 1 , . . . e k ) where e i is the i th edge of the possible 2 edges. Notice that F ( n 2 ) = F , all subsets of Ω. So the filtration has length � n � . 2

  21. Vertex Exposure Filtration Let F 1 = { Ω , ∅} . Let F k = σ ( { e : e ⊂ { v 1 , . . . v k } ) where v i is the i th vertex of the n vertices. Here F n = F and the filtration has length n − 1. Notice that we can order the vertices and edges so that the vertex filtration is a subsequence of the edge filtration.

  22. The Martingale We will use the vertex filtration. Let X k = E [ χ ( G ) |F k ]. Then X 1 = E χ ( G ) X n = χ ( G ) X k is a (Doob’s) martingale with respect to F k

  23. The Martingale We will use the vertex filtration. Let X k = E [ χ ( G ) |F k ]. Then X 1 = E χ ( G ) X n = χ ( G ) X k is a (Doob’s) martingale with respect to F k Can we bound | X k − X k − 1 | ?

  24. The Martingale We will use the vertex filtration. Let X k = E [ χ ( G ) |F k ]. Then X 1 = E χ ( G ) X n = χ ( G ) X k is a (Doob’s) martingale with respect to F k Can we bound | X k − X k − 1 | ? Yes. | X k − X k − 1 | ≤ 1. Why? Say G 1 and G 2 are identical except for a set of edges containing a fixed vertex v . Then | χ ( G 1 ) − χ ( G 2 ) | ≤ 1, because v can always be given a completely new color to preserve a proper coloring. We call this the vertex Lipschitz condition.

  25. Chromatic number of a random graph Now we can apply Azuma’s Inequality to X k , with D 2 = ( n − 1). Pr[ | X n − X 1 | ≥ t ] ≤ 2 e − t 2 / 2( n − 1) or √ n − 1] ≤ 2 e − r 2 / 2 Pr[ | X n − X 1 | ≥ r

  26. Chromatic number of a random graph Now we can apply Azuma’s Inequality to X k , with D 2 = ( n − 1). Pr[ | X n − X 1 | ≥ t ] ≤ 2 e − t 2 / 2( n − 1) or √ n − 1] ≤ 2 e − r 2 / 2 Pr[ | X n − X 1 | ≥ r What other graph functions satisfy either an edge or vertex Lipschitz condition?

  27. Isoperimetric Inequalities The Classic Isoperimetry Problem: Of all 2D shapes with area 1, which has the smallest boundary? Ans: the circle!

  28. Isoperimetric Inequalities The Classic Isoperimetry Problem: Of all 2D shapes with area 1, which has the smallest boundary? Ans: the circle! Another way of writing this is to say that if a region in the plane has area x , then its boundary must be at least 2 √ π x . This is an isoperimetric inequality. [Check for a rectangle]

  29. Isoperimetric Inequalities The Hamming Cube is the space { 0 , 1 } n with the Hamming metric: d ( x , y ) is the number of coordinates in which x and y differ. Neighbors are points in the cube that differ in one coordinate. The boundary of a subset of the cube is the set of all points in the subset that neighbor a point outside the subset.

  30. Isoperimetric Inequalities The Hamming Cube is the space { 0 , 1 } n with the Hamming metric: d ( x , y ) is the number of coordinates in which x and y differ. Neighbors are points in the cube that differ in one coordinate. The boundary of a subset of the cube is the set of all points in the subset that neighbor a point outside the subset. A generalization of a boundary is the r -enlargement of a set A . We define A r = { x : d ( x , A ) ≤ r } In particular, A ⊆ A r .

  31. Isoperimetric Inequalities The Hamming Cube is the space { 0 , 1 } n with the Hamming metric: d ( x , y ) is the number of coordinates in which x and y differ. Neighbors are points in the cube that differ in one coordinate. The boundary of a subset of the cube is the set of all points in the subset that neighbor a point outside the subset. A generalization of a boundary is the r -enlargement of a set A . We define A r = { x : d ( x , A ) ≤ r } In particular, A ⊆ A r . An isoperimetric inequality would show that if A is large, then A r must be very large.

  32. Isoperimetric Inequalities Theorem Let A ⊂ { 0 , 1 } n . Let | A | ≥ ǫ 2 n and define λ so that e − λ 2 / 2 = ǫ . Then if r = 2 λ √ n, | A r | ≥ (1 − ǫ )2 n

  33. Isoperimetric Inequalities Theorem Let A ⊂ { 0 , 1 } n . Let | A | ≥ ǫ 2 n and define λ so that e − λ 2 / 2 = ǫ . Then if r = 2 λ √ n, | A r | ≥ (1 − ǫ )2 n Notice that this says that if some subset has an ǫ fraction of the total volume of the Hamming cube, then almost all the hypercube is within distance O ( √ n ) from some point in the set.

  34. Isoperimetric Inequalities Proof: We need a random variable and a filtration. Let X be the distance of a randomly chosen point x from A . [The distance of a point x from a set is the minimum distance d ( x , y ) over all points y ∈ A ].

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend