cs70 jean walrand lecture 34
play

CS70: Jean Walrand: Lecture 34. Conditional Expectation CS70: Jean - PowerPoint PPT Presentation

CS70: Jean Walrand: Lecture 34. Conditional Expectation CS70: Jean Walrand: Lecture 34. Conditional Expectation 1. Review: joint distribution, LLSE 2. Definition of Conditional expectation 3. Properties of CE 4. Applications: Diluting,


  1. Properties of CE E [ Y | X = x ] = ∑ yPr [ Y = y | X = x ] y Theorem (a) X , Y independent ⇒ E [ Y | X ] = E [ Y ] ; (b) E [ aY + bZ | X ] = aE [ Y | X ]+ bE [ Z | X ] ;

  2. Properties of CE E [ Y | X = x ] = ∑ yPr [ Y = y | X = x ] y Theorem (a) X , Y independent ⇒ E [ Y | X ] = E [ Y ] ; (b) E [ aY + bZ | X ] = aE [ Y | X ]+ bE [ Z | X ] ; (c) E [ Yh ( X ) | X ] = h ( X ) E [ Y | X ] , ∀ h ( · ) ;

  3. Properties of CE E [ Y | X = x ] = ∑ yPr [ Y = y | X = x ] y Theorem (a) X , Y independent ⇒ E [ Y | X ] = E [ Y ] ; (b) E [ aY + bZ | X ] = aE [ Y | X ]+ bE [ Z | X ] ; (c) E [ Yh ( X ) | X ] = h ( X ) E [ Y | X ] , ∀ h ( · ) ; (d) E [ h ( X ) E [ Y | X ]] = E [ h ( X ) Y ] , ∀ h ( · ) ;

  4. Properties of CE E [ Y | X = x ] = ∑ yPr [ Y = y | X = x ] y Theorem (a) X , Y independent ⇒ E [ Y | X ] = E [ Y ] ; (b) E [ aY + bZ | X ] = aE [ Y | X ]+ bE [ Z | X ] ; (c) E [ Yh ( X ) | X ] = h ( X ) E [ Y | X ] , ∀ h ( · ) ; (d) E [ h ( X ) E [ Y | X ]] = E [ h ( X ) Y ] , ∀ h ( · ) ; (e) E [ E [ Y | X ]] = E [ Y ] .

  5. Properties of CE E [ Y | X = x ] = ∑ yPr [ Y = y | X = x ] y Theorem (a) X , Y independent ⇒ E [ Y | X ] = E [ Y ] ; (b) E [ aY + bZ | X ] = aE [ Y | X ]+ bE [ Z | X ] ; (c) E [ Yh ( X ) | X ] = h ( X ) E [ Y | X ] , ∀ h ( · ) ; (d) E [ h ( X ) E [ Y | X ]] = E [ h ( X ) Y ] , ∀ h ( · ) ; (e) E [ E [ Y | X ]] = E [ Y ] . Proof:

  6. Properties of CE E [ Y | X = x ] = ∑ yPr [ Y = y | X = x ] y Theorem (a) X , Y independent ⇒ E [ Y | X ] = E [ Y ] ; (b) E [ aY + bZ | X ] = aE [ Y | X ]+ bE [ Z | X ] ; (c) E [ Yh ( X ) | X ] = h ( X ) E [ Y | X ] , ∀ h ( · ) ; (d) E [ h ( X ) E [ Y | X ]] = E [ h ( X ) Y ] , ∀ h ( · ) ; (e) E [ E [ Y | X ]] = E [ Y ] . Proof: (a),(b) Obvious

  7. Properties of CE E [ Y | X = x ] = ∑ yPr [ Y = y | X = x ] y Theorem (a) X , Y independent ⇒ E [ Y | X ] = E [ Y ] ; (b) E [ aY + bZ | X ] = aE [ Y | X ]+ bE [ Z | X ] ; (c) E [ Yh ( X ) | X ] = h ( X ) E [ Y | X ] , ∀ h ( · ) ; (d) E [ h ( X ) E [ Y | X ]] = E [ h ( X ) Y ] , ∀ h ( · ) ; (e) E [ E [ Y | X ]] = E [ Y ] . Proof: (a),(b) Obvious (c) E [ Yh ( X ) | X = x ] = ∑ Y ( ω ) h ( X ( ω ) Pr [ ω | X = x ] ω

  8. Properties of CE E [ Y | X = x ] = ∑ yPr [ Y = y | X = x ] y Theorem (a) X , Y independent ⇒ E [ Y | X ] = E [ Y ] ; (b) E [ aY + bZ | X ] = aE [ Y | X ]+ bE [ Z | X ] ; (c) E [ Yh ( X ) | X ] = h ( X ) E [ Y | X ] , ∀ h ( · ) ; (d) E [ h ( X ) E [ Y | X ]] = E [ h ( X ) Y ] , ∀ h ( · ) ; (e) E [ E [ Y | X ]] = E [ Y ] . Proof: (a),(b) Obvious (c) E [ Yh ( X ) | X = x ] = ∑ Y ( ω ) h ( X ( ω ) Pr [ ω | X = x ] ω = ∑ Y ( ω ) h ( x ) Pr [ ω | X = x ] ω

  9. Properties of CE E [ Y | X = x ] = ∑ yPr [ Y = y | X = x ] y Theorem (a) X , Y independent ⇒ E [ Y | X ] = E [ Y ] ; (b) E [ aY + bZ | X ] = aE [ Y | X ]+ bE [ Z | X ] ; (c) E [ Yh ( X ) | X ] = h ( X ) E [ Y | X ] , ∀ h ( · ) ; (d) E [ h ( X ) E [ Y | X ]] = E [ h ( X ) Y ] , ∀ h ( · ) ; (e) E [ E [ Y | X ]] = E [ Y ] . Proof: (a),(b) Obvious (c) E [ Yh ( X ) | X = x ] = ∑ Y ( ω ) h ( X ( ω ) Pr [ ω | X = x ] ω = ∑ Y ( ω ) h ( x ) Pr [ ω | X = x ] = h ( x ) E [ Y | X = x ] ω

  10. Properties of CE E [ Y | X = x ] = ∑ yPr [ Y = y | X = x ] y Theorem (a) X , Y independent ⇒ E [ Y | X ] = E [ Y ] ; (b) E [ aY + bZ | X ] = aE [ Y | X ]+ bE [ Z | X ] ; (c) E [ Yh ( X ) | X ] = h ( X ) E [ Y | X ] , ∀ h ( · ) ; (d) E [ h ( X ) E [ Y | X ]] = E [ h ( X ) Y ] , ∀ h ( · ) ; (e) E [ E [ Y | X ]] = E [ Y ] . Proof: (continued)

  11. Properties of CE E [ Y | X = x ] = ∑ yPr [ Y = y | X = x ] y Theorem (a) X , Y independent ⇒ E [ Y | X ] = E [ Y ] ; (b) E [ aY + bZ | X ] = aE [ Y | X ]+ bE [ Z | X ] ; (c) E [ Yh ( X ) | X ] = h ( X ) E [ Y | X ] , ∀ h ( · ) ; (d) E [ h ( X ) E [ Y | X ]] = E [ h ( X ) Y ] , ∀ h ( · ) ; (e) E [ E [ Y | X ]] = E [ Y ] . Proof: (continued) (d) E [ h ( X ) E [ Y | X ]] = ∑ h ( x ) E [ Y | X = x ] Pr [ X = x ] x

  12. Properties of CE E [ Y | X = x ] = ∑ yPr [ Y = y | X = x ] y Theorem (a) X , Y independent ⇒ E [ Y | X ] = E [ Y ] ; (b) E [ aY + bZ | X ] = aE [ Y | X ]+ bE [ Z | X ] ; (c) E [ Yh ( X ) | X ] = h ( X ) E [ Y | X ] , ∀ h ( · ) ; (d) E [ h ( X ) E [ Y | X ]] = E [ h ( X ) Y ] , ∀ h ( · ) ; (e) E [ E [ Y | X ]] = E [ Y ] . Proof: (continued) (d) E [ h ( X ) E [ Y | X ]] = ∑ h ( x ) E [ Y | X = x ] Pr [ X = x ] x = ∑ h ( x ) ∑ yPr [ Y = y | X = x ] Pr [ X = x ] x y

  13. Properties of CE E [ Y | X = x ] = ∑ yPr [ Y = y | X = x ] y Theorem (a) X , Y independent ⇒ E [ Y | X ] = E [ Y ] ; (b) E [ aY + bZ | X ] = aE [ Y | X ]+ bE [ Z | X ] ; (c) E [ Yh ( X ) | X ] = h ( X ) E [ Y | X ] , ∀ h ( · ) ; (d) E [ h ( X ) E [ Y | X ]] = E [ h ( X ) Y ] , ∀ h ( · ) ; (e) E [ E [ Y | X ]] = E [ Y ] . Proof: (continued) (d) E [ h ( X ) E [ Y | X ]] = ∑ h ( x ) E [ Y | X = x ] Pr [ X = x ] x = ∑ h ( x ) ∑ yPr [ Y = y | X = x ] Pr [ X = x ] x y = ∑ h ( x ) ∑ yPr [ X = x , y = y ] x y

  14. Properties of CE E [ Y | X = x ] = ∑ yPr [ Y = y | X = x ] y Theorem (a) X , Y independent ⇒ E [ Y | X ] = E [ Y ] ; (b) E [ aY + bZ | X ] = aE [ Y | X ]+ bE [ Z | X ] ; (c) E [ Yh ( X ) | X ] = h ( X ) E [ Y | X ] , ∀ h ( · ) ; (d) E [ h ( X ) E [ Y | X ]] = E [ h ( X ) Y ] , ∀ h ( · ) ; (e) E [ E [ Y | X ]] = E [ Y ] . Proof: (continued) (d) E [ h ( X ) E [ Y | X ]] = ∑ h ( x ) E [ Y | X = x ] Pr [ X = x ] x = ∑ h ( x ) ∑ yPr [ Y = y | X = x ] Pr [ X = x ] x y = ∑ h ( x ) ∑ yPr [ X = x , y = y ] x y = ∑ h ( x ) yPr [ X = x , y = y ] x , y

  15. Properties of CE E [ Y | X = x ] = ∑ yPr [ Y = y | X = x ] y Theorem (a) X , Y independent ⇒ E [ Y | X ] = E [ Y ] ; (b) E [ aY + bZ | X ] = aE [ Y | X ]+ bE [ Z | X ] ; (c) E [ Yh ( X ) | X ] = h ( X ) E [ Y | X ] , ∀ h ( · ) ; (d) E [ h ( X ) E [ Y | X ]] = E [ h ( X ) Y ] , ∀ h ( · ) ; (e) E [ E [ Y | X ]] = E [ Y ] . Proof: (continued) (d) E [ h ( X ) E [ Y | X ]] = ∑ h ( x ) E [ Y | X = x ] Pr [ X = x ] x = ∑ h ( x ) ∑ yPr [ Y = y | X = x ] Pr [ X = x ] x y = ∑ h ( x ) ∑ yPr [ X = x , y = y ] x y = ∑ h ( x ) yPr [ X = x , y = y ] = E [ h ( X ) Y ] . x , y

  16. Properties of CE E [ Y | X = x ] = ∑ yPr [ Y = y | X = x ] y Theorem (a) X , Y independent ⇒ E [ Y | X ] = E [ Y ] ; (b) E [ aY + bZ | X ] = aE [ Y | X ]+ bE [ Z | X ] ; (c) E [ Yh ( X ) | X ] = h ( X ) E [ Y | X ] , ∀ h ( · ) ; (d) E [ h ( X ) E [ Y | X ]] = E [ h ( X ) Y ] , ∀ h ( · ) ; (e) E [ E [ Y | X ]] = E [ Y ] . Proof: (continued)

  17. Properties of CE E [ Y | X = x ] = ∑ yPr [ Y = y | X = x ] y Theorem (a) X , Y independent ⇒ E [ Y | X ] = E [ Y ] ; (b) E [ aY + bZ | X ] = aE [ Y | X ]+ bE [ Z | X ] ; (c) E [ Yh ( X ) | X ] = h ( X ) E [ Y | X ] , ∀ h ( · ) ; (d) E [ h ( X ) E [ Y | X ]] = E [ h ( X ) Y ] , ∀ h ( · ) ; (e) E [ E [ Y | X ]] = E [ Y ] . Proof: (continued) (e) Let h ( X ) = 1 in (d).

  18. Properties of CE Theorem (a) X , Y independent ⇒ E [ Y | X ] = E [ Y ] ; (b) E [ aY + bZ | X ] = aE [ Y | X ]+ bE [ Z | X ] ; (c) E [ Yh ( X ) | X ] = h ( X ) E [ Y | X ] , ∀ h ( · ) ; (d) E [ h ( X ) E [ Y | X ]] = E [ h ( X ) Y ] , ∀ h ( · ) ; (e) E [ E [ Y | X ]] = E [ Y ] .

  19. Properties of CE Theorem (a) X , Y independent ⇒ E [ Y | X ] = E [ Y ] ; (b) E [ aY + bZ | X ] = aE [ Y | X ]+ bE [ Z | X ] ; (c) E [ Yh ( X ) | X ] = h ( X ) E [ Y | X ] , ∀ h ( · ) ; (d) E [ h ( X ) E [ Y | X ]] = E [ h ( X ) Y ] , ∀ h ( · ) ; (e) E [ E [ Y | X ]] = E [ Y ] . Note that (d) says that E [( Y − E [ Y | X ]) h ( X )] = 0 .

  20. Properties of CE Theorem (a) X , Y independent ⇒ E [ Y | X ] = E [ Y ] ; (b) E [ aY + bZ | X ] = aE [ Y | X ]+ bE [ Z | X ] ; (c) E [ Yh ( X ) | X ] = h ( X ) E [ Y | X ] , ∀ h ( · ) ; (d) E [ h ( X ) E [ Y | X ]] = E [ h ( X ) Y ] , ∀ h ( · ) ; (e) E [ E [ Y | X ]] = E [ Y ] . Note that (d) says that E [( Y − E [ Y | X ]) h ( X )] = 0 . We say that the estimation error Y − E [ Y | X ] is orthogonal to every function h ( X ) of X .

  21. Properties of CE Theorem (a) X , Y independent ⇒ E [ Y | X ] = E [ Y ] ; (b) E [ aY + bZ | X ] = aE [ Y | X ]+ bE [ Z | X ] ; (c) E [ Yh ( X ) | X ] = h ( X ) E [ Y | X ] , ∀ h ( · ) ; (d) E [ h ( X ) E [ Y | X ]] = E [ h ( X ) Y ] , ∀ h ( · ) ; (e) E [ E [ Y | X ]] = E [ Y ] . Note that (d) says that E [( Y − E [ Y | X ]) h ( X )] = 0 . We say that the estimation error Y − E [ Y | X ] is orthogonal to every function h ( X ) of X . We call this the projection property.

  22. Properties of CE Theorem (a) X , Y independent ⇒ E [ Y | X ] = E [ Y ] ; (b) E [ aY + bZ | X ] = aE [ Y | X ]+ bE [ Z | X ] ; (c) E [ Yh ( X ) | X ] = h ( X ) E [ Y | X ] , ∀ h ( · ) ; (d) E [ h ( X ) E [ Y | X ]] = E [ h ( X ) Y ] , ∀ h ( · ) ; (e) E [ E [ Y | X ]] = E [ Y ] . Note that (d) says that E [( Y − E [ Y | X ]) h ( X )] = 0 . We say that the estimation error Y − E [ Y | X ] is orthogonal to every function h ( X ) of X . We call this the projection property. More about this later.

  23. Application: Calculating E [ Y | X ] Let X , Y , Z be i.i.d. with mean 0 and variance 1.

  24. Application: Calculating E [ Y | X ] Let X , Y , Z be i.i.d. with mean 0 and variance 1. We want to calculate E [ 2 + 5 X + 7 XY + 11 X 2 + 13 X 3 Z 2 | X ] .

  25. Application: Calculating E [ Y | X ] Let X , Y , Z be i.i.d. with mean 0 and variance 1. We want to calculate E [ 2 + 5 X + 7 XY + 11 X 2 + 13 X 3 Z 2 | X ] . We find E [ 2 + 5 X + 7 XY + 11 X 2 + 13 X 3 Z 2 | X ]

  26. Application: Calculating E [ Y | X ] Let X , Y , Z be i.i.d. with mean 0 and variance 1. We want to calculate E [ 2 + 5 X + 7 XY + 11 X 2 + 13 X 3 Z 2 | X ] . We find E [ 2 + 5 X + 7 XY + 11 X 2 + 13 X 3 Z 2 | X ] = 2 + 5 X + 7 XE [ Y | X ]+ 11 X 2 + 13 X 3 E [ Z 2 | X ]

  27. Application: Calculating E [ Y | X ] Let X , Y , Z be i.i.d. with mean 0 and variance 1. We want to calculate E [ 2 + 5 X + 7 XY + 11 X 2 + 13 X 3 Z 2 | X ] . We find E [ 2 + 5 X + 7 XY + 11 X 2 + 13 X 3 Z 2 | X ] = 2 + 5 X + 7 XE [ Y | X ]+ 11 X 2 + 13 X 3 E [ Z 2 | X ] = 2 + 5 X + 7 XE [ Y ]+ 11 X 2 + 13 X 3 E [ Z 2 ]

  28. Application: Calculating E [ Y | X ] Let X , Y , Z be i.i.d. with mean 0 and variance 1. We want to calculate E [ 2 + 5 X + 7 XY + 11 X 2 + 13 X 3 Z 2 | X ] . We find E [ 2 + 5 X + 7 XY + 11 X 2 + 13 X 3 Z 2 | X ] = 2 + 5 X + 7 XE [ Y | X ]+ 11 X 2 + 13 X 3 E [ Z 2 | X ] = 2 + 5 X + 7 XE [ Y ]+ 11 X 2 + 13 X 3 E [ Z 2 ] = 2 + 5 X + 11 X 2 + 13 X 3 ( var [ Z ]+ E [ Z ] 2 )

  29. Application: Calculating E [ Y | X ] Let X , Y , Z be i.i.d. with mean 0 and variance 1. We want to calculate E [ 2 + 5 X + 7 XY + 11 X 2 + 13 X 3 Z 2 | X ] . We find E [ 2 + 5 X + 7 XY + 11 X 2 + 13 X 3 Z 2 | X ] = 2 + 5 X + 7 XE [ Y | X ]+ 11 X 2 + 13 X 3 E [ Z 2 | X ] = 2 + 5 X + 7 XE [ Y ]+ 11 X 2 + 13 X 3 E [ Z 2 ] = 2 + 5 X + 11 X 2 + 13 X 3 ( var [ Z ]+ E [ Z ] 2 ) = 2 + 5 X + 11 X 2 + 13 X 3 .

  30. Application: Diluting

  31. Application: Diluting At each step, pick a ball from a well-mixed urn.

  32. Application: Diluting At each step, pick a ball from a well-mixed urn. Replace it with a blue ball.

  33. Application: Diluting At each step, pick a ball from a well-mixed urn. Replace it with a blue ball. Let X n be the number of red balls in the urn at step n .

  34. Application: Diluting At each step, pick a ball from a well-mixed urn. Replace it with a blue ball. Let X n be the number of red balls in the urn at step n . What is E [ X n ] ?

  35. Application: Diluting At each step, pick a ball from a well-mixed urn. Replace it with a blue ball. Let X n be the number of red balls in the urn at step n . What is E [ X n ] ? Given X n = m , X n + 1 = m − 1 w.p. m / N

  36. Application: Diluting At each step, pick a ball from a well-mixed urn. Replace it with a blue ball. Let X n be the number of red balls in the urn at step n . What is E [ X n ] ? Given X n = m , X n + 1 = m − 1 w.p. m / N (if you pick a red ball)

  37. Application: Diluting At each step, pick a ball from a well-mixed urn. Replace it with a blue ball. Let X n be the number of red balls in the urn at step n . What is E [ X n ] ? Given X n = m , X n + 1 = m − 1 w.p. m / N (if you pick a red ball) and X n + 1 = m otherwise.

  38. Application: Diluting At each step, pick a ball from a well-mixed urn. Replace it with a blue ball. Let X n be the number of red balls in the urn at step n . What is E [ X n ] ? Given X n = m , X n + 1 = m − 1 w.p. m / N (if you pick a red ball) and X n + 1 = m otherwise. Hence, E [ X n + 1 | X n = m ] = m − ( m / N )

  39. Application: Diluting At each step, pick a ball from a well-mixed urn. Replace it with a blue ball. Let X n be the number of red balls in the urn at step n . What is E [ X n ] ? Given X n = m , X n + 1 = m − 1 w.p. m / N (if you pick a red ball) and X n + 1 = m otherwise. Hence, E [ X n + 1 | X n = m ] = m − ( m / N ) = m ( N − 1 ) / N = X n ρ , with ρ := ( N − 1 ) / N .

  40. Application: Diluting At each step, pick a ball from a well-mixed urn. Replace it with a blue ball. Let X n be the number of red balls in the urn at step n . What is E [ X n ] ? Given X n = m , X n + 1 = m − 1 w.p. m / N (if you pick a red ball) and X n + 1 = m otherwise. Hence, E [ X n + 1 | X n = m ] = m − ( m / N ) = m ( N − 1 ) / N = X n ρ , with ρ := ( N − 1 ) / N . Consequently,

  41. Application: Diluting At each step, pick a ball from a well-mixed urn. Replace it with a blue ball. Let X n be the number of red balls in the urn at step n . What is E [ X n ] ? Given X n = m , X n + 1 = m − 1 w.p. m / N (if you pick a red ball) and X n + 1 = m otherwise. Hence, E [ X n + 1 | X n = m ] = m − ( m / N ) = m ( N − 1 ) / N = X n ρ , with ρ := ( N − 1 ) / N . Consequently, E [ X n + 1 ] = E [ E [ X n + 1 | X n ]]

  42. Application: Diluting At each step, pick a ball from a well-mixed urn. Replace it with a blue ball. Let X n be the number of red balls in the urn at step n . What is E [ X n ] ? Given X n = m , X n + 1 = m − 1 w.p. m / N (if you pick a red ball) and X n + 1 = m otherwise. Hence, E [ X n + 1 | X n = m ] = m − ( m / N ) = m ( N − 1 ) / N = X n ρ , with ρ := ( N − 1 ) / N . Consequently, E [ X n + 1 ] = E [ E [ X n + 1 | X n ]] = ρ E [ X n ] , n ≥ 1 .

  43. Application: Diluting At each step, pick a ball from a well-mixed urn. Replace it with a blue ball. Let X n be the number of red balls in the urn at step n . What is E [ X n ] ? Given X n = m , X n + 1 = m − 1 w.p. m / N (if you pick a red ball) and X n + 1 = m otherwise. Hence, E [ X n + 1 | X n = m ] = m − ( m / N ) = m ( N − 1 ) / N = X n ρ , with ρ := ( N − 1 ) / N . Consequently, E [ X n + 1 ] = E [ E [ X n + 1 | X n ]] = ρ E [ X n ] , n ≥ 1 . ⇒ E [ X n ] = ρ n − 1 E [ X 1 ] =

  44. Application: Diluting At each step, pick a ball from a well-mixed urn. Replace it with a blue ball. Let X n be the number of red balls in the urn at step n . What is E [ X n ] ? Given X n = m , X n + 1 = m − 1 w.p. m / N (if you pick a red ball) and X n + 1 = m otherwise. Hence, E [ X n + 1 | X n = m ] = m − ( m / N ) = m ( N − 1 ) / N = X n ρ , with ρ := ( N − 1 ) / N . Consequently, E [ X n + 1 ] = E [ E [ X n + 1 | X n ]] = ρ E [ X n ] , n ≥ 1 . ⇒ E [ X n ] = ρ n − 1 E [ X 1 ] = N ( N − 1 ) n − 1 , n ≥ 1 . = N

  45. Diluting Here is a plot:

  46. Diluting Here is a plot:

  47. Diluting By analyzing E [ X n + 1 | X n ] , we found that E [ X n ] = N ( N − 1 N ) n − 1 , n ≥ 1 .

  48. Diluting By analyzing E [ X n + 1 | X n ] , we found that E [ X n ] = N ( N − 1 N ) n − 1 , n ≥ 1 . Here is another argument for that result.

  49. Diluting By analyzing E [ X n + 1 | X n ] , we found that E [ X n ] = N ( N − 1 N ) n − 1 , n ≥ 1 . Here is another argument for that result. Consider one particular red ball, say ball k .

  50. Diluting By analyzing E [ X n + 1 | X n ] , we found that E [ X n ] = N ( N − 1 N ) n − 1 , n ≥ 1 . Here is another argument for that result. Consider one particular red ball, say ball k . At each step, it remains red w.p. ( N − 1 ) / N , when another ball is picked.

  51. Diluting By analyzing E [ X n + 1 | X n ] , we found that E [ X n ] = N ( N − 1 N ) n − 1 , n ≥ 1 . Here is another argument for that result. Consider one particular red ball, say ball k . At each step, it remains red w.p. ( N − 1 ) / N , when another ball is picked. Thus, the probability that it is still red at step n is [( N − 1 ) / N ] n − 1 .

  52. Diluting By analyzing E [ X n + 1 | X n ] , we found that E [ X n ] = N ( N − 1 N ) n − 1 , n ≥ 1 . Here is another argument for that result. Consider one particular red ball, say ball k . At each step, it remains red w.p. ( N − 1 ) / N , when another ball is picked. Thus, the probability that it is still red at step n is [( N − 1 ) / N ] n − 1 . Let Y n ( k ) = 1 { ball k is red at step n } .

  53. Diluting By analyzing E [ X n + 1 | X n ] , we found that E [ X n ] = N ( N − 1 N ) n − 1 , n ≥ 1 . Here is another argument for that result. Consider one particular red ball, say ball k . At each step, it remains red w.p. ( N − 1 ) / N , when another ball is picked. Thus, the probability that it is still red at step n is [( N − 1 ) / N ] n − 1 . Let Y n ( k ) = 1 { ball k is red at step n } . Then, X n = Y n ( 1 )+ ··· + Y n ( N ) .

  54. Diluting By analyzing E [ X n + 1 | X n ] , we found that E [ X n ] = N ( N − 1 N ) n − 1 , n ≥ 1 . Here is another argument for that result. Consider one particular red ball, say ball k . At each step, it remains red w.p. ( N − 1 ) / N , when another ball is picked. Thus, the probability that it is still red at step n is [( N − 1 ) / N ] n − 1 . Let Y n ( k ) = 1 { ball k is red at step n } . Then, X n = Y n ( 1 )+ ··· + Y n ( N ) . Hence, E [ X n ] = E [ Y n ( 1 )+ ··· + Y n ( N )]

  55. Diluting By analyzing E [ X n + 1 | X n ] , we found that E [ X n ] = N ( N − 1 N ) n − 1 , n ≥ 1 . Here is another argument for that result. Consider one particular red ball, say ball k . At each step, it remains red w.p. ( N − 1 ) / N , when another ball is picked. Thus, the probability that it is still red at step n is [( N − 1 ) / N ] n − 1 . Let Y n ( k ) = 1 { ball k is red at step n } . Then, X n = Y n ( 1 )+ ··· + Y n ( N ) . Hence, E [ X n ] = E [ Y n ( 1 )+ ··· + Y n ( N )] = NE [ Y n ( 1 )]

  56. Diluting By analyzing E [ X n + 1 | X n ] , we found that E [ X n ] = N ( N − 1 N ) n − 1 , n ≥ 1 . Here is another argument for that result. Consider one particular red ball, say ball k . At each step, it remains red w.p. ( N − 1 ) / N , when another ball is picked. Thus, the probability that it is still red at step n is [( N − 1 ) / N ] n − 1 . Let Y n ( k ) = 1 { ball k is red at step n } . Then, X n = Y n ( 1 )+ ··· + Y n ( N ) . Hence, E [ X n ] = E [ Y n ( 1 )+ ··· + Y n ( N )] = NE [ Y n ( 1 )] = NPr [ Y n ( 1 ) = 1 ]

  57. Diluting By analyzing E [ X n + 1 | X n ] , we found that E [ X n ] = N ( N − 1 N ) n − 1 , n ≥ 1 . Here is another argument for that result. Consider one particular red ball, say ball k . At each step, it remains red w.p. ( N − 1 ) / N , when another ball is picked. Thus, the probability that it is still red at step n is [( N − 1 ) / N ] n − 1 . Let Y n ( k ) = 1 { ball k is red at step n } . Then, X n = Y n ( 1 )+ ··· + Y n ( N ) . Hence, E [ X n ] = E [ Y n ( 1 )+ ··· + Y n ( N )] = NE [ Y n ( 1 )] NPr [ Y n ( 1 ) = 1 ] = N [( N − 1 ) / N ] n − 1 . =

  58. Application: Mixing

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend