18.175: Lecture 7 Sums of random variables Scott Sheffield MIT 1 - PowerPoint PPT Presentation

18.175: Lecture 7 Sums of random variables Scott Sheffield MIT 1 18.175 Lecture 7

Outline Definitions Sums of random variables 2 18.175 Lecture 7

Recall expectation definition � Given probability space (Ω , F , P ) and random variable X (i.e., measurable function X from Ω to R ), we write EX = XdP . � Expectation is always defined if X ≥ 0 a.s., or if integrals of max { X , 0 } and min { X , 0 } are separately finite. 4 18.175 Lecture 7

Strong law of large numbers Theorem (strong law): If X 1 , X 2 , . . . are i.i.d. real-valued � � n − 1 random variables with expectation m and A n := n a i =1 X i are the empirical means then lim n →∞ A n = m almost surely. Last time we defined independent. We showed how to use � � Kolmogorov to construct infinite i.i.d. random variables on a measure space with a natural σ -algebra (in which the existence of a limit of the X i is a measurable event). So we’ve come far enough to say that the statement makes sense. 5 18.175 Lecture 7

Recall some definitions Two events A and B are independent if � � P ( A ∩ B ) = P ( A ) P ( B ). Random variables X and Y are independent if for all � � C , D ∈ R , we have P ( X ∈ C , Y ∈ D ) = P ( X ∈ C ) P ( Y ∈ D ), i.e., the events { X ∈ C } and { Y ∈ D } are independent. Two σ -fields F and G are independent if A and B are � � independent whenever A ∈ F and B ∈ G . (This definition also makes sense if F and G are arbitrary algebras, semi-algebras, or other collections of measurable sets.) 6 18.175 Lecture 7

Recall some definitions Say events A 1 , A 2 , . . . , A n are independent if for each � � I ⊂ { 1 , 2 , . . . , n } we have P ( ∩ i ∈ I A i ) = P ( A i ). i ∈ I Say random variables X 1 , X 2 , . . . , X n are independent if for � � any measurable sets B 1 , B 2 , . . . , B n , the events that X i ∈ B i are independent. Say σ -algebras F 1 , F 2 , . . . , F n if any collection of events (one � � from each σ -algebra) are independent. (This definition also makes sense if the F i are algebras, semi-algebras, or other collections of measurable sets.) 7 18.175 Lecture 7

Recall Kolmogorov Kolmogorov extension theorem: If we have consistent � � probability measures on ( R n , R n ), then we can extend them uniquely to a probability measure on R N . Proved using semi-algebra variant of Carath´ eeodory’s � � extension theorem. 8 18.175 Lecture 7

Extend Kolmogorov Kolmogorov extension theorem not generally true if replace � � ( R , R ) with any measure space. But okay if we use standard Borel spaces . Durrett calls such � � spaces nice: a set ( S , S ) is nice if have 1-1 map from S to R so that φ and φ − 1 are both measurable. Are there any interesting nice measure spaces? � � Theorem: Yes, lots. In fact, if S is a complete separable � � metric space M (or a Borel subset of such a space) and S is the set of Borel subsets of S , then ( S , S ) is nice. separable means containing a countable dense set. � � 9 18.175 Lecture 7

Standard Borel spaces Main idea of proof: Reduce to case that diameter less than � � one (e.g., by replacing d ( x , y ) with d ( x , y ) / (1 + d ( x , y ))). Then map M continuously into [0 , 1] N by considering countable dense set q 1 , q 2 , . . . and mapping x to c l d ( q 1 , x ) , d ( q 2 , x ) , . . . . Then give measurable one-to-one map from [0 , 1] N to [0 , 1] via binary expansion (to send N × N -indexed matrix of 0’s and 1’s to an N -indexed sequence of 0’s and 1’s). In practice: say I want to let Ω be set of closed subsets of a � � disc, or planar curves, or functions from one set to another, etc. If I want to construct natural σ -algebra F , I just need to produce metric that makes Ω complete and separable (and if I have to enlarge Ω to make it complete, that might be okay). Then I check that the events I care about belong to this σ -algebra. 10 18.175 Lecture 7

Fubini’s theorem Consider σ -finite measure spaces ( X , A , µ 1 ) and ( Y , B , µ 2 ). � � Let Ω = X × Y and F be product σ -algebra. � � Check: unique measure µ on F with µ ( A × B ) = µ 1 ( A ) µ 2 ( B ). � � Fubini’s theorem: If f ≥ 0 or | f | d µ < ∞ then � � f ( x , y ) µ 2 ( dy ) µ 1 ( dx ) = fd µ = X Y X × Y f ( x , y ) µ 1 ( dx ) µ 2 ( dy ) . Y X Main idea of proof: Check definition makes sense: if f � � measurable, show that restriction of f to slice { ( x , y ) : x = x 0 } is measurable as function of y , and the integral over slice is measurable as function of x 0 . Check Fubini for indicators of rectangular sets, use π − λ to extend to measurable indicators. Extend to simple, bounded, L 1 (or non-negative) functions. 11 18.175 Lecture 7

Non-measurable Fubini counterexample What if we take total ordering - or reals in [0 , 1] (such that � � for each y the set { x : x - y } is countable) and consider indicator function of { ( x , y ) : x - y } ? 12 18.175 Lecture 7

More observations If X i are independent with distributions µ i , then ( X 1 , . . . , X n ) � � has distribution µ 1 × . . . µ n . If X i are independent and satisfy either X i ≥ 0 for all i or � � E | X i | < ∞ for all i then n n n n E X i = X i . i =1 i =1 13 18.175 Lecture 7

Summing two random variables � Say we have independent random variables X and Y with density functions f X and f Y . � Now let’s try to find F X + Y ( a ) = P { X + Y ≤ a } . � � This is the integral over { ( x , y ) : x + y ≤ a } of � f ( x , y ) = f X ( x ) f Y ( y ). Thus, � � a − y ∞ P { X + Y ≤ a } = f X ( x ) f Y ( y ) dxdy −∞ −∞ ∞ = F X ( a − y ) f Y ( y ) dy . −∞ � Differentiating both sides gives � ∞ F X ( a ∞ f X ( a − y ) f ( y ) dy . d X + Y ( a ) = − y ) f Y ( y ) dy = f Y da −∞ −∞ � Latter formula makes some intuitive sense. We’re integrating over the set of x , y pairs that add up to a . � Can also write P ( X + Y ≤ z ) = F ( z − y ) dG ( ). y � 18.175 Lecture 7 16

Summing i.i.d. uniform random variables Suppose that X and Y are i.i.d. and uniform on [0 , 1]. So � � f X = f Y = 1 on [0 , 1]. What is the probability density function of X + Y ? � � 1 ∞ f X + Y ( a ) = −∞ f X ( a − y ) f Y ( y ) dy = 0 f X ( a − y ) which is � � the length of [0 , 1] ∩ [ a − 1 , a ]. That’s a when a ∈ [0 , 1] and 2 − a when a ∈ [0 , 2] and 0 � � otherwise. 17 18.175 Lecture 7

Summing two normal variables 2 , Y is normal with X is normal with mean zero, variance σ 1 � � 2 . mean zero, variance σ 2 2 2 − x − y √ 1 2 σ 2 √ 1 2 σ 2 f X ( x ) = e 1 and f Y ( y ) = e 2 . � � 2 πσ 1 2 πσ 2 ∞ We just need to compute f X + Y ( a ) = −∞ f X ( a − y ) f Y ( y ) dy . � � We could compute this directly. � � Or we could argue with a multi-dimensional bell curve picture � � that if X and Y have variance 1 then f σ 1 X + σ 2 Y is the density of a normal random variable (and note that variances and expectations are additive). Or use fact that if A i ∈ {− 1 , 1 } are i.i.d. coin tosses then � � a σ 2 N 1 A i is approximately normal with variance σ 2 when √ i =1 N N is large. Generally: if independent random variables X j are normal � � n n n ( µ j , σ 2 ) then σ 2 a X j is normal ( a j =1 µ j , a ). j j =1 j =1 j 18 18.175 Lecture 7

MIT OpenCourseWare http://ocw.mit.edu 18.175 Theory of Probability Spring 2014 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.

18.175: Lecture 7 Sums of random variables Scott Sheffield MIT 1 - PowerPoint PPT Presentation

18.175: Lecture 7 Sums of random variables Scott Sheffield MIT 1 18.175 Lecture 7 Outline Definitions Sums of random variables 2 18.175 Lecture 7 Outline Definitions Sums of random variables 3 18.175 Lecture 7 Recall expectation definition

18.175: Lecture 3 Random variables and distributions Scott Sheffield MIT 1 18.175 Lecture 3

18.175: Lecture 18 Poisson random variables Scott Sheffield MIT 18.175 Lecture 18 1 Outline Extend

18.175: Lecture 17 Poisson random variables Scott Sheffield MIT 18.175 Lecture 16 1 Outline More

18.175: Lecture 23 Random walks Scott Sheffield MIT 18.175 Lecture 23 1 Outline Random walks

18.175: Lecture 11 Independent sums and large deviations Scott Sheffield MIT 1 18.175 Lecture 11

4 Sums of Random Variables Many of the variables dealt with in physics can be expressed as a sum

18.175: Lecture 13 More large deviations Scott Sheffield MIT 1 18.175 Lecture 13 Outline Legendre

18.175: Lecture 5 More integration and expectation Scott Sheffield MIT 1 18.175 Lecture 5 Outline

18.175: Lecture 4 Integration Scott Sheffield MIT 1 18.175 Lecture 4 Outline Integration

Dedekind sums ingredients Dedekind sums Fourier- Dedekind sums Restricted partition Mirco

Ch10: More Expectations and Variances Part I: Expected Values of Sums of Random Variables 10.1

18.175: Lecture 14 Weak convergence and characteristic functions Scott Sheffield MIT 1 18.175

18.175: Lecture 15 Characteristic functions and central limit theorem Scott Sheffield MIT 1 18.175

18.175: Lecture 32 More Markov chains Scott Sheffield MIT 1 18.175 Lecture 32 Outline General

18.175: Lecture 1 Probability spaces and -algebras Scott Sheffield MIT 1 18.175 Lecture 1

Chapter 2: Random Variables In this chapter we will cover: 1. Discrete Random variables, ( 2.1

Resolvable-measurable mappings of metrizable spaces Sergey Medvedev TOPOSYM 2016 Prague, 25

Probability and Random Processes Lecture 4 General integration theory Mikael Skoglund,

A LUSIN TYPE MEASURABILITY PROPERTY FOR VECTOR-VALUED FUNCTIONS Kirill Naralenkov MGIMO

Measurable cardinals in category theory Andrew Brooke-Taylor University of Leeds Reflections on

Preliminaries : A function f : R R is additive if it satisfies the Cauchy equation (CE) f (

First steps in random walks (a brief introduction to Markov chains) Paul-Andr Mellis CNRS,

Jubilee of Fourier Analysis and Applications In Celebration of John Benedetto 80th Birthday

From Haar to Lebesgue via Domain Theory Michael Mislove Tulane University Topology Seminar