SLIDE 1
CS246 (Winter 2014) Mining Massive Data Sets
Probability reminders
Sammy El Ghazzal (selghazz@stanford.edu) Disclaimer These notes may contain typos, mistakes or confusing points. Please contact the author so that we can improve them for next year.
1 Definition: a few reminders
Definition (Sample space, Event space, Probability measure). This definition contains the basic definitions of probability theory:
- Sample space (usually denoted as Ω): the set of all possible outcomes.
- Event space (usually denoted as F): a family of subsets of Ω (possibly all subsets of Ω).
- Probability Measure Function P: a function that goes from F to R.
It must have the following properties:
- 1. P (Ω) = 1.
- 2. ∀A ∈ F, 0 ≤ P (A) ≤ 1.
- 3. P (A ∪ B) = P (A) + P (B) − P (A ∩ B).
- 4. For a set of disjoint events A1, . . . , Ap:
P
- ∪
1≤i≤pAi
- =
p
- i=1
P (Ai) . Proposition (Union bound). Let A and B be two events. As we have seen, it holds that: P (A ∪ B) = P (A) + P (B) − P (A ∩ B) , and in particular (the following formula is referred to as the Union Bound): P (A ∪ B) ≤ P (A) + P (B) , and more generally if E1, . . . , En are events: P
- ∪
1≤i≤nEi
- ≤
n
- i=1