Independence, Variance, Bayes Theorem Russell Impagliazzo and Miles - PowerPoint PPT Presentation

Independence, Variance, Bayes’ Theorem Russell Impagliazzo and Miles Jones Thanks to Janine Tiefenbruck http://cseweb.ucsd.edu/classes/sp16/cse21-bd/ June 1, 2016

Independence Rosen p. 486 Theorem : If X and Y are independent random variables over the same sample space, then E ( X Y ) = E( X ) E( Y ) Note: This is not necessarily true if the random variables are not independent!

Concentration Rosen Section 7.4 How close (on average) will we be to the average / expected value? Let X be a random variable with E(X) = E . The unexpectedness of X is the random variable U = |X-E| The average unexpectedness of X is AU(X) = E ( |X-E| ) = E( U ) The variance of X is V(X) = E( |X – E| 2 ) = E ( U 2 ) The standard deviation of X is σ(X) = ( E( |X – E| 2 ) ) 1/2 = V(X) 1/2

Concentration How close (on average) will we be to the average / expected value? Let X be a random variable with E(X) = E . Weight all differences from mean equally The unexpectedness of X is the random variable U = |X-E| The average unexpectedness of X is AU(X) = E ( |X-E| ) = E( U ) The variance of X is V(X) = E( |X – E| 2 ) = E ( U 2 ) The standard deviation of X is σ(X) = ( E( |X – E| 2 ) ) 1/2 = V(X) 1/2 Weight large differences from mean more

Concentration How close (on average) will we be to the average / expected value? Let X be a random variable with E(X) = E . The variance of X is V(X) = E( |X – E| 2 ) = E ( U 2 ) Theorem : V(X) = E(X 2 ) – ( E(X) ) 2

Concentration How close (on average) will we be to the average / expected value? Let X be a random variable with E(X) = E . The variance of X is V(X) = E( |X – E| 2 ) = E ( U 2 ) Theorem : V(X) = E(X 2 ) – ( E(X) ) 2 Proof: V(X) = E( (X-E) 2 ) = E( X 2 – 2XE + E 2 ) = E(X 2 ) – 2E E (X) + E 2 = E(X 2 ) – 2E 2 + E 2 = E(X 2 ) – ( E(X) ) 2 J Linearity of expectation

Standard Deviation The standard deviation gives us a bound on how far off we are likely to be from the expected value. It is frequently but not always a fairly accurate bound.

𝑜 = 1 𝜀𝜁 &

Is this tight? There are actually stronger concentration bounds which say that the probability of being off from the average drops exponentially rather than polynomially. Even with ()* + , these stronger bounds, the actual number becomes Θ samples. - . If you see the results of polling, they almost always give a margin of error which is obtained by plugging in 𝜀 = 0.01 and solving for 𝜗 .

Recall: Conditional probabilities Probability of an event may change if have additional information about outcomes. Suppose E and F are events, and P(F)>0. Then, i.e. Rosen p. 456

Bayes' Theorem Rosen Section 7.3 Based on previous knowledge about how probabilities of two events relate to one another, how does knowing that one event occurred impact the probability that the other did?

Bayes' Theorem: Example 1 Rosen Section 7.3 A manufacturer claims that its drug test will detect steroid use 95% of the time . What the company does not tell you is that 15% of all steroid-free individuals also test positive (the false positive rate). 10% of the Tour de France bike racers use steroids. Your favorite cyclist just tested positive. What’s the probability that he used steroids? Your first guess? A. Close to 95% B. Close to 85% C. Close to 15% D. Close to 10% E. Close to 0%

Bayes' Theorem: Example 1 Rosen Section 7.3 A manufacturer claims that its drug test will detect steroid use 95% of the time . What the company does not tell you is that 15% of all steroid-free individuals also test positive (the false positive rate). 10% of the Tour de France bike racers use steroids. Your favorite cyclist just tested positive. What’s the probability that he used steroids? Define events: we want P ( used steroids | tested positive)

Bayes' Theorem: Example 1 Rosen Section 7.3 A manufacturer claims that its drug test will detect steroid use 95% of the time . What the company does not tell you is that 15% of all steroid-free individuals also test positive (the false positive rate). 10% of the Tour de France bike racers use steroids. Your favorite cyclist just tested positive. What’s the probability that he used steroids? Define events: we want P ( used steroids | tested positive) so let E = Tested positive F = Used steroids

Bayes' Theorem: Example 1 Rosen Section 7.3 A manufacturer claims that its drug test will detect steroid use 95% of the time . What the company does not tell you is that 15% of all steroid-free individuals also test positive (the false positive rate). 10% of the Tour de France bike racers use steroids. Your favorite cyclist just tested positive. What’s the probability that he used steroids? Define events: we want P ( used steroids | tested positive) E = Tested positive P( E | F ) = 0.95 F = Used steroids

Bayes' Theorem: Example 1 Rosen Section 7.3 A manufacturer claims that its drug test will detect steroid use 95% of the time. What the company does not tell you is that 15% of all steroid-free individuals also test positive (the false positive rate). 10% of the Tour de France bike racers use steroids . Your favorite cyclist just tested positive. What’s the probability that he used steroids? Define events: we want P ( used steroids | tested positive) E = Tested positive P( E | F ) = 0.95 F = Used steroids P(F) = 0.1 P( ) = 0.9

Bayes' Theorem: Example 1 Rosen Section 7.3 A manufacturer claims that its drug test will detect steroid use 95% of the time. What the company does not tell you is that 15% of all steroid-free individuals also test positive (the false positive rate). 10% of the Tour de France bike racers use steroids. Your favorite cyclist just tested positive. What’s the probability that he used steroids? Define events: we want P ( used steroids | tested positive) E = Tested positive P( E | F ) = 0.95 P( E | ) = 0.15 F = Used steroids P(F) = 0.1 P( ) = 0.9

Bayes' Theorem: Example 1 Rosen Section 7.3 A manufacturer claims that its drug test will detect steroid use 95% of the time. What the company does not tell you is that 15% of all steroid-free individuals also test positive (the false positive rate). 10% of the Tour de France bike racers use steroids. Your favorite cyclist just tested positive. What’s the probability that he used steroids? Define events: we want P ( used steroids | tested positive) Plug in: 41% E = Tested positive P( E | F ) = 0.95 P( E | ) = 0.15 F = Used steroids P(F) = 0.1 P( ) = 0.9

Bayes' Theorem: Example 2 Rosen Section 7.3 Suppose we have found that the word “Rolex” occurs in 250 of 2000 messages known to be spam and in 5 out of 1000 messages known not to be spam. Estimate the probability that an incoming message containing the word “Rolex” is spam, assuming that it is equally likely that an incoming message is spam or not spam.

Bayes' Theorem: Example 2 Rosen Section 7.3 Suppose we have found that the word “Rolex” occurs in 250 of 2000 messages known to be spam and in 5 out of 1000 messages known not to be spam. Estimate the probability that an incoming message containing the word “Rolex” is spam, assuming that it is equally likely that an incoming message is spam or not spam. We want: P( spam | contains "Rolex" ) . So define the events E = contains "Rolex" F = spam

Bayes' Theorem: Example 2 Rosen Section 7.3 Suppose we have found that the word “Rolex” occurs in 250 of 2000 messages known to be spam and in 5 out of 1000 messages known not to be spam. Estimate the probability that an incoming message containing the word “Rolex” is spam, assuming that it is equally likely that an incoming message is spam or not spam. We want: P( spam | contains "Rolex" ) . So define the events What is P(E|F)? E = contains "Rolex" A. 0.005 F = spam B. 0.125 C. 0.5 D. Not enough info

Bayes' Theorem: Example 2 Rosen Section 7.3 Training set: establish probabilities Suppose we have found that the word “Rolex” occurs in 250 of 2000 messages known to be spam and in 5 out of 1000 messages known not to be spam. Estimate the probability that an incoming message containing the word “Rolex” is spam, assuming that it is equally likely that an incoming message is spam or not spam. We want: P( spam | contains "Rolex" ) . E = contains "Rolex" P( E | F) = 250/2000 = 0.125 P( E | ) = 5/1000 = 0.005 F = spam

Bayes' Theorem: Example 2 Rosen Section 7.3 Suppose we have found that the word “Rolex” occurs in 250 of 2000 messages known to be spam and in 5 out of 1000 messages known not to be spam. Estimate the probability that an incoming message containing the word “Rolex” is spam, assuming that it is equally likely that an incoming message is spam or not spam. We want: P( spam | contains "Rolex" ) . E = contains "Rolex" P( E | F) = 250/2000 = 0.125 P( E | ) = 5/1000 = 0.005 F = spam P( F ) = P( ) = 0.5

Independence, Variance, Bayes Theorem Russell Impagliazzo and Miles - PowerPoint PPT Presentation

Independence, Variance, Bayes Theorem Russell Impagliazzo and Miles Jones Thanks to Janine Tiefenbruck http://cseweb.ucsd.edu/classes/sp16/cse21-bd/ June 1, 2016 Independence Rosen p. 486 Theorem : If X and Y are independent random

Formal Modeling in Cognitive Science Independence Lecture 23: Conditional Probability; Bayes

Bayes Theorem Thomas Bayes (1701-1761) Simple form of Bayes Theorem, for

CS70: Jean Walrand: Lecture 23. Bayes Rule, Independence, Mutual Independence 1. Conditional

Variance Will Perkins January 22, 2013 Variance Definition The variance of a random variable X

Naive Bayes and Gaussian Bayes Classifier Ladislav Rampasek slides by Mengye Ren and others

Independence, Variance, Bayes Theorem Russell Impagliazzo and Miles Jones Thanks to Janine

Independence, Variance, Bayes Theorem Russell Impagliazzo and Miles Jones Thanks to Janine

31. Stokes Theorem Stokes theorem is to Greens theorem, for the work done, as the

Math 186: Conditional Probability and Bayes Theorem (2.4) Independence (2.5) Math 283: Ewens

The Nave Bayes Classifier Machine Learning 1 Todays lecture The nave Bayes Classifier

CS70: Lecture 28. Review: Independence Variance Definition Variance; Inequalities; WLLN X and Y

CS70: Lecture 28. Variance; Inequalities; WLLN 1. Review: Independence 2. Variance 3.

7 Modelling Uncertainty Bayes theorem 7 Modelling Uncertainty Bayes theorem

Alex Psomas: Lecture 18. Random Variables: Variance 1. Variance 2. Distributions Variance Flip

Estimating Variance under Estimating Mean . . . Interval and Fuzzy Estimating Variance . . .

Analysis of variance and regression December 4, 2007 Variance component models Variance

Approximate Conditional-mean Type Filtering for State-space Models Bernhard Spangl, Universit

Time Series Mining and Forecasting Duen Horng (Polo) Chau Georgia Tech Slides based on

Runtime analysis RLS on OneMax 10 trials of n { 1 , . . . , 200 } . 2 , 000 n ln n

Now our picture looks like Decision and Closure Context Free Languages Properties of CFLs

Efficient Determination of the Hyperparameter via L-curve in Large Scale Least Squares and Total

Further Discussions and Beyond EE630 Further Discussions and Beyond EE630 Final exam: two

BA Larder, AD Revell, D Wang, R Hamers, H Tempelman, R Barth, AMJ Wensing, C Morrow, R Wood, F

Bistability in ODE and Boolean network models Matthew Macauley Department of Mathematical