Chapter 3: Basics from Probability Theory and Statistics It is - PowerPoint PPT Presentation

Chapter 3: Basics from Probability Theory and Statistics It is likely that unlikely things should happen. -- Aristotle The excitement that a gambler feels when making a bet is equal to the amount he might win times the probability of winning it. -- Blaise Pascal To understand God's thoughts we must study statistics, for these are the measure of his purpose. -- Florence Nightingale 3-1 IRDM WS 2015

Outline 3.1 Probability Theory Events, Probabilities, Bayes ‘ Theorem, Random Variables, Distributions, Moments, Tail Bounds, Central Limit Theorem, Entropy Measures 3.2 Statistical Inference Sampling, Parameter Estimation, Maximum Likelihood, Confidence Intervals, Hypothesis Testing, p-Values, Chi-Square Test, Linear and Logistic Regression mostly following L. Wasserman Chapters 1-5 3-2 IRDM WS 2015

Why All This Math? • Ranking search results • Estimating size, structure, dynamics of Web & social networks (from samples) • Inferring user intention (e.g. auto-completion) • Predicting best advertisements • Identifying patterns (over sampled and uncertain data) • Explaining features/aspects of patterns • Characterizing trends, outliers, etc. • Analyzing properties of complex (uncertain) data • Assessing the quality of IR and DM methods 3-3 IRDM WS 2015

2.1 Basic Probability Theory A probability space is a triple (  , E, P) with • a set  of elementary events (sample space), • a family E of subsets of  with  E which is closed under  ,  , and  with a countable number of operands (with finite  usually E=2  ), and • a probability measure P: E  [0,1] with P[  ]=1 and P[  i A i ] =  i P[A i ] for countably many, pairwise disjoint A i Properties of P: P[A] + P[  A] = 1 P[A  B] = P[A] + P[B] – P[A  B] P[  ] = 0 (null/impossible event) P[  ] = 1 (true/certain event) 3-4 IRDM WS 2015

Probability Spaces: Examples Roll one dice, events are; 1, 2, 3, 4, 5 or 6 Roll 2 dice, events are: (1,1), (1,2), …, (1,6), (2,1), (2,2), … …, (6,5), (6,6) Repeat rolling a dice until the first 6, events are <6>, <o,6>, <o,o,6>, <o,o,o,6>, … where o denotes 1,2,3,4 or 5. Roll 2 dice and consider their sum, events are: sum is 2, sum is 3, sum is 4, …, sum is 12 Roll 2 dice and consider their sum, events are: sum is even, sum is odd 3-5 IRDM WS 2015

Independence and Conditional Probabilities Two events A, B of a prob. space are independent if P[A  B] = P[A] P[B]. A finite set of events A={A 1 , ..., A n } is independent   if for every subset S  A the equation P[ A ] P[A ] i i holds.   A S A S i i The conditional probability P[A | B] of A under the  P [ A B ] condition (hypothesis) B is defined as:  P [ A | B ] P [ B ] Event A is conditionally independent of B given C if P[A | BC] = P[A | C]. 3-6 IRDM WS 2015

Total Probability and Bayes’ Theorem Total probability theorem: For a partitioning of  into events B 1 , ..., B n : n   P[ A] P[ A| B ] P[ B ] i i  i 1 P [ B | A ] P [ A ]  Bayes‘ theorem: P [ A | B ] P [ B ] P[A|B] is called posterior probability P[A] is called prior probability 3-7 IRDM WS 2015

Bayes’ Theorem: Example 1 Events: R = rain, 𝑆 = no rain, U = umbrella, 𝑉 = no umbrella Observed data: P[ 𝑆 ]=0.7 P[ R ] = 0.3 P[ U | 𝑆 ] = 0.1 P[ U | R ] = 0.6 Superstition deconstructed: Does carrying an umbrella prevent rain? Bayesian inference: P[ 𝑆 | U ] = ? 𝑆 𝑉 = 𝑄 𝑉 𝑆 𝑄[ 𝑄 𝑉 𝑆 𝑄[ 𝑆] 𝑆] 𝑄 = 𝑄 𝑉 𝑆 𝑄 𝑄[𝑉] 𝑆 + 𝑄 𝑉 𝑆 𝑄[𝑆] = 7/25 = 0.28 3-8 IRDM WS 2015

Bayes’ Theorem: Example 2 Showmaster shuffles three cards (queen of hearts is big prize): You choose a card on which you bet! Showmaster opens one of the other cards ? Showmaster offers you Should you change? to change your choice! 3-9 IRDM WS 2015

Random Variables A random variable (RV) X on the prob. space (  , E, P) is a function X:   M with M  R s.t. {e | X(e)  x}  E for all x  M (X is measurable). F X : M  [0,1] with F X (x) = P[X  x] is the (cumulative) distribution function (cdf) of X. With countable set M the function f X : M  [0,1] with f X (x) = P[X = x] is called the (probability) density function (pdf) of X; in general f X (x) is F‘ X (x). For a random variable X with distribution function F, the inverse function F -1 (q) := inf{x | F(x) > q} for q  [0,1] is called quantile function of X. (0.5 quantile (50 th percentile) is called median) Random variables with countable M are called discrete , otherwise they are called continuous . For discrete random variables the density function is also referred to as the probability mass function . 3-10 IRDM WS 2015

Important Discrete Distributions p )  x 1 x • Bernoulli distribution with parameter p:    P[ X x] p (1  for x {0,1} • Uniform distribution over {1, 2, ..., m}: 1      P [ X k ] f ( k ) for 1 k m X m • Binomial distribution (coin toss n times repeated; X: #heads):   n  k n k       P [ X k ] f ( k ) p ( 1 p ) X   k • Poisson distribution (with rate  ): k       P [ X k ] f ( k ) e X k ! • Geometric distribution (#coin tosses until first head): k     P [ X k ] f ( k ) ( 1 p ) p X • 2-Poisson mixture (with a 1 +a 2 =1): k k          1  2 1 2 P [ X k ] f ( k ) a e a e X 1 2 k ! k ! 3-11 IRDM WS 2015

Important Continuous Distributions • Uniform distribution in the interval [a,b] 1    f X ( x ) for a x b ( 0 otherwise )  b a • Exponential distribution (z.B. time until next event of a Poisson process) with rate  = lim  t  0 (# events in  t) /  t :   x    f ( x ) e for x 0 ( 0 otherwise ) X     x x      1 2 • Hyperexponential distribution: f ( x ) p e ( 1 p ) e X 1 2  a 1   a b     • Pareto distribution: f ( x ) for x b , 0 otherwise X   b x  c f ( x ) Example of a „heavy - tailed“ distribution with X   1 x 1  • logistic distribution: F ( x ) X e  x  1 3-12 IRDM WS 2015

Normal Distribution (Gaussian Distribution) • Normal distribution N(  ,  2 ) (Gauss distribution; 2   ( x )  approximates sums of independent, 2   1 2 f ( x ) e identically distributed random variables): X 2  2 • Distribution function of N(0,1): 2 x  z   1  2 ( z ) e dx  2   Theorem: Let X be normal distributed with expectation  and variance  2 .    X Then Y :  is normal distributed with expectation 0 and variance 1. 3-13 IRDM WS 2015

Normal Distribution Illustrated pdf of Normal distributions cdf of Normal distributions with different parameters with different parameters area:  (a) a area: 2  (a)  1 standard Normal N(0;1) a 3-14 -a IRDM WS 2015

Multidimensional (Multivariate) Distributions Let X 1 , ..., X m be random variables over the same prob. space with domains dom(X 1 ), ..., dom(X m ). The joint distribution of X 1 , ..., X m has a density function f ( x , ..., x ) X , ..., X 1 m 1 m    with ... f ( x , ..., x ) 1 X , ..., X 1 m 1 m   x dom ( X ) x dom ( X ) 1 1 m m    or ... f ( x ,...,x ) dx ...dx 1 X1,...,Xm 1 m m 1 dom( X ) dom( X ) 1 m The marginal distribution of X i in the joint distribution of X 1 , ..., X m has the density function     ... ... f ( x , ..., x ) or X , ..., X 1 m 1 m x x  x  x 1 i 1 i 1 m     ... ... f ( x , ..., x ) dx ... dx dx ... dx   X , ..., X 1 m m i 1 i 1 1 1 m X X  X  X 1 i 1 i 1 m 3-15 IRDM WS 2015

Important Multivariate Distributions multinomial distribution (n, m) (n trials with m-sided dice):   n k   k       1 m P [ X k ... X k ] f ( k , ..., k ) p ... p   1 1 m m X , ..., X 1 m m 1 1 m k ... k   1 m   n n !    with :   k ... k   k ! ... k ! 1 m 1 m    multidimensional normal distribution ( ): ,      1 T 1        ( x ) ( x ) 1  2 f ( x ) e X , ..., X 1 m m   ( 2 ) with covariance matrix  with  ij := Cov(X i ,X j ) and determinant |  | of  2-16 IRDM WS 2015

Chapter 3: Basics from Probability Theory and Statistics It is - PowerPoint PPT Presentation

Chapter 3: Basics from Probability Theory and Statistics It is likely that unlikely things should happen. -- Aristotle The excitement that a gambler feels when making a bet is equal to the amount he might win times the probability of winning

Probability Basics Martin Emms October 1, 2020 Probability Basics Outline Probability

Probability Basics Probability Background Martin Emms October 1, 2020 Probability Basics

Probability Basics Probabilistic Inference Martin Emms October 1, 2020 Probability Basics

Basics of Probability Basics of Probability Janyl Jumadinova February 2426, 2020 Janyl

Chapter 2 Probability 1. Definition of Probability 2. Probability of disjoint events 3.

Chapter 2 Probability 1. Definition of Probability 2. Probability of disjoint events 3.

Which probability Which probability Which probability Which probability theory for cosmology?

Recap of Basic Probability Elements of basic probability theory probability theory The

Chapter 2: Basics from Probability Theory and Statistics 2.1 Probability Theory Events,

Continuing Probability. Wrap up: Total Probability and Conditional Probability. Continuing

Chapter II: Basics from probability theory and statistics Information Retrieval & Data

CS70: Jean Walrand: Lecture 21. Events, Conditional Probability 1. Probability Basics Review 2.

Chapter 1: Probability Theory (a recap) STK4011/9011: Statistical Inference Theory Johan Pensar

Probability Theory p ( E ) = p ( a 1 ) + p ( a 2 ) + ... + p ( a m ) 1 2 3 4 5 6 7 8 9 10 11 12 13

Outline 1. Bayes Law L7: Probability Basics 2. Probability distributions CS 344R/393R:

Chapter 3: Basics from Probability Theory and Statistics 3.1 Probability Theory Events,

Stefan Heule, Eric Schkufza, Rahul Sharma, Alex Aiken PLDI, Santa Barbara, June 16, 2016 Symbolic

Review of Probability 1 Probability Theory: Many techniques in speech processing require the

Constructing Canonical Strategies For Parallel Implementation Of Isogeny Based Cryptography Aaron

Burning Your Skis LESSON 9 Your Response to the

X 1 1 1 1 0 0 0 0 1000

The Effective Audit Committee 2 April 2019 The Holiday Inn Housekeeping Fire alarms test

Probability and Independence Definition If P(B) > 0 , the conditional probability of A given

A template-based approach for speech synthesis intonation generation using LSTMs Srikanth Ronanki

Chapter 3: Basics from Probability Theory and Statistics It is - PowerPoint PPT Presentation

Chapter 3: Basics from Probability Theory and Statistics It is likely that unlikely things should happen. -- Aristotle The excitement that a gambler feels when making a bet is equal to the amount he might win times the probability of winning

Probability Basics Martin Emms October 1, 2020 Probability Basics Outline Probability

Probability Basics Probability Background Martin Emms October 1, 2020 Probability Basics

Probability Basics Probabilistic Inference Martin Emms October 1, 2020 Probability Basics

Basics of Probability Basics of Probability Janyl Jumadinova February 2426, 2020 Janyl

Chapter 2 Probability 1. Definition of Probability 2. Probability of disjoint events 3.

Chapter 2 Probability 1. Definition of Probability 2. Probability of disjoint events 3.

Which probability Which probability Which probability Which probability theory for cosmology?

Recap of Basic Probability Elements of basic probability theory probability theory The

Chapter 2: Basics from Probability Theory and Statistics 2.1 Probability Theory Events,

Continuing Probability. Wrap up: Total Probability and Conditional Probability. Continuing

Chapter II: Basics from probability theory and statistics Information Retrieval &amp; Data

CS70: Jean Walrand: Lecture 21. Events, Conditional Probability 1. Probability Basics Review 2.

Chapter 1: Probability Theory (a recap) STK4011/9011: Statistical Inference Theory Johan Pensar

Probability Theory p ( E ) = p ( a 1 ) + p ( a 2 ) + ... + p ( a m ) 1 2 3 4 5 6 7 8 9 10 11 12 13

Outline 1. Bayes Law L7: Probability Basics 2. Probability distributions CS 344R/393R:

Chapter 3: Basics from Probability Theory and Statistics 3.1 Probability Theory Events,

Stefan Heule, Eric Schkufza, Rahul Sharma, Alex Aiken PLDI, Santa Barbara, June 16, 2016 Symbolic

Review of Probability 1 Probability Theory: Many techniques in speech processing require the

Constructing Canonical Strategies For Parallel Implementation Of Isogeny Based Cryptography Aaron

Burning Your Skis LESSON 9 Your Response to the

X 1 1 1 1 0 0 0 0 1000

The Effective Audit Committee 2 April 2019 The Holiday Inn Housekeeping Fire alarms test

Probability and Independence Definition If P(B) &gt; 0 , the conditional probability of A given

A template-based approach for speech synthesis intonation generation using LSTMs Srikanth Ronanki

Chapter II: Basics from probability theory and statistics Information Retrieval & Data

Probability and Independence Definition If P(B) > 0 , the conditional probability of A given