1
Data Analysis and Uncertainty
Instructor: Sargur N. Srihari
University at Buffalo The State University of New York
srihari@cedar.buffalo.edu
Srihari
Data Analysis and Uncertainty Instructor: Sargur N. Srihari - - PowerPoint PPT Presentation
Data Analysis and Uncertainty Instructor: Sargur N. Srihari University at Buffalo The State University of New York srihari@cedar.buffalo.edu 1 Srihari Topics 1. Introduction 2. Dealing with Uncertainty 3. Random Variables and Their
1
srihari@cedar.buffalo.edu
Srihari
Srihari 2
Uncertain about extent to which samples differ from each other
Srihari 3
4
Lack theoretical backbone and the wide acceptance of probability
identical situations
– An idealization since all customers are not identical
any parameters estimated from the data
Srihari 5
– Domain is integers
– Domain is set of positive real numbers
Srihari
Srihari 7
p(x1,..,xd)
p(x1)=∫∫ p(x1,x2,x3)dx2dx3
Srihari 8
is denoted f(x1|x2) and defined as
Srihari 9
p(x1 | x2) = p(x1,x2) p(x2)
Product A Product B Customer 1 1 Customer 2 1 1 Customer n=100,000 Total nA=10,000 nB=5000
Srihari 10
Probability that randomly selected customer bought A is nA/n=0.1 Probability that randomly selected customer bought B is nB/n=0.05 nAB= those who bought both A and B=10 P(B=1|A=1)=10/10,000=0.001 Probability of customer buying B reduces from 0.05 to 0.001 if we know customer bought product A
purchasing item B?
Srihari
is unaffected by whether or not butter was purchased once we know bread was purchased
Z Y X
conditionally dependent given Z
unconditional
larger samples (old B, young A) dominate
Srihari 15
A B Old 2/10 30/90 Young 48/90 10/10 A B Total 50/10 40/100
past values given the current value in the sequence
Srihari 16
f (x1,...,xn) = f (x1) f (x j | x j−1
j= 2 n
∏
)
Srihari 17
and small X with small Y
Srihari 18
correlated but causally linked only by a third variable: smoking
negatively correlated
Srihari 19
lower rates
more cases
factor
Srihari 20
Srihari 21
22
Generative Model of data allows data to be generated from the model Inference allows making statements about data
Srihari 23
i=1 n
Srihari 24
Srihari 25
Bias(θ) = E[θ
∧
]−θ
ˆ θ
Var(θ
∧
) = E[θ
∧
− E[θ
∧
]]2
Srihari 26
E[(θ
∧
−θ)2] = E[(θ
∧
− E[θ
∧
]+ E[θ
∧
]−θ)2] = (E[θ
∧
]−θ)2 + E[(θ
∧
− E[θ
∧
])2] = (Bias(θ
∧
))2 + Var(θ
∧
)
Srihari
L(θ | D) = L(θ | x(1),..., x(n)) = p(x(1),..., x(n) |θ) = f (x(i) |θ)
i=1 n
Srihari 28
l(θ | x(1),...,x(n)) = − n 2 log2π − 1 2 (x(i) −θ)2
i=1 n
Srihari 29
Binomial distribution
r=7 r milk purchases out of n customers θ is the probability that milk is purchased by random customer
Srihari 30
Normal distribution
Estimate unknown mean θ Histogram of 20 data points drawn from zero mean, unit variance Likelihood function Log-Likelihood function
Srihari 31
Histogram of 200 data points drawn from zero mean, unit variance Likelihood function Log-Likelihood function
Srihari 32
p(θ | D) = p(D |θ)p(θ) p(D) = p(D |θ)p(θ) p(D |ϕ)p(ϕ)dϕ
ϕ
Srihari 33
λ = L(θ0 | D) supϕ L(ϕ | D) X 2 = (Ek − Ok)2 Ek
k=1,t
Srihari 34
p(Hi | x) ∝ p(x | Hi)p(Hi) p(H0 | x) p(H1 | x) ∝ p(H0) p(H1) • p(x | H0) p(x | H1)
Srihari 35
p(θ | D) ∝ p(D |θ)p(θ) p(θ) ∝θα−1(1−θ)β −1 L(θ | D) = θ r(1−θ)n−r
p(θ | D) ∝ p(D |θ)p(θ) = θ r(1−θ)n−rθα−1(1−θ)β −1 = θ r+α−1(1−θ)n−r+β −1 p(x(n +1) | D) = p(x(n +1),θ | D)dθ
∫
= p(x(n +1) |
∫
θ)p(θ | D)dθ p(θ | D
1,D2) ∝ p(D2 |θ)p(D 1 |θ)p(θ)
I(θ | x) = −E[∂2 logL(θ | x) ∂θ 2 ] p(θ) ∝ I(θ | x)
Srihari 38
σ 2 n (1− n N ) (x(i) − x)/(n −1)
∑
Nk x k N
∑
1 N 2 Nk
2 var(x k)
∑
Srihari 39
xk / nk
∑ ∑
1− f ( nk
∑
)2 a 1− a ( sk
2
∑
+ r2 nk
2
∑
− 2r sk
∑
nk)
Srihari 40