On-line estimation with the multivariate Gaussian distribution
Sanjoy Dasgupta and Daniel Hsu UC San Diego
On-line estimation with the multivariate Gaussian distribution
- Nr. 1
On-line estimation with the multivariate Gaussian distribution - - PowerPoint PPT Presentation
On-line estimation with the multivariate Gaussian distribution Sanjoy Dasgupta and Daniel Hsu UC San Diego On-line estimation with the multivariate Gaussian distribution Nr. 1 Outline 1. On-line density estimation and previous work 2.
T
T = inf θ∈Θ T
T
t ) in trial t
T ≥1
T = limσ2→0 T 2 ln σ2 = −∞.
T > −∞ for all T. (Alternative: compare to best-in-hindsight
1) = (µ0, σ2 0) = (0, s2)
t
t+1 =
t
i
t+1
t = 1/t;
T ) = Ω(T).
2
0.5 1 1.5 2 2.5 x 10
4
10 20 30 40 50 60 t Rt
T
t
t ≥ s2/t (where s2 is trial-zero variance), have trivial bound of O(T 2).
t = 1
t−1
T
k=1 ∆k
k=1 ∆k
t → 0: regret can be arbitrarily
t > 0, then there is some T0 such that for all T > T0,
T
t → 0, average regret tends to zero.
T ≥1
1) ∈ R × R>0; then
t+1) = arg min (µ,σ2)
1 ∆((µ2, σ2), (µ1, σ2 1)) + t
1
1
T
t ) − T
1
1))
T +1 ∆((µ, σ2), (µT +1, σ2 T +1))
T
t+1 ∆((µt, σ2 t ), (µt+1, σ2 t+1))
t
1
T
t+1 ∆((µt, σ2 t ), (µt+1, σ2 t+1))
T
t
1
1
t
1
1
T
i=1 ai
t=1 at
T
t−1
2
T
△
T
t ) − inf (µ,σ2) T
(µ,σ2) T
(µ,σ2)
T
T > −∞
T
T =
T
t
T ≥