> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
Probabilistic Fitting
Marcel Lüthi, University of Basel
Slides based on presentation by Sandro Schönborn
1
Probabilistic Fitting Marcel Lthi, University of Basel Slides - - PowerPoint PPT Presentation
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL Probabilistic Fitting Marcel Lthi, University of Basel Slides based on presentation by Sandro Schnborn 1 > DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
Marcel Lüthi, University of Basel
Slides based on presentation by Sandro Schönborn
1
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
where frequentists interpretations are not applicable! Gallileo’s view on Saturn
sharp.
Image credit: McElrath, Statistical Rethinking: Figure 1.12
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
Bu But t th the e patien tient t eith either has a cavi vity or
All these statements do not contradict each other, they summarize the dentist’s knowledge about the patient
6
𝑄 cavity = 0.1 𝑄 cavity toothache) = 0.8 𝑄 cavity toothache, gum problems) = 0.4
AIMA: Russell & Norvig, Artificial Intelligence. A Modern Approach, 3rd edition,
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
7
Subjectivity: There is no single, real underlying distribution. A probability distribution expresses our knowledge – It is different in different situations and for different observers since they have different knowledge.
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
Marginal
Distribution of certain points only
Conditional
Distribution of points conditioned on known values of others
Probabilistic model: joint distribution of points
𝑄 𝑦1|𝑦2 = 𝑄 𝑦1, 𝑦2 𝑄 𝑦2 𝑄 𝑦1 =
𝑦2
𝑄(𝑦1, 𝑦2)
Product rule: 𝑄 𝑦1, 𝑦2 = 𝑞 𝑦1 𝑦2 𝑞(𝑦2)
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
e.g. points on chin when nose is queried
𝑄(𝑌) =
𝐼
𝑄(𝑌, 𝐼)
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
Model Face distribution Ob Observ rvatio ion Concrete points Possibly uncertain Pos
Face distribution consistent with observation Prior belief More knowledge Posterior belief
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
𝑄 𝑌|𝑦1 … 𝑦𝑂 = 𝑄 𝑌, 𝑦1, … , 𝑦𝑂 𝑄 𝑦1, … , 𝑦𝑂
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
𝑄 𝑌 → 𝑄 𝑌 𝑦1 … 𝑦𝑂
𝑄 𝑌, 𝑦1, … , 𝑦𝑂 = 𝑄 𝑦1, … , 𝑦𝑂|𝑌 𝑄 𝑌
𝑄 𝑌|𝑦1 … 𝑦𝑂 = 𝑄 𝑌, 𝑦1, … , 𝑦𝑂 𝑄 𝑦1, … , 𝑦𝑂 = 𝑄 𝑦1, … , 𝑦𝑂|𝑌 𝑄 𝑌 𝑄 𝑦1, … , 𝑦𝑂
𝑄 𝑅|𝐹 = 𝑄 𝑅, 𝐹 𝑄 𝐹 = 𝑄 𝐹|𝑅 𝑄 𝑅 𝑄 𝐹
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
Model needs to describe how observations are distributed with joint distribution 𝑄 𝑅, 𝐹
But joint distribution is more complex
𝑄 𝑅, 𝐹 = 𝑄 𝐹|𝑅 𝑄 𝑅
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
𝑄 𝑅, 𝐹 = 𝑄 𝐹|𝑅 𝑄 𝑅
Prio rior Lik Likelih ihood Join Joint
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
𝑄 𝑅 → 𝑄 𝑅 𝐹 → 𝑄 𝑅 𝐹, 𝐺 → 𝑄 𝑅 𝐹, 𝐺, 𝐻 → ⋯
Prio rior Lik Likelih ihood Pos
Mar argin inal l Lik Likelih ihood
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
𝑄 𝜄|𝐸 = 𝑄 𝐸|𝜄 𝑄 𝜄 𝑄 𝐸 = 𝑄 𝐸|𝜄 𝑄 𝜄 ∫ 𝑄 𝐸|𝜄 𝑄 𝜄 𝑒𝜄 𝑄 𝜄|𝐸 ∝ 𝑄 𝐸|𝜄 𝑄 𝜄
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
approach?
distribution?
20
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
Post sterio ior Dis istributio ion 𝑞(θ|image) = 𝑞 𝜄 𝑞(image|𝜄)
𝑞 image
MAP Solution 𝛽∗ = arg max
𝜄
𝑞 𝜄 𝑞(image|𝜄) Local Maxima
We need approximate inference!
Infeasible to compute: p(image)= ∫ 𝑞 𝜄 𝑞 image 𝜄 𝑒𝜄
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
Variati tional meth thods
arg max
𝑟
KL(𝑟(𝜄)|𝑞(𝜄|𝐸))
Samplin ing methods
KL: Kullback- Leibler divergence
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
𝐹 𝑔 𝑦 = න 𝑔 𝑦 𝑞 𝑦 𝑒𝑦 𝐹 𝑔 𝑦 ≈ መ 𝑔 = 1 𝑂
𝑗 𝑂
𝑔 𝑦𝑗 , 𝑦𝑗 ~ 𝑞 𝑦
𝑊 መ 𝑔 ~ 𝑃 1 𝑂
This is s dif diffic icult!
24
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
unnormalized, expensive point-wise evaluation
25
Random.nextDouble() Random.nextGaussian()
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
Design a Markov Chain such that samples 𝑦 obey the target distribution 𝑞 Concept: “Use an already existing sample to produce the next one”
26
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
1. Draw a sample 𝒚′ from 𝑅(𝒚′|𝒚) (“proposal”) 2. With probability 𝛽 = min
𝑄 𝒚′ 𝑄 𝒚 , 1
accept 𝒚′ as new state 𝒚 3. Emit current state 𝒚 as sample
28
Requirements:
Result:
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
𝑄 𝒚 =
1 2𝜌 Σ 𝑓−1
2 𝒚−𝝂 𝑈Σ−1(𝒚−𝝂)
𝑅 𝒚′ 𝒚 = 𝒪(𝒚′|𝒚, 𝜏2𝐽2)
29
Random walk Ƹ 𝜈 = 1.56 1.68 Σ = 1.09 0.63 0.63 1.07 𝜈 = 1.5 1.5 Σ = 1.25 0.75 0.75 1.25 Sampled Estimate Target
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
30
𝜏 = 0.2 𝜏 = 1.0
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
1. Draw a sample 𝒚′ from 𝑅(𝒚′|𝒚) (“proposal”) 2. With probability 𝛽 = min
𝑄 𝑦′ 𝑄 𝑦 𝑅 𝑦|𝑦′ 𝑅 𝑦′|𝑦 , 1 accept 𝒚′ as new state 𝒚
3. Emit current state 𝒚 as sample
31
𝑅 𝒚′ 𝒚 ≠ 𝑅 𝒚 𝒚′ 𝑅 𝒚′ 𝒚 > 0 ⇔ 𝑅 𝒚 𝒚′ > 0
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
Unbiased but correlated (not i.i.d.)
Algorithm only considers ratios 𝑄(𝑦′)/𝑄(𝑦)
Dependent Proposals ls: 𝑅 𝑦′ 𝑦 depends on current sample 𝑦
Algorithm adapts to target with simple 1-step memory
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
Proposal should match target to avoid too many rejections
and too small stepping
33
2006
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
𝑄 𝑦′ 𝑄 𝑦 𝑅 𝑦|𝑦′ 𝑅 𝑦′|𝑦 , 1
34
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
unnormalized distributions?
can the MH-Algorithm avoid getting stuck in local optima?
algorithm.
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
Type into the codepane: goto(“http://shapemodelling.cs.unibas.ch/exercises/Exercise15.html”) Scalismo 0.16: Check examples in https://github.com/unibas-gravis/pmm2018
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
3D Alignment with Shape and Pose
38
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
39
right.eye.corner_outer left.eye.corner_outer right.lips.corner left.lips.corner
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
Pose, Positioning in space
1, … , 𝑚𝑈 𝑜
1, … , 𝑚𝑆 𝑜
𝑄 𝜄 𝑚𝑈
1, … , 𝑚𝑈 𝑜 ∝ 𝑞 𝑚𝑈 1, … , 𝑚𝑈 𝑆|𝜄 𝑄(𝜄)
Parameters 𝜄 = (𝛽, 𝜒, 𝜔, 𝜘, 𝑢) Shape transformation 𝜒𝑡 𝛽 = 𝜈 𝑦 +
𝑗=1 𝑠
𝛽𝑗 𝜇𝑗𝛸𝑗(𝑦) Rigid transformation
𝜒𝑆 𝜒, 𝜔, 𝜘, 𝑢 = 𝑆𝜘𝑆𝜔𝑆𝜒 𝒚 + 𝑢
Full transformation 𝜒 𝜄 (𝑦) = (𝜒𝑆∘ 𝜒𝑇)[𝜄](𝑦)
40
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
"𝑅 𝜄′|𝜄 = 𝑂(𝜄′|𝜄, Σ𝜄)"
𝑂(𝜷′|𝜷, 𝜏𝑇
2𝐽𝑛× 𝑛 )
𝑂 𝜒′ 𝜒, 𝜏𝜒
2 , 𝑂 𝜔′ 𝜔, 𝜏𝜔 2 , 𝑂 𝜘′ 𝜘, 𝜏𝜘 2
𝑂 𝒖′ 𝒖, 𝜏𝑢
2𝐽3×3
𝑅 𝜄′|𝜄 = ∑𝑑𝑗𝑅𝑗(𝜄′|𝜄)
41
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
Simple models: In Independent Gau Gaussians Observation of 𝑀 landmark locations 𝑚𝑈
𝑗 in image
𝑞 𝑚𝑈 𝜄, 𝑚𝑆 = 𝑂 𝜒 𝜄 𝑚𝑆 , 𝐽3×3𝜏2
𝑞 𝑚𝑈
1, … , 𝑚𝑈 𝑜|𝜄 = ෑ 𝑗=1 𝑀
𝑞𝑗 𝑚𝑈
𝑗 |𝜄
42
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
final posterior?
43
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
44
Ƹ 𝜈yaw = 0.511 ො 𝜏yaw = 0.073 (4°) Ƹ 𝜈tx = −1 mm ො 𝜏tx = 4 mm Ƹ 𝜈𝛽1 = 0.4 ො 𝜏𝛽1 = 0.6 (Estimation from samples)
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
45
Posterior values (log, unnormalized!)
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
46
Ƹ 𝜈yaw = 0.50 ො 𝜏yaw = 0.041 (2.4°) Ƹ 𝜈tx = −2 mm ො 𝜏tx = 0.8 mm Ƹ 𝜈𝛽1 = 1.5 ො 𝜏𝛽1 = 0.35
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
47
Ƹ 𝜈yaw = 0.49 ො 𝜏yaw = 0.11 (7°) Ƹ 𝜈tx = −5 mm ො 𝜏tx = 10 mm Ƹ 𝜈𝛽1 = 0 ො 𝜏𝛽1 = 0.6
> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2018 | BASEL
48