A-NICE-MC
Adversarial Training for MCMC
Jiaming Song Shengjia Zhao Stefano Ermon March 7, 2018
Stanford University
A-NICE-MC Jiaming Song 1. Motivation 2. Notations and Problem - - PowerPoint PPT Presentation
Adversarial Training for MCMC Shengjia Zhao Stefano Ermon March 7, 2018 Stanford University A-NICE-MC Jiaming Song 1. Motivation 2. Notations and Problem Setup 3. Adversarial Training for Markov Chains 4. Adversarial Training for MCMC 5.
Stanford University
1
2
3
4
5
6
t=0 is drawn through
θ: state distribution at time t.
7
θ}∞ t=0 converges quickly (minimize t such that
θ − pd| < δ).
t=0 should be as
t=0). 8
9
10
θ(x) at time t is also intractable
11
G
D
G
D
12
13
14
θ
D
x∼πb
θ[D(¯
x∼Tm
θ (¯
x|xd)[D(¯
θ (x|xd) denotes the distribution of x when the transition kernel
15
θ
D
x∼πb
θ[D(¯
x∼Tm
θ (¯
x|xd)[D(¯
16
n as the probability distribution at time step t for the n-th
n}∞ n=1 converges to pd in total
m − pd∥TV< ϵ, then
m
m − pd∥TV ;
n=1 converges to
17
n − pd∥TV< δ.
n − pd∥TV< ϵ (Assumption 1).
n − pd∥TV< δ (Assumption 2).
n=1 converges to pd in total variation. 18
19
20
21
22
θ}∞ t=0 converges quickly. (need reasonable
t=0 should be as
23
24
25
26
∂x
27
∂h | = |det ∂f−1(x) ∂x
θ ! 28
θ ,
θ (x, v).
29
30
f f −1 High “high” acceptance “low” acceptance
U(x, v)
Low
U(x, v)
θ . Outside the high probability regions fθ will guide x towards pd(x),
θ . Inside high probability regions both
31
t=0 should be as
i=1 xi/N] be the variance of the mean estimate
1 ) is the number of independent samples from p(x)
j=1 xj/M] = V 32
t=0 should be as
1 ) =
s=1 (1 − s N)ρs
33
θ.
θ (·|x1)
34
i=1 through MCMC with gθ as proposal
35
4 2 2 4 4 3 2 1 1 2 3 4 6 4 2 2 4 6 6 4 2 2 4 6 6 4 2 2 4 6 6 4 2 2 4 6 6 4 2 2 4 6 6 4 2 2 4 6
36
37
38
39
1 + x2 2]
1 + x2 2]
40
41
42
43
44
44
fc 400, relu fc 400, relu fc 400, relu sum sum sum
v ∼ N(0, I)
identity identity identity
fc 400, relu fc 400, relu fc 400, relu sum sum sum
v ∼ N(0, I)
identity identity identity fc 400, relu