- weighted
- Sleep
Xiong yi
Zhang
Jered McInerney
weighted Methods Importance 20 : - Sleep Planning Inference - - PowerPoint PPT Presentation
Lecture weighted Methods Importance 20 : - Sleep Planning Inference Wake as - , Scribes Xiong yi Zhang : McInerney Jered Auto Variational Summary encoders : Use Idea neural to parameterize I netwonus : model , 2- ) (
Xiong yi
Zhang
Jered McInerneySummary
Idea I : Use neural netwonus to parameterize a deep generative model pg ( x , 2- ) Idea 3+4 : Use auto encoding variational inference to train an inference model go, ( Z Ix ) that approximates the posterior pg CZIX ) Idea 2 : Use samples from go , ( 7 K ) to approximate the gradient To log pok ) Idea 5 : Use me parameterization to computeDeff
Any
method that produces ' a valid importance weight defines a lower bound£1943
To
L ( O , G) =ftp.#qopaxil0ologpocx.ZD
) (Always the some )Tq L ( O
, 4) =ftp.#qopxxif9ologqq1zixsflegwlx,zstb
)) (REINFORCE
)Tg
L I 0,10) = Epa, # pce , [ Og log w ( x , 2- aces) )) ( Repwaweteiud )£194
ftp.#qzixsleogwDsEf..llogpdxyw=pofx.zy9ql7IX
) n↳
Wakelower
band with upper boundlog
poles ^1-
Planning as inference ! Replace target pot x. 7) with a density exp [ RIZ ) T pgcz )£194 )
II.
leg
wo.aceb.h.xbsxbnlln.it/sxYiii)qbih~pCE
) ImportanceI
E Wh U I £ ( 99 ) =Epa ,
# go , , likeflog
I )Eof
log E)slog
Egli )leg ↳ £
. Wo ,¢lEb' h ,xb )) TLDR : Move Sum¥
E Wh U I L ( 0,0) =Epa ,
# go , cziikix , [log I ) = is E. leg ( II. PH3z.net#qf2-oCEb.h)/Xb )) Note : Any unbiased estimation E [ I ] = po Cx ) defines a lower boundEp
Egl by
I ) sEpl log EGEE ) )
= Ep flog pok ))Epa ,
# punk , flog 'EEEpix,
# pceiikgf.gg#If0o.$wgfEk ) ) =Epix .
# peeingI
f
wqY.l.IT?7.,0o.oibgw..pld.xY
9 SelfMinimizing
by
minimizing inclusive KL rather than exclusive KLEpa , # p.com/
leg
]
Note : Mt " 9 ) instead!
])
= EpI log
Poet ) ] tEpc ,→ I
KL ( pottksllqotfks)) 7, EpI log
Po CA ) Making UH ) smaller also reduces the KL divergenceMinimizing
an Upper Bound Idea ; Make go , Cz it ) similar to poczlx )by
minimizing inclusive KL rather than exclusive KL UH =Epa , # p.com/bgPgIYIIT
, ]I
No dependence,U4
) = # peal # pg*× , IDeploy
94171×3 )T
Need samplesfrom
potfk
) ( can use any Monte Carlo method )Minimizing
by
minimizing inclusive KL rather than exclusive KL,U4
) = # peal # pg*× , IDeploy
94171×3 )ftp.xsf#g.czxs/P:YIT
, Oo ,bgqattixs ))
I§
?
, , Woio Czbih , ×b , EE . Wo,¢Czb4×b ,4 leg 991744×6 )
[ Sometimes referred to as " reweighed " wakeReweighed
Wakeself
qgzlxb )
b , h To Epa , I log poem ) =LEE IT
me topocxb.tk
)Epcxilkllpottix
) " 9ohm )) =from
the generative model and computegradient
( often
shipped )ftp.cnn/logPg::ITIl/=fE4bggo,czbixbs
b