Statistical Neurodynamics
- f Deep Networks
Statistical Neurodynamics of Deep Networks Shun ichi Amari RIKEN - - PowerPoint PPT Presentation
Statistical Neurodynamics of Deep Networks Shun ichi Amari RIKEN Brain Science Institute Statistical Neurodynamics Rozonoer (1969 Amari (1971; 197 Amari et al (2013) Toyoizumi et al (2015) Poole, , Ganguli (2016) ~ (0, 1) w N ij
Rozonoer (1969) Amari (1971; 197 Amari et al (2013) Toyoizumi et al (2015) Poole, …, Ganguli (2016)
ij
2 1 1
i l l l l
1 2 1
( ) 1 ( )
i ij i i l l i l l l l l
x w x w A x n A F A
2
~ (0, 1/ ) 0, 1 '(0) =const
ij i
w N n w
2
l a b ab l
ab a b l
Poole et al (2016) Deep neural networks
2 2 1 2
( ) ( ) ~ (0, ) 1 ( ) [ ( ) ] ( ) ( ) ( ) ~ (0,1)
k k l
y w y u u N A A y E u A n A Av Dv v N
i
2 2 2 1
( ) ( '( ) ) E[ '( )) ] E[ '( )) ]E[ ] mean field approximation ( ) '( )
k k a a k k ab k j kj k j k j
dy B dy B B B u w g B B g u w w u w w A Av Dv
e e
1 1 1 1
ab ab ab l l ab ab
2 2
ab a b a b a b a b ab ab ab ab ab
2 2 2 2 1 2 1 1 1 2 2 1
l l l l l ab ab ab l l ab ab
2
1 ( , ') ( ') 1 ( , ') ' ' ' 2 ~N(0, V) ' ' V= ( ') E[ (
i i i i k k k k
D x x x x n C x x x x x x n D A A C u w y u w y A C C A C A
) ( ' )] C C A C C
1 1
l l
Poole et al (2016) Deep neural networks
l l
Multilayer Perceptrons
i i
y v
w x
,
i i
f v
x w x
1 2
( , ,..., )
n
x x x x
1 1
( ,..., ; ,..., )
m m
w w v v
1
w x
y
Multilayer Perceptron
1 1,
, , ; ,
i i m m
y f v v v
x θ w x θ w w
neuromanifold
( ) x
space of functions S
Geometry of singular model
W
1
, , :
t t t t
G y G l l
Fisher Natural Gradient Stochastic Descent Information Matrix invarint; steepest descent x
2
1 1 2 2 2
, , 1 2
t u
f w w y f u e dt
x J x J x x
1 2 1 2 1 2 2 1 2 1 1 1 2 2
, , 0, , , 0, , R w w w w w w w w w w f w w J J J J J J J J x J x J x
1 1 2 2 1 2 1 2 2 1 2 1 1 2
, , , , , , w w w w w w w w w z w w w z J J v u J J v u
Milnor attractor
2 2 1 2 3 2
blow-down coordinates , , 1 , 1 , , 1
n
c z u u c z z u S : = e u u e e u
Dynamic vector fields: Redundant case