- G. Cauwenberghs
520.776 Learning on Silicon
Model-Free Stochastic Perturbative Adaptation and Optimization
Gert Cauwenberghs
Johns Hopkins University gert@jhu.edu 520.776 Learning on Silicon
http://bach.ece.jhu.edu/gert/courses/776
Model-Free Stochastic Perturbative Adaptation and Optimization Gert - - PowerPoint PPT Presentation
Model-Free Stochastic Perturbative Adaptation and Optimization Gert Cauwenberghs Johns Hopkins University gert@jhu.edu 520.776 Learning on Silicon http://bach.ece.jhu.edu/gert/courses/776 G. Cauwenberghs 520.776 Learning on Silicon
520.776 Learning on Silicon
http://bach.ece.jhu.edu/gert/courses/776
520.776 Learning on Silicon
– Model Complexity – Compensation of Analog VLSI Mismatch
– Algorithmic Properties – Mixed-Signal Architecture – VLSI Implementation
– Learning of Continuous-Time Dynamics – Reinforcement Learning
– AdOpt VLSI Controller – Adaptive Optics “Quality” Metrics – Applications to Laser Communication and Imaging
520.776 Learning on Silicon
520.776 Learning on Silicon
REFERENCE
p i INPUTS OUTPUTS
SYSTEM
520.776 Learning on Silicon
520.776 Learning on Silicon
520.776 Learning on Silicon
520.776 Learning on Silicon
520.776 Learning on Silicon
2 ε(p(k )+π(k )) – ε(p(k )–π(k ))
(k ) = ±σ ; E(πi (k )πj (l)) ≈ σ2 δijδkl
520.776 Learning on Silicon
GLOBAL
NETWORK
pi(t) + φ(t) πi(t) pi(t)
z–1
πi(t) –η ε(t)
^
φ(t) LOCAL φ(t) –η ε(t)
^
2 ε(p(k )+π(k )) – ε(p(k )–π(k ))
520.776 Learning on Silicon
POL
^
520.776 Learning on Silicon
0.1 0.2 0.3 0.4 0.5 0.6 10 10 10 10 10 10
Gate Voltage Vbn (V) Voltage Decrement ²V stored (V)
∆t = 40 msec 1 msec 23 µsec ∆t = 0
(a)
0.1 0.2 0.3 0.4 0.5 0.6 10 10 10 10 10 10
Gate Voltage Vbp (V) Voltage Increment ²V stored (V)
∆t = 40 msec 1 msec 23 µsec
(b)
C V stored Iadapt ∆Qadapt EN p EN n POL V bp V bn
520.776 Learning on Silicon
Iref
+
π1
H
π2
H
π3
H
π4
H
π5
H
π6
H
π0
H
π6
V
π5
V
π4
V
π3
V
π2
V
π1
V
UPDATE ACTIVATION AND PROBE MULTIPLEXING
Q(.) Q(.) Q(.) Q(.) Q(.) Q(.)
BINARY QUANTIZATION
x1(t) x1
T(t)
x2(t) x2
T(t)
x1 x2 x3 x4 x5 x6 Iref
–
W 11 W 12 W 13 W 14 W 15 W 16 W 21 W 31 W 41 W 51 W 61 W 22 W 66 W 56 W 65
θ1 θ2 θ3 θ4 θ5 θ6 W off W off W off W off W off W off
TEACHER FORCING
dx dt = (
)
F p, x, y z =
( )
G x x(t) y(t) z(t) DYNAMICAL SYSTEM
520.776 Learning on Silicon
ADAPTIVE CRITIC SYSTEM
pi INPUTS OUTPUTS
520.776 Learning on Silicon
520.776 Learning on Silicon
SELhor SELvert Vδ Vbp UPD UPD Vbn
Vαp Vbn SELhor
Vbn SELhor
UPD UPD Vbp Vbp Vαp Vαn
LOCK LOCK
HYST HYST
Adaptive Critic Action Network Neuron Select (State) State Eligibility
u(t) y = –1 y = 1 y(t) x1(t) x2(t)
State (Quantized) Action (Binary) 64 Reinforcement Learning Neurons
520.776 Learning on Silicon
520.776 Learning on Silicon
520.776 Learning on Silicon
nonlinear filter; etc.)
520.776 Learning on Silicon
Wavefront corrector with N elements: u1 ,…,un ,…,uN Incoming wavefront
520.776 Learning on Silicon
image
J(u) wavefront corrector performance metric sensor
AdOpt VLSI wavefront controller
520.776 Learning on Silicon
image
J(u+δu) J(u) wavefront corrector performance metric sensor
AdOpt VLSI wavefront controller
520.776 Learning on Silicon
image
J(u+δu)
wavefront corrector performance metric sensor AdOpt VLSI wavefront controller J(u) δJ J(u-δu)
(k+1) = uj (k) + γ δJ(k) δuj (k)
520.776 Learning on Silicon
z–1
( ) ( )
( ) ( )
(k) (k) (k) (k) (k) 1 N 1 N k k k k 1 N 1 N
520.776 Learning on Silicon
z–1
analog digital
520.776 Learning on Silicon
Edwards, Cohen, Cauwenberghs, Vorontsov & Carhart (1999)
( )
( ) ( )
( )
k j k k k j j j (k) (k) (k) (k 1) (k) (k) (k) j j j
+
520.776 Learning on Silicon
520.776 Learning on Silicon
HEX-127 LC SLM
520.776 Learning on Silicon
520.776 Learning on Silicon
Weyrauch, Vorontsov, Bifano, Hammer, Cohen & Cauwenberghs
520.776 Learning on Silicon
520.776 Learning on Silicon
2
υ
r
Horn(1968), Delbruck (1999)
Muller et al.(1974) Vorontsov et al.(1996)
520.776 Learning on Silicon
Cohen, Cauwenberghs, Vorontsov & Carhart (2001)
N,M N,M i,j i,j i,j i,j
ij
ij
520.776 Learning on Silicon
+4
520.776 Learning on Silicon
520.776 Learning on Silicon
uniformly illuminated dynamic range ~ 20
520.776 Learning on Silicon
520.776 Learning on Silicon
Cohen, Cauwenberghs, Vorontsov & Carhart (2001)
2 N,M N,M 2 i,j i,j i,j i,j
70µm
perimeter of dummy pixels
520.776 Learning on Silicon
( )
N,M 1+κ 2 κ u i,j u i,j BVM 2 N,M i,j i,j
520.776 Learning on Silicon
520.776 Learning on Silicon
dynamic range ~ 5
520.776 Learning on Silicon
520.776 Learning on Silicon
520.776 Learning on Silicon
520.776 Learning on Silicon