Mathematical understanding of violation of detailed balance - - PowerPoint PPT Presentation

mathematical understanding of violation of detailed
SMART_READER_LITE
LIVE PREVIEW

Mathematical understanding of violation of detailed balance - - PowerPoint PPT Presentation

. Mathematical understanding of violation of detailed balance condition and its application to Langevin dynamics . . . Masayuki Ohzeki 1 Akihisa Ichiki 2 1 Department of Systems Science, Kyoto University 2 Green Mobility Collaborative Research


slide-1
SLIDE 1

. . . . . .

. . . .

Mathematical understanding

  • f violation of detailed balance condition

and its application to Langevin dynamics

Masayuki Ohzeki1 Akihisa Ichiki 2

1Department of Systems Science, Kyoto University 2Green Mobility Collaborative Research Center, Nagoya University

2014/12/01 References: Phys. Rev. E 88, 020101(R) (2013) and cond-mat/1307.0434

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 1 / 23

slide-2
SLIDE 2

. . . . . .

.

Self Introduction

. . . . . Ohzeki Masayuki (大関 真之 in Chinese characters) Meaning of the family name “Ohzeki” A famous company of Sake The second grade of Sumo wrestler Just below “Yokozuna” in the ranking. Until the Yokozuna rank was introduced, Ohzeki was the highest rank attainable. Great guard in front of castle My ancestor was indeed a strong samurai But!! He attacked to Edo castle! In order to make revolution!

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 2 / 23

slide-3
SLIDE 3

. . . . . .

. . .

1

Review of sampling of the desired distribution Aim Markov-Chain Monte-Carlo and detailed balance condition Violation of DBC . . .

2

Langevin dynamics Langevin equation and its corresponding Fokker-Planck equation Violation of DBC: introducing an additional force Example: washboard Example: XY model . . .

3

Mathematical assurance Rough sketch of proof of accelerated relaxation Asymmetric matrix/operator . . .

4

Conclusion

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 3 / 23

slide-4
SLIDE 4

. . . . . .

.

Aim

. . . . . We would like to generate the desired distribution in order to investigate the property of the equilibrium of many-body system How to generate? Markov-Chain Monte-Carlo method (MCMC) Mainly for discrete and continuous variables and discrete time Langevin dynamics Mainly for continuous variables and continuous time We demand faster convergence to a desired distribution.

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 4 / 23

slide-5
SLIDE 5

. . . . . .

.

Markov-Chain Monte-Carlo method

. . . . . We generate a sequence of the stochastic dynamics following the master equation. Pt+1(x) = ∑

y

P(x|y)Pt(y), (1) where Pt(x) is an instantaneous distribution. We use the transition matrix P(x|y) to imitate the stochastic dynamics: P(xt|xt−1)P(xt−1|xt−2) · · · P(x2|x1)P(x1) → P(ss)(xt) (2) After a sufficient iteration, we obtain the desired distribution.

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 5 / 23

slide-6
SLIDE 6

. . . . . .

.

Detailed balance condition

. . . . . In order to obtain desired distribution, we often offer that the transition matrix satisfies P(y|x) P(x|y) = P(eq)(y) P(eq)(x), (3) where P(eq)(x), namely exp(−βE(x) − βF). .

Flexibility

. . . . . If we design the energy function E(x), we generate the desired distribution. .

Why use DBC?

. . . . . It is simple to construct the transition matrix.

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 6 / 23

slide-7
SLIDE 7

. . . . . .

.

Detailed balance condition

. . . . . In order to obtain desired distribution, we often offer that the transition matrix satisfies P(y|x) P(x|y) = P(eq)(y) P(eq)(x), (3) where P(eq)(x), namely exp(−βE(x) − βF). .

Flexibility

. . . . . If we design the energy function E(x), we generate the desired distribution. .

Why use DBC?

. . . . . It is simple to construct the transition matrix.

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 6 / 23

slide-8
SLIDE 8

. . . . . .

.

Detailed balance condition

. . . . . In order to obtain desired distribution, we often offer that the transition matrix satisfies P(y|x) P(x|y) = P(eq)(y) P(eq)(x), (3) where P(eq)(x), namely exp(−βE(x) − βF). .

Flexibility

. . . . . If we design the energy function E(x), we generate the desired distribution. .

Why use DBC?

. . . . . It is simple to construct the transition matrix.

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 6 / 23

slide-9
SLIDE 9

. . . . . .

.

Balanced condition

. . . . . In order to assure to generate the desired distribution in the steady state, we must demand the balanced condition. ∑

x

P(y|x)P(ss)(x) = P(ss)(y) (4) .

Flexibility

. . . . . If we set P(ss)(x) ∝ exp(−βE(x)), we can offer the desired distribution. .

Why not use BC (Violation of DBC)?

. . . . . It is “not” simple to construct the transition matrix.

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 7 / 23

slide-10
SLIDE 10

. . . . . .

.

Balanced condition

. . . . . In order to assure to generate the desired distribution in the steady state, we must demand the balanced condition. ∑

x

P(y|x)P(ss)(x) = P(ss)(y) (4) .

Flexibility

. . . . . If we set P(ss)(x) ∝ exp(−βE(x)), we can offer the desired distribution. .

Why not use BC (Violation of DBC)?

. . . . . It is “not” simple to construct the transition matrix.

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 7 / 23

slide-11
SLIDE 11

. . . . . .

.

Balanced condition

. . . . . In order to assure to generate the desired distribution in the steady state, we must demand the balanced condition. ∑

x

P(y|x)P(ss)(x) = P(ss)(y) (4) .

Flexibility

. . . . . If we set P(ss)(x) ∝ exp(−βE(x)), we can offer the desired distribution. .

Why not use BC (Violation of DBC)?

. . . . . It is “not” simple to construct the transition matrix.

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 7 / 23

slide-12
SLIDE 12

. . . . . .

.

Recent development of violation of DBC

. . . . . Several types without DBC are proposed. .

Skewed DBC (2011)

. . . . . DBC in the replicated system. We violate DBC for each individual system. The composite system does not violate BC. [K. S. Turitsyn, M. Chertkov and M. Vucelja: Physica D 240 (2011) 410.] [Sakai and Hukushima: JPSJ 82 (2013) 064003.]

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 8 / 23

slide-13
SLIDE 13

. . . . . .

.

Recent development of violation of DBC

. . . . . Several types without DBC are proposed. .

Suwa-Todo method (2010)

. . . . . A tricky algorithm based on BC. While satisfying BC, they allocate the weight neglecting DBC.

Metropolis heat bath present

100 101 102 103 0.86 0.9 0.94 0.98

τint

q = 4 q = 8 0.74 0.76 0.78

T

[H.Suwa and S.Todo: Phys. Rev. Lett. 105 (2010) 120603.]

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 8 / 23

slide-14
SLIDE 14

. . . . . .

Similarly, is there a modification of the Langevin dynamics, which can accelerate the convergence to the desired distribution?

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 9 / 23

slide-15
SLIDE 15

. . . . . .

.

Langevin equation

. . . . . The N-dimensional Langevin equation in an inverse temperature β = 1/T is dx = A(x, t)dt + √ 2 β dW (5) where A(x, t) is a force and W is the Wiener process. .

corresponding Fokker Planck equation

. . . . . The corresponding Fokker-Planck equation is ∂P(x, t) ∂t = −divJ(x, t), (6) where J(x, t) is the probability flow defined as J(x, t) = A(x, t)P(x, t) − TgradP(x, t) (7) The divergence is div = ∑

i ∂/∂xi and the gradient is [grad]i = ∂/∂xi.

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 10 / 23

slide-16
SLIDE 16

. . . . . .

.

Langevin equation

. . . . . The N-dimensional Langevin equation in an inverse temperature β = 1/T is dx = A(x, t)dt + √ 2 β dW (5) where A(x, t) is a force and W is the Wiener process. .

corresponding Fokker Planck equation

. . . . . The corresponding Fokker-Planck equation is ∂P(x, t) ∂t = −divJ(x, t), (6) where J(x, t) is the probability flow defined as J(x, t) = A(x, t)P(x, t) − TgradP(x, t) (7) The divergence is div = ∑

i ∂/∂xi and the gradient is [grad]i = ∂/∂xi.

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 10 / 23

slide-17
SLIDE 17

. . . . . .

.

Reference equation

. . . . . The Fokker-Planck equation is ∂P(x, t) ∂t = −divJ(x, t), (8) .

Equilibrium system

. . . . . We impose P(x, t) ∝ exp(−βE(x)) and J(x, t) = 0. .

Solution

. . . . . The force is given as A(x, t) = −gradE(x) (9) No flow appears in the equilibrium state.

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 11 / 23

slide-18
SLIDE 18

. . . . . .

.

Reference equation

. . . . . The Fokker-Planck equation is ∂P(x, t) ∂t = −divJ(x, t), (8) .

Equilibrium system

. . . . . We impose P(x, t) ∝ exp(−βE(x)) and J(x, t) = 0. .

Solution

. . . . . The force is given as A(x, t) = −gradE(x) (9) No flow appears in the equilibrium state.

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 11 / 23

slide-19
SLIDE 19

. . . . . .

.

Reference equation

. . . . . The Fokker-Planck equation is ∂P(x, t) ∂t = −divJ(x, t), (8) .

Equilibrium system

. . . . . We impose P(x, t) ∝ exp(−βE(x)) and J(x, t) = 0. .

Solution

. . . . . The force is given as A(x, t) = −gradE(x) (9) No flow appears in the equilibrium state.

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 11 / 23

slide-20
SLIDE 20

. . . . . .

.

Reference equation

. . . . . The Fokker-Planck equation is ∂P(x, t) ∂t = −divJ(x, t), (10) .

Nonequilibrium system as a lesson

. . . . . We impose P(x, t) ∝ exp(−βE(x)) and J(x, t) = γ1. .

Solution

. . . . . The force is given as A(x, t) = −gradE(x) + γ1 exp (βE(x)) (11) A constant flow appears in the steady state. However the force is unidirectional and exponentially large.

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 12 / 23

slide-21
SLIDE 21

. . . . . .

.

Reference equation

. . . . . The Fokker-Planck equation is ∂P(x, t) ∂t = −divJ(x, t), (10) .

Nonequilibrium system as a lesson

. . . . . We impose P(x, t) ∝ exp(−βE(x)) and J(x, t) = γ1. .

Solution

. . . . . The force is given as A(x, t) = −gradE(x) + γ1 exp (βE(x)) (11) A constant flow appears in the steady state. However the force is unidirectional and exponentially large.

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 12 / 23

slide-22
SLIDE 22

. . . . . .

.

Reference equation

. . . . . The Fokker-Planck equation is ∂P(x, t) ∂t = −divJ(x, t), (10) .

Nonequilibrium system as a lesson

. . . . . We impose P(x, t) ∝ exp(−βE(x)) and J(x, t) = γ1. .

Solution

. . . . . The force is given as A(x, t) = −gradE(x) + γ1 exp (βE(x)) (11) A constant flow appears in the steady state. However the force is unidirectional and exponentially large.

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 12 / 23

slide-23
SLIDE 23

. . . . . .

.

Reference equation

. . . . . The Fokker-Planck equation is ∂P(x, t) ∂t = −divJ(x, t), (12) .

Nonequilibrium divergence-free system

. . . . . We impose P(x, t) ∝ exp(−βE(x)) and J(x, t) = B(x, t)P(x, t). .

Solution

. . . . . When we define [B(x, t)]i = γ (∂E(x) ∂xi−1 − ∂E(x) ∂xi+1 ) , xN+1 = x1 and x0 = xN. (13) Then divJ(x, t) = 0 and [A(x)]i = −∂E(x)

∂xi

+ [B(x, t)]i. The force is free from the exponentially large term. (Similar to Suwa-Todo method??).

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 13 / 23

slide-24
SLIDE 24

. . . . . .

.

Reference equation

. . . . . The Fokker-Planck equation is ∂P(x, t) ∂t = −divJ(x, t), (12) .

Nonequilibrium divergence-free system

. . . . . We impose P(x, t) ∝ exp(−βE(x)) and J(x, t) = B(x, t)P(x, t). .

Solution

. . . . . When we define [B(x, t)]i = γ (∂E(x) ∂xi−1 − ∂E(x) ∂xi+1 ) , xN+1 = x1 and x0 = xN. (13) Then divJ(x, t) = 0 and [A(x)]i = −∂E(x)

∂xi

+ [B(x, t)]i. The force is free from the exponentially large term. (Similar to Suwa-Todo method??).

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 13 / 23

slide-25
SLIDE 25

. . . . . .

.

Reference equation

. . . . . The Fokker-Planck equation is ∂P(x, t) ∂t = −divJ(x, t), (12) .

Nonequilibrium divergence-free system

. . . . . We impose P(x, t) ∝ exp(−βE(x)) and J(x, t) = B(x, t)P(x, t). .

Solution

. . . . . When we define [B(x, t)]i = γ (∂E(x) ∂xi−1 − ∂E(x) ∂xi+1 ) , xN+1 = x1 and x0 = xN. (13) Then divJ(x, t) = 0 and [A(x)]i = −∂E(x)

∂xi

+ [B(x, t)]i. The force is free from the exponentially large term. (Similar to Suwa-Todo method??).

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 13 / 23

slide-26
SLIDE 26

. . . . . .

.

Divergence free replicated system

. . . . . We consider a composite system as P(x1, x2, t). The Langevin equations are dxi = Ai(x1, x2, t) + √ 2 β dWi, (14) and the corresponding Fokker-Planck equations are ∂P(x1, x2, t) ∂t = −

2

i=1

diviJi(x1, x2, t), (15)

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 14 / 23

slide-27
SLIDE 27

. . . . . .

.

Additional bidirectional forces

. . . . . We impose the forces as A1(x1, x2, t) = −grad1E(x1) + γgrad2E(x2) (16) A2(x1, x2, t) = −grad2E(x2) − γgrad1E(x1) (17) Each flow is given by Ji(x1, x2, t) = ±γ ( gradjE(xj) ) P(x1, x2, t) (i = j) but ∑

i diviJi = 0 (In this sense, similar to the skewed DBC.)

The steady state is P(ss)(x1, x2) ∝ exp (−βE(x1) − βE(x2)). Implimentation is very simple. We remove exponentially large force.

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 15 / 23

slide-28
SLIDE 28

. . . . . .

.

Example: washboard

. . . . . We set N = 10, 000 particles in the potential energy, which has a washboard shape characterized by a function, E(x) = 1 2

N

i=1

(1 + sin(2πxi)) (18) for xi ∈ [0, 10). at t = 2.5 in β = 10.

2 4 6 8 10

1500 3000

N(x)

x

2 4 6 8 10

1500 3000

2 4 6 8 10

1500 3000 γ=1.0 γ=0.1 γ=0.0

−5 −4 −3 −2 −1 1 0.5 1 1.5 2 2.5

t

Δ

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 16 / 23

slide-29
SLIDE 29

. . . . . .

.

Example: XY model

. . . . . We employ the XY model as an interacting many-body system E(x) = − ∑

j∈∂i

cos (xi − xj) , (19) Note that xi here denotes the spin direction such that xi ∈ [0, 2π). We set N = 50 × 50 spins and γ = 0 and 5 from top to bottom at T = 1.0 above TKT (left) and T = 0.5 below TKT (right).

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 17 / 23

slide-30
SLIDE 30

. . . . . .

Why does the convergence seem to be fastened?

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 18 / 23

slide-31
SLIDE 31

. . . . . .

.

Operator form of the Fokker-Planck eq.

. . . . . Let us consider the operator form of the Fokker-Planck equation as ∂ ˜ P(x1, x2, t) ∂t = L({ai}, {ˆ ai})˜ P(x1, x2, t) (20) where L({ai}, {ˆ ai}) = −

2

i=1

ˆ aiai − γ (ˆ a1a2 − a1ˆ a2) , (21) ai and ˆ ai are the operator satisfying [ai, ˆ ai] = −divigradiE(xi), where ˆ P(x1, x2, t) = P(x1, x2, t)/ √ P(ss)(x1, x2). The first eigenstate for ∑2

i=1 ˆ

aiai has a vanishing eigenvalue. Another term holds the first eigenstate with a vanishing eigenvalue When γ = 0, L({ai}, {ˆ ai}) is symmetric, otherwise asymmetric.

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 19 / 23

slide-32
SLIDE 32

. . . . . .

.

Rough sketch of proof

. . . . . The eigenvalue of L characterizes the relaxation speed as P(x1, x2, t) ∼ exp(λLt)P(x1, x2, 0). Let us decompose L = S + Γ, where S is symmetric and Γ is anti-symmetric. We prove that Re(λL) − λS ≤ 0 for a fixed S. .

Ostrowski-Taussky inequality

. . . . . If A + A† is positive-definite, | det A| ≥ det A + A† 2 (22) Generalization of |z| ≥ |Rez| for a complex number z to matrices.

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 20 / 23

slide-33
SLIDE 33

. . . . . .

det

W W

.

Slope of the characteristic polynomial

. . . . . By use of the OT inequality, we find d dλ det (λ − L)

  • λ=0

≥ d dλ det (λ − S)

  • λ=0

(23)

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 21 / 23

slide-34
SLIDE 34

. . . . . .

det

L L

.

Slope of the characteristic polynomial

. . . . . By use of the OT inequality, we find d dλ det (λ − L)

  • λ=0

≥ d dλ det (λ − S)

  • λ=0

(23)

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 21 / 23

slide-35
SLIDE 35

. . . . . .

det

L L

.

λL

2 ≤ Re(λ) ≤ λL 1 = 0

. . . . . Division by λ removes the negative sign of the polynomial and OT inequality leads

  • 1

λ det (λ − L)

  • ≥ 1

λ det (Re(λ) − S) (24)

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 22 / 23

slide-36
SLIDE 36

. . . . . .

.

Conclusion

. . . . . We propose an additional force to the Langevin dynamical system Replicated divergence-free system (Skewed DBC?) Implementation is very simple. Confirmed in XY model. Single divergence-free system (Suwa-Todo method?) Implementation is very simple. Not yet confirmed. Mathematical assurance Nonzero eigenvalues are less than the ordinary Langevin dynamics. Applications Glassy dynamics and molecule dynamics (Physics and Chemistry). Stochastic gradient method to infer the inherent parameter generating huge number of data (Machine Learning).

  • M. Ohzeki and A. Ichiki (KU and NU)

StatPhys-Kolkata VIII 2014/12/01 23 / 23