Why do irreversible processes converge faster to equilibrium than - - PowerPoint PPT Presentation

why do irreversible processes converge faster to
SMART_READER_LITE
LIVE PREVIEW

Why do irreversible processes converge faster to equilibrium than - - PowerPoint PPT Presentation

Why do irreversible processes converge faster to equilibrium than reversible ones? Marcus Kaiser SAMBa Summer Conference 2017 University of Bath joint work with R. L. Jack and J. Zimmer Marcus Kaiser Claim: Irreversible systems converge faster


slide-1
SLIDE 1

Why do irreversible processes converge faster to equilibrium than reversible ones?

Marcus Kaiser

SAMBa Summer Conference 2017

University of Bath

joint work with R. L. Jack and J. Zimmer

Marcus Kaiser

slide-2
SLIDE 2

Claim: Irreversible systems converge faster to equilibrium [Hwang et al. 2005][Pavliotis2013][ReyBellet-Spiliopoulos2015,2016] [Bierkens2015]

Marcus Kaiser

slide-3
SLIDE 3

Claim: Irreversible systems converge faster to equilibrium [Hwang et al. 2005][Pavliotis2013][ReyBellet-Spiliopoulos2015,2016] [Bierkens2015] Interesting for two reasons:

  • Understanding the physics of non-equilibrium systems
  • Acceleration of sampling methods like MCMC

Marcus Kaiser

slide-4
SLIDE 4

Claim: Irreversible systems converge faster to equilibrium [Hwang et al. 2005][Pavliotis2013][ReyBellet-Spiliopoulos2015,2016] [Bierkens2015] Interesting for two reasons:

  • Understanding the physics of non-equilibrium systems
  • Acceleration of sampling methods like MCMC

We investigate the effect of breaking detailed balance on the convergence to the steady state. We will consider (interacting) particle systems and their hydrodynamic scaling limits.

Marcus Kaiser

slide-5
SLIDE 5

We consider systems on two scales:

(1) Microscopic systems

finite state, ergodic and irreducible continuous time Markov processes

with unique steady state π and dynamics given by ˙ µt(x) =

  • y

µt(y)c(y → x) − µt(x)c(x → y) = L†µt(x).

(2) Macroscopic systems

drift diffusive systems of the form ∂tρ = ∇ ·

  • D(ρ)∇ρ
  • − ∇ ·
  • χ(ρ)E
  • .

Marcus Kaiser

slide-6
SLIDE 6

Microscopic systems

Marcus Kaiser

slide-7
SLIDE 7

Microscopic systems = particle systems

We consider a system of indistinguishable particles which hop between sites i i+1 leading to a transition from state x to state y

x y c(x, y)

Marcus Kaiser

slide-8
SLIDE 8

Microscopic systems

Relations to physics: Equilibrium systems are characterised by ‘detailed balance’ π(x)c(x→y) = π(y)c(y→x), which correspond to vanishing currents in the steady state, whereas non-equilibrium systems are characterised by a non-zero current in the steady state. The microscopic current for a measure µ is given by Jx,y(µ) = µ(x)c(x→y) − µ(y)c(y→x). Jx,y(π) = 0 (for all x, y) if and only if the system is an equilibrium system (i.e. satisfies detailed balance).

Marcus Kaiser

slide-9
SLIDE 9

Microscopic systems

Alternative characterisation in terms of the generator L: The process is reversible (satisfies detailed balance) if L is symmetric w.r.t. the inner product in L2(π). In general, we can write any generator L as L = LS + LA, where LS is symmetric and LA is anti-symmetric (w.r.t. L2(π)). LS is again a generator with unique stationary measure π.

Marcus Kaiser

slide-10
SLIDE 10

Example

We consider a system of independent particles in a potential U in 2d.

20 40 60 80 100 120 140 x1

  • 1

1 2 3 4 5 V(x1,1/2)

  • 0.5

. 5 0.5 1 1 1 1.5 1 . 5 1.5 2 2 2 2 2 2.5 2.5 2.5 2.5 2.5 3 3 3 3 3 3 3 . 5 3.5 3.5 3.5 3.5 3.5 4 4 4 4 4 4 4.5 4.5 4.5 4 . 5 4.5 4 . 5 5 5 5 5 5 5 20 40 60 80 100 120 140 x1 20 40 60 80 100 120 140 x2

We can think here of a Monte Carlo sampling with many (≈ 150000)

  • samples. Sampling from π ∝ e−U. Lattice size L2 = 140 × 140.

Marcus Kaiser

slide-11
SLIDE 11

Example - Test observable

t

5 10 15 20

S(t)

7.5 8 8.5 9

reversible irreversible

t

5 10 15 20

D(t)

0.5 1 1.5 2

reversible irreversible

5 10 15 20 0.5 1 1.5 2

reversible irreversible asymptotic value

t

5 10 15 20

average x1 pos

  • 0.2
  • 0.1

0.1 0.2 0.3

reversible irreversible

[K., Jack, Zimmer, J Stat Phys 2017]

Marcus Kaiser

slide-12
SLIDE 12

Acceleration of convergence

⇒ The Markov chain with generator L = LS + LA converges faster to π than the process with generator LS. This convergence can be checked in different ways: (e.g.)

  • The spectral gap of the generator

(the largest non-zero eigenvalue of L).

  • The large deviation rate functional

Marcus Kaiser

slide-13
SLIDE 13

Spectral gap

The spectrum σ(L) is contained in C− := {z ∈ C| Re(z) ≤ 0} and 0 ∈ σ(L). We denote with α(L) the modulus of the real part of the non-zero eigenvalue with largest real part.

reversible irreversible

σ(LS) σ(L)

α(LS ) α(L) Marcus Kaiser

slide-14
SLIDE 14

Spectral gap

We assume that L is diagonalisable such that we can write any distribution at time t ∈ [0, ∞) as µt(x) = π(x) + e−tα(L)γ(t, x) for a (in t) bounded function γ(t, x). Therefore µt − π ≤ Ce−tα(L).

(The initial distribution is here given by µ0 = π + γ(0, ·))

Hence Theorem (Spectral gap) α(L) ≥ α(LS).

Marcus Kaiser

slide-15
SLIDE 15

Large deviations

Large deviations characterise asymptotic probabilities (here as t → ∞) in terms of a rate functional I(µ). In this case, we consider the empirical average Θt := 1

t

t

0 δXudu, which satisfies

P[Θt ≈ µ] ≍ e−tI(µ).

This notation stands for the following two inequalities: For all closed sets A and open sets O, we have lim sup

t→∞

1 t log P [Θt ∈ A] ≤ − inf

µ∈A I(µ)

and lim inf

t→∞

1 t log P [Θt ∈ O] ≥ − inf

µ∈O I(µ).

Marcus Kaiser

slide-16
SLIDE 16

Large deviations

We compare P[Θt(LS) ≈ µ] ≍ e−tIS(µ) and P[Θt(L) ≈ µ] ≍ e−tI(µ). Consistently with the above result, we have Theorem (Rate functional) IS(µ) ≤ I(µ) Informally this implies that asymptotically as t → ∞ P[Θt(LS) ≈ µ] ≥ P[Θt(L) ≈ µ] for µ = π.

Marcus Kaiser

slide-17
SLIDE 17

Macroscopic systems

Marcus Kaiser

slide-18
SLIDE 18

Macroscopic systems

With the appropriate rescaling of the rates, the systems becomes on large enough scales (for large L) ‘independent’ of the lattice size.

0.5 1 1.5 2

t

0.5 1 1.5

D(t)

L=150, rev. L=300, rev. L=450, rev. L=150, irrev. L=300, irrev. L=450, irrev.

Plot of 1d system with L = 150, 300, 450.

The system then can be approximately described by a deterministic mass evolution.

Marcus Kaiser

slide-19
SLIDE 19

Macroscopic systems

The macroscopic behaviour can be described in terms of a conservation law of the form ∂tρt = −∇ · jt (1) for some current jt on a given domain Λ with a suitable boundary condition on ∂Λ.

Marcus Kaiser

slide-20
SLIDE 20

Macroscopic systems

The macroscopic behaviour can be described in terms of a conservation law of the form ∂tρt = −∇ · jt (1) for some current jt on a given domain Λ with a suitable boundary condition on ∂Λ. E.g. a box with periodic boundary condition

∂tρ = −∇ · jt

  • n Λ

Marcus Kaiser

slide-21
SLIDE 21

Macroscopic systems

The macroscopic behaviour can be described in terms of a conservation law of the form ∂tρt = −∇ · jt (1) for some current jt on a given domain Λ with a suitable boundary condition on ∂Λ. E.g. a box with periodic boundary condition

∂tρ = −∇ · jt

  • n Λ

For the hydrodynamic limit, the associated hydrodynamic current J(ρt) is given by J(ρt) = −D(ρt)∇ρt + χ(ρt)E. (2) We assume that equation (1) with jt = J(ρt) as in (2) has a unique steady state ¯ ρ.

Marcus Kaiser

slide-22
SLIDE 22

Splitting the current

A fundamental result from the Macroscopic Fluctuation Theory (MFT) is that one can split the current in the sum of a symmetric and an anti-symmetric term: J = JS + JA which satisfies an orthogonality condition JS(ρ), JA(ρ)χ(ρ)−1 :=

  • Λ

JS(ρ) · χ(ρ)−1JA(ρ)dx = 0.

JS and JA can be obtained from the current of the adjoint process as JS = (J + J∗)/2 and JA = (J − J∗)/2.

Note: In general JS(ρt) is not −D(ρt)∇ρt.

Marcus Kaiser

slide-23
SLIDE 23

Non-equilibrium systems

Non-equilibrium systems correspond to the case when JA does not vanish, whereas equilibrium systems are characterised by J = JS.

Similar to the microscopic case, where L = LS for reversible systems. Marcus Kaiser

slide-24
SLIDE 24

Non-equilibrium systems

Non-equilibrium systems correspond to the case when JA does not vanish, whereas equilibrium systems are characterised by J = JS.

Similar to the microscopic case, where L = LS for reversible systems.

In general, we can write the symmetric part of the current as JS(ρt) = −χ(ρt)∇ δV δρt . where V is the so called quasipotential.

  • V(ρ) ≥ 0 with equality if and only if ρ = ¯

ρ.

  • V(ρt) is monotonic decreasing.
  • V can be thought of as a (non-equilibrium) free energy that drives

the system to the unique and globally attractive steady state ¯ ρ.

Marcus Kaiser

slide-25
SLIDE 25

Symmetric current

In the case that JA(ρ) = 0, we can write the dynamics as the gradient flow (or steepest descent) ∂tρt = ∇ · χ(ρt)∇ δV δρt .

¯ ρ ρ0

V

Recall that a gradient flow consists of a metric M and an energy V, such that ∂tρt = −M(ρt) δV

δρt .

(Here M(ρt) = −∇ · χ(ρt)∇). Marcus Kaiser

slide-26
SLIDE 26

Example of a symmetric process

Consider the linear equation ∂tρt = ∆ρt + ∇ · (ρt∇U). This is the linear case (where χ(ρt) = ρt) and the external force is of gradient type (E = −∇U).

Marcus Kaiser

slide-27
SLIDE 27

Example of a symmetric process

Consider the linear equation ∂tρt = ∆ρt + ∇ · (ρt∇U). This is the linear case (where χ(ρt) = ρt) and the external force is of gradient type (E = −∇U). It can be restated as ∂tρt = ∇ ·

  • ρt∇ log

ρt e−U

  • ,

and thus V is δV δρt = log ρt e−U

  • .

Note that the steady state ¯ ρ is proportional to e−U. The quantity ∇ δV

δρt

is the force which drives the process to the steady state ¯ ρ.

Marcus Kaiser

slide-28
SLIDE 28

Anti-symmetric current

We assume that the external field is given by E = −∇U + ˜ E (where ˜ E is not a gradient) and consider the Poisson equation ∇ ·

  • χ(ρ)∇ψ
  • = −∇ ·
  • χ(ρ) ˜

E), which has (under some regularity assumptions on the rhs) a unique ρ dependent solution ψ = ψρ.

Marcus Kaiser

slide-29
SLIDE 29

Anti-symmetric current

We assume that the external field is given by E = −∇U + ˜ E (where ˜ E is not a gradient) and consider the Poisson equation ∇ ·

  • χ(ρ)∇ψ
  • = −∇ ·
  • χ(ρ) ˜

E), which has (under some regularity assumptions on the rhs) a unique ρ dependent solution ψ = ψρ. With this, the flux can be written as J(ρt) = −χ(ρ)∇δV δρ − χ(ρt)∇ψρ + JF (ρ) for the divergence free flux JF (ρ) := JA(ρ) + χ(ρ)∇ψρ.

Marcus Kaiser

slide-30
SLIDE 30

Anti-symmetric current

We assume that the external field is given by E = −∇U + ˜ E (where ˜ E is not a gradient) and consider the Poisson equation ∇ ·

  • χ(ρ)∇ψ
  • = −∇ ·
  • χ(ρ) ˜

E), which has (under some regularity assumptions on the rhs) a unique ρ dependent solution ψ = ψρ. With this, the flux can be written as J(ρt) = −χ(ρ)∇δV δρ − χ(ρt)∇ψρ + JF (ρ) for the divergence free flux JF (ρ) := JA(ρ) + χ(ρ)∇ψρ. All of these three terms are orthogonal w.r.t. the inner product ·, ·χ(ρ)−1, and ∂tρt = ∇ · χ(ρt)∇ δV δρt + ∇ · χ(ρt)∇ψρt.

Marcus Kaiser

slide-31
SLIDE 31

Large deviation rate functional

Under typical assumptions of no dynamical phase transition, the rate functional for the empirical density is given by I2(ρ) = 1 4

  • Λ

∇δV δρ · χ(ρ)∇δV δρ dx + 1 4

  • Λ

∇ψρ · χ(ρ)∇ψρ dx.

Marcus Kaiser

slide-32
SLIDE 32

Large deviation rate functional

Under typical assumptions of no dynamical phase transition, the rate functional for the empirical density is given by I2(ρ) = 1 4

  • Λ

∇δV δρ · χ(ρ)∇δV δρ dx + 1 4

  • Λ

∇ψρ · χ(ρ)∇ψρ dx. Note that the first term corresponds to the contribution of the symmetric current and the second summand corresponds to the first part of the anti-symmetric current which is not divergence free. The divergence free part is not contributing to the rate functional.

Marcus Kaiser

slide-33
SLIDE 33

Large deviation rate functional

The rate functional for the reversible process has ∇ψ = 0, such that Theorem (Rate functional) IS

2 (ρ) ≤ I2(ρ).

This implies again that asymptotically, as L → ∞, for ρ = ¯ ρ P[ΘL

t (JS) ≈ ρ] ≥ P[ΘL t (J) ≈ ρ].

Marcus Kaiser

slide-34
SLIDE 34

Intuition for the convergence

The orthogonality condition 0 = −

  • Λ

JS(ρ) · χ(ρ)−1JA(ρ)dx =

  • Λ

δV δρ ∇ · JA(ρ)dx implies that JA has no effect on the value of V. That is, the current JA(ρ) acts on the level sets of V.

¯ ρ ρ0

V

Marcus Kaiser

slide-35
SLIDE 35

Thanks

Marcus Kaiser