Simulating Biomolecules with Variable Protonation State: Constant-pH - - PowerPoint PPT Presentation

simulating biomolecules with variable protonation state
SMART_READER_LITE
LIVE PREVIEW

Simulating Biomolecules with Variable Protonation State: Constant-pH - - PowerPoint PPT Presentation

Simulating Biomolecules with Variable Protonation State: Constant-pH Molecular Dynamics Simulations with NAMD Brian Radak Leadership Computing Facility, Argonne National Laboratory Computational Biophysics Workshop Enhanced Sampling and


slide-1
SLIDE 1

Simulating Biomolecules with Variable Protonation State: Constant-pH Molecular Dynamics Simulations with NAMD

Brian Radak

Leadership Computing Facility, Argonne National Laboratory

Computational Biophysics Workshop – Enhanced Sampling and Free Energy Calculations University of Illinois, Urbana-Champaign September 26, 2017

slide-2
SLIDE 2

Acknowledgements (other people to blame)

Univ of Chicago

◮ Benoˆ

ıt Roux

◮ Donghyuk Suh ◮ Huan Rui ◮ Yunjie Chen

ALCF

◮ Sunhwan Jo (SilcsBio) ◮ Wei Jiang

UIUC

◮ Jim Phillips ◮ Chris Chipot ◮ Abhi Singharoy

Theta Early Science Program

slide-3
SLIDE 3

pH Effects in Biochemistry

7.2 6.7 8.0 nucleus cytosol golgi mitochondria lysosome 4.7 endosome 5.5-6.5 6.0 endoplasmic reticulum

Casey, et al Nat Rev Mol Cell Biol, 2010

variability of pH by region enzyme rate vs. pH

pKa,base general base pKa,acid general acid pKa,opt kopt fractional population rate constant pH

7.5 7.2 6.9 7.5 normal cancerous pHe pHi + +

Webb, et al Nat Rev Cancer, 2011

pH gradients at cell surfaces

slide-4
SLIDE 4

Constant pH and the semi-grand canonical ensemble

◮ Conventional MD samples a canonical ensemble:

Q =

  • dx e−βU(x)

◮ Constant-pH MD samples a semi-grand canonical ensemble:

Ξ(pH) =

  • λ∈S

Qλ10−nλpH The added interaction is between the number of protons, nλ, and a pH bath. λ is a new variable designating the protonation state.

slide-5
SLIDE 5

Networks of protonation states

E-R1H:R2H −H+ E-R−

1 :R2H

−H+ E-R−

1 :R− 2

−H+ E-R1H:R−

2

−H+

conventional MD

E-R1H:R2H −H+ E-R−

1 :R2H

−H+ E-R−

1 :R− 2

−H+ E-R1H:R−

2

−H+

constant pH MD

slide-6
SLIDE 6

Networks of protonation states

E-R1H:R2H −H+ E-R−

1 :R2H

−H+ E-R−

1 :R− 2

−H+ E-R1H:R−

2

−H+

conventional MD

E-R1H:R2H −H+ E-R−

1 :R2H

−H+ E-R−

1 :R− 2

−H+ E-R1H:R−

2

−H+

constant pH MD

N = 2 3 4 1 2 N

slide-7
SLIDE 7

pH as a thermodynamic force

◮ Classical MD utilizes mechanical forces

F = −∇U[x(t)] = m∂v ∂t ; v = ∂x ∂t

◮ pH may be regarded as a thermodynamic force

ln 10pH = −∂ ln Ξ ∂nλ Mechanical forces – deterministic/stochastic dynamics Thermodynamic forces – probabilistic “dynamics” Pλ(pH) ∝ Qλ10−nλpH

slide-8
SLIDE 8

How do we define nodes in the network?

Consider a system with m sites:

slide-9
SLIDE 9

Protonation state probabilities/populations

A(x, λ)pH =

  • λ∈S
  • dx A(x, λ)e−βU(x;λ)10−nλpH

Ξ(pH) Pλs = λspH – the probability that site s is occupied There are two kinds of terms in the summation, λs = 0/1 Ξ(pH) = Ξ0(pH) + Ξ1(pH)10−pH thus, λspH = Ξ1(pH)10−pH Ξ0(pH) + Ξ1(pH)10−pH = 1 1 + Ξ0(pH)

Ξ1(pH)10pH

slide-10
SLIDE 10

Connection to thermodynamics

λspH = 1 1 + Ξ0(pH)

Ξ1(pH)10pH

compares to the Henderson-Hasselbalch equation such that pKa(pH) = − log Ξ0(pH) Ξ1(pH), except that now pKa(pH) is pH dependent. One often uses the approximation: pKa(pH) ≈ pK (a)

a

+ (1 − n)

  • pH − pK (a)

a

  • ,

where n is the Hill coefficient and pK (a)

a

is the “apparent” pKa.

slide-11
SLIDE 11

Networks of protonation states

E-R1H:R2H −H+ E-R−

1 :R2H

−H+ E-R−

1 :R− 2

−H+ E-R1H:R−

2

−H+

conventional MD

E-R1H:R2H −H+ E-R−

1 :R2H

−H+ E-R−

1 :R− 2

−H+ E-R1H:R−

2

−H+

constant pH MD We can now see that the fraction of simulation time spent in a given protonation state is directly impacted by the difference of the pKa of a residue/site and the pH.

slide-12
SLIDE 12

That’s great – how do we sample the states?

  • 1. Sample the configuration space of a given state

(i.e., sample x for a given Qλ)

  • 2. Change between protonation states according to the number
  • f protons and the given pH

(i.e., sample λ and choose a new Qλ) This may be regarded as a Gibbs sampling, whereby the configuration and state are sampled in an alternating fashion.

slide-13
SLIDE 13

A problem! Environmental response

◮ (De)Protonation is a significant electrostatic event. ◮ Non-trivial reorganization of solvent, possibly solute. ◮ Naive sudden changes in protonation are likely to cause high

energy configurations and/or steric clashes.

slide-14
SLIDE 14

Possible solutions to the solvent clash problem

auxillary implicit solvent continuous fractional proton discrete copy fractional proton

slide-15
SLIDE 15

“Fast” alchemical growth

◮ Swap the protonation state by using time-dependent

interactions.

◮ Gradually stronger interactions will induce solvent response. ◮ Clashes are avoided by using the natural dynamics of the

model.

slide-16
SLIDE 16

The neMD/MC constant pH paradigm

◮ Drive alchemical growth with nonequilibrium work ◮ Accept/reject with a generalized Metropolis criterion

Stern J Chem Phys, 2007; Chen & Roux J Chem Theory Comput, 2015

slide-17
SLIDE 17

The neMD/MC constant pH paradigm

!=1/3 !=2/3 !=0 !=1

MC sample of auxiliary coordinates removal of auxiliary coordinates neMD alchemical growth

◮ Drive alchemical growth with nonequilibrium work ◮ Accept/reject with a generalized Metropolis criterion

Stern J Chem Phys, 2007; Chen & Roux J Chem Theory Comput, 2015

slide-18
SLIDE 18

Beyond Gibbs sampling: Hybrid MD and neMD/MC

We now alternate conventional sampling with MD (x) and Metropolis Monte Carlo sampling (x and λ): ρ(x, λ)T(x, λ → x′, λ′) = ρ(x′, λ′)T(x′, λ′ → x, λ) such that the neMD/MC transition probability is: T(x, λ → x′, λ′) = min

  • 1, ρ(x′, λ′)

ρ(x, λ)

  • = min
  • 1, e−βW 10−∆npH

(If you’d like, MD uses the probability T(x → x′) = 1.)

slide-19
SLIDE 19

Important considerations

◮ How long should I sample the equilibrium stage? ◮ How long should I sample the nonequilibrium stage

(the “switch time,” τswitch)

◮ Rejecting a nonequilibrum trajectory is expensive, how can we

avoid doing that so much?

slide-20
SLIDE 20

The two-step “inherent” pKa algorithm

T(x, λ → x′, λ′) = T (i)(λ → λ′)T (s)(x → x′|λ → λ′) T (i)(λ → λ′) = min

  • 1, 10pK (i)

a (λ,λ′)−∆npH

◮ neMD/MC can be split into two parts

  • 1. T (i) – only depends on λ and the pH – CHEAP
  • 2. T (s) – depends on the switch (W ) – COSTLY

◮ Effort is shifted by estimating a parameter, pK (i) a ◮ Optimal efficiency achieved for exact pKa ◮ Dramatically improved performance on wide pH ranges! ◮ Can do even better for systems with more than two states.

Chen & Roux J Chem Theory Comput, 2015; Radak, et al submitted

slide-21
SLIDE 21

Let’s look at this graphically

0.0 0.2 0.4 0.6 0.8 1.0 pKa - 2 pKa - 1 pKa pKa + 1 pKa + 2 add if unoccupied remove if occupied be occupied probability pH

◮ It’s silly to try to add/remove protons to/from acidic/basic

residues at high/low pH

◮ Transitions are proposed in proportion to the estimated

population.

slide-22
SLIDE 22

What about after we’ve proposed a switch?

◮ A short switch will not change much and likely be rejected. ◮ A long switch is expensive (limit of a single switch – BAD). ◮ Since the switch success depends on the work, let’s analyze

that.

slide-23
SLIDE 23

Work and force fluctuations – a typical neMD/MC cycle

50 100 150 200 τmol τswitch σ0

2

force time (/ps) σ0

2

5 10 15 20 force correlation work ∆F

Radak & Roux J Chem Phys, 2016

slide-24
SLIDE 24

Theoretical and Empirical Performance Analysis

5 10 15 20 1400 1600 1800 2000 0 10 20 30 40 50 60 0.0 0.2 0.4 0.6 0.8 1.0 near maximal acceptance probability near minimal transition rate transition rate (/ns-1) mean acceptance probability switch time (/ps) τopt ≈ 11 ps

◮ Well-defined criteria for

  • ptimization.

◮ Cost is quite tractable.

Radak & Roux J Chem Phys, 2017

10 20 30 40 50 60 70 80 40 80 120 160 200 error ~10% (noramlized) optimal switch time exact approx. 0.0 0.2 0.4 0.6 0.8 1.0 10 20 30 40 50 60 23.3%

  • ptimal mean

acceptance probability equilibrium work variance [/(kBT)2]

slide-25
SLIDE 25

NAMD Constant pH features

◮ Flexible Tcl interface source

...lib/namdcph/namdcph.tcl

◮ PSF build procedure is unchanged (automated psfgen) ◮ Implemented with PME and full electrostatics ◮ Normal CPU scaling (no GPU yet) - depends on alchemy ◮ Companion analysis script cphanalyze

pH 7.0 cphNumstepsPerSwitch 7500 ;# run 7500 steps per switch cphRun 5000 10 ;# run 10 cycles of 5000 MD steps

slide-26
SLIDE 26

Output and Analysis

0.0 0.2 0.4 0.6 0.8 1.0 2 3 4 5 6 7 protonated fraction pH

GLU10/43/52 ASP19/21/83 LYS24 HIS8

◮ Normal usage requires

multiple pH values (“titration curves”)

◮ New output cphlog and

cphrst

◮ New checkpoint files

psf/pdb

◮ Can boost performance with

WHAM (cphanalyze)

◮ Can also analyze residue

correlations

slide-27
SLIDE 27

Reference amino acids are well-reproduced

0.0 0.2 0.4 0.6 0.8 1.0 4 5 6 7 8 9 10 11 ASP: 4.1 +/- 0.1 GLU: 4.4 +/- 0.1 HIS-δ: 6.6 +/- 0.2 HIS-ε: 7.1 +/- 0.2 LYS: 10.4 +/- 0.3 CYS: 9.5 +/- 0.1 protonated fraction/state population pH

◮ Adjustments to force field

enforce empirical reference values

◮ Implicitly model solvated

proton and bond energy effects

◮ Bonus - accurate

reproduction of tautomeric ratios!

slide-28
SLIDE 28

What about challenging environments?

◮ Single titratable peptide

(AADAA)

◮ Lipid relaxation is slow

(slower than water)

◮ Low dielectric region should

perturb pKa in obvious way

slide-29
SLIDE 29

A B

z = 50 Å z = 20 Å z = 0 bulk water

◮ Significant shifts due to low dielectric region. ◮ Switch time of 10 ps is sufficient.

Radak, et al submitted Teixeira, et al J Chem Theory Comput, 2016

slide-30
SLIDE 30

Staph nuclease (SNase) - A constant pH benchmark

slide-31
SLIDE 31

Benchmarking of SNase pKa values

2 3 4 5 6 7 2 3 4 5 6 7 pKa,calc pKa,expt this work - GLU this work - ASP Huang, et al. - GLU Huang, et al. - ASP

◮ Good correlation with

measured values for carboxylates

◮ Bonus - estimates for

HIS and LYS residue this work LYS 24 8.43 (0.45) HIS 8 6.66 (0.56) 121 5.36 (0.50)

slide-32
SLIDE 32

Concluding Remarks/Future Directions

◮ Things we are working on:

◮ Performance improvements in alchemy – CUDA support ◮ Titratable lipids and phosphates

◮ Things we would like to work on:

◮ psfgen improvements – support for Drude ◮ Better visualization support in VMD ◮ More powerful interface for analysis (PyNAMD)

slide-33
SLIDE 33

Acknowledgements (other people to blame)

Univ of Chicago

◮ Benoˆ

ıt Roux

◮ Donghyuk Suh ◮ Huan Rui ◮ Yunjie Chen

ALCF

◮ Sunhwan Jo (SilcsBio) ◮ Wei Jiang

UIUC

◮ Jim Phillips ◮ Chris Chipot ◮ Abhi Singharoy

Theta Early Science Program