Predicting Protein Folding Paths S.Will, 18.417, Fall 2011 Protein - - PowerPoint PPT Presentation

predicting protein folding paths
SMART_READER_LITE
LIVE PREVIEW

Predicting Protein Folding Paths S.Will, 18.417, Fall 2011 Protein - - PowerPoint PPT Presentation

Predicting Protein Folding Paths S.Will, 18.417, Fall 2011 Protein Folding by Robotics S.Will, 18.417, Fall 2011 Probabilistic Roadmap Planning (PRM): Thomas, Song, Amato. Protein folding by motion planning . Phys. Biol., 2005 Aims Find good


slide-1
SLIDE 1

S.Will, 18.417, Fall 2011

Predicting Protein Folding Paths

slide-2
SLIDE 2

S.Will, 18.417, Fall 2011

Protein Folding by Robotics

Probabilistic Roadmap Planning (PRM): Thomas, Song, Amato. Protein folding by motion planning.

  • Phys. Biol., 2005
slide-3
SLIDE 3

S.Will, 18.417, Fall 2011

Aims

Find good quality folding paths (into given native structure)

no structure prediction!

Predict formation orders (of secondary structure)

slide-4
SLIDE 4

S.Will, 18.417, Fall 2011

Motion planning

Motion planning Probabilistic roadmap planing

Sampling of configuration space Q Connect nearest configurations by (simple) local planner Apply graph algorithms to “roadmap”: Find shortest path

slide-5
SLIDE 5

S.Will, 18.417, Fall 2011

Motion planning

Motion planning Probabilistic roadmap planing

Sampling of configuration space Q Connect nearest configurations by (simple) local planner Apply graph algorithms to “roadmap”: Find shortest path

slide-6
SLIDE 6

S.Will, 18.417, Fall 2011

Motion planning

Motion planning Probabilistic roadmap planing

Sampling of configuration space Q Connect nearest configurations by (simple) local planner Apply graph algorithms to “roadmap”: Find shortest path

slide-7
SLIDE 7

S.Will, 18.417, Fall 2011

Motion planning

Motion planning Probabilistic roadmap planing

Sampling of configuration space Q Connect nearest configurations by (simple) local planner Apply graph algorithms to “roadmap”: Find shortest path

slide-8
SLIDE 8

S.Will, 18.417, Fall 2011

Motion planning

Motion planning Probabilistic roadmap planing

Sampling of configuration space Q Connect nearest configurations by (simple) local planner Apply graph algorithms to “roadmap”: Find shortest path

slide-9
SLIDE 9

S.Will, 18.417, Fall 2011

More on PRM for motion planning

tree-like robots (articulated robots)

Articulated Joint

configuration = vector of angles configuration space Q = {q | q ∈ Sn}

S — set of angles n — number of angles = degrees of freedom (dof)

slide-10
SLIDE 10

S.Will, 18.417, Fall 2011

More on PRM for motion planning

tree-like robots (articulated robots) configuration = vector of angles configuration space Q = {q | q ∈ Sn}

S — set of angles n — number of angles = degrees of freedom (dof)

slide-11
SLIDE 11

S.Will, 18.417, Fall 2011

Proteins are Robots (aren’t they?)

Obvious similarity ;-) ==? Our model

Protein == vector of phi and psi angles (treelike robot with 2n dof) possible models range from only backbone up to full atom

slide-12
SLIDE 12

S.Will, 18.417, Fall 2011

Proteins are Robots (aren’t they?)

Obvious similarity ;-) ==? Our model

Protein == vector of phi and psi angles (treelike robot with 2n dof) possible models range from only backbone up to full atom

slide-13
SLIDE 13

S.Will, 18.417, Fall 2011

Proteins are Robots (aren’t they?)

Obvious similarity ;-) ==? Our model

C C C C C C N N N O O O

Protein == vector of phi and psi angles (treelike robot with 2n dof) possible models range from only backbone up to full atom

slide-14
SLIDE 14

S.Will, 18.417, Fall 2011

Proteins are Robots (aren’t they?)

Obvious similarity ;-) ==? Our model

C C C C C C N N N O O O phi psi

Protein == vector of phi and psi angles (treelike robot with 2n dof) possible models range from only backbone up to full atom

slide-15
SLIDE 15

S.Will, 18.417, Fall 2011

Differences to usual PRM

no external obstacles, but

self-avoidingness torsion angles

quality of paths

low energy intermediate states kinetically prefered paths highly probable paths

slide-16
SLIDE 16

S.Will, 18.417, Fall 2011

Energy Function

method can use any potential Our coarse potential [Levitt. J.Mol.Biol., 1983. ]

each sidechain by only one “atom” (zero dof) Utot =

  • restraints

Kd{[(di − d0)2 + d2

c ]

1 2 − dc} + Ehp

first term favors known secondary structure through main chain hydrogen bonds and disulphide bonds second term hydrophobic effect Van der Waals interaction modeled by step function

All-atom potential: EEF1 [Lazaridis, Karplus. Proteins, 1999. ]

slide-17
SLIDE 17

S.Will, 18.417, Fall 2011

PRM method for Proteins

Sampling Connecting Extracting

slide-18
SLIDE 18

S.Will, 18.417, Fall 2011

Sampling — Node Generation

Sampling

Connecting Extracting

slide-19
SLIDE 19

S.Will, 18.417, Fall 2011

Node Generation

No uniform sampling

configuration space too large ⇒ need biased sampling strategy

Gaussian sampling

centered around native conformation with different STDs 5◦, 10◦, . . . , 160◦ ensure representants for different numbers of native contacts

Selection by energy P(accept q) =      1 if E(q) < Emin

Emax−E(q) Emax−Emin

if Emin ≤ E(q) ≤ Emax if E(q) > Emax

slide-20
SLIDE 20

S.Will, 18.417, Fall 2011

More on Node Generation

Visualization of Sampling Strategy Distribution Psi and Phi angles RMSD vs. Energy

slide-21
SLIDE 21

S.Will, 18.417, Fall 2011

Node Connection

Sampling

Connecting

Extracting

slide-22
SLIDE 22

S.Will, 18.417, Fall 2011

Connecting Nodes by Local Planner

connect configurations in close distance generate N intermediary nodes by local planner assign weights to edges Pi =

  • e− ∆E

kT

if ∆E > 0 1 if ∆E ≤ 0 Weight =

N

  • i=0

−log(Pi)

slide-23
SLIDE 23

S.Will, 18.417, Fall 2011

Connecting Nodes by Local Planner

connect configurations in close distance generate N intermediary nodes by local planner assign weights to edges Pi =

  • e− ∆E

kT

if ∆E > 0 1 if ∆E ≤ 0 Weight =

N

  • i=0

−log(Pi)

slide-24
SLIDE 24

S.Will, 18.417, Fall 2011

Connecting Nodes by Local Planner

connect configurations in close distance generate N intermediary nodes by local planner assign weights to edges Pi =

  • e− ∆E

kT

if ∆E > 0 1 if ∆E ≤ 0 Weight =

N

  • i=0

−log(Pi)

slide-25
SLIDE 25

S.Will, 18.417, Fall 2011

Connecting Nodes by Local Planner

connect configurations in close distance generate N intermediary nodes by local planner assign weights to edges Pi =

  • e− ∆E

kT

if ∆E > 0 1 if ∆E ≤ 0 Weight =

N

  • i=0

−log(Pi)

slide-26
SLIDE 26

S.Will, 18.417, Fall 2011

Connecting Nodes by Local Planner

connect configurations in close distance generate N intermediary nodes by local planner assign weights to edges Pi =

  • e− ∆E

kT

if ∆E > 0 1 if ∆E ≤ 0 Weight =

N

  • i=0

−log(Pi)

slide-27
SLIDE 27

S.Will, 18.417, Fall 2011

Connecting Nodes by Local Planner

connect configurations in close distance generate N intermediary nodes by local planner

P1 P2 P3 P5 P4

assign weights to edges Pi =

  • e− ∆E

kT

if ∆E > 0 1 if ∆E ≤ 0 Weight =

N

  • i=0

−log(Pi)

slide-28
SLIDE 28

S.Will, 18.417, Fall 2011

Connecting Nodes by Local Planner

connect configurations in close distance generate N intermediary nodes by local planner

Weight

assign weights to edges Pi =

  • e− ∆E

kT

if ∆E > 0 1 if ∆E ≤ 0 Weight =

N

  • i=0

−log(Pi)

slide-29
SLIDE 29

S.Will, 18.417, Fall 2011

Connecting Nodes by Local Planner

connect configurations in close distance generate N intermediary nodes by local planner assign weights to edges Pi =

  • e− ∆E

kT

if ∆E > 0 1 if ∆E ≤ 0 Weight =

N

  • i=0

−log(Pi)

slide-30
SLIDE 30

S.Will, 18.417, Fall 2011

Extracting Paths

Sampling Connecting

Extracting

slide-31
SLIDE 31

S.Will, 18.417, Fall 2011

Extracting Paths

Shortest Path

extract one shortest path from some starting conformation, one path at a time

Single Source Shortest Paths (SSSP)

extract shortest paths from all starting conformation compute paths simultaneously generate tree of shortest paths (SSSP tree)

slide-32
SLIDE 32

S.Will, 18.417, Fall 2011

Big Picture

Sampling Connecting Extracting

slide-33
SLIDE 33

S.Will, 18.417, Fall 2011

Studied Proteins

Overview of studied proteins, roadmap size, and construction times

slide-34
SLIDE 34

S.Will, 18.417, Fall 2011

Formation orders

formation order of secondary structure for verifying method formation orders can be determined experimentally [ Li, Woodward. Protein Science, 1999. ]

Pulse labeling Out-exchange

prediction of formation orders

single paths averaging over multiple paths (SSSP-tree)

slide-35
SLIDE 35

S.Will, 18.417, Fall 2011

Timed Contact Maps

slide-36
SLIDE 36

S.Will, 18.417, Fall 2011

Formation Order

no (reported) contradictions between prediction and validation different kind of information from experiment and prediction

slide-37
SLIDE 37

S.Will, 18.417, Fall 2011

The Proteins G and L

Studied in more detail good test case structurally similar: 1α + 4β fold differently

Protein G: β-turn 2 forms first Protein L: β-turn 1 forms first

slide-38
SLIDE 38

S.Will, 18.417, Fall 2011

Comparison of Analysis Techniques β-Turn Formation

slide-39
SLIDE 39

S.Will, 18.417, Fall 2011

Conclusion

PRM can be applied to “realistic” protein models Introduced method makes verifiable prediction Coarse potential is sufficient Predictions in good accordance to experimental data