Shake-and-Bake: Applications and Advances Russ Miller & Charles - - PowerPoint PPT Presentation

shake and bake applications and advances russ miller
SMART_READER_LITE
LIVE PREVIEW

Shake-and-Bake: Applications and Advances Russ Miller & Charles - - PowerPoint PPT Presentation

Shake-and-Bake: Applications and Advances Russ Miller & Charles M. Weeks Hauptman-Woodward Med. Res. Inst. Principal Contributors: C.-S. Chang G.T. DeTitta S.M. Gallo H.A. Hauptman H.G. Khalak D.A. Langs R. Miller S. Potter C.M.


slide-1
SLIDE 1

Shake-and-Bake: Applications and Advances Russ Miller & Charles M. Weeks

Hauptman-Woodward Med. Res. Inst.

Principal Contributors: C.-S. Chang G.T. DeTitta S.M. Gallo H.A. Hauptman H.G. Khalak D.A. Langs

  • R. Miller
  • S. Potter

C.M. Weeks

Partial funding from NIH and NSF.

slide-2
SLIDE 2

Outline of Talk

◆ Shake-and-Bake

❏ The Minimal Function

◆ SnB

❏ Results

◆ SnB v2.0

❑Rationale ❑Results

◆ Summary

slide-3
SLIDE 3

◆ Direct Methods use probabilistic theories to

exploit linear relationships among phases.

◆ Resolution of 1.2Å or better. ◆ Routinely applied to structures with 150 or

fewer atoms.

◆ Standard packages:

❏ SHELX ❏ teXsan ❏ SIR92/96

Direct Methods

slide-4
SLIDE 4

Conventional Direct Methods

Phase Refinement Peak Selection

FFT

{Trial Phases}

Solutions?

Reciprocal Space Real Space

Struct Factor

slide-5
SLIDE 5

Shake-and-Bake

Phase Refinement Peak Selection

{Trial Structures}

Struct Factor

FFT-1

slide-6
SLIDE 6

The Minimal Function

( )

R W est W W N E E E est

T T T T T T T h k h k T h k h k T T

= − ∑ ∑ = + + =      

− − − −

cos | | cos

/

φ φ φ φ φ φ

2 1 2

2 Triple: is the known expected value of

slide-7
SLIDE 7

◆ Direct Methods Optimization Technique ◆ Multiple Random-Atom Trial Structures ◆ Real/Reciprocal Space Cycling ◆ Phase Refinement Techniques:

❏ Parameter Shift ❏ Tangent Formula

◆ Minimal Function as FOM

Shake-and-Bake

slide-8
SLIDE 8

Structure Factors

Shake

Phase Refinement

Shake-and-Bake

Bake

Map Interpretation

Reciprocal Space Real Space

FFT FFT-1

T1 T2 T3 T1 T2 T3

slide-9
SLIDE 9

SnB: Random Start

slide-10
SLIDE 10

SnB: Final Structure

slide-11
SLIDE 11

Structure of SnB

  • (Shake-and-Bake)

(Shake-and-Bake)

slide-12
SLIDE 12

Shake-and-Bake

Preprocess Data Preprocess Data (Invariants) (Invariants) Structure Factors Structure Factors Compute Phases Compute Phases Minimal Minimal Function Function Perturb Perturb Phases Phases Phase Refinement Phase Refinement FFT FFT Peak Pick Peak Pick Find Atoms Find Atoms Process Process Trials Trials Output Output Trials Trials Shake-and-Bake Shake-and-Bake

slide-13
SLIDE 13

SnB Parameters

Default Ph8755 ToxII Atoms (asu) n 74 508 Phases 8n - 10n 740 5,000 Triples 70n - 100n 7,400 50,000 Cycles (PS) n/2 40 255 Peaks recycled 0.8n - n 74 400 E-Fourier Steps 2 2 5

slide-14
SLIDE 14

Ph8755: SnB Histogram

Atoms: 74 Phases: 740 Space Group: P1 Triples: 7,400

10 20 30 40 50 60

0.252 0.272 0.292 0.312 0.332 0.352 0.372 0.392 0.412 0.432

Trials: 100 Cycles: 40 Rmin range: 0.243 - 0.429

slide-15
SLIDE 15

Ph8755: Trace of SnB Solution

0.2 0.4 0.6 0.8 10 20 30 40

Atoms: 74 Space Group: P1 SnB Cycles: 40

slide-16
SLIDE 16

2 25 639 390 386 135 41

1

100 200 300 400 500 600 700

0.467 0.475 0.483 0.491 0.499 0.507 0.515 0.523 0.531 0.539

Rmin Trials

ToxII: SnB Histogram

Atoms: 500 Phases: 5,000 Space Group: P212121 Triples: 50,000 Trials: 1619 Cycles: 255 Rmin range: [0.467,0.532]

slide-17
SLIDE 17

Tox II: Trace of SnB Solution

0.44 0.49 0.54 100 200 Cycle Rmin

Atoms: 500 Space Group: P212121 SnB Cycles: 255

Solution

slide-18
SLIDE 18

Visualization in SnB (Ph8755)

Geomview: Geometry Center, U. Minn.

slide-19
SLIDE 19

Some SnB Applications

STRUCTURE LOCATION ATOMS SP GRP RES

Vancomycin Penn 258 P43212 0.9Å I4 Peptide HWI 289 I4 1.1 Microlide France 296 P21 1.1 Gramicidin A HWI 317 P212121 0.86 Er-1 pheromone UCLA 328 C2 1.0 Crambin HWI ~400 P21 0.83 Alpha-1 peptide OCI/U. of T. 471 P1 0.92 Rubredoxin HWI 497 P21 1.0 Scorpion Toxin II HWI 624 P212121 0.96

slide-20
SLIDE 20

Factors Determining Success Rate

◆ Data quality ◆ Resolution ◆ Complexity and connectivity of structure ◆ Space group ◆ Presence of heavy atoms

slide-21
SLIDE 21

An Interesting I4 Structure

◆ Structure:

❑Peptide with 10 Sulfurs ❑289 nonH atoms total ❑1.1Å resolution data

◆ Bugs: Special Positions & Refinement ◆ Results (SnB 2.0)

❑PS:

53% success rate

❑PS/Rest:

44% success rate

❑Tan:

25% success rate

slide-22
SLIDE 22

Extending Resolution: the I4 Structure

◆ Truncate to 1.2Å - 1.5Å ◆ Solutions at all resolutions

❑1.2Å: Standard bimodal distribution ❑1.3Å: Standard bimodal distribution ❑1.4Å: Good, but some mixing of solutions and

nonsolutions

❑1.5Å: Solutions (~50 deg. phase error) but not

recognizable by FOM

◆ Recognize low-resolution solutions??

slide-23
SLIDE 23

SnB 2.0: Rationale

◆ Improve running time

❑Build from ground up

◆ Provide additional features

❑Inverse Fourier ❑Density modification ❑Grid size ❑“Twice Baking” ❑Peaks at special positions

slide-24
SLIDE 24

SnB v2.0 Parameters/Proteins

Structure Atoms (n) Heavy Atoms Phases Cycles Max Succ Rate

Vancomycin 202 Cl 8 2000 200 0.8% I4 Peptide 248 S10 1900 250 53.0 Gramicidin A 272

  • 3000

275 1.1 Crambin 327 S6 3000 300 4.8 Rubredoxin 395 FeS6 4000 400 6.2 Scorpion Toxin II 508 S8 5000 500 1.4

Note: n = independent protein atoms

slide-25
SLIDE 25

SnB 2.0: Varying Peaks

Structure 50 100 200 300 400 Vancomycin 0.4% 0.6% 0.2%

  • I4 Peptide

53.0 52.0 45.0

  • Gramicidin A

0.0 0.3 1.1 0.7%

  • Crambin

4.3 4.8 3.3 3.4

  • Rubredoxin

5.7 6.2 5.4 3.9 3.4% Scorpion Toxin II

  • 1.0

1.4 0.4 0.1

slide-26
SLIDE 26

SnB 2.0: Varying Cycles

Structure 0.25n 0.5n 0.75n n 1.25n 1.5n

Vancomycin 0.1% 0.4% 0.4% 0.6% 0.7%

  • I4 Peptide

27.0 40.0 48.0 53.0

  • Gramicidin A

0.0 0.4 0.6 0.9 1.2 2.0% Crambin 3.1 4.1 4.6 4.8

  • Rubredoxin

4.6 5.5 5.9 6.0

  • Scorpion Toxin II

0.05 0.5 1.0 1.4

  • Success Rates while varying

SnB Phase Refinement Cycles

slide-27
SLIDE 27

SnB 2.0: Phase Refinement

Structure Peaks PS Standard PS Restricted Tangent Vancomycin 100 0.6% 0.4% 0.3% I4 Peptide 50 53.0 44.0 25.0 Gramicidin A 200 1.1 0.5 0.0 Crambin 100 4.8 3.7 2.2 Rubredoxin 150 6.0 5.2 4.0 Scorpion Toxin II 200 1.4 1.0 0.7

slide-28
SLIDE 28

SnB 2.0 Parameters (>1.1Å)

◆ Peaks

❑0.4n if “heavy” atoms present ❑0.8n if all C,N,O

◆ Phase Refinement

❑Unrestricted Parameter Shift

◆ Cycles

❑n/2 if n<400 and “heavy” atoms present ❑n otherwise

slide-29
SLIDE 29

SnB 2.0: Timings

(SGI R10000 Workstation)

Structure non-H Atoms Space Group n/2 Cycles Trials/ Day Solns/ Day Vancomycin 258 P43212 100 391 1.5 I4 Peptide 289 I4 125 274 110 Gramicidin A 317 P212121 135 572 2 Crambin ~400 P21 150 1029 42 Rubredoxin 497 P21 200 294 16 Scorpion Toxin II 624 P212121 250 109 0.5

Note: For each structure, optimum no. of peaks used.

slide-30
SLIDE 30

Computing Platforms

◆ Unix Workstations ❑ SGI, Sun, DEC/Alpha ❑ Wintel/Linux ◆ Parallel Computers ❑ Cray T3D/E, TMC CM-5, IBM SP2 ❑ SGI Origin 2000 ❑ HP-Convex Exemplar ◆ Cray C90

slide-31
SLIDE 31

Summary

◆ Shake-and-Bake: Dual-Space Direct Methods ◆ Targeted at 100-800 atom structures ◆ SnB version 2.0

❑ Optimized code with Inverse FFT ❑ Additional Density Modification Options ❑ Improved Fourier Recycling: “Twice Baking” ❑ I/O: |E| calculation and visualization interface ❑ (SIR/SAS/MAD Invariants with estimated values)

◆ http://www.hwi.buffalo.edu/SnB/

slide-32
SLIDE 32