Rigid body refinement (basics)
D.Svergun, EMBL-Hamburg
Rigid body refinement (basics) D.Svergun, EMBL-Hamburg Shapes from - - PowerPoint PPT Presentation
Rigid body refinement (basics) D.Svergun, EMBL-Hamburg Shapes from recent projects at EMBL-HH Complexes and assemblies Complexes and assemblies Domain and quaternary structure Domain and quaternary structure S-layer proteins Toxin B
D.Svergun, EMBL-Hamburg
Domain and quaternary structure Complexes and assemblies Domain and quaternary structure Complexes and assemblies
Dcp1/Dcp2 complex S-layer proteins Toxin B α-synuclein oligomers She et al, Mol Cell (2008) Fagan et al Mol. Microbiol (2009) Albesa-Jové et al JMB (2010) Giehm et al PNAS USA (2011)
In most cases, high resolution models are drawn inside
Structural transitions Flexible/transient systems
Src kinase Cytochrome/adrenodoxin Microbiol (2009) Complement factor H
are drawn inside the shapes
Bernado et al JMB (2008) Xu et al JACS (2008) Morgan et al NSMB (2011)
Th i l d l l Theoretical model or complete crystal structure available Validation in solution Incomplete structure available Addition of missing loops/domains Structure of subunits available Rigid body model of the complex Structure of domains and multiple curves available Model of the domain structure structure
♦ To obtain scattering from the particles, solvent
♦ Further, the bound solvent density may differ from
Ω Ω
2 b b s s a 2
♦ Aa(s): atomic scattering in vacuum
♦ As(s): scattering from the excluded volume ♦ Ab(s): scattering from the hydration ♦ Ab(s): scattering from the hydration shell CRYSOL (X ) CRYSOL (X-rays):
Svergun et al. (1995). J. Appl. Cryst. 28, 768
CRYSON (neutrons): Svergun et al. (1998) P.N.A.S. USA, 95, 2267
Ω Ω
2 b s a 2
2 2
lm l l m l
− = ∞ =
L l
= − =
L l l l m lm lm lm
2 2
L l
2 2
The
The programs programs: :
= − =
l l m lm lm lm
either
either fit fit the the experimental experimental data data by by varying varying the the density density
the hydration hydration layer layer δρ δρ (affects (affects the the third third term) term) and and y y ρ ( ) the the total total excluded excluded volume volume (affects (affects the the second second term) term)
predict the the scattering scattering from from the the atomic atomic structure structure using using default default parameters parameters (theoretical (theoretical excluded excluded volume volume using using default default parameters parameters (theoretical (theoretical excluded excluded volume volume and and bound bound solvent solvent density density of
1. .1 1 g/cm g/cm3 )
provide
provide output
files (scattering (scattering amplitudes) amplitudes) for for rigid rigid b d b d fi t fi t ti ti body body refinement refinement routines routines
compute
compute particle particle envelope envelope function function F( F(ω)
1) 1)
1) 1)
2) 2)
3) 3)
3) 3)
4) 4)
lg I, relative
3
Experimental data Fit with shell Fit without shell
2 3
Lysozyme
1 2
Hexokinase EPT
PPase
s, nm-1
1 2 3 4
12 Scattering length density, 1010cm-2
♦ For X-rays: both lead to
10 12 floppy side chains denser solvent layer solvent density
similar effect (particle appears larger) ♦ Floppy chains would in
6 8 protein density
s
♦ Floppy chains would in all cases increase the apparent particle size ♦ Neutrons in H O (shell):
4 6
♦ Neutrons in H2O (shell): particle would appear nearly unchanged
2
♦ Neutrons in D2O (shell): particle would appear smaller than the atomic model
SAXS SANS in H2O SANS in D2O
model
lg I, relative
lg I, relative
1
1
Neutrons, D2O Neutrons, H2O X-rays
X-rays
Neutrons, D2O Neutrons, H2O
2 4
s, nm-1
1 2 3
s, nm-1
Lysozyme: appears larger for X-rays Thioredoxine reductase : CRYSOL and smaller for neutrons in D2O and CRYSON fits with denser shell
The
The ‘cube ‘cube method’ method’ (Luzzati Luzzati et et al, al, 1972 1972; ; Fedorov Fedorov and and Pavlov, Pavlov, 1983 1983; ; M Mü üller ller, , 1983 1983) ) ensures ensures uniform uniform filling filling of
the excluded excluded volume volume. . Could/should/must Could/should/must be be superior superior over
the the effective effective atomic atomic factors factors method method at at higher higher angles angles. .
CRYDAM
CRYDAM (still unpublished) (still unpublished)
lg I, relative
♦ Represents hydration shell by dummy water atoms
2
dummy water atoms ♦ Handles proteins, carbohydrates, nucleic acids and their complexes ♦ Is applicable for wide angle
1 X-ray data, lysozyme Fit by CRYSOL Fit by CRYDAM
♦ Is applicable for wide angle scattering range Malfois, M. & Svergun, D.I. (2001), to be submitted
CRYSOL 3.0
CRYSOL 3.0 (is coming) (is coming)
s, nm-1 5 10
ed
. Bardhan Bardhan, , S S. . Park Park and and L L. . Makowski Makowski ( (2009 2009) ) SoftWAXS SoftWAXS: : a a computational computational tool tool for for modeling modeling wide wide-
angle X X-
ray solution solution scattering scattering from from biomolecules biomolecules J. . Appl Appl. . Cryst
42, 932 932-
943 - A program program to to compute compute WAXS WAXS y , p g p g p
Schneidman-
Duhovny D, D, Hammel Hammel M, M, Sali Sali A A. . ( (2010 2010) ) FoXS FoXS: : a a web web server server for for rapid rapid computation computation and and fitting fitting of
SAXS profiles profiles. . Nucleic Nucleic Acids Acids Res Res. . 38 38 Suppl Suppl: :W W540 540-
4. . - Debye Debye-
like computations, computations, Web Web server server Grishaev Grishaev A Guo Guo L Irving Irving T Bax Bax A (2010 2010) Improved Improved Fitting Fitting of
Solution X
Grishaev A, A, Guo Guo L, L, Irving Irving T, T, Bax Bax A. (2010 2010) Improved Improved Fitting Fitting of
Solution X- ray ray Scattering Scattering Data Data to to Macromolecular Macromolecular Structures Structures and and Structural Structural Ensembles Ensembles by by Explicit Explicit Water Water Modeling Modeling. . J Am Am Chem Chem Soc Soc. . 132 132, , 15484 15484-
6. . - Generate Generate bulk bulk and and bound bound waters waters with with MD, MD, do do fit fit the the data data to to the the model model
Poitevin F, F, Orland Orland H, H, Doniach Doniach S, S, Koehl Koehl P, P, Delarue Delarue M M ( (2011 2011) ). . AquaSAXS AquaSAXS: : a a web web server server for for computation computation and and fitting fitting of
SAXS profiles profiles with with non non-
uniformally hydrated hydrated atomic atomic models models. . Nucleic Nucleic Acids Acids. . Res Res. . 39 39, , W W184 184-
189 - Generate Generate waters waters around around proteins proteins using using MD MD (AquaSol AquaSol program) program) W189 189 - Generate Generate waters waters around around proteins proteins using using MD MD (AquaSol AquaSol program) program)
Virtanen JJ, JJ, Makowski Makowski L, L, Sosnick Sosnick TR, TR, Freed Freed KF KF. . ( (2011 2011) ) Modeling Modeling the the hydration hydration layer layer around around proteins proteins: : applications applications to to small small-
and wide wide-
angle x x-
ray scattering scattering. . Biophys Biophys J J. . 101 101, 2061 2061-
9. . - Use Use a a “ “HyPred HyPred solvation solvation” ” model model to to generate generate the the shell, shell, geared geared towards towards WAXS WAXS.
http://dara.embl-hamburg.de/ About 15000 atomic models of biologically active molecules are generated from the entries are generated from the entries in Protein Data Bank and the scattering patterns computed by CRYSOL Rapidly identifies proteins with similar shape (from low similar shape (from low resolution data) and neighbors in structural organization (from higher resolution data) higher resolution data)
Sokolova, A.V., Volkov, V.V. & Svergun, D.I. (2003) J. Appl. Crystallogr. 36, 865-868
Crystallographic packing forces are Packing forces in the crystal restrict the Crystallographic packing forces are comparable with the intersubunit
multisubunit macromolecules Packing forces in the crystal restrict the allosteric transition in aspartate transcarbamylase
multisubunit macromolecules could be significantly different from those in the crystal
Svergun, D.I., Barberato, C., Koch, M.H.J., Fetler, L. & Vachette, P. (1997). Proteins, 27, 110-117
lg I, relative
2 1 SAXS experiment Fit by 1yzb Fit by 2aga 0.0 0.2 0.4 0.6 0.8
NMR models of the Josephin domain of ataxin-3: red curve and chain: 1yzb, Nicastro et al. (2005) PNAS USA 102, 10493; blue curve and chain: 2aga, Mao et al (2005) PNAS USA 102 12700
s, A-1
(2006) J. Biomol. NMR, 36, 267.
Mao et al. (2005) PNAS USA 102, 12700.
Closure of the two domains of PGK upon substrate binding is essential for the enzyme Closure of the two domains of PGK upon substrate binding is essential for the enzyme
are required for the closure A SAXS fingerprint of A SAXS fingerprint of
(human PGK) SAXS proves that binding of both substrates induces the closure
Pig PGK Bs PGK Pig PGK Tm PGK Tb PGK Ligands/ Parameters Substr. free MgADP binary MgATP binary 3-PG binary
atern1 atern2 atern1 atern2
N 2 746 4 332 3 524 3 158 3 664 4 767 9 135 9 560
the closure
No 2.746 4.332 3.524 3.158 3.664 4.767 9.135 9.560 3-PG 2.678 5.329 3.297 1.958 3.655 4.234 6.052 6.125 MgATP 3.855 2.848 2.409 3.389 7.827 7.766 3.179 3.910 MgADP 1.486 3.235 1.627 1.140 1.780 2.463 5.151 6.193 MgATP*3-PG 6.140 6.044 4.656 5.307 5.146 4.805 2.247 1.611 MgADP*3-PG 2.303 3.522 2.795 2.049 2.712 2.810 2.018 2.922
Varga, A., Flachner, B., Konarev, P., Gráczer, E., Szabó, J., Svergun, D., Závodszky, P. & Vas, M. (2006) FEBS Lett. 580, 2698-2706.
Rg (theor), A 24.25 24.34 24.02 23.97 24.24 24.16 23.26 22.64
Biologically active dimer of myomesin-1 Experiment started: 24-07-2004 at 21:09 Final result obtained: 24-07-2004 at 21:48
Pinotsis, N., Lange, S., Perriard, J.-C., Svergun, D.I. & Wilmanns, M. (2008) EMBO J . 27, 253-264
In eukaryotes, DNA rereplication is prevented by control of the assembly f li i l ( RC )
Lee et al (2004), Nature, 430, 913
component of the pre-RC assembly. Timely inhibition of Cdt1 by Geminin is
tGeminin dimer + tCdt monomer
Timely inhibition of Cdt1 by Geminin is essential to this DNA replication licensing. SAXS identifies crystallization conditions for the complex
van Gerwen, A. Fish, M. V. Petoukhov, M. S. Iliou, Z. Lygerou, R. H. Medema, J. J. Blow, D. I. Svergun, S. Taraviras & A. Perrakis (2009) PNAS USA, 106, 19807
In eukaryotes, DNA rereplication is prevented by control of the assembly f li i l ( RC )
component of the pre-RC assembly. Timely inhibition of Cdt1 by Geminin is Timely inhibition of Cdt1 by Geminin is essential to this DNA replication licensing. The mechanism of DNA licensing inhibition by Geminin, is analyzed by combining MX, SAXS and functional
can e ist in t o distinct forms a can exist in two distinct forms, a ‘‘permissive’’ heterotrimer and an ‘‘inhibitory’’ heterohexamer.
van Gerwen, A. Fish, M. V. Petoukhov, M. S. Iliou, Z. Lygerou, R. H. Medema, J. J. Blow, D. I. Svergun, S. Taraviras & A. Perrakis (2009) PNAS USA, 106, 19807
in reference positions are known known.
complex can be constr cted b mo ing and constructed by moving and rotating the second subunit.
depends
three Euler rotation angles and three Cartesian shifts. three Cartesian shifts.
in reference positions are known known.
complex can be constr cted b mo ing and constructed by moving and rotating the second subunit.
depends
three Euler rotation angles and three Cartesian shifts. three Cartesian shifts.
Rotation: A Shift: x, y, z C B
Using spherical harmonics the amplitude(s) of arbitrarily Using spherical harmonics the amplitude(s) of arbitrarily
Rotation: α, β, γ
Using spherical harmonics, the amplitude(s) of arbitrarily Using spherical harmonics, the amplitude(s) of arbitrarily rotated and displaced subunit(s) are analytically expressed rotated and displaced subunit(s) are analytically expressed via via the initial amplitude and the six positional parameters: the initial amplitude and the six positional parameters: Clm
lm(s) =
(s) = C (B (B β ) Clm
lm(B
(Blm
lm,
, α, , β, , γ, x, y, z). , x, y, z). The scattering from the complex is then rapidly calculated as The scattering from the complex is then rapidly calculated as
∞
* 2
l l lm lm B A
− l
Svergun, D.I. (1991). J. Appl. Cryst. 24, 485-492
Interconnectivity Interconnectivity Absence of steric clashes Absence of steric clashes Symmetry Symmetry Symmetry Symmetry Intersubunit contacts Intersubunit contacts (from chemical shifts by (from chemical shifts by NMR or mutagenesis) NMR or mutagenesis) NMR or mutagenesis) NMR or mutagenesis) Distances between Distances between residues (FRET or residues (FRET or mutagenesis) mutagenesis) g ) g ) Relative orientation of Relative orientation of subunits (RDC by NMR) subunits (RDC by NMR) Scattering data from Scattering data from subcomplexes subcomplexes
Petoukhov & Svergun Petoukhov & Svergun (2005) (2005) Biophys J Biophys J 89 89 1237; 1237; (2005) (2005) Biophys J. Biophys J. 89 89, 1237; , 1237; (2006) (2006) Eur. Biophys. J
. 35 35, , 567. 567.
♦ ASSA (SUN/SGI/DEC) ♦ MASSHA (Win9x/NT/2000)
Kozin & Svergun (2000). J. Appl.
Konarev, Petoukhov & Svergun (2001).
EPSPS
Feil, I.K., Malfois, M., Hendle, J., van der Zandt, H. & Svergun, D.I. (2001) J. Biol. Chem. 276, 12024-12029
Fit ( lti l X Fit ( lti l X d t ) tt i ( ) f ti l d t ) tt i ( ) f ti l Fits (multiple X Fits (multiple X-ray and neutron) scattering curve(s) from partial ray and neutron) scattering curve(s) from partial constructs or contrast variation using simulated annealing constructs or contrast variation using simulated annealing Requires models of subunits, builds interconnected models without Requires models of subunits, builds interconnected models without steric clashes steric clashes steric clashes steric clashes Uses constraints: symmetry, distance (FRET or mutagenesis) Uses constraints: symmetry, distance (FRET or mutagenesis) relative orientation (RDC from NMR), if applicable relative orientation (RDC from NMR), if applicable
lg I, relative
Petoukhov & Svergun (2005) Petoukhov & Svergun (2005) Biophys J. Biophys J. 89 89, 1237; , 1237; (2006) (2006) Eur. Biophys. J
. 35 35, 567. , 567.
10 11 9 10
s, nm-1
0.5 1.0 1.5 2.0 8
A tyrosine kinase MET (118 kDa) consisting of five domains Single curve fitting with Program SASREF distance constraints: C to N C to N termini contacts
Gherardi, E., Sandin, S., Petoukhov, M.V., Finch, J., Youles, M.E., Ofverstedt, L.G., Miguel, R.N., Blundell, T.L., Vande Woude, G.F., Skoglund, U. & Svergun, D.I. (2006) PNAS USA, 103, 4046.
Receptor binding Monomeric fraction Receptor binding H(C) domain reveals concentraton- dependent fraction Dimeric fraction 100 : 0 0 : 100 dependent
Polydisperse fractions 64 : 36 43 : 57 21 : 79 14 : 86 Mon:Dim Ab initio and rigid body analysis of the dimeric H(C) domain using the structure of the monomer in the crystal (1FV2) and accounting that the mutant Cys869Ala remains Mon:Dim Qazi, O., Bolgiano, B., Crane, D., Svergun, D.I., Konarev, P.V., Yao, Z.P., Robinson, C.V., Brown, K.A. & Fairweather N. (2007) J Mol Biol. 365, 123–134. always monomeric yield a unique model of the dimer
Eleven X-ray and neutron curves Atomic and homology Atomic and homology gy gy models models Distance restrains from tRNA Distance restrains from tRNA footprinting (Arts et al footprinting (Arts et al footprinting (Arts et al. footprinting (Arts et al. (1998) (1998) EMBO J. EMBO J. 17, 17, 7430) 7430)
Fukuhara et al. (in preparation)
Flexible loops or domains Flexible loops or domains are often not resolved in are often not resolved in high resolution models or high resolution models or genetically removed to genetically removed to facilitate crystallization facilitate crystallization Tentative configuration of Tentative configuration of g such fragments are such fragments are reconstructed by fixing the reconstructed by fixing the known portion and adding known portion and adding th i i t t fit th th i i t t fit th the missing parts to fit the the missing parts to fit the scattering from the full scattering from the full-
length macromolecule.
Using DR-type models and protein type models and protein-
specific penalty functions g yp p yp p p p y p p y Primary sequence Secondary structure Excluded volume
Number of neighbours 6 2 3 4 5 6 Shell radius, nm 0.2 0.4 0.6 0.8 1.0 1
Neighbors d b Knowledge-based t ti l Bond angles & dihed als dist ib tion distribution potentials dihedrals distribution
Petoukhov, M.V., Eady, N.A.J., Brown, K.A. & Svergun, D.I. (2002) Biophys. J. 83, 3113
BUNCH combines rigid body and ab initio ab initio modelling to find the positions and orientations modelling to find the positions and orientations modelling to find the positions and orientations modelling to find the positions and orientations
flexible linkers represented as “dummy residues” flexible linkers represented as “dummy residues” chains chains
Multiple experimental scattering data sets from partial constructs (e.g. deletion mutants) can be partial constructs (e.g. deletion mutants) can be pa t a co st ucts (e g de et o uta ts) ca be pa t a co st ucts (e g de et o uta ts) ca be fitted simultaneously with the data of the full fitted simultaneously with the data of the full-
length protein.
BUNCH accounts for symmetry, allows one to fix some domains and to restrain the model by some domains and to restrain the model by contacts between specific residues contacts between specific residues
Petoukhov, M. V. & Svergun, D. I. (2005). Biophys. J. 89, 1237-1250
The dimeric sensor histidine-kinase PrrB from Mycobacterium tuberculosis contains ATP y binding and dimerization domains and a 59 aas long (flexible) HAMP linker
Tentative homology model based on Thermotoga maritima CheA Three domain Two domain
PrrB model after rigid body refinement and addition of HAMP linker Nowak, E., Panjikar, S., Morth, J. P., Jordanova R., Svergun, D. I. & Tucker, P. A. (2006) Structure, 14, 275
The dimeric sensor histidine-kinase PrrB from Mycobacterium tuberculosis contains ATP
Tentative homology model
y binding and dimerization domains and a 59 aas long (flexible) HAMP linker
based on Thermotoga maritima CheA Three domain Two domain
Superposition with the independently determined sensor histidine-kinase from PrrB model after rigid body refinement and addition of HAMP linker Superposition with the independently determined sensor histidine kinase from Thermotoga maritima (Marina A. et al. (2005) Embo J. 24, 4247) Nowak, E., Panjikar, S., Morth, J. P., Jordanova R., Svergun, D. I. & Tucker, P. A. (2006) Structure, 14, 275
The homotetrameric p53 plays a central role in the cell cycle and maintaining genomic The homotetrameric p53 plays a central role in the cell cycle and maintaining genomic
intrinsically disordered segments. Cross-shaped extended p53 from SAXS and NMR Compact and NMR p53/DNA from SAXS and an independent EM reconstruction Tidow, H., Melero, R., Mylonas, E., Freund, S.M., Grossmann, J.G., Carazo, J.M., Svergun, D.I., Valle, M. & Fersht, A.R. (2007) Proc Natl Acad Sci USA, 104, 12324
A merger of SASREF and BUNCH: advanced methods to account for missing loops in multi missing loops in multi-subunit protein structures (RANLOGS CORAL) subunit protein structures (RANLOGS CORAL) missing loops in multi missing loops in multi subunit protein structures (RANLOGS, CORAL) subunit protein structures (RANLOGS, CORAL)
M.V. Petoukhov, D. Franke, A. Shkumatov, G. Tria, A.G. Kikhney, M. Gajda, C. Gorba, H.D.T. Mertens, P.V. Konarev, D.I. Svergun (2012). J. Appl. Cryst. 45, 342-350.
∞
⎥ ⎦ ⎤ ⎢ ⎣ ⎡ + + − − − = ) ( ) ( sin ) ( ) ( sin ) ( ) (
k k k k
s s D s s D s s D s s D s I s s I
A solution scattering curve f ti l ith i
=
⎦ ⎣ + −
1
) ( ) (
k k k
s s D s s D
from a particle with maximum size D can be represented by its values taken at discrete points (Shannon channels)
10
2
I(s)
2 4 6 8 10 12
Ns
points (Shannon channels) sk = kπ/ D I t i l SAS i t
10
1
In a typical SAS experiment, Ns ≈ 5-15
10
0.00 0.05 0.10 0.15 0.20
s, A
( ) The mathematical theory of
Press, Urbana.
Shape determination: M≈ 103 variables (e.g. 0 or 1 bead assignments in DAMMIN Rigid bod methods M 101 a iables (positional and otational Rigid body methods: M≈ 101 variables (positional and rotational parameters of the subunits) From the informational point of view, rigid body modeling should provide unique or at least much less ambiguous models than shape determination NO WAY NO WAY As all the problems are non-linear, the number of Shannon channels does t i t b f t hi h i ibl t t t not give you exact number of parameters, which is possible to extract from the scattering data (depending on accuracy, this number varies between zero and infinity). Further, uniqueness of reconstruction depends largely on the complexity
A synthetic example: two different orientations of
A synthetic example: two different orientations of
A synthetic example: two different orientations of
A synthetic example: two different orientations of tRNA in a dimeric complex with aspartyl tRNA in a dimeric complex with aspartyl-
tRNA synthetase obtained by rigid body modelling and synthetase obtained by rigid body modelling and compatible with X compatible with X-
ray and contrast variation neutron
Petoukhov, M.V. & Svergun, D. I. (2006) Eur. Biophys. J. 35, 567-576
scattering data scattering data
Information
Information
Information
Avoiding
For
For
R d d l l f f t i ti l t i f t k f ti Reduced levels of frataxin, an essential protein of yet unknown function, cause neurodegenerative pathology. Its bacterial orthologue (CyaY) forms functional complexes with the two central components to iron–sulphur cluster assembly: desulphurase Nfs1/IscS SAXS: free IscS is p scaffold protein Isu/IscU. IscS IscS dimeric, free IscU and CyaY are monomeric IscS IscU CyaY IscS IscU IscS/IscU Ab initio and rigid body models of complexes: I U bi d th (sol) IscU IscS/CyaY IscS/CyaY IscS/IscU IscU binds on the periphery of IscS dimer, CyaY binds close to the dimerization interface IscU (MX) C Y IscS/CyaY/IscU Prischi F, Konarev PV, Iannuzzi C, Pastore C, Adinolfi S, Martin SR, Svergun DI & Pastore A. (2010) Nat Commun. 1, 95-104 CyaY IscS/CyaY/IscU
Nuclear receptors Complement factor H Flt3 signaling complex PDH complex E2 core Rochel et al NSMB (2011) Morgan et al NSMB (2011) Verstraete et al Blood (2011) Marrott et al FEBS J (2011) ( ) ( ) HCV NS3/4A inhibitor Colonization factor GbpA Myomesin Nanocomposites FEBS J (2011) Schiering et al PNAS USA (2011) Wong et al Plos Pathog (2012) Pinotsis et al Plos Biol (2012) Shtykova et al JPC (2012)
Lysozyme and its near
Lysozyme and its near
7.5 No scale LYZ23.FIT LYZ23.FIT LYZ58.FIT FOOL01.FIT FOOL03.FIT 17-Oct-2001 04:24:12 Close window to continue 001 002 003 6 6.5 7 Y 003 004 005 5 5.5 6 0.2 0.4 0.6 0.8 1 1.2 1.4 X Scales : 1.00 1.00 1.00 1.00 1.00