Mixtures, assemblies & flexible systems
Haydyn Mertens (EMBL-Hamburg)
Mixtures, assemblies & flexible systems Haydyn Mertens - - PowerPoint PPT Presentation
Mixtures, assemblies & flexible systems Haydyn Mertens (EMBL-Hamburg) DOGMA: No function without structure! Not true! Flexibility enables more function Hub proteins in networks Interaction specialists Tompa, Peter, et al.
Haydyn Mertens (EMBL-Hamburg)
Hub proteins in networks Interaction specialists
Tompa, Peter, et al. “Intrinsically disordered proteins: emerging interaction specialists." Current Opinion in Structural Biology 35 (2015): 49-59.
Chemometric
“Model-free” deconvolution
Ensembles
Flexible systems
Combinations
Known components
Basic SAXS/SANS parameters provide initial hints!
Monodisperse non-interacting systems
I(s) = 4π p(r)sin(sr) sr
Dmax
dr
(averaged over all orientations)
investigation (at low resolution)
Polydisperse and interacting systems
I(s) = 4π p(r)sin(sr) sr
Dmax
∫
dr
(averaged over all orientations)
k
× vk
I(s) = 4π p(r)sin(sr) sr
Dmax
∫
dr
× vk
× vk
I(s) = 4π p(r)sin(sr) sr
Dmax
∫
dr
× vk
I(s) = 4π p(r)sin(sr) sr
Dmax
∫
dr
× vk × vk × vk
parameters
Sample characterisation
Mixture? Flexible?
Mertens & Svergun, J. Struct. Biol. (2009)
0.0005 0.001 0.0015 0.002 q
2 Åln[I(q)] y = -2.8441 - 284.09 * x y = -5.8408 - 199.89 * x Rg = 29.2 Rg = 24.5 0.001 0.01 0.1 1 q, Å
10
10
10
10
10
I(q), cm
Protein A (dimer) Protein A (monomer)
0.001 0.01 0.1 1 q, Å
10
10
10
10
10
I(q), cm
Protein A (dimer) Protein A (monomer) 20 40 60 80 100 r, Å 10
10
10
10
p(r), rel. units Protein A (dimer) Protein A (monomer)
Vp = 2π 2 I(0) Q
Q = q2[I(q)− A]dq
∞
∫
Via p(r) (reg. curve)
I(s) = vkIk(s)
k
∑
+
I(s) =∑kvkIk(s)
Konarev et al., J Appl Cryst. 36, 1277-1282.
I(s) = vkIk(s)
k
∑
+
× vk × vk
I(s) =∑kvkIk(s)
Konarev et al., J Appl Cryst. 36, 1277-1282.
Keown, J, Griffin, M, Mertens H, Pearce, G. J. Biol. Chem. (2013)
I(s) = vkIk(s)
k
∑
An encounter adrenodoxin/cytochrome C complex
I(s) = vkIk(s)
k
∑
Volume fractions Mon:Dim:Tri:Tet 0 : 0 : 50 : 50 0 : 8 : 47 : 45 5 : 25 : 55 : 15 25 : 25 : 50 : 0 0 : 100 : 0 : 0
SAXS model
hetero-tetrameric native complex at high salt and high solute concentration
24 mg/ml 12 6 2.4 Cross-linked
describe the data
Flexibility & conformational polydispersity
I(s) = vkIk(s)
k
∑
Mertens & Svergun,JSB, 2010, 172(1)
Mertens et al., JBC. (2012), 287:41
Unfolded Partially folded Folded (1.73,1.104)
20 residue linkers
I(s) = vkIk(s)
k
∑
Pool Ik(s) Ensembles Best fitting ensemble Analysis
(Bernado et al. JACS, 2007; Tria et al. IUCRJ, 2014)
I(s) = vkIk(s)
k
∑
Bernado & Svergun Mol. Biosystems. (2012) 8:151-167
I(s) = vkIk(s)
k
∑
LRCMQCKTNGDCRVEECALGQDLCRTTIVRLWEEGEELELVEKS CTCSEKTNRTLSYRTGLKITSLTEVVCGLDLCNQGNSGRAVTYS RSRYLECISCGSSDMSCERGRHQSLQCRSPEEQCLDVVTHWIQE GEEGRPKDDRHLRGCGYLPGCPGSNGFHNNDTFHFLKCCNTTKC NEGPILELENLPQNGRQCYSCKGNSTHGCSSEETFLIDCRGPMN QCLVATGTHEPKNQSYMVRGCATASMCQHAHLGDAFSMCHIDVS CCTKSGCNHPDLDVQYRSG Rigid body 1 (PDB) Rigid body 2 (PDB) Rigid body 3 (PDB)
I(s) = vkIk(s)
k
∑
Rigid body 1 (PDB) Rigid body 2 (PDB) Rigid body 3 (PDB) LRCMQCKTNGDCRVEECALGQDLCRTTIVRLWEEGEELELVEKS CTCSEKTNRTLSYRTGLKITSLTEVVCGLDLCNQGNSGRAVTYS RSRYLECISCGSSDMSCERGRHQSLQCRSPEEQCLDVVTHWIQE GEEGRPKDDRHLRGCGYLPGCPGSNGFHNNDTFHFLKCCNTTKC NEGPILELENLPQNGRQCYSCKGNSTHGCSSEETFLIDCRGPMN QCLVATGTHEPKNQSYMVRGCATASMCQHAHLGDAFSMCHIDVS CCTKSGCNHPDLDVQYRSG
I(s) = vkIk(s)
k
∑
❏Symmetric core ❏Symmetric linkers/termini ❏Symmetric core ❏Asymmetric linkers/termini
I(s) = vkIk(s)
k
∑
Experimental data (*.dat) Models (*.pdb) EOM χ2(fit) Sequence (*.txt) (rigid bodies) Rg dist. Dmax dist. Symmetry
I(s) = vkIk(s)
k
∑
Mylonas et al. (2008), Biochemistry 47:10345-10353
* * * * * * * * * * * *
Pool Sel
I(s) = vkIk(s)
k
∑
Mertens et al., JBC, 2012, 287(41), 34304-34315
❏ uPAR WT ❏ uPAR mutant
I(s) = vkIk(s)
k
∑
Hb(S)=-Σp(xi)logb[p(xi)] Rflex = -Hb(S)
(high uncertainty) (low uncertainty)
Rσ = σs / σp 0% ← Rflex→ 100% Rigid Flexible 0 ← Rσ→ 1
Tria, Mertens et al., 2014
I(s) = vkIk(s)
k
∑
Hb(S)=-Σp(xi)logb[p(xi)]
Tria, Mertens et al., 2014
I(s) = vkIk(s)
k
∑
Panjkovich & Svergun, PCCP, 2016
I(s) = vkIk(s)
k
∑
I(s) = vkIk(s)
k
∑
36
PRIMUS GNOM DATtools SHANUM
Data collection No apriori information Hi-res model available Partial hi-res model available
CRYSOL CRYSON DAMMIN GASBOR Ab initio modeling Ambimeter Hybrid modeling SREFLEX EOM SASREF CORAL
flexible system “rigid” system
CORMAP
MONODISPERSE MIXTURE
MIXTURE OLIGOMER SVD
VALIDATE
SUPALM/DAMAVER/SASRES
MEMPROT
Chemometrics is the use of mathematical and statistical methods to improve the understanding of chemical information and to correlate quality parameters or physical properties to analytical instrument data.
Model-free deconvolution approach
SVD Factorization of a matrix: Y = USV Matrix = rotate-stretch-rotate
Singular value decomposition of data matrix Y (I(s) vs s): Y (S x W) = USV U (S x N) formed by significant eigenvectors of YYt V (N x W) formed by significant eigenvectors of YtY Columns of U and rows of V orthogonal: UtU = VVt = I (identity matrix) S (N x N) diagonal matrix, elements are positive square roots of the significant eigenvalues of YYt or YtY in descending order
Right singular vectors Left singular vectors SAXS profiles
(elution peaks)
Rotation (A=VR) Rotation (C=UR)
SVD using order from eg. SEC-SAXS
SVD with direction
component
Model Concentration Profile EFA Plot 4 components
detector associated with increase of rank by 1
rank
find point of disappearance
❏ Concentration of a component is zero outside its “window” ❏ EFA forward/backward defines limits
❏ Component i appears where rank Y rises to i (in forward direction) ❏ Disappears when rank rises to N + 1 - i (in backward direction)
Mixtures
Be aware! Be careful!
fibrillation)
I(s) = vkIk(s)
k
∑
Giehm, L., Svergun, D.I., Otzen, D.E. & Vestergaard, B. (2011) PNAS USA, 108, 3246
The oligomer is shown to disrupt liposomes, i.e. it is potentially cytotoxic