SAS data reduction Haydyn Mertens (EMBL-Hamburg) Data reduction - - PowerPoint PPT Presentation

sas data reduction
SMART_READER_LITE
LIVE PREVIEW

SAS data reduction Haydyn Mertens (EMBL-Hamburg) Data reduction - - PowerPoint PPT Presentation

SAS data reduction Haydyn Mertens (EMBL-Hamburg) Data reduction steps Acquisition Reduction Parameters SAXS instrumentation 2D images to 1D profile SAXS invariants - Sample - Integration - Rg - Buffer - Normalisation - I0 (MM) -


slide-1
SLIDE 1

SAS data reduction

Haydyn Mertens (EMBL-Hamburg)

slide-2
SLIDE 2

Data reduction steps

Parameters

SAXS invariants

  • Rg
  • I0 (MM)
  • Vp

Acquisition

SAXS instrumentation

  • Sample
  • Buffer
  • Background

Reduction

2D images to 1D profile

  • Integration
  • Normalisation
  • Averaging/Subtract
slide-3
SLIDE 3

Data Acquisition

Detection

X-rays, neutrons Detectors

slide-4
SLIDE 4 YEARS I 1974–2014

Instrumentation – X-rays

Haydyn Mertens, EMBO 2017 (Singapore)

  • 4. December 2017
  • Monochromatic and collimated X-ray radiation
  • Reduced parasitic scattering
  • Calibrated detector with low background

slits slits detector Scattered X-rays

primary beam beamstop

Wavelengths ~ 0.06 nm - 0.15 nm

slide-5
SLIDE 5 YEARS I 1974–2014

Haydyn Mertens, EMBO 2017 (Singapore)

  • 6. December 2017
  • Cf. Al Kikhney (EMBL-Hamburg)

Instrumentation – X-rays

slide-6
SLIDE 6 YEARS I 1974–2014

Instrumentation – Neutrons

Haydyn Mertens, EMBO 2017 (Singapore)

  • 6. December 2017
  • Monochromatic and collimated radiation
  • Reduced parasitic scattering
  • Calibrated detector with low background

collimator collimator detector Scattered Neutrons

primary beam beamstop

Wavelengths ~ 0.2 nm – 1.0 nm Velocity selector

slide-7
SLIDE 7 YEARS I 1974–2014

The small-angle scattering experiment

  • X-rays are scattered mostly by electrons
  • Thermal neutrons are scattered mostly by nuclei
  • Scattering amplitude from an ensemble of atoms A(s) is the

Fourier transform of the scattering length density distribution in the sample r(r)

  • Experimentally, scattering intensity I(s) = [A(s)]2 is measured.

Haydyn Mertens, EMBO 2017 (Singapore)

  • 4. December 2017

k0 = 2π/λ

k1 s = k1-k0 =(4πsinθ)/λ

Radiation sources: X-raylaboratory λ = 0.1-0.2 nm X-raysynchrotron λ = 0.03-0.35 nm Neutrontherm λ = 0.2-1 nm

slide-8
SLIDE 8 YEARS I 1974–2014

The small-angle scattering experiment

  • 2D pattern collected
  • Exposure time (ms- sec)
  • Transmitted beam intensity (beam-stop)
  • Radially averaged (for isotropic scattering) à 1D profile
  • Normalisation
  • Frames checked for radiation damage and averaged
  • Subtraction of buffer/background

Haydyn Mertens, EMBO 2017 (Singapore)

  • 5. December 2017
1 2 3 4 5 s, nm
  • 1
I(s), rel. units. 5 10 15 r, nm 0.0 0.5 1.0 p(r), rel. units.
slide-9
SLIDE 9 YEARS I 1974–2014

The small-angle scattering experiment

Haydyn Mertens, EMBO 2017 (Singapore)

  • 6. December 2017
  • MASKING
  • S-AXIS calibration
  • AgBeh (d = 5.38 nm)
  • Intensity scale calibration
  • H2O 0.0163 cm-1
slide-10
SLIDE 10

SAS sample measurement

  • Subtract scattering from matrix/solvent (also reduces contribution of

background, eg. slits & sample holder)

  • Contrast Dr

Dr = <r(r) - rs>, where rs is the scattering density of the matrix, may be very small for biological samples To obtain scattering from particles of interest:

Particle+matrix matrix Difference

slide-11
SLIDE 11 YEARS I 1974–2014

What are we really measuring?

Haydyn Mertens, EMBO 2017 (Singapore)

  • 4. December 2017
  • Contrast and the other things ...
  • SAXS intensity profile of “difference” between particle and

matrix/background.

I(q) = ξ dσ (q) dΩ = ξ nΔρ2V 2P(q)S(q)

Number of particles Contrast Particle volume Form factor Structure factor Instrument constant

slide-12
SLIDE 12 YEARS I 1974–2014

What are we really measuring?

Haydyn Mertens, EMBO 2017 (Singapore)

  • 4. December 2017
  • Contrast and the other things ...
  • SAXS intensity profile of “difference” between particle and

matrix/background.

I(q) = ξ dσ (q) dΩ = ξ nΔρ2V 2P(q)S(q)

Number of particles Contrast Particle volume Form factor Structure factor Instrument constant

Solute Buffer

Scattering Density Δρ

slide-13
SLIDE 13 YEARS I 1974–2014

What are we really measuring?

Haydyn Mertens, EMBO 2017 (Singapore)

  • 4. December 2017
  • SAXS intensity profile of “difference” between particle and

matrix/background.

  • If dilute enough, S(q) = 1.0 (can neglect)
  • If n high enough (concentrated sample), we have signal!
  • If V is big (eg. large protein), strong signal (even if n is small)
  • P(q) defines the shape
  • If contrast (Δρ = ρparticle – ρsolvent) is small à see close to nothing!

I(q) = ξ dσ (q) dΩ = ξ nΔρ2V 2P(q)S(q)

slide-14
SLIDE 14 YEARS I 1974–2014

Contrast

Haydyn Mertens, EMBO 2017 (Singapore)

  • 4. December 2017
  • X-ray scattering densities of solvents and macromolecules
  • Possible to contrast match in an X-ray experiment but tricky!

Scattering species Scattering density eÅ-3 H2O 0.334 D2O 0.334 50 % Sucrose in H2O 0.40 Protein 0.42 RNA 0.46 DNA 0.55

Adapted from Svergun & Feigin, Structure Analysis by Small-Angle X-Ray and Neutron Scattering, Plenum, 1987

slide-15
SLIDE 15 YEARS I 1974–2014

Contrast for X-ray

Haydyn Mertens, EMBO 2017 (Singapore)

  • 4. December 2017
  • Matching electron density of sub-complex (eg. Protein:DNA)

Standard buffer (aqueous) ~ 50% sucrose

I = IDNA + IDNAIPROT + IPROT I = IDNA + IDNAIPROT + IPROT I = IDNA + IDNAIPROT + IPROT

slide-16
SLIDE 16

Characterise your sample & Look at your data!

PAGE/SEC-MALLS/SAXS DATA

slide-17
SLIDE 17

Standard situation

Monodisperse non-interacting systems

I(s) = 4π p(r)sin(sr) sr

Dmax

dr

  • Observed scattering proportional to

(averaged over all orientations)

  • Facilitates size, shape internal structure

investigation (at low resolution)

slide-18
SLIDE 18

0.5 1 1.5 2 2.5 3 s, nm

  • 1

2 4

ideal

0.5 1 1.5 2 2.5 3 s, nm

  • 1

2 4

attractive

0.5 1 1.5 2 2.5 3 s, nm

  • 1

2 4

repulsive

  • Form factor of each particle in the solution summed
  • Monodisperse
  • Dilute

Experimental SAS profile

slide-19
SLIDE 19

Experimental SAS profile

0.5 1 1.5 2 2.5 3 s, nm

  • 1

2 4

ideal

0.5 1 1.5 2 2.5 3 s, nm

  • 1

2 4

attractive

0.5 1 1.5 2 2.5 3 s, nm

  • 1

2 4

repulsive

Inter-particle distances begin to be of the same order as the intra- particle distances.

  • Form factor and structure factor
  • Interparticle interference
  • Attractive (eg. aggregation)
slide-20
SLIDE 20

Experimental SAS profile

0.5 1 1.5 2 2.5 3 s, nm

  • 1

2 4

ideal

0.5 1 1.5 2 2.5 3 s, nm

  • 1

2 4

attractive

0.5 1 1.5 2 2.5 3 s, nm

  • 1

2 4

repulsive

Start to see a set of dominant average inter-particle distances (usually only at high conc.).

  • Form factor and structure factor
  • Interparticle interference
  • Repulsive (eg. ordering)
slide-21
SLIDE 21 YEARS I 1974–2014

1 2 3 4 5 s, nm

  • 1

I(s), rel. units. 5 10 15 r, nm 0.0 0.5 1.0 p(r), rel. units.

Haydyn Mertens, EMBO 2017 (Singapore)

  • 4. December 2017

) s R ) I( I(s)

g 2 2

3 1 exp(

  • @

Radius of gyration Rg (Guinier, 1939)

Maximum size Dmax: p(r)=0 for r> Dmax

Dmax

Excluded particle volume (Porod, 1952)

ò

¥

= =

2 2

) ( I(0)/Q; 2 V ds s I s Q p

Parameters from SAS:

slide-22
SLIDE 22 YEARS I 1974–2014

Big vs small objects and the scattering angle!

Haydyn Mertens, EMBO 2017 (Singapore)

  • 4. December 2017
  • Intensity drops off more rapidly for larger particles!

Adapted from: Kratky, O. (1963) Prog. Biophys. Mol. Biol. 13.

2θ Δb Δb = Δa à signal cancellation at lower angles Δa = 1λ àall phases uniformly represented (cancellation of signal) 2θ Δa

slide-23
SLIDE 23 YEARS I 1974–2014

Haydyn Mertens, EMBO 2017 (Singapore)

  • 4. December 2017

① SAMPLE OPTIMISATION ② DATA COLLECTION ③ DATA REDUCTION ④ PRIMARY ANALYSIS ⑤ MODELING ⑥ VALIDATION

slide-24
SLIDE 24 YEARS I 1974–2014

Haydyn Mertens, EMBO 2017 (Singapore)

  • 4. December 2017

① SAMPLE OPTIMISATION ② DATA COLLECTION ③ DATA REDUCTION ④ PRIMARY ANLAYSIS ⑤ MODELING ⑥ VALIDATION

slide-25
SLIDE 25 YEARS I 1974–2014

Data reduction

1D profiles

Averaging/Subtraction Merging

slide-26
SLIDE 26 YEARS I 1974–2014

Data flow

19/06/12

Obtain 1D profiles of frames (sample & buffer) Average frames

(in PRIMUS)

Subtract background/buffer scattering

(in PRIMUS)

Guinier analysis Concentration dependent behaviour? average Merge/extrapolate No Yes IFT (eg. GNOM) Modeling Fitting (eg. CRYSOL)

slide-27
SLIDE 27 YEARS I 1974–2014

Data reduction: Averaging of frames in PRIMUS

19/06/12

Concentration series loaded (including buffers)

lin - lin log - lin

slide-28
SLIDE 28 YEARS I 1974–2014

Data reduction

19/06/12

Averaging data sets

Sample and two buffer sets Sample VS buffer

slide-29
SLIDE 29 YEARS I 1974–2014

Data reduction

19/06/12

Averaging data sets

Average buffers

slide-30
SLIDE 30 YEARS I 1974–2014

Data reduction

19/06/12

Subtraction of the background (averaged buffer)

Subtract average buffer from sample

slide-31
SLIDE 31

Radiation damage

Frames change

Following exposure

  • Intensity increase
  • SAS parameters
slide-32
SLIDE 32

Concentration dependence!

slide-33
SLIDE 33 YEARS I 1974–2014

Concentration dependence

19/06/12

  • Difference of low and higher concentration data

Low conc. High conc.

slide-34
SLIDE 34 YEARS I 1974–2014

Merging data sets

19/06/12

  • Option 1:
  • Use lowest conc data for low-s and merge with high-s from high conc

data.

  • Option 2:
  • If Rg and I(0) change in a linear way with concentration, extrapolate

to infinite dilution (remove structure factor) Low-s region often contains significant structure factor

slide-35
SLIDE 35 YEARS I 1974–2014

Option 1: Merging data sets

19/06/12

Rg changes linearly (or close enough) with concentration

slide-36
SLIDE 36 YEARS I 1974–2014

Option 1: Merging data sets

19/06/12

  • Select low-s region of low concentration data

Reduce influence of interparticle interference

Merge with data

slide-37
SLIDE 37 YEARS I 1974–2014

Option 1: Merging data sets

19/06/12

  • Select overlapping region with high concentration data set

(should be no influence of structure factor at these “atomic distances”)

Reduce influence of interparticle interference

Merge data

slide-38
SLIDE 38 YEARS I 1974–2014
  • Merged and scaled to lowest concentration data set

Option 1: Merging data sets

19/06/12

Reduce influence of inter-particle interference

New 1D SAXS profile

slide-39
SLIDE 39 YEARS I 1974–2014

Option 2: extrapolation to infinite dilution

19/06/12

  • Find linear relationship between Rg and/or I(0) and

concentration

  • Simple extrapolation to zero concentration à ideal SAXS

Reduce influence of interparticle interference

  • Works well for repulsive

interference

  • Extrapolate:
  • I(q) – (a) and (b)
  • I(0) or Rg (c) directly
  • Automated method available
  • ALMERGE (ATSAS)

Glatter, O., and O. Kratky. 1982. Small Angle X-ray Scattering. Vol. 102. Academic Press London.

slide-40
SLIDE 40 YEARS I 1974–2014

Option 2: extrapolation to infinite dilution

19/06/12

Extrapolation procedure in PRIMUS

Extrapolate data

slide-41
SLIDE 41

The importance of buffer matching!

slide-42
SLIDE 42 YEARS I 1974–2014

Sample preparation

  • Buffer matching!
  • Must be almost (if not actually) PERFECT
  • If not you will over/under subtract the data and not have anything

meaningful to work with

Haydyn Mertens, EMBO 2017 (Singapore)

  • 4. December 2017

sample unmatched buffer matched buffer

  • versubtracted

subtracted

slide-43
SLIDE 43 YEARS I 1974–2014

Data flow

19/06/12

Obtain 1D profiles of frames (sample & buffer) Average frames

(in PRIMUS)

Subtract background/buffer scattering

(in PRIMUS)

Guinier analysis Concentration dependent behaviour? average Merge/extrapolate No Yes IFT (eg. GNOM) Modeling Fitting (eg. CRYSOL)

slide-44
SLIDE 44 YEARS I 1974–2014

Haydyn Mertens, EMBO 2017 (Singapore)

  • 4. December 2017

① SAMPLE OPTIMISATION ② DATA COLLECTION ③ DATA REDUCTION ④ PRIMARY ANLAYSIS ⑤ MODELING ⑥ VALIDATION

slide-45
SLIDE 45 YEARS I 1974–2014

Overall Parameters

Size and Shape

Rg, Dmax I0, Vp

slide-46
SLIDE 46
  • Scattering at low angles (range limited)
  • Convenient plot to extract parameters

Guinier approximation

Description by Guinier & Fournet (1950s) to describe:

I q

( ) ≅ I 0 ( )exp −1

3q2Rg

2

# $ % & ' (

ln[I(q)]= ln[I(0)]+ −q2Rg

2

3 " # $ % & '

slide-47
SLIDE 47 YEARS I 1974–2014

Guinier approximation

  • 4. December 2017

The Guinier plot

ln[I(q)]= ln[I(0)]+ −q2Rg

2

3 " # $ % & '

y = c + mx (x = q2) (m = - Rg

2 / 3)

ln[(q)] q2,Å-2 Rg = 10 Å Rg = 100 Å I(0) = 10 I(0) = 100 Extract Rg and I(0) from the linear region of the plot

Haydyn Mertens, EMBO 2017 (Singapore)

slide-48
SLIDE 48 YEARS I 1974–2014

Guinier approximation

  • 7. December 2017

The Guinier plot

ln[I(q)]= ln[I(0)]+ −q2Rg

2

3 " # $ % & '

y = c + mx (x = q2) (m = - Rg

2 / 3)

ln[(q)] Extract Rg and I(0) from the linear region of the plot

Haydyn Mertens, EMBO 2017 (Singapore)

slide-49
SLIDE 49 YEARS I 1974–2014

Radius of gyration, Rg

Haydyn Mertens, EMBO 2017 (Singapore)

  • 4. December 2017
  • A SAXS parameter for “size”
  • Distribution of components around an axis (or center of mass)
  • “the root-mean-square distance of all elemental scattering volumes

from their center of mass weighted by their scattering densities”

Rg solid sphere < Rg hollow sphere

slide-50
SLIDE 50 YEARS I 1974–2014

Radius of gyration, Rg

Haydyn Mertens, EMBO 2017 (Singapore)

  • 4. December 2017
  • Proteins of different Rg
  • Distribution of components around an axis (or center of mass)

Rg 2.8 nm Rg 3.9 nm

BSA monomer BSA dimer

slide-51
SLIDE 51 YEARS I 1974–2014

I(0) à MM

Haydyn Mertens, EMBO 2017 (Singapore)

  • 4. December 2017
  • Forward scattering intensity, I(0)
  • Relates to the “volume of electrons/scattering centers” in the particle(s)

I(q) = ξ dσ (q) dΩ = ξ nΔρ2V 2P(q)S(q)

I(0) = nΔ⍴ 2V2

slide-52
SLIDE 52 YEARS I 1974–2014

I(0) à MM

Haydyn Mertens, EMBO 2017 (Singapore)

  • 4. December 2017
  • Molecular mass calculation
  • From standard (eg. BSA, glucose isomerase, lysozyme)
  • Absolute calibration using water ( I_water = 0.0163 cm-1)

I(0) = nΔ⍴ 2V2

6.02E23 mol-1 Concentration (g.cm-3) Contrast (cm-2) Partial specific volume of protein (eg. 0.7425 cm3.g-1) (cm-1)

slide-53
SLIDE 53 YEARS I 1974–2014

I(0) à MM

Haydyn Mertens, EMBO 2017 (Singapore)

  • 7. December 2017
  • Molecular mass calculation
  • From standard (eg. BSA, glucose isomerase, lysozyme)
  • Absolute calibration using water ( I_water = 0.0163 cm-1)

I(0) = nΔ⍴ 2V2

MMsample = [(I0/c)sample / (I0/c)standard] X MMstandard

slide-54
SLIDE 54 YEARS I 1974–2014

Real space distance distribution, P(r)

Haydyn Mertens, EMBO 2017 (Singapore)

  • 7. December 2017

Shapes

I 0

( ) = 4π

p(r).dr

Dmax

Rg

2 = 1

2 r

2.dr Dmax

∫ I(q) = 4π p(r)sin(qr) qr .dr

Dmax

p(r) = r2 2π 2 q2I(q)sin(qr) qr .dq

∫ FT-1 FT Proteins

slide-55
SLIDE 55 YEARS I 1974–2014

Relation to “real-space”

19/06/12

  • Solution is Indirect Fourier Transformation (IFT), (Glatter, 1977)
  • Fit a function to the SAXS data and transform à p(r)
  • Regularisation parameter (α) helps balance between the fit and the FT.

Indirect Fourier Transformation (IFT) of SAXS data

p(r) = ckφk(si)

k=1 K

Φ = χ 2 +αP(p) χ 2 = 1 N −1 Iexp(sj)−cIcalc(sj) σ (sj) " # $ $ % & ' '

j=1 N

2

P(p) = p'

[ ]

Dmax

2

dr

slide-56
SLIDE 56 YEARS I 1974–2014

Relation to “real-space”

19/06/12

  • Dmax estimate (3.5 nm) poor solution – too small

So, what is a good p(r)? How do I know a good solution?

OK fit Truncated

slide-57
SLIDE 57 YEARS I 1974–2014

Relation to “real-space”

19/06/12

  • Dmax estimate (4.0 nm) poor solution – still too small

So, what is a good p(r)? How do I know a good solution?

OK fit Truncated

slide-58
SLIDE 58 YEARS I 1974–2014

Relation to “real-space”

19/06/12

  • Dmax estimate (4.5 nm) - good solution

So, what is a good p(r)? How do I know a good solution?

Good fit Smooth

slide-59
SLIDE 59 YEARS I 1974–2014

Relation to “real-space”

19/06/12

  • Dmax estimate (5.0 nm) poor solution – too long
  • And exceeding confidence limits of method (Dmax*qmin > π)

So, what is a good p(r)? How do I know a good solution?

Good fit Smooth but “tailing”

slide-60
SLIDE 60 YEARS I 1974–2014

Relation to “real-space”

19/06/12

  • Dmax estimate (5.5 nm) poor solution – definitely too long
  • And exceeding confidence limits of method (Dmax*qmin > π)

So, what is a good p(r)? How do I know a good solution?

Good fit Oscillating

slide-61
SLIDE 61 YEARS I 1974–2014

Relation to “real-space”

19/06/12

  • Balance between fit and p(r) smoothness

Playing with alpha (regularisation parameter)

α = 212 α = 0.1

slide-62
SLIDE 62 YEARS I 1974–2014

Relation to “real-space”

19/06/12

  • Balance between fit and p(r) smoothness

Playing with alpha (regularisation parameter)

α = 212 α = 212,000

slide-63
SLIDE 63 YEARS I 1974–2014

19/06/12

Particle volume

  • “Porod” volume – excluded volume of hydrated particle
  • Can be related (empirically) to particle mass
  • Force decay of scattering intensity by s-4 (Constant K)
  • Follows Porod law for homogenous particles

Vol = 23 nm3 (MM = 14 kDa) MM ~ Vp/1.6

slide-64
SLIDE 64 YEARS I 1974–2014

19/06/12

Particle volume

  • “Porod” volume – excluded volume of hydrated particle
  • Can be related (empirically) to particle mass

Vp = 23 nm3 MM ~ 23 / 1.6 = 15 kDa

slide-65
SLIDE 65

Summary

Data obtained from 2D images à 1D profiles Buffer (and background) subtraction Profiles tell you a lot about the sample Parameters (Rg, I0, Dmax, Vp) extracted

slide-66
SLIDE 66

Data reduction steps

Parameters

SAXS invariants

  • Rg
  • I0 (MM)
  • Vp

Acquisition

SAXS instrumentation

  • Sample
  • Buffer
  • Background

Reduction

2D images to 1D profile

  • Integration
  • Normalisation
  • Averaging/Subtract
slide-67
SLIDE 67

Important

Steps!

  • cf. Al Kikhney (EMBL-Hamburg)