Standards for hybrid and integrative methods
Jill Trewhella, The University of Sydney
Structural & biophysical methods for biological macromolecules in solution Sungkyunkwan University, Suwon Korea, June 19-26, 2016
Standards for hybrid and integrative methods Jill Trewhella, The - - PowerPoint PPT Presentation
Standards for hybrid and integrative methods Jill Trewhella, The University of Sydney Structural & biophysical methods for biological macromolecules in solution Sungkyunkwan University, Suwon Korea, June 19-26, 2016 SAS as a constraint in
Structural & biophysical methods for biological macromolecules in solution Sungkyunkwan University, Suwon Korea, June 19-26, 2016
wwPDB SAStf Meeting report, Trewhella et al, Structure 21, 875, 2013 3D modeling from SAS (especially SAXS) data is becoming automated
standard format SAS data that is searchable and freely accessible for download;
assessment of the quality of deposited SAS data and the accuracy of SAS-derived models, as well as the extent to which a given model fits the SAS data;
data and models calls for archiving
data; and
the PDB and what complementary archives might be needed (taking into account both scientific needs and funding). wwPDB SAS task force
Outcome of the First wwPDB Hybrid / Integrative Methods Task Force Workshop
(at EBI UK, Oct. 2014)
Kleywegt, S. K. Burley, J. Markley, H. Nakamura, P. Adams, A.M.J.J. Bonvin, W. Chiu, M. Dal Peraro, F. Di Maio, T.E. Ferrin, K. Grünewald, A. Gutmanas, R. Henderson, G. Hummer, K. Iwasaki, G. Johnson, C.L. Lawson, J. Meiler, M.A. Marti-Renom, G.T. Montelione,
R.J. Read, H. Saibil, G.F. Schröder, C. Schwieters, C.A.M. Seidel, D. Svergun, M. Topf, E.L. Ulrich, S. Velankar, and J.D. Westbrook Structure 23, 1156-1167, 2015
computational protocols should be archived; inclusivity is key.
models, multi-state models, ensembles of models, and models related by time or
validated, and adopted.
Structural information Method Atomic structures of components X-ray/neutron Xtalography, NMR, 3DEM, comparative modeling, molecular docking 3D maps and 2D images Electron microscopy and tomography Atomic and protein distances NMR, FRET, DEER, EPR, other spectroscopic techniques; chemical crosslinks/mass spectrometry, disulfide bonds/ gel electrophoresis Binding site mapping NMR spectroscopy, mutagenesis, FRET Size, shape, P(r) SAS Shape and size Atomic force microscopy, ion mobility mass spec., fluorescence correlation spectroscopy and fluorescence anisotropy Component positions Super-resolution optical microscopy, FRET imaging Physical proximity Co-purification, native mass spectrometry, genetic methods, and gene/protein sequence covariance Solvent accessibility Footprinting methods, e.g. H/D exchange by mass spec. or NMR, functional consequences of point mutations Proximity of genome segments Chromosome Conformation Capture and other data Propensities for different interaction modes Molecular mechanics force fields, potentials of mean force, statistical potentials, and sequence co-variation
to the crystal structure from 4.5 to 3.3 Å, largely due to more accurate translational positioning of domains
Grishaev et al.J.Biomol. NMR 376, 95, 2008 NMR only SAXS-NMR
3.05 NMR 1.01 Xtal 0.97 NMR/SAXS
Schwieters and Clore (2014) Using small-
Red lines depict backbone coordinates of the lowest energy 10 structures calculated omitting SAXS/WAXS data (Panel A) and including SAXS/WAXS data (panel B). Blue cartoon is representation of the X-ray structure from PDB ID 193L.
Figure 2: Schwieters and Clore (2014) Prog. Nucl. Magn. Reson. Spectrosc. 80, 1
Figure 3 from Schwieters and Clore (2014) Prog. Nucl. Magn. Reson.
Comparison of SAXS/WAXS curves for lysozyme. Panels depicting the agreement to experiment of the SAXS/WAXS curves associated with the 10 lowest energy structures calculated without (panel A) and with (panel B) SAXS/WAXS data, respectively. The experimental data is shown in black with gray vertical bars equal to 1 SD; the curves calculated from the simulated annealing structures are shown in
plotted above each panel.
Structure statistics for 10 lowest energy lysozyme structures w and w/o SAXS
all NMR data deposited NMR structures 1E8L (model 49) X-ray structure without SAXS with SAXS NOE violations 4.3 ±2.5 0.2 ±0.4 0.0±0.0 RDC R-factor, medium 1% 9.9 ±1.5 5.9 ±0.3 5.9±0.3 13.3 RDC R-factor, medium 2% 13.8 ±2.4 9.2 ±0.8 5.7±0.4 15.2 dihedral violations 4.4 ±1.2 0.1 ±0.3 0.0±0.2 SAXS χ2 2.3 ±1.4 0.4±0.1 1.7±0.6 0.86 HBDB energy (kcal/mol) 116.4 ±18.8 −160.7 ±13.3 −45.6±9.1 (−36.2) −255.12 torsion DB violations 3.8 ±2.6 1.4 ±1.5 1.7±0.9 (2) bond violations 6.5 ±5.2 0.4 ±0.8 0.0±0.0 angle violations 8.6 ±5.9 0.1 ±0.3 10.3±0.8 (10) 48 improper violations 3.1 ±2.7 0.0±0.0 0.0±0.0 24 bad non-bonded contacts 21.2 ±7.5 2.7 ±1.8 166.5±7.8 (176) 48 precision to mean (Å) 2.30±0.58 0.84±0.14 0.50±0.13 − precision to mean (Å) 2.78±0.56 1.50±0.14 0.52±0.20 1.46 C rmsd to X-ray struct. (Å) 2.82±0.59 1.32±0.19 1.48±0.10 (1.46) −
pyDock, Jimenez-Garciaet al. (2015) Nucleic Acids Res 2015 43, W356. FoXSDock, Schneidman-Duhovny et al. (2013) Biophys J 105, 962 HADDOCK, Karaca & Bonvin (2013) Acta Crystallogr D69, 683
43% for pyDOCK, 48% for HADDOCK 63% for FoXSDock
Schneidman-Duhovny et al. (2012) BMC Struct Biol 12, 17
Size shape surface
LogI(q)
Guinier Plot Rg and I(0) shape and size Kratky Plot foldedness Porod Plot volume
q2 (Å-2) q (Å-1) q (Å-1) q2I(q) P(r) r (Å) q4I(q)
Jacques & Trewhella (2010) Protein Science 19, 642
*Orthaber et al. (2000) J. Appl. Cryst. 33, 218
→ NCVWeb Home → Contrast → Rg → Compost NCVWeb MULCh: ModULes for the analysis of Contrast variation data The purpose of this set of programs is to allow the extraction of structural parameters from Neutron Contrast Variation data for two component systems. There are three modules to facilitate this: 1.Contrast: This module determines the contrast ("scattering power" relative to the solvent) for each subunit in the complex for various proportions of D2O in the solvent. The contrast is important for the subsequent modules, but also helpful in planning and experiment. 2.Rg: Analyses the dependence of the radius of gyration upon contrast. From this analysis information can be extracted relating to the radii of gyration of each subunit and their separation. 3.Compost: Decomposes the scattering profiles down to scattering from each subunit, and a cross term, related to the disposition of each. Testing and basic description of the work has been published: A.E. Whitten, S. Cai and J. Trewhella, J. Appl. Crystallogr. If this page is used we ask that you cite that article. The MULCh manual can be found here The source code for the modules can be found here Test data and input files for MULCh can be found here
http://smb-research.smb.usyd.edu.au/NCVWeb/ Δ𝜍, 𝜑, and dry volume from sequence+ligands and solvent constituents
Your spectrophotomoter is calibrated and you
You avoid agents in your buffer that absorb in the UV,
You check the absorbance for the folded and
You perform multiple measurements.
*Krigbaum and Kugler (1970) Biochemistry 9, 1216
𝑅
Forces q-4 dependence for the high-q data Uses the smoothed I(q) profile generated in GNOM to cope with the
noisy high-q data to determine the correction.
Scattering data range set to 𝑟𝑛𝑏𝑦 =
8 𝑆
Petoukhov et al (2012) J. Appl. Cryst. 45, 342 See also: Fischer et al. (2010) J. Appl. Cryst. 43,101 (DATMOW) Rambo and Tainer (2013) Nature 496, 477 (DATVC) This method is subject significance uncertainties in the high-q dependence.
∞
𝑊
𝑠
𝑟2𝐽 𝑟 𝑟2𝐽 𝑟 𝑟2𝐽 𝑟
𝑟 (Å-1) 𝑟 (Å-1) 𝑟 (Å-1)
Acta Crystallographica – notes for authors 11.3. Small-angle scattering data Guidelines for articles reporting structural modelling of small angle scattering may be found at http://journals.iucr.org/services/sas/. For articles that present experimental SAS data, the deposition of an ASCII file representing the background-corrected scattering profile(s) with errors is recommended.
draft requirements for presenting biological macromolecule small-angle scattering data It is not the intention of this document to define a quality requirement for SAS experiments that would be acceptable for publication. Rather, the purpose is to outline the way in which SAS experiments should be presented in order to enable the reader to independently assess the quality of any interpretations made by the authors.
2 versus the reciprocal of contrast and extracted
component scattering functions (including cross-term) where appropriate;
Data collection parameters Structural parameters Instrument Rg (Å) from Guinier Beam geometry I(0) (cm-1) from Guinier Wavelength (Å) Molecular mass Mr (from I(0)) q range (Å-1) Rg (Å) from P(r) Exposure time (s) I(0) (cm-1) from P(r) Sample concentration range (mg mL-1) dmax (Å) Temperature (°C) Porod volume estimate (Å) Molecular mass determination Software employed Partial specific volume Primary data reduction Contrast (Δρx1010 cm-2) Data processing Mr from sequence (Da) ab initio analysis dry/hydrated volumes from sequence (Å3) Validation and averaging Computation of model intensities Rigid body modelling Generic Table
C5C6C7 C5C6*C7 tC5C6C7 tC5*C6C7 C6*C7 C7 Guinier results
Rg (Å) 37(1) 25(2) 15.3(1) qRg-range (Å-1) 0.65 -1.3 0.72-1.22 0.57-1.3
Rg (Å) 37(2) 38(2) 40(2) 44(2) qRg-range (Å-1) 0.54-1.3 0.54-1.3 0.56-1.16 0.56-1.16 P(r) results
Rg (Å) 39.38(9) 25.58(4) 15.53(3) dmax 145 90 60 q-range (Å-1) 0.017-1.189 0.028- 0.305 0.039- 0.251
Rg (Å) 39.38(7) 40.3(1) 42.6(5) 45.6(9) dmax 141 150 169 200 q-range (Å-1) 0.014- 0.193 0.015-0.193 0.018-0.193 0.02-0.193 Protein mg mL-1 range for extrapolation to infinite dilution
0.9-3.1 0.6-5.1 5.7-11.7
0.7-3.4 0.5-1.9 0.3-1.4 0.2-1.2 Molecular weight, MW from I(0) values*
37,427 23,249 11,803
38,204 35,416 39,708 37,055 Ratio of I(0) derived to expected MW 1.07 0.99 1.06 0.99 1.10 1.10
Nadvi et al. “Clinically-linked Mutations in the Central Domains of the Cardiac Myosin-binding Protein C Associated with Distinct Phenotypes show Differential Structural Effects,” Structure 24, 105-115, 2016
Data collection parameters
Instrumentation Australian Synchrotron SAXS beam-line Expt 1: 30 June 2014 Expt 2: 18 April, 2015 Beam geometry 250 µm × 150 µm 250 m x 150 m q-range meas (Å-1) 0.01-0.476 0.006-0.344 Exposure time (secs) 1 s x 24 1 s x 44 frames Temperature (oC) 22 22 Sample details C5C6*C7 C6*C7 C7 C5C6C7 C5C6*C7 tC5C6C7 tC5C6*C7 Partial specific volume (ν, cm3 g-1) 0.735 0.736 0.735 0.735 0.735 0.735 0.735 Contrast (Δρ, 1010 cm-2) 2.841 2.826 2.794 2.841 2.841 2.839 2.820 Molecular weight (MW) from seq. Da 35,709 21,498 10,734 35,738 35,709 37,576 37,519
Software employed for data reduction, analysis and interpretation
SAXS data reduction ScatterBrain Calculation of expected MW, Δρ and ν values MULCh SAXS data analysis ATSAS 2.6.0 SAS Data Analysis ab initio bead modelling DAMMIF (via ATSAS on-line) Atomic structure modelling SASREF (via ATSAS on-line) 3D graphic model reps PYMOL
Trewhella, J. “Small-angle scattering and 3D structure interpretation,”
http://www.sasbdb.org/
Whitten et al (2007) J. Mol. Biol. 368, 407
Crystal structure of a small histidine kinase inhibitor, Sda. Molecules A, B and C constitute the asymmetric unit. Molecules A*, B* and C* are related to the asymmetric unit by rotation about a 2-fold crystal symmetry axis (perpendicular to the middle of the image). Overlay of the 3 molecules in the asymmetric unit.
Jacques et al (2009) Acta D65, 574
Trimer from crystal structure is missing about 25% of the mass. Trimers have largest buried surface areas 1112 & 1081 Å2 for A-B-C and B-C-A* trimers, respectively.
Jacques et al (2009) Acta D65, 574
subunit A A-B-C-A*-B*-C* hexamer B-C dimer A-B* dimer A-B-C trimer B-C-A* trimer 0.85 2.0 19.3 1.1 5.6 2.2
CORAL 2 = 1.49 Crystal trimer + missing residues SASREF 2 = 0.88 (severe steric clashes) 3 NMR monomers SASREF 2 = 0.85 2 NMR monomers
Dimer pair 2 A-B 2.7 B-C 1.2 A-C 1.0 CRYSOL fits to SAXS data with dimer models created by aligning the NMR monomer with different Xtal dimers PISA stable assemblies: A-B, B-C, trimer, hexamer OLIGOMER fits to SAXS data with dimer and trimer models created by aligning the NMR monomer with Xtal structure 2 0.85 Mixture 2 Hex-Mon 1.3 Tri-Mon 1.0 A-B+B-C 1.0