Folding, Assembly, Flexible Systems
Maxim Petoukhov EMBL, Hamburg Outstation
Folding, Assembly, Flexible Systems Maxim Petoukhov EMBL, Hamburg - - PowerPoint PPT Presentation
Folding, Assembly, Flexible Systems Maxim Petoukhov EMBL, Hamburg Outstation Outline Outline Introduction Combined rigid body and ab initio modelling Unfolded proteins Modular proteins with disordered linkers Transient
Folding, Assembly, Flexible Systems
Maxim Petoukhov EMBL, Hamburg Outstation
Outline Outline
quaternary structure analysis of macromolecular complexes, oligomers and modular proteins
reduces the ambiguity
– The resulting models are still of low resolution
Re: Atomic Structure Based Modelling Re: Atomic Structure Based Modelling
k k k
Re: Quantitative Analysis of Mixtures
OLIGOMER: volume fractions of individual components Prerequisite: models (profiles) of components are known
s, nm-1 5 10 15 lg I(s) 5 6 7 8 Resolution, nm 2.00 1.00 0.67 0.50 0.33
Small angle scattering: resolution
Shape F
Atomic structure
Resolution is defined by the magnitude of the largest scattering vector in the diffraction pattern d=2π/smax
Concept of Dummy Residues Concept of Dummy Residues
amino acids
represented as one entity (dummy residue)
– Identical – Centered at the Cα positions
= < … … >
D.I. Svergun, M.V. Petoukhov, & M.H.J. Koch (2001) Biophys. J. 80, 2946-53
Building native-like folds of polypeptides
Primary sequence Secondary structure Excluded volume
Shell radius, nm 0.2 0.4 0.6 0.8 1.0 Number of neighbours 1 2 3 4 5 6Neighbors distribution Knowledge-based potentials Bond angles & dihedrals distribution
(Mis)-Folding from SAXS?
Modelling Modelling of
multidomain proteins proteins
MRGSHHHHHH GSGVPSRVIH IRKLPIDVTE GEVISLGLPF GKVTNLLMLK GKNQAFIEMN TEEAANTMVN YYTSVTPVLR GQPIYIQFSN HKELKTDSSP NQARAQAALQ AVNSVQSGNL ALAASAAAVD AGMAMAGQSP VLRIIVENLF YPVTLDVLHQ IFSKFGTVLK IITFTKNNQF QALLQYADPV SAQHAKLSLD GQNIYNACCT LRIDFSKLTS LNVKYNNDKS RDYTRPDLPS GDSQPSLDQT MAAAFGLSVP NVHGALAPLA IPSAAAAAAA AGRIAIPGLA GAGNSVLLVS NLNPERVTPQ SLFILFGVYG DVQRVKILFN KKENALVQMA DGNQAQLAMS HLNGHKLHGK PIRITLSKHQ NVQLPREGQE DQGLTKDYGN SPLHRFKKPG
Modelling Modelling of
multidomain proteins proteins
combined approach is proposed to built the models
large and flexible interdomain linkers
DR chains which are attached to the appropriate terminals in rigid domains.
model is a rotation about one
DR(s).
Modelling Modelling of
multidomain proteins proteins
Building native Building native-
like folds of linkers
Absence of steric clashes
Bond angles, degrees 20 40 60 80 100 120 140 160 Dihedral angles, degreesBond angles & dihedrals distribution Loop compactness may also be required
3
3
l id
n Rg =
Neighbors distribution along the sequence
i i+K r
s, nm-1
0.5 1.0 1.5 2.0
lg I, relative
8 9 10 11
Simultaneous fitting of multiple data sets from Simultaneous fitting of multiple data sets from deletion mutants deletion mutants
BUNCH: BUNCH: Modelling Modelling of
multidomain proteins proteins
and probable conformations of DR linkers, those fit the SAXS data.
together with the absence of overlaps.
domains and form-factors of DR comprising the loops using spherical harmonics.
( ) ( ) ( ) 2
) ( ) ( 2
| | 2
∑ ∑ ∑ ∑
+ =
∞ = − = i lm i l l l m k lm k
s D s A s I π
Petoukhov M.V., Svergun, D.I. (2005). Biophys. J. 89, 1237-1250
Structure and RNA interactions of polypyrimidine tract binding protein
H62 F98 L136 L255 R185 I187 K238 R254 F216 K271 K266 K259 K218 Q223 K92 K94 R122 K65 K64 Q96 K137 K134 H133 4 1 3 2 2 5 3 1 4 C N N NNMR: high resolution structures
Collaboration: S.Curry (London)
A B C D
Multiple scattering curves from deletion mutants fitted simultaneously
PTB is an important regulator of alternative splicing, which allows the production of multiple mRNA transcripts from a single pre-mRNA species. PTB contains four domains (RNA recognition motifs, RRMs), whose structure is solved by NMR.
Structure and RNA interactions of polypyrimidine tract binding protein
Overlap of the typical ab initio and rigid body models Further restraints (e.g. from NMR) are required to resolve the orientational ambiguity
Petoukhov, M. V., Monie, T. P., Allain, F. H., Matthews, S., Curry, S., and Svergun, D. I. (2006). Structure 14, 1021-1027.
CORAL: Crossing SASREF & BUNCH
– Does not account for missing portions
– Single polypeptide chain – Impossible to fix more than one domain
Random Loop Library for Combined Modelling
RANLOGS database
CORAL
Hybrid Modelling in Coral
RANLOGS database
CORAL
Novel feature: consorted movements
Kratky Plots to Detect Disorder
20Unfolded Folded Multi-domains with flexible linkers
Patterns of globular and flexible proteins
► Local Protein Fluctuations: Backbone and side-chains Protein vibrations, loop motions and breathing to facilitate interactions and catalysis ► Concerted domain motions: Linkers as Hinges Specific and limited domain jumps between relative positions often linked with catalysis in on/off mechanisms ► Highly flexible regions or domains An astronomical number of conformations are available. Flexible multi-domain proteins (MD) and Intrinsically Disordered Proteins (IDPs) Linked with signaling and regulation
Biomolecules are Dynamic Entities
Flexible Proteins: Importance
► Many biological functions such as transcription, regulation, cell cycle control, require extensive flexibility ► Is more common in higher organisms that have to perform more and more controlled functions. Disorder is correlated with complexity ► High selectivity, moderate affinity, and promiscuity are properties often linked to flexibility ► In these systems partially structured conformations or transient long range interactions can be crucial for biological
► Smooth Scattering profiles and featureless Kratky Plots ► Large Rg and Dmax ► Absence of correlation peaks in the p(r) function ► Low correlation densities in ab initio reconstructions ► Isolated domains in rigid body modelling ► Prediction of disorder using bioinformatics tools
http://www.idpbynmr.eu/home/science/research-tools.html
Indications (not Proofs!) of Flexibility
Flexibility as mix of different conformations
24=
k k k
s I v s I ) ( ) (
dr sr sr r p s I
D
= sin ) ( 4 ) ( π
vk = volume fraction Ik(s) = scattering intensity from the k-th component For monodisperse systems the scattering is proportional to that of a single particle averaged
Detection of Flexibility
PolyUbiquitin Molecules
2,3,4 and 5 Ubiquitin (72 AA) domains connected by 20 AA linker (RanCH)
Flexible Rigid
Flexible Multidomain Proteins present less features than rigid counterparts Bernadó Eur. Biophys. J. 2009, 39, 769
SAXS curves
Analysis of the overall size descriptors (Rg, p(r), Kratky)
Modelling: ab initio (DAMMIN/DAMMIF) and Rigid body (BUNCH/CORAL) Analysis of the differences
Rigid Scenario Flexible Scenario
Detection of Flexibility
Go for flexibility!
Ensemble Optimization Method
structures curve
Ensemble Optimization Method
Rg
1
Rg
2
Rg
3
Rg
4
Rg
5 ...
ρ (Rg)
∑
==
N n n sI N s I
1) ( 1 ) (
Ensemble methods in SAXS
Elitism
Experimental Curve
Mutations C r
s i n g N
Rg Distribution
Elitism
Experimental Curve
Mutations Crossing N G Generations
…
R1 R2 R3 R2 R1 R3
…
Ensemble Optimization Method (EOM)
Bernadó, Mylonas, Petoukhov, Blackledge, and Svergun. J Am Chem Soc 2007, 129:5656-64.
ψ φ
Cα Cα
Kohn et al. PNAS, 2004, 101, 12491 Rg
Rg = R0·Nν R0 Persistence Length ν Solvent ‘quality’
Several experimental and theoretical studies establish ν ≈ 0.588 as an indication of the ‘random coil’ in chemically denatured (Urea or GuHCl) proteins. N
Theoretical distribution of the bond and dihedral angles for random chains Quasi Cα -Cα Ramachandran plot
coordinates alone, JMB, 1997, 273, 371-376
Bond angles vs. Dihedral angles
Modelling: Native vs. Random
Missing loops (i.e. flat electron density
map) …
33MRIGMV……..GGVQSHVLQ…..VLRDAGHEVS…….PHVKLPDYVS
missing loop 30 AA
Kratky Plot
apoferritin
vs.
pool
Nter.pdb Cter.pdb seq.seq curve.dat
Rg, Å Dmax, Å
Multi-curves fitting
pool
EOM 2.0 can handle multimeric assemblies
(full length protein measured in two buffers with low and high ionic strength respectively)
EOM: Impact of the Pool Size
35… the beauty of the Pentagon
Ensemble Optimization Method
curve actual structures Optimized ensemble
Assessing conformation variability: Assessing conformation variability: lysozyme lysozyme u unfolding nfolding
8M Urea, 10 mM DTT 8M Urea, 10 mM DTT Rg = 26.3 Å Rg (native) = 15.1 Å 8M Urea, 100 mM DTT Rg = 30.0 Å
…
FULL-LENGTH
Pool
M CONFORMATIONS (POOL)
…
DELETION-1
Pool-1
…
DELETION-2
Pool-2
EOM-1
…=
EOM
…= …=
EOM-2 Cα-Cα Average Distance Matrix
Experimental data
Multiple Curve Fitting with EOM: Reaching «Higher Resolution»
Adult Tau Fetal Tau
R1R2R3R4 P2 P1 I2 I1 N C R1R2R3R4 P2 R1R2R3R4 P2 P1 R1R2R3R4
K32 K16 K18 ht40
R1 R3R4 P2 P1 N C R1 R3R4 P2 R1 R3R4 P2
K27 K17 K19 K44
R4 R1 R3 R1 R3R4 P2 P1 N R1 R3R4 C
K10
P2 P1 N
K25 ht23
Mylonas et al. Biochemistry 2008, 47, 10345 <Cα-Cα>select <Cα-Cα>pool <Cα-Cα>select <Cα-Cα>pool
Application to Tau Protein
Other ensemble approaches
Maximum Occurrence Approach
calculated as the maximum weight that the conformation can have and be in agreement with the experimental data
– NMR data » Residual dipolar couplings » Pseudo contact shifts » SAXS
Paramagnetic Me
Bertini, Giachetti, Luchinat, Parigi, Petoukhov, Pierattelli, Ravera, Svergun, JACS, 2010
Maximum Occurrence Approach
Probe of Calmodulin Conformations by Combining SAXS with NMR
Blue: 5% Red: 40%
Flavorubredoxin
Modular enzyme endowed with nitric oxide and/or oxygen reductase activity
Collaboration:
Flavorubredoxin: : ab ab initio initio modelling modelling
Flavorubredoxin: : Xtal Xtal vs vs SAXS SAXS
Flavorubredoxin: : rigid body modelling rigid body modelling
Flavorubredoxin: various : various constraints constraints
Flavorubredoxin: EOM : EOM
Heterogeneous Assemblies with Flexibility
Cross-shaped extended p53 from SAXS and NMR
Tumor suppressor p53 and its complex with DNA
P53 is a transcription factor that regulates genes involved in cell cycle and apoptosis. Misfunction of p53 is related with cancer The homotetrameric p53 consists of folded core and tetramerization domains, linked and flanked by intrinsically disordered segments Tidow, H., Melero, R., Mylonas, E., Freund, S.M., Grossmann, J.G., Carazo, J.M., Svergun, D.I., Valle, M. & Fersht, A.R. (2007) Proc Natl Acad Sci USA, 104, 12324
Compact P53/DNA from SAXS also confirmed by EM
NC-dUTPase interactions with oligonucleotide
Németh-Pongrácz, V., Barabás, O., Fuxreiter, M., Simon, I., Pichová, I., Rumlová, M., Zábranská, H., Svergun, D., Petoukhov, M., Harmat, V., Klement, É., Hunyadi-Gulyás, É., Medzihradszky, K., Kónya, E. & Vértessy B. (2007) Nucleic Acid Res. 35, 495-505.
The homotrimeric fusion protein nucleocapsid (NC)- dUTPase combines domains that participate in RNA/DNA folding, reverse transcription, and DNA repair in Mason-Pfizer monkey betaretrovirus infected cells. + nucleocapsid dUtpase (TG)4 Xtal NMR
(1) NC-dUtpase (2) NC-dUtpase+peptide (3) NC (4) NC+peptide
P1 P3
Complex Complex Stoichiometry Stoichiometry
R.H.H. van den Heuvel, D.I. Svergun, M.V. Petoukhov, A. Coda, B. Curti, S. Ravasio, M.A. Vanoni & A. Mattevi (2003). J. Mol. Biol. 330, 113-128
+
s, nm-1
1 2 3
1 2 3 4
lg I(s), relative
Fd Fd-Glts Fd:Fd-Glts 1:1 Fd:Fd-Glts 2:1
Equimolar mixture
and free Fd
Molecular Assembly of Lumazine Synthase
s, nm-1
0.2 0.4 0.6 0.8 1.0 1.2 1.4
lgI, relative
1 2 3 4 5 Experiment Icosahedral P1 pH 7 pH 10
r, nm
5 10 15 20 25 30
p(r), relative
0.2 0.4 0.6 0.8 1.0
pH 7 pH10
2nm
pH 7 pH 10 LS catalyzes the formation of 6,7-dimethyl-8-ribityllumazine in the penultimate step
pentamers and icosahedral capsids Wild type LS in borate buffer T1 T4 (?)
Zhang, X., Konarev, P.V., Petoukhov, M.V., Svergun, D.I., Xing, L., Cheng, R.H., Haase, I., Fischer, M., Bacher, A., Ladenstein, R. & Meining, W. (2006) J Mol Biol. 362, 753-770
Quaternary structure of the human Cdt1-Geminin complex regulates DNA replication licensing
MX, SAXS and functional studies.
heterotrimer and an ‘‘inhibitory’’ heterohexamer
(2009) PNAS USA, 106, 19807
s, nm-1
0.5 1.0 1.5 2.0
lg I, relative
2 3 4
Calmodulin-Activated Glutamate Decarboxylase
Gut H, Dominici P, Pilati S, Astegno A, Petoukhov MV, Svergun DI, Grütter MG and Capitani G. J Mol Biol. 2009 392:334-51
axes of Gad
GASBOR_MX: quaternary structure of weak (symmetric)
SASREF_MX: structural analysis of transient complexes
3D Reconstruction from Polydisperse Data?
BSA Example
BSA Example
SASREF_MX GASBOR_MX
changes and assembly/dissociation processes
(potentially) flexible molecules
collected are simply a TOOL to describe the shape distributions
modelled (with caution) against SAXS data
moderate flexibility
Summary Summary