Introduction to Three Dimensional Structure Determination of - - PowerPoint PPT Presentation

introduction to three dimensional structure determination
SMART_READER_LITE
LIVE PREVIEW

Introduction to Three Dimensional Structure Determination of - - PowerPoint PPT Presentation

Introduction to Three Dimensional Structure Determination of Macromolecules by Cryo-Electron Microscopy Amit Singer Princeton University, Department of Mathematics and PACM July 23, 2014 Amit Singer (Princeton University) July 2014 1 / 21


slide-1
SLIDE 1

Introduction to Three Dimensional Structure Determination of Macromolecules by Cryo-Electron Microscopy

Amit Singer

Princeton University, Department of Mathematics and PACM

July 23, 2014

Amit Singer (Princeton University) July 2014 1 / 21

slide-2
SLIDE 2

Single Particle Reconstruction using cryo-EM

Schematic drawing of the imaging process: The cryo-EM problem:

Amit Singer (Princeton University) July 2014 2 / 21

slide-3
SLIDE 3

New detector technology: Exciting times for cryo-EM

www.sciencemag.org SCIENCE VOL 343 28 MARCH 2014

1443

The Resolution Revolution

BIOCHEMISTRY Werner Kühlbrandt Advances in detector technology and image processing are yielding high-resolution electron cryo-microscopy structures of biomolecules.

P

recise knowledge of the structure of macromolecules in the cell is essen- tial for understanding how they func-

  • tion. Structures of large macromolecules can

now be obtained at near-atomic resolution by averaging thousands of electron microscope images recorded before radiation damage

  • accumulates. This is what Amunts et al. have

done in their research article on page 1485 of this issue ( 1), reporting the structure of the large subunit of the mitochondrial ribosome at 3.2 Å resolution by electron cryo-micros- copy (cryo-EM). Together with other recent high-resolution cryo-EM structures ( 2– 4) (see the fi gure), this achievement heralds the beginning of a new era in molecular biology, where structures at near-atomic resolution are no longer the prerogative of x-ray crys- tallography or nuclear magnetic resonance (NMR) spectroscopy. Ribosomes are ancient, massive protein- RNA complexes that translate the linear genetic code into three-dimensional proteins. Mitochondria—semi-autonomous organelles A B C

Near-atomic resolution with cryo-EM. (A) The large subunit of the yeast mitochondrial ribosome at 3.2 Å reported by Amunts et al. In the detailed view below, the base pairs of an RNA double helix and a magnesium ion (blue) are clearly resolved. (B) TRPV1 ion channel at 3.4 Å ( 2), with a detailed view of residues lining the ion pore on the four-fold axis of the tetrameric channel. (C) F420-reducing [NiFe] hydrogenase at 3.36 Å ( 3). The detail shows an α helix in the FrhA subunit with resolved side chains. The maps are not drawn to scale.

Amit Singer (Princeton University) July 2014 3 / 21

slide-4
SLIDE 4

The Resolution Revolution

X-ray crystallography is one of the greatest innovations of the 20th century

  • W. K¨

uhlbrandt, Science 343, 1443, March 28 2014: “Does the resolution revolution in cryo-EM mean that the era of x-ray protein crystallography is coming to an end? Definitely not. For the foreseeable future, small proteins—in cryo-EM, anything below 100 kD counts as small—and resolutions of 2˚ A or better will remain the domain of x-rays. But for large, fragile, or flexible structures (such as membrane protein complexes) that are difficult to prepare yet hold the key to central biomedical questions, the new technology is a major breakthrough. In the future, it may no longer be necessary to crystallize large, well-defined complexes such as ribosomes. Instead, their structures can be determined elegantly and quickly by cryo-EM. These are exciting times.”

Amit Singer (Princeton University) July 2014 4 / 21

slide-5
SLIDE 5

Big “Movie” Data, Publicly Available

http://www.ebi.ac.uk/pdbe/emdb/empiar/

Amit Singer (Princeton University) July 2014 5 / 21

slide-6
SLIDE 6

ASPIRE: Algorithms for Single Particle Reconstruction

Open source toolbox, publicly available: http://spr.math.princeton.edu/

Amit Singer (Princeton University) July 2014 6 / 21

slide-7
SLIDE 7
  • E. coli 50S ribosomal subunit

27,000 particle images provided by Dr. Fred Sigworth, Yale Medical School 3D reconstruction by S, Lanhui Wang (now data scientist at TripAdvisor), and Jane Zhao (now postdoc at Courant Institute, NYU)

Amit Singer (Princeton University) July 2014 7 / 21

slide-8
SLIDE 8

Data-Driven Discovery: The cryo-EM experience

Amit Singer (Princeton University) July 2014 8 / 21

slide-9
SLIDE 9

Image Formation Model and Inverse Problem

Projection Ii Molecule φ Electron source Ri =    − R1

i T

− − R2

i T

− − R3

i T

−    ∈ SO(3)

Projection images Ii(x, y) = ∞

−∞ φ(xR1 i + yR2 i + zR3 i ) dz + “noise”.

φ : R3 → R is the electric potential of the molecule. Cryo-EM problem: Find φ and R1, . . . , Rn given I1, . . . , In.

Amit Singer (Princeton University) July 2014 9 / 21

slide-10
SLIDE 10

Toy Example

Amit Singer (Princeton University) July 2014 10 / 21

slide-11
SLIDE 11

Typical Reconstruction Procedure: Iterative Refinement

Alternating minimization or expectation-maximization, starting from an initial guess φ0 for the 3-D structure Ii = P(Ri · φ) + ǫi, i = 1, . . . , n.

Ri · φ(r) = φ(R−1

i

r) is the left group action P is integration in the z-direction.

Converges to a local optimum, not necessarily the global one. Model bias is a well-known pitfall “Einstein from noise” (Henderson, PNAS 2013) Is “reference free” orientation assignment and reconstruction possible?

Amit Singer (Princeton University) July 2014 11 / 21

slide-12
SLIDE 12

Orientation Estimation: Fourier projection-slice theorem

Projection Ii Projection Ij ˆ Ii ˆ Ij 3D Fourier space 3D Fourier space

(xij, yij) (xji, yji) Ricij cij = (xij, yij, 0)T Ricij = Rjcji

Amit Singer (Princeton University) July 2014 12 / 21

slide-13
SLIDE 13

Angular Reconstitution (Vainshtein and Goncharov 1986, Van Heel 1987)

Amit Singer (Princeton University) July 2014 13 / 21

slide-14
SLIDE 14

Experiments with simulated noisy projections

Each projection is 129x129 pixels. SNR = Var(Signal) Var(Noise) ,

(a) Clean (b) SNR=20 (c) SNR=2−1 (d) SNR=2−2 (e) SNR=2−3 (f) SNR=2−4 (g) SNR=2−5 (h) SNR=2−6 (i) SNR=2−7 (j) SNR=2−8

Amit Singer (Princeton University) July 2014 14 / 21

slide-15
SLIDE 15

Fraction of correctly identified common lines and the SNR

Define common line as being correctly identified if both radial lines deviate by no more than 10◦ from true directions.

log2(SNR) p 20 0.997 0.980

  • 1

0.956

  • 2

0.890

  • 3

0.764

  • 4

0.575

  • 5

0.345

  • 6

0.157

  • 7

0.064

  • 8

0.028

  • 9

0.019

Amit Singer (Princeton University) July 2014 15 / 21

slide-16
SLIDE 16

Outline for this week

1 Orientation assignment 2 Heterogeneity 3 Class averaging and symmetry detection Amit Singer (Princeton University) July 2014 16 / 21

slide-17
SLIDE 17

Orientation Estimation

Projection Ii Molecule φ Electron source Ri ∈ SO(3)

Projection Ii Projection Ij ˆ Ii ˆ Ij 3D Fourier space 3D Fourier space

(xij, yij) (xji , yji ) Ri cij cij = (xij, yij , 0)T Ri cij = Rj cji

n = 3: Vainshtein and Goncharov 1986, van Heel 1987 n > 3: S, Shkolnisky SIAM Imaging 2011

Minimizing a quadratic cost: n

i,j=1 Ricij − Rjcji2

Quadratic constraints: RT

i Ri = I3×3

Semidefinite Programming (SDP) Relaxation, Spectral Relaxation

n > 3: Maximum likelihood using SDP: Bandeira, Charikar, Chen, S (in preparation)

Amit Singer (Princeton University) July 2014 17 / 21

slide-18
SLIDE 18

The heterogeneity problem

What if the molecule has more than one possible structure?

(Image source: H. Liao and J. Frank, Classification by bootstrapping in single particle methods, Proceedings of the 2010 IEEE international conference on biomedical imaging, 2010.)

Covariance matrix estimation of the 3-D structures from their 2-D projections Katsevich, Katsevich, S (submitted)

Amit Singer (Princeton University) July 2014 18 / 21

slide-19
SLIDE 19

Class averaging for image denoising

Rotation invariant representation (steerable PCA, bispectrum) Vector diffusion maps S, Wu (Comm. Pure Appl. Math 2012) Zhao, S (J Struct. Bio. 2014) Generalization of Laplacian Eigenmaps (Belkin, Niyogi 2003) and Diffusion Maps (Coifman, Lafon 2006) Graph Connection Laplacian Experimental images (70S) courtesy of

  • Dr. Joachim Frank (Columbia)

Class averages by vector diffusion maps (averaging with 20 nearest neighbors)

Amit Singer (Princeton University) July 2014 19 / 21

slide-20
SLIDE 20

Algorithmic Pipeline

Particle Picking: manual, automatic or experimental image segmentation. Class Averaging: classify images with similar viewing directions, register and average to improve their signal-to-noise ratio (SNR). Orientation Estimation: common lines Three-dimensional Reconstruction: a 3D volume is generated by a tomographic inversion algorithm. Iterative Refinement Extensions Non-trivial point-group symmetries Heterogeneity (structural variability)

Amit Singer (Princeton University) July 2014 20 / 21

slide-21
SLIDE 21

References - Orientation Assignment

  • A. Singer, Y. Shkolnisky, “Three-Dimensional Structure Determination from

Common Lines in Cryo-EM by Eigenvectors and Semidefinite Programming”, SIAM Journal on Imaging Sciences, 4 (2), pp. 543–572 (2011).

  • R. Hadani, A. Singer, “Representation theoretic patterns in three

dimensional Cryo-Electron Microscopy I – The intrinsic reconstitution algorithm”, Annals of Mathematics, 174 (2), pp. 1219–1241 (2011).

  • L. Wang, A. Singer, Z. Wen, “Orientation Determination of Cryo-EM

images using Least Unsquared Deviations”, SIAM Journal on Imaging Sciences, 6(4), pp. 2450–2483 (2013).

  • A. S. Bandeira, M. Charikar, A. Singer, A. Zhu, “Multireference Alignment

using Semidefinite Programming”, 5th Innovations in Theoretical Computer Science (ITCS 2014).

Amit Singer (Princeton University) July 2014 21 / 21