microscopic advances with large scale learning stochastic
play

Microscopic Advances with Large-Scale Learning: Stochastic - PowerPoint PPT Presentation

Microscopic Advances with Large-Scale Learning: Stochastic Optimization for Cryo-EM Ali Punjani, Marcus Brubaker University of Toronto Department of Computer Science Structure Determination } Macromolecules } Protein structure determines


  1. Microscopic Advances with Large-Scale Learning: Stochastic Optimization for Cryo-EM Ali Punjani, Marcus Brubaker University of Toronto Department of Computer Science

  2. Structure Determination } Macromolecules } Protein structure determines function } Traditional approaches: } X-ray Crystallography } NMR Spectroscopy

  3. Electron Cryo-Microscopy (Cryo-EM) Computational Task: Low dose electron beam Recover 3D Electron Density Particles in unknown 3D pose Ice Transfer Function Film/CCD Corrupted Noisy Integral Projections } No crystals needed, large molecules and complexes

  4. Cryo-EM Image Formation Low dose electron beam 2D Particle Images Corruption by CTF Particles in unknown 3D pose = Ice Transfer Function Film/CCD Corrupted Noisy Integral Projections } Challenges for reconstruction: } Destructive CTF } Low SNR } Unknown pose

  5. Cryo-EM Image Formation R t θ I V K p ( I| θ , R , t , V ) = N ( I| S t C θ P R V , σ 2 I )

  6. Cryo-EM Image Formation R t θ I V K Voxels p ( I| θ , R , t , V ) = N ( I| S t C θ P R V , σ 2 I ) Integral Projection Linear

  7. Cryo-EM Image Formation R t θ I V K Voxels p ( I| θ , R , t , V ) = N ( I| S t C θ P R V , σ 2 I ) Integral Projection Linear In Fourier Domain: Fourier Coefficients p (˜ I| θ , R , t , ˜ V ) = N (˜ I| ˜ S t ˜ C θ ˜ P R ˜ V , σ 2 I ) Slicing Diagonal

  8. Marginalization for Latent Variables R t θ I V K Z Z p (˜ I| θ , ˜ p (˜ I| θ , R , t , ˜ V ) = V ) p ( R ) p ( t ) d R d t R 2 SO (3)

  9. Marginalization for Latent Variables R t θ I V K Z Z p (˜ I| θ , ˜ p (˜ I| θ , R , t , ˜ V ) = V ) p ( R ) p ( t ) d R d t R 2 SO (3) M w j p (˜ I| θ , R j , t j , ˜ X V ) ≈ j =1 } Numerical Quadrature

  10. Maximum-a-Posteriori Estimation R t θ I V K K p (˜ I i | θ i , ˜ Y p ( V| D ) ∝ p ( V ) V ) i =1

  11. Optimization Problem R t θ I V K K p (˜ I i | θ i , ˜ Y p ( V| D ) ∝ p ( V ) V ) i =1 K ⇣ V ) + K − 1 log p ( V ) ⌘ log p (˜ I| θ , ˜ X arg min V − i =1

  12. Stochastic Optimization for Cryo-EM K V ) + K − 1 log p ( V ) ⇣ ⌘ X log p (˜ I| θ , ˜ arg min V − i =1 } Expensive to compute objective with large K } Stochastic Optimization: } Approximate objective with subset of images } Update based on approximate gradient } Various Algorithms (vary by update rule) } Advantages: speed, random initialization

  13. Experiments: Datasets } Real Dataset: } 46K Images of ATP Synthase from Thermus Thermophilius } Low SNR and known CTF parameters

  14. Experiments: Datasets } Synthetic Dataset: } 50,000 Projections of known artificial density } Low SNR and realistic CTF parameters

  15. Experiments: Seven Methods } Vanilla Stochastic Gradient Descent (SGD) } Momentum Methods: } Classical Momentum } Nesterov’s Accelerated Gradient } Adaptive Methods: } AdaGrad } TONGA } Quasi-Second Order Methods: } Online L-BFGS } Hessian Free

  16. Experiments: Results } Identical random initialization in all experiments

  17. Experiments: Results } Simplest Method

  18. Experiments: Results } Momentum Method

  19. Experiments: Results } Adaptive Step-size

  20. Experiments: Results } Quasi-second order

  21. Experiments: Results } Qualitatively Similar } Reasonable in one pass through data

  22. Experiments: Results

  23. Experiments: Results

  24. Experiments: Comparison Projection Matching RELION (E-M) Proposed Approach 3 Hours – 1 Epochs

  25. Experiments: Comparison Projection Matching RELION (E-M) Proposed Approach 24 Hours – 5 Epochs 24 Hours – 5 Epochs 3 Hours – 1 Epochs

  26. Experiments: Comparison Projection Matching RELION (E-M) Proposed Approach 24 Hours – 5 Epochs 24 Hours – 5 Epochs 3 Hours – 1 Epochs

  27. Experiments: Comparison Projection Matching RELION (E-M) Proposed Approach 24 Hours – 5 Epochs 24 Hours – 5 Epochs 3 Hours – 1 Epochs } Random Initialization is difficult for other methods

  28. Conclusions } Introduced Cryo-EM Structure Determination } Stochastic Optimization solution } Simple methods are best } State of the art speed and robustness

  29. Recent Progress } Higher resolution reconstructions } Importance Sampling: 100,000x speedup

  30. Recent Progress } Higher resolution reconstructions } Importance Sampling: 100,000x speedup } Forward: } Heterogeneous mixtures of particles } Better priors } Video exposure

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend