CSE 527 Lecture 9 The Gibbs Sampler Talk Today Zasha Weinberg - PowerPoint PPT Presentation

CSE 527 Lecture 9 The Gibbs Sampler

Talk Today • Zasha Weinberg Combi HSB K-069, 1:30 “Fast, accurate annotation of non-coding RNAs”

The “Gibbs Sampler” • Lawrence et al. “Detecting Subtle Sequence Signals: A Gibbs Sampling Strategy for Multiple Sequence Alignment” Science 1993

The Double Helix Los Alamos Science

Some DNA Binding Domains

Some History • Geman & Geman, IEEE PAMI 1984 • Hastings, Biometrika, 1970 • Metropolis, Rosenbluth, Rosenbluth, Teller, & Teller, “Equations of State Calculations by Fast Computing Machines,” J. Chem. Phys. 1953 • Josiah Williard Gibbs, 1839-1903, American physicist, a pioneer of thermodynamics

How to Average • An old problem: • n random variables: x 1 , x 2 , . . . , x k • Joint distribution (p.d.f.): P ( x 1 , x 2 , . . . , x k ) • Some function: f ( x 1 , x 2 , . . . , x k ) • Want Expected Value: E ( f ( x 1 , x 2 , . . . , x k ))

How to Average E ( f ( x 1 , x 2 , . . . , x k )) = � � � f ( x 1 , x 2 , . . . , x k ) · P ( x 1 , x 2 , . . . , x k ) dx 1 dx 2 . . . dx k · · · • x 1 x 2 x k • Approach 1: direct integration (rarely solvable analytically, esp. in high dim) • Approach 2: numerical integration (often difficult, e.g., unstable, esp. in high dim) • Approach 3: Monte Carlo integration sample and average: x (1) , � x (2) , . . . � x ( n ) ∼ p ( � x ) � � n x )) ≈ 1 x ( i ) ) E ( f ( � i =1 f ( � n

Markov Chain Monte Carlo (MCMC) • Independent sampling also often hard, but not required for expectation X t +1 | � � • MCMC X t • Simplest & most common: Gibbs Sampling P ( x i | x 1 , x 2 , . . . , x i − 1 , x i +1 , . . . , x k ) • Algorithm for t = 1 to ∞ t+ 1 t for i = 1 to k do : x t +1 ,i ∼ P ( x t +1 ,i | x t +1 , 1 , x t +1 , 2 , . . . , x t +1 ,i − 1 , x t,i +1 , . . . , x t,k )

• Input: again assume sequences s1, ..., sk with one length w motif per sequence • Motif model: WMM • Parameters: Where are the motifs? for 1 <= i <= k, have 1 <= xi <= |si|-w+1 • “Full conditional”: to calc P ( x i = j | x 1 , x 2 , . . . , x i − 1 , x i +1 , . . . , x k ) build WMM from motifs in all sequences except i, then calc prob that motif in ith seq occurs at j by usual “scanning” alg.

Randomly initialize xi’s for t = 1 to ∞ for i = 1 to k discard motif instance from si; recalc WMM from rest Similar to for j = 1 ... |si|-w+1 MEME, but it would calculate prob that ith motif is at j: average over, rather than P ( x i = j | x 1 , x 2 , . . . , x i − 1 , x i +1 , . . . , x k ) sample from pick new xi according to that distribution

Issues • Burnin - how long must we run the chain to reach stationarity? • Mixing - how long a post-burnin sample must we take to get a good sample of the stationary distribution? (Recall that individual samples are not independent, and may not “move” freely through the sample space.)

Variants & Extensions • “Phase Shift” - may settle on suboptimal solution that overlaps part of motif. Periodically try moving all motif instances a few spaces left or right. • Algorithmic adjustment of pattern width: Periodically add/remove flanking positions to maximize (roughly) average relative entropy per position • Multiple patterns per string

CSE 527 Lecture 9 The Gibbs Sampler Talk Today Zasha Weinberg - PowerPoint PPT Presentation

CSE 527 Lecture 9 The Gibbs Sampler Talk Today Zasha Weinberg Combi HSB K-069, 1:30 Fast, accurate annotation of non-coding RNAs The Gibbs Sampler Lawrence et al. Detecting Subtle Sequence Signals: A Gibbs Sampling

CSE 527 Computational Biology http://www.cs.washington.edu/527 Lecture 1: Overview & Bio

CSE 527 Computational Biology http://www.cs.washington.edu/527 Lecture 1: Overview & Bio

Rigid Geometric Transformations COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision

Camera Calibration COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Camera

Training Neural Nets COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Training

Tracking Feature Windows COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision

CSE 527, Additional notes on MLE & EM Based on earlier notes by C. Grant & M. Narasimhan

Image Motion COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Image Motion 1 /

HW2o Image Differentiation COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision

Correlation, Convolution, Filtering COMPSCI 527 Computer Vision COMPSCI 527 Computer

Rigid Geometric Transformations COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision

Image Pyramids COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Image Pyramids 1

The Eight-Point Algorithm COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision The

The Singular Value Decomposition COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision

Image Motion COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Image Motion 1 /

The Epipolar Geometry COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision The

Approximate Inference by Stochastic Simulation/Sampling Methods Zhenke Wu Department of

Computers, mathematical proof, and the nature of the human mind POMSIGMAA Keynote Address

Problem Session 1 Stats 60/160 July 14, 2020 1 Measure of Center, Skew average (or mean ):

Project Lighthouse and stuff we learnt along the way David Teller Mozilla Connected Devices

Neutronization and weak reactions in SNe Ia Edward Brown Michigan State University In this

Chapter 10 Verification and Validation of Simulation Models Banks, Carson, Nelson & Nicol

Simulation Discrete-Event System Simulation Dr. Mesut Gne Computer Science, Informatik

Hiroyuki KOURA Advanced Science Research Center, Japan Atomic Energy Agency, JAPAN Hiroyuki Koura

CSE 527 Lecture 9 The Gibbs Sampler Talk Today Zasha Weinberg - PowerPoint PPT Presentation

CSE 527 Lecture 9 The Gibbs Sampler Talk Today Zasha Weinberg Combi HSB K-069, 1:30 Fast, accurate annotation of non-coding RNAs The Gibbs Sampler Lawrence et al. Detecting Subtle Sequence Signals: A Gibbs Sampling

CSE 527 Computational Biology http://www.cs.washington.edu/527 Lecture 1: Overview &amp; Bio

CSE 527 Computational Biology http://www.cs.washington.edu/527 Lecture 1: Overview &amp; Bio

Rigid Geometric Transformations COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision

Camera Calibration COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Camera

Training Neural Nets COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Training

Tracking Feature Windows COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision

CSE 527, Additional notes on MLE &amp; EM Based on earlier notes by C. Grant &amp; M. Narasimhan

Image Motion COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Image Motion 1 /

HW2o Image Differentiation COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision

Correlation, Convolution, Filtering COMPSCI 527 Computer Vision COMPSCI 527 Computer

Rigid Geometric Transformations COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision

Image Pyramids COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Image Pyramids 1

The Eight-Point Algorithm COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision The

The Singular Value Decomposition COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision

Image Motion COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision Image Motion 1 /

The Epipolar Geometry COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision The

Approximate Inference by Stochastic Simulation/Sampling Methods Zhenke Wu Department of

Computers, mathematical proof, and the nature of the human mind POMSIGMAA Keynote Address

Problem Session 1 Stats 60/160 July 14, 2020 1 Measure of Center, Skew average (or mean ):

Project Lighthouse and stuff we learnt along the way David Teller Mozilla Connected Devices

Neutronization and weak reactions in SNe Ia Edward Brown Michigan State University In this

Chapter 10 Verification and Validation of Simulation Models Banks, Carson, Nelson &amp; Nicol

Simulation Discrete-Event System Simulation Dr. Mesut Gne Computer Science, Informatik

Hiroyuki KOURA Advanced Science Research Center, Japan Atomic Energy Agency, JAPAN Hiroyuki Koura

CSE 527 Computational Biology http://www.cs.washington.edu/527 Lecture 1: Overview & Bio

CSE 527 Computational Biology http://www.cs.washington.edu/527 Lecture 1: Overview & Bio

CSE 527, Additional notes on MLE & EM Based on earlier notes by C. Grant & M. Narasimhan

Chapter 10 Verification and Validation of Simulation Models Banks, Carson, Nelson & Nicol