Lecture 7: RNA folding Chapter 6 Problem 6.51 in Jones and Pevzner - - PowerPoint PPT Presentation

lecture 7 rna folding
SMART_READER_LITE
LIVE PREVIEW

Lecture 7: RNA folding Chapter 6 Problem 6.51 in Jones and Pevzner - - PowerPoint PPT Presentation

Lecture 7: RNA folding Chapter 6 Problem 6.51 in Jones and Pevzner and the Turner model Fall 2019 September 19, 2019 RNA Basics RNA bases A,C,G,U Canonical Base Pairs A-U G-C G-U wobble pairing Bases can only pair


slide-1
SLIDE 1

Lecture 7: RNA folding

Chapter 6 – Problem 6.51 in Jones and Pevzner and the Turner model Fall 2019 September 19, 2019

slide-2
SLIDE 2

RNA Basics

2

— RNA bases A,C,G,U — Canonical Base Pairs

  • A-U
  • G-C
  • G-U “wobble” pairing
  • Bases can only pair with
  • ne other base.

Image: http://www.bioalgorithms.info/

slide-3
SLIDE 3

RNA Structural Levels

3

Primary

AAUCG...CUUCUUCCA Primary Secondary Tertiary

slide-4
SLIDE 4

RNA Secondary Structure

4

Hairpin loop Junction (Multiloop) Bulge Loop Single-Stranded Internal Loop Stack Pseudoknot

slide-5
SLIDE 5

Base Pair Maximization

5

U C C A G G A C

Zuker (1981) Nucleic Acids Research 9(1) 133-149

slide-6
SLIDE 6

Base Pair Maximization – Dynamic Programming Algorithm

6

Simple Example: Maximizing Base Pairing

slide-7
SLIDE 7

Base Pair Maximization – Dynamic Programming Algorithm

7

S(i,j) is the folding of the subsequence of the RNA strand from index i to index j which results in the highest number of base pairs

slide-8
SLIDE 8

Base Pair Maximization – Dynamic Programming Algorithm

8

slide-9
SLIDE 9

Base Pair Maximization – Dynamic Programming Algorithm

9

slide-10
SLIDE 10

Base Pair Maximization – Dynamic Programming Algorithm

10

slide-11
SLIDE 11

Base Pair Maximization – Dynamic Programming Algorithm

11

slide-12
SLIDE 12

Circular Representation

12

Images – David Mount

slide-13
SLIDE 13

Pseudoknots

13

— Pseudoknots cause a breakdown in the presented Dynamic

Programming Algorithm.

— In order to form a pseudoknot, checks must be made to ensure

base is not already paired – this breaks down the divide and conquer recurrence relations.

Images – David Mount

slide-14
SLIDE 14

Simplifying Assumptions

  • RNA folds into one minimum free-energy

structure.

  • There are no knots (base pairs never cross).
  • The energy of a particular base pair in a double

stranded region is sequence independent.

  • Neighbors do not influence the energy.
  • Was solved by dynamic programming, Zucker and

Steigler 1981

14

slide-15
SLIDE 15

Sequence Dependent Base Pair Energy Values (Nearest Neighbor Model)

15

U U C G G C A U G C A UCGAC 3’ 5’ U U C G U A A U G C A UCGAC 3’ 5’

Example values: GC GC GC GC AU GC CG UA

  • 2.3 -2.9 -3.4 -2.1
slide-16
SLIDE 16

Free Energy Computation (Nearest Neighbor Model)

16

U U A A G C G C A G C U A A U C G A U A 3’ A 5’

  • 0.3
  • 0.3
  • 1.1 mismatch of hairpin
  • 2.9 stacking

+3.3 1nt bulge

  • 2.9 stacking
  • 1.8 stacking

5’ dangling

  • 0.9 stacking
  • 1.8 stacking
  • 2.1 stacking

G= - 4.9 kcal/mol

+5.9 4 nt loop

slide-17
SLIDE 17

RNA Secondary Structure

17

Stack

slide-18
SLIDE 18

Nearest Neighbor Model

  • Stacking energy - assign negative energies to these

between base pair regions.

  • Energy is influenced by the nearest closing base pair
  • These energies are estimated experimentally from small

synthetic RNAs.

  • Positive energy - added for low entropy regions such

as bulges, loops, etc.

18

slide-19
SLIDE 19

RNA Secondary Structure

19

Hairpin loop

slide-20
SLIDE 20

Nearest Neighbor Model

  • Hairpin energy:
  • Experimentally measured for hairpins of length 5, 6, 7, 8, …

up to a maximum. Extrapolation above the maximum.

  • The closing pair affects the energy. Distinguish between A-

U and C-G.

20

slide-21
SLIDE 21

RNA Secondary Structure

21

Bulge Loop Internal Loop

slide-22
SLIDE 22

Nearest Neighbor Model

  • Bulge/Internal energy:
  • Let L1, L2 denote the lengths of the two sides of the bulge/

internal loop.

  • Experimentally measured for different values of L1, L2.
  • In practice for computational convenience, the energy is

given as function of L1 + L2 by a lookup table and extrapolation.

22

slide-23
SLIDE 23

RNA Secondary Structure

23

Junction (Multiloop)

slide-24
SLIDE 24

Nearest Neighbor Model

  • Multiloop energy:
  • Let U denote the number of unpaired bases.
  • Let P denote the number of base pairs.
  • The free energy is an affine function of U and P:

a1 + a2 U + a3 P.

  • This is the least accurate component of the NN model.

24