RNA Secondary Structure CSE 417 W.L. Ruzzo The Double Helix Los - PowerPoint PPT Presentation

RNA Secondary Structure CSE 417 W.L. Ruzzo

The Double Helix Los Alamos Science

The “Central Dogma” of Molecular Biology DNA  RNA  Protein Protein gene DNA (chromosome) RNA (messenger) cell

Non-coding RNA • Messenger RNA - codes for proteins • Non-coding RNA - all the rest – Before, say, mid 1990’s, 1-2 dozen known (critically important, but narrow roles: e.g. ribosomal and transfer RNA, splicing, SRP) • Since mid 90’s dramatic discoveries • Regulation, transport, stability/degradation • E.g. “microRNA”: hundreds in humans • E.g. “riboswitches”: thousands in bacteria

DNA structure: dull …ACCGCTAGATG… …TGGCGATCTAC…

RNA Structure: Rich • RNA’s fold, and function • Nature uses what works

Why is structure Important? • For protein-coding, similarity in sequence is a powerful tool for finding related sequences – e.g. “hemoglobin” is easily recognized in all vertebrates • For non-coding RNA, many different sequences have the same structure, and structure is most important for function. – So, using structure plus sequence, can find related sequences at much greater evolutionary distances

Q: What’s so hard? A G A A A A A A U G A C G U U C U C G A C U C G C U A G C G G U G C A A G G G A G G C A U C G C C G G A C G C A A G A G G G A G A G A G G A C C A C A C U U G U A C C C C G A A A A A G G C U G C C A A A U A A A A G A G U G A G A C A C U C U U U U G G U C G U G C U C U G C G A G C G U C G G A C G C A U U G C U G A A A A C G U G C U U G U U G A U G G G C A: Structure often more important than sequence

6S mimics an open promoter E.coli Barrick et al. RNA 2005 Trotochaud et al. NSMB 2005 Willkomm et al. NAR 2005

Chloroflexi Chloroflexus aurantiacus δ -Proteobacteria Used by CMfinder Geobacter metallireducens Geobacter sulphurreducens Found by scan Symbiobacterium thermophilum

“Central Dogma” = “Central Chicken & Egg”? DNA  RNA  Protein Protein gene DNA (chromosome) RNA (messenger) cell Was there once an “RNA World”?

6.5 RNA Secondary Structure Algorithms

RNA Secondary Structure RNA. String B = b 1 b 2 … b n over alphabet { A, C, G, U }. Secondary structure. RNA is single-stranded so it tends to loop back and form base pairs with itself. This structure is essential for understanding behavior of molecule. C A Ex: GUCGAUUGAGCGAAUGUAACAACGUGGCUACGGCGAGA A A A U G C C G U A A G G U A U U A G A C G C U G C G C G A G C G A U G complementary base pairs: A-U, C-G

RNA Secondary Structure Secondary structure. A set of pairs S = { (b i , b j ) } that satisfy:  [Watson-Crick.] – S is a matching and – each pair in S is a Watson-Crick pair: A-U, U-A, C-G, or G-C.  [No sharp turns.] The ends of each pair are separated by at least 4 intervening bases. If (b i , b j ) ∈ S, then i < j - 4.  [Non-crossing.] If (b i , b j ) and (b k , b l ) are two pairs in S, then we cannot have i < k < j < l. Free energy. Usual hypothesis is that an RNA molecule will form the secondary structure with the optimum total free energy. approximate by number of base pairs Goal. Given an RNA molecule B = b 1 b 2 … b n , find a secondary structure S that maximizes the number of base pairs.

RNA Secondary Structure: Examples Examples. G G G G G G G C U C U C G C G C U A U A U A G U A U A U A base pair U G U G G C C A U U G G G C A U G U U G G C C A U A A G A ≤ 4 ok sharp turn crossing

RNA Secondary Structure: Subproblems First attempt. OPT(j) = maximum number of base pairs in a secondary structure of the substring b 1 b 2 … b j . match b t and b n t n 1 Difficulty. Results in two sub-problems.  Finding secondary structure in: b 1 b 2 … b t-1 . OPT(t-1)  Finding secondary structure in: b t+1 b t+2 … b n-1 . need more sub-problems

Dynamic Programming Over Intervals Notation. OPT(i, j) = maximum number of base pairs in a secondary structure of the substring b i b i+1 … b j .  Case 1. If i ≥ j - 4. – OPT(i, j) = 0 by no-sharp turns condition.  Case 2. Base b j is not involved in a pair. – OPT(i, j) = OPT(i, j-1)  Case 3. Base b j pairs with b t for some i ≤ t < j - 4. – non-crossing constraint decouples resulting sub-problems – OPT(i, j) = 1 + max t { OPT(i, t-1) + OPT(t+1, j-1) } take max over t such that i ≤ t < j-4 and b t and b j are Watson-Crick complements Remark. Same core idea in CKY algorithm to parse context-free grammars.

Bottom Up Dynamic Programming Over Intervals Q. What order to solve the sub-problems? A. Do shortest intervals first. RNA(b 1 ,…,b n ) { 4 0 0 0 for k = 5, 6, …, n-1 3 0 0 for i = 1, 2, …, n-k i 2 0 j = i + k Compute M[i, j] 1 6 7 8 9 return M[1, n] using recurrence j } j 6 7 8 9 1 Running time. O(n 3 ). 0 2 i 0 0 3 0 0 0 4

CUCCGGUUGCAAUGUC n= 16 E.g.: ((.(....).)..).. OPT(1,6) = 1: 0 0 0 0 0 1 1 1 1 1 2 2 2 3 3 3 0 0 0 0 0 0 0 0 1 1 2 2 2 2 2 2 CUCCGG 0 0 0 0 0 0 0 0 1 1 1 1 1 2 2 2 (....) 0 0 0 0 0 0 0 0 1 1 1 1 1 2 2 2 0 0 0 0 0 0 0 0 0 1 1 1 1 1 1 2 0 0 0 0 0 0 0 0 0 0 0 1 1 1 1 2 0 0 0 0 0 0 0 0 0 0 0 1 1 1 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 E.g.: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 OPT(6,16) = 2: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 GUUGCAAUGUC 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 (.(...)...) 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0

RNA Secondary Structure CSE 417 W.L. Ruzzo The Double Helix Los - PowerPoint PPT Presentation

RNA Secondary Structure CSE 417 W.L. Ruzzo The Double Helix Los Alamos Science The Central Dogma of Molecular Biology DNA RNA Protein Protein gene DNA (chromosome) RNA (messenger) cell Non-coding RNA Messenger RNA -

Secondary Framing Secondary Framing Secondary Framing Secondary Framing 1 1 Secondary Framing

RNA World Hypothesis and RNA folding By Lixin Dai October 16, 2002 Outline: RNA World

CSE 527 Autumn 2006 Lectures 15-16 RNA Secondary Structure Prediction RNA Secondary Structure:

CSE 527 Autumn 2007 Lectures 17-18 RNA Secondary Structure Prediction RNA Secondary Structure:

Outline CSEP 590A Summer 2006 Biological roles for RNA What is secondary structure? Lecture

The Double Helix RNA Secondary Structure CSE 417 W.L. Ruzzo Los Alamos Science The Central

Prediction of RNA-RNA Interaction slides by Mathias M ohl and Rolf Backofen ohl M.M c

CSEP 590A Summer 2006 Lecture 8 RNA Secondary Structure Prediction Outline Biological roles

DNA AND RNA ATI TEAS SCIENCE DNA & RNA Questions related to DNA and RNA cover topics

Prediction of RNA-RNA-Interaction 20 1 15 1 5 10 20 5 10 20 15 10 1 15 5 1 20 10

PROTEIN SYNTHESIS RNA (ribonucleic acid) 3 types RNA DIFFERENCES 1. messenger RNA (mRNA) DNA

PROTEIN SYNTHESIS RNA (ribonucleic acid) 3 types RNA DIFFERENCES 1. messenger RNA (mRNA)

Introduction to RNA-Seq Mary Piper Bioinformatics Consultant and Trainer DataCamp RNA-Seq

RNA-seq basics: From reads to differential expression COMBINE RNA-seq Workshop RNA sequencing

RNA Secondary RNA Secondary Structures: Structures: A Case Study on A Case Study on Viruses

Outline CSE 527 What is it Lecture 17, 11/24/04 How is it Represented RNA Secondary

Thoughts about European School system Curriculum rethinking process Mission statement The

Safety Report January 2019 Incidents Reported Date Injury Description: Causes: Prevention:

Titan Medical Inc. Single Inc isio n Sur ge r y TSX-V: TMD | OTCQX: TITXF May 21, 2013 1

Yellowstone River Basin History of Water Planning in MT Pre-1965 Water Project Development

BASE RUNNING PRESENTATION NOTES BOB HARROW, AUSTRAILIAN MENS NATIONAL TEAM HEAD COACH

FY19 Full Year Results Presentation 28 August 2019 Marc England CHIEF EXECUTIVE OFFICER Chris

Globalfiler TM Casework Kit Validation Kyra Groeblinghoff DNA

Instructions for Presenters Thank you for participating in SCURC 201 7 ! Please take careful note

RNA Secondary Structure CSE 417 W.L. Ruzzo The Double Helix Los - PowerPoint PPT Presentation

RNA Secondary Structure CSE 417 W.L. Ruzzo The Double Helix Los Alamos Science The Central Dogma of Molecular Biology DNA RNA Protein Protein gene DNA (chromosome) RNA (messenger) cell Non-coding RNA Messenger RNA -

Secondary Framing Secondary Framing Secondary Framing Secondary Framing 1 1 Secondary Framing

RNA World Hypothesis and RNA folding By Lixin Dai October 16, 2002 Outline: RNA World

CSE 527 Autumn 2006 Lectures 15-16 RNA Secondary Structure Prediction RNA Secondary Structure:

CSE 527 Autumn 2007 Lectures 17-18 RNA Secondary Structure Prediction RNA Secondary Structure:

Outline CSEP 590A Summer 2006 Biological roles for RNA What is secondary structure? Lecture

The Double Helix RNA Secondary Structure CSE 417 W.L. Ruzzo Los Alamos Science The Central

Prediction of RNA-RNA Interaction slides by Mathias M ohl and Rolf Backofen ohl M.M c

CSEP 590A Summer 2006 Lecture 8 RNA Secondary Structure Prediction Outline Biological roles

DNA AND RNA ATI TEAS SCIENCE DNA &amp; RNA Questions related to DNA and RNA cover topics

Prediction of RNA-RNA-Interaction 20 1 15 1 5 10 20 5 10 20 15 10 1 15 5 1 20 10

PROTEIN SYNTHESIS RNA (ribonucleic acid) 3 types RNA DIFFERENCES 1. messenger RNA (mRNA) DNA

PROTEIN SYNTHESIS RNA (ribonucleic acid) 3 types RNA DIFFERENCES 1. messenger RNA (mRNA)

Introduction to RNA-Seq Mary Piper Bioinformatics Consultant and Trainer DataCamp RNA-Seq

RNA-seq basics: From reads to differential expression COMBINE RNA-seq Workshop RNA sequencing

RNA Secondary RNA Secondary Structures: Structures: A Case Study on A Case Study on Viruses

Outline CSE 527 What is it Lecture 17, 11/24/04 How is it Represented RNA Secondary

Thoughts about European School system Curriculum rethinking process Mission statement The

Safety Report January 2019 Incidents Reported Date Injury Description: Causes: Prevention:

Titan Medical Inc. Single Inc isio n Sur ge r y TSX-V: TMD | OTCQX: TITXF May 21, 2013 1

Yellowstone River Basin History of Water Planning in MT Pre-1965 Water Project Development

BASE RUNNING PRESENTATION NOTES BOB HARROW, AUSTRAILIAN MENS NATIONAL TEAM HEAD COACH

FY19 Full Year Results Presentation 28 August 2019 Marc England CHIEF EXECUTIVE OFFICER Chris

Globalfiler TM Casework Kit Validation Kyra Groeblinghoff DNA

Instructions for Presenters Thank you for participating in SCURC 201 7 ! Please take careful note

DNA AND RNA ATI TEAS SCIENCE DNA & RNA Questions related to DNA and RNA cover topics