Modeling and studying RNA secondary structure
Eugène Asarin LIAFA, CNRS & Univ. Paris Diderot
Modeling and studying RNA secondary structure Eugne Asarin LIAFA, - - PowerPoint PPT Presentation
Modeling and studying RNA secondary structure Eugne Asarin LIAFA, CNRS & Univ. Paris Diderot Credits Co-authors, partners and teachers: Vassily Lyubetsky, Alexander Seliverstov (IITP) Thierry Cachat, Tayssir Touili (LIAFA)
Eugène Asarin LIAFA, CNRS & Univ. Paris Diderot
Co-authors, partners and teachers:
Sponsor:
EVOLVER/REVERA
Special thanks to:
background
– everything is a transition system – one should explore its state space in a smart way
– More informatics than byology+physics – More models than solutions – More questions and speculations than answers
Motivating example: Classical Attenuation Regulation
DNA A G C T G C
Polymerase DNA RNA Ribosome Amino Acids Transcription: DNA to RNA, done by Polymerase Translation: RNA to Amino Acids , done by Ribosome Gene
Gene DNA RNA Ribosome Amino Acids
Gene expressed if Polymerase reach the Gene
Gene DNA RNA Ribosome
Depends on the structure of the RNA between Ribosome and Polymerase
A C A C U G G C U C A C C U U C G G G U G G G C C U U U C U C G
RNA: a sequence of nucleotides A, G, C, U with links A-U and G-C ACACUG C C A C U C G G G U G A G C CUUUCUGCG U U G C Helix
(a simplified view)
This structure is dynamic and changes very fast and can cause the slippage of the Polymerase
Gene RNA Ribosome
T-rich region: connection of Pol and DNA weakens
Gene RNA Ribosome
T-rich region: connection of Pol and DNA weakens Slippage of the polymerase and the gene is not expressed: Termination
Gene RNA Ribosome
T-rich region: connection of Pol and DNA weakens Polymerase reaches Gene and the Gene is expressed: Antitermination
Each of these two situations can happen with some probability.
secondary structure capable to predict the probability of gene expression.
– Should be quantitative – Should represent transient behaviours (steady state not enough) – Should be validated by biological data on regulation
RNA, e.g. in ribozymes
– The sequence is fixed. – States of the MC : all possible secondary structures on this sequence (or part of it) – Transitions: simple events (see below) – Rates: determined by E
– As usual, many parameters are difficult to find – The Markov Chain is huge and complex – Only Monte-Carlo simulation is possible – It is still heavy and slow
simulation etc.
We suggest Probabilistic Rewriting Systems
by a term
structure by rewriting rules
RNA Ribosome Polymerase
w =(R,P) R: position of the Ribosome in the RNA P: position of the Polymerase in the RNA W={w=(R,P) s.t. 13 R P l } l: the length of the regulatory region
A C B D f = (A,B,C,D)
A C B D f = (A,B,C,D) g = (E,F,G,H) F E G H
f(g)
A C B D f = (A,B,C,D) w =(R,P) P R
w(f)
A C B D f = (A,B,C,D) g = (E,F,G,H) w =(R,P) P R
w(f , g)
E G F H
f j k h g R P
w =(R,P) w(f , g(h,k) , j) The order is not important
A C B D H G F E
f = (A,B,C,D) g = (E,F,G,H)
Rewriting rule: f g One rule for multipe contexts
f j k h
f j k h g
Rewriting rule: w(f , h , k , j) w(f , g(h , k) , j) Meta Rewriting rule: w( , h , k ) w( , g(h , k) )
f j k h m
f j k h g m
Meta Rewriting rule: m( , h , k ) m( , g(h , k) )
One rule for multipe contexts
RNA Ribosome Polymerase
w =(R,P) Movement of the Ribosome: (R , P)(x) (R+3 , P) (x’) Movement of the Polymerase: (R , P) (x) (R , P+1) (x) Termination: Slippage of the Polymerase: (R , P) (x) three rules for multipe contexts
– Stacking energy (easy) – Free energy (a bit obscure)
Implemented or not
visited.
states
successors of a given state (and the rates)
repeat 1000 times s=empty window repeat find all successors s’ of s and rates s->s’ s= a random s’ until expressed or aborted
(not used yet) A data structure (close to a formula) to represent current set of states or probability distribution.
explode
(everybody use it in a naive way)
helices stored.
800000->300000 states
done systematically
(terminator, antiterminator, noise)
level?
any progress, major event rare
together
steady state
Evaluation – mostly for N^k
not a big deal
shortcuts, a colored graphs?
and Performance evaluation and backwards
units is:
units is:
' , ( ) ( ), 1 ( ) ( ) ' , (
) (
s s s E e s E s s
t s E