And now for something completely different CFG utility beyond - PowerPoint PPT Presentation

Dec 31, 2023 •379 likes •440 views

And now for something completely different CFG utility beyond compilers 1 An RNA Structure An RNA Sensor & On/Off Switch L19 absent: Gene On L19 present: Gene Off mRNA leader An RNA Grammar S LS | L L s | dFd F

And now for something completely different CFG utility beyond compilers 1
An RNA Structure An RNA Sensor & On/Off Switch L19 absent: Gene On L19 present: Gene Off mRNA leader An RNA Grammar S → LS | L L → s | “dFd” F → LS | “dFd” “dFd” means mRNA leader switch? Watson-Crick base pair: aFu | uFa | gFc | cFg paren-like nesting 2
Actually, a Stochastic CFG What SCFG Gives Associate probabilities with rules: “Prior” probabilities for frequencies of nucleotides/pairs fraction paired vs unpaired S → LS | L (0.87) (0.13) average lengths of each, etc. L → S (0.89*p(s)) | dFd (0.11*p(dd)) F → LS | dFd (0.21) (0.79*p(dd)) Result: a probability distribution on sequences/structures Where p(s) & p(dd) are the probabilities of the E.g., is my sequence more likely to arise under this specific single/paired nucleotides, perhaps from RNA model or a simple “background” model, say empirical data or a model of sequence evolution where A/C/G/T = 1/4? Cocke-Kasami-Younger Parser “Inside” Algorithm for SCFG Suppose all rules of form A → BC or A → a Just like CKY, but instead of just recording (by mechanically transforming grammar, or algorithm below…) possibility of A in M[i,j], record its probability : Given x = x 1 …x n , want M i,j = { A | A → x i+1 …x j } For each A, do sum instead of union, over all possible k and all possible A → BC rules, of For j=2 to n products of their respective probabilities. M[j-1,j] = {A | A → x j is a rule} A for i = j-1 down to 1 M[i,j] = ∪ i < k < j M[i,k] ⊗ M[k,j] B C Result: for each i, j, A, have Pr(A ⇒ * x i+1 …x j ) Where X ⊗ Y = {A | A → BC , B ∈ X, and C ∈ Y } Time: O(n 3 ) i+1 k k+1 j 3
The SCFG “Viterbi” algorithm ncRNA Discovery in Bacteria Like inside, but use max instead of sum; Cmfinder--A Covariance Model Based RNA Motif Finding Algorithm , Yao, Weinberg, Ruzzo, Gives probability of the single parse tree Bioinformatics , 2006, 22(4): 445-452, A Computational Pipeline for High Throughput Discovery of having max probability; (inside sums cis-Regulatory Noncoding RNA in Prokaryotes . Yao, Barrick, probability over all legal trees) Weinberg, Neph, Breaker, Tompa and Ruzzo . PLoS Comput Biol . 3(7): e126, July 6, 2007. Identification of 22 candidate structured RNAs in bacteria using the CMfinder comparative genomics pipeline . Weinberg, Barrick, Yao, Roth, Kim, Gore, Wang, Lee, Block, Sudarsan, Neph, Tompa, Ruzzo and Breaker. Nucl. Acids Res., July 2007 35: 4809-4819. ncRNA Discovery in Vertebrates Comparative genomics beyond sequence based alignments: RNA structures in the boxed = confirmed ENCODE regions riboswitch (+2 more) Torarinsson, Yao, Wiklund, Bramsen , Hansen, Kjems, Tommerup, Ruzzo and Gorodkin Genome Research, to appear 4
Experimental Validation Bottom Line CFG technology is a key tool for RNA description, discovery and search A very active research area. (Some call RNA the “dark matter” of the genome.) Huge compute hog: results above represent hundreds of CPU-years, and smart algorithms can have a big impact More? Check out CSE 427 5

Recommend

And now for something completely different And now for something completely different Algorithms

And now for something completely different And now for something completely different Algorithms for NLP (11-711) Fall 2019 Formal Language Theory In one lecture Robert Frederking Now for Something Completely Different We will look at

572 views • 45 slides

And now for something completely different Algorithms for NLP (11-711) Fall 2017 Formal Language

And now for something completely different Algorithms for NLP (11-711) Fall 2017 Formal Language Theory In one lecture Robert Frederking Now for Something Completely Different We will look at grammars from a mathematical point of

536 views • 41 slides

Financial Disclosure Statement Something Old, Something New, Something Unbreakable, and Something

Occupational and Environmental Lung Disease: Financial Disclosure Statement Something Old, Something New, Something Unbreakable, and Something Blue Robert Cohen, MD, FCCP Division of Pulmonary n I have no relevant financial relationships and

457 views • 19 slides

Detailed Design and Verification with JML Curt Clifton Rose-Hulman Institute of Technology And

Detailed Design and Verification with JML Curt Clifton Rose-Hulman Institute of Technology And now for something completely different And now for something completely different Course Topics DETAILED DESIGN AND VERIFICATION (JML)

490 views • 23 slides

A CRADLE TO CRADLE INSPIRED MORGUE! And now for something completely different Martijn

A CRADLE TO CRADLE INSPIRED MORGUE! And now for something completely different Martijn Horsman CE Ambassador 5-6 December, 2019 | RAI, Amsterdam Partnership approach Joint awareness SRE MT Joint perspective Morgue MT

330 views • 4 slides

VoIP Security Title : Something Old (H.323), Something New (IAX), Something Hallow ( Security ),

VoIP Security Title : Something Old (H.323), Something New (IAX), Something Hallow ( Security ), & Something Blue (VoIP Administrators) BlackHat 2007 Presented by: Himanshu Dwivedi (hdwivedi@isecpartners.com) Zane Lackey

771 views • 58 slides

1 To check something out (pv): to see, watch, examine, try. Something/someone is not ones cup of

1 To check something out (pv): to see, watch, examine, try. Something/someone is not ones cup of tea (i)(s): something or someone is not your style, not something you would like to do. Food for thought (i): something to think about, mental food

504 views • 12 slides

I SODAR S k am land The IsoDAR Target at KamLAND for NBI2014 and now for something completely

I SODAR S k am land The IsoDAR Target at KamLAND for NBI2014 and now for something completely different L. Bartoszek BARTOSZEK ENGINEERING With contributions from the DAEdALUS/IsoDAR collaboration 9/25/14 What is IsoDAR? IsoDAR is: 1. A

536 views • 28 slides

CS 1501 www.cs.pitt.edu/~nlf4/cs1501/ P vs NP But first, something completely different... Some

CS 1501 www.cs.pitt.edu/~nlf4/cs1501/ P vs NP But first, something completely different... Some computational problems are unsolvable No algorithm can be written that will always produce the correct output One example is the

565 views • 34 slides

And Now for Something Completely Difgerent A Revolutionary Alternative to the Reference Desk

And Now for Something Completely Difgerent A Revolutionary Alternative to the Reference Desk Jennifer Foster Teaching & Learning Librarian Western Libraries abby koehler Teaching & Learning Librarian Western Libraries Jessica Taylor

285 views • 15 slides

Probabilistic Graphical Models 10-708 Learning Completely Observed Learning Completely Observed

Probabilistic Graphical Models 10-708 Learning Completely Observed Learning Completely Observed Undirected Graphical Models Undirected Graphical Models Eric Xing Eric Xing Lecture 12, Oct 19, 2005 Reading: MJ-Chap. 9,19,20 Recap: MLE for

170 views • 15 slides

Pitch and Loudness By: Chase Lenhart How High or Low Something Is How Loud or Soft Something

Pitch and Loudness By: Chase Lenhart How High or Low Something Is How Loud or Soft Something Is What is Pitch? Pitch is how high or low something is. It comes from the Greek mathematician Pythagoras who found that a longer string makes

123 views • 7 slides

Something Ancient and Something Recent Raymond W. Yeung Institute of Network Coding, CUHK

Something Ancient and Something Recent Raymond W. Yeung Institute of Network Coding, CUHK Something Ancient Diversified Coding with One Distortion Criterion Raymond W. Yeung Department of Information Engineering The Chinese University of

494 views • 35 slides

CSCE 790 Computer Systems Security Biometrics (Something You Are) Professor Qiang Zeng

CSCE 790 Computer Systems Security Biometrics (Something You Are) Professor Qiang Zeng Spring 2020 Previous Class Credentials Something you know (Knowledge factors) Something you have (Possession factors) Something

510 views • 24 slides

A cross product is 1) Something I know about. 2) Something Ive heard about but I cant

A cross product is 1) Something I know about. 2) Something Ive heard about but I cant remember what it is. 3) Something Ive never heard about. Magnetic Field (B) Produced by magnets Compass Needle points parallel to the

332 views • 10 slides

NASHOBA Introducing Honor Roll 2014 What do we have now Different at Different at Confusion

NASHOBA Introducing Honor Roll 2014 What do we have now Different at Different at Confusion as to Confusion as to Different at Different at each middle each middle how it is how it is High School High School school school

449 views • 11 slides

Chapter 6 Chapter 6 System Models Learning Objective Abstract presentations of systems whose

Chapter 6 Chapter 6 System Models Learning Objective Abstract presentations of systems whose requirements are being analyzed Frederick T Sheldon Assistant Professor of Computer Science Washington State University CS 422 Software Engineering

408 views • 11 slides

1 2 Analysis Modeling Review Chapters 12 3 Analysis Modeling A Picture is worth a

1 2 Analysis Modeling Review Chapters 12 3 Analysis Modeling A Picture is worth a 1000 Words!!! Helps better understand the requirements Data Function and Behavior Analysis modeling helps validate the

1.33k views • 88 slides

Process Modeling Umberto Nanni Enterprise Information Systems 1 Processes to be modeled

D IPARTIMENTO DI I NGEGNERIA INFORMATICA AUTOMATICA E GESTIONALE A NTONIO R UBERTI Master Degree Programme in Manage gement ment Engin inee eeri ring En Enterprise erprise In Information formation Sys ystems ems Umbe mberto to Nan

625 views • 27 slides

23 Action-Oriented Design Methods 1. Use Cases 2. Structured Analysis/Design (SA/SD) 3. Structured

Fakultt Informatik, Institut fr Software- und Multimediatechnik, Lehrstuhl fr Softwaretechnologie 23 Action-Oriented Design Methods 1. Use Cases 2. Structured Analysis/Design (SA/SD) 3. Structured Analysis and Design Technique (SADT)

531 views • 32 slides

SE 1: Software Requirements Specification and Analysis Lecture 4: Basic Notations Nancy Day,

SE 1: Software Requirements Specification and Analysis Lecture 4: Basic Notations Nancy Day, Davor Svetinovi c http://www.student.cs.uwaterloo.ca/cs445/Winter2006 uw.cs.cs445 U Waterloo SE1 (Winter 2006) p.1/38 Announcements Send

621 views • 38 slides

Practical Approach For Lightway Threat Modeling Automation Vitaly Davidoff CISSP , CSSLP

Practical Approach For Lightway Threat Modeling Automation Vitaly Davidoff CISSP , CSSLP Agenda What is Threat Modeling Existing Methodologies Problems in common solution Lightway Threat Modeling As a Code

659 views • 34 slides

Mapping Data Flow Models to the Palladio Component Model Stephan Seifermann, Dominik Werle, Mazen

Mapping Data Flow Models to the Palladio Component Model Stephan Seifermann, Dominik Werle, Mazen Ebada 06.11.2019 - Symposium on Software Performance 2019, Wrzburg SOFTWARE DESIGN AND QUALITY GROUP, INSTITUTE FOR PROGRAM STRUCTURES AND DATA

526 views • 26 slides

The CLIC Linear Collider The CLIC Linear Collider Hans H. Braun / CERN Hans H. Braun / CERN

The CLIC Linear Collider The CLIC Linear Collider Hans H. Braun / CERN Hans H. Braun / CERN Introduction CLIC & CTF3 Introduction CLIC & CTF3 CTF3 status and achievements CTF3 status and achievements RF Structure

867 views • 49 slides