CU U sequences, using an iterative training procedure that is - PDF document

.. 1994 Oxford University Press Nucleic Acids Research, 1994, Vol. 22, No. 11 2079-2088 RNA sequence analysis using covariance models Sean R.Eddy* and Richard Durbin MRC Laboratory of Molecular Biology, Hills Road, Cambridge CB2 2QH, UK Received February 16, 1994; Revised and Accepted April 26, 1994 ABSTRACT We describe a general approach to several RNA molecules fit for a particular function, such as protein binding sequence analysis problems using probabilistic models (19, 20) or even catalysis (21), out of randomized repertoires. that flexibly describe the secondary structure and One wants to be able to detect similar RNAs and RNA motifs primary sequence consensus of an RNA sequence the primary sequence based in sequence data. However, family. We call these models 'covariance models'. A techniques that generally work quite well for protein sequence covariance model of tRNA sequences is an extremely analysis are not well suited for studying RNA. sensitive and discriminative tool for searching for Most functional RNAs appear to be selected more for additional tRNAs and tRNA-related sequences in particular base-paired than maintenance of a structure sequence databases. A model can be built conservation of primary sequence. RNA secondary structure automatically from an existing sequence alignment. We induces strong pairwise correlations in RNA sequence, usually also describe an algorithm for learning a model and manifested as Watson-Crick complementarity. RNA sequence hence a consensus secondary structure from initially analysis therefore must work with this pattern of correlations in unaligned example sequences and no prior structural addition to primary sequence conservation, and methods for information. Models trained unaligned tRNA searching databases for new members of RNA families have on examples correctly predict tRNA scondary structure consequently lagged behind those for analysis of protein. Transfer and produce high-quality multiple alignments. The RNA or group I introns can be recognized by specialized, custom- approach may be applied to any family of small RNA built programs (22-25). Programs that use manually constructed and relatively inflexible patterns of conserved residues and base- sequences. pairs, analogous to PROSITE patterns of protein motif sequences (26), have been described for RNA (27, 28). More general INTRODUCTION methods that capture both primary and secondary structure A major role of computational methods in molecular biology is consensus information while still flexibly scoring insertions, to identify similarities between sequences. Similarity between deletions, and mismatches are desirable (29, 30). sequences generally implies functional and/or evolutionary Database searching for RNAs is not the only problem affected homology and therefore provides important biological by the lack of mathematical models that deal with secondary structure. Multiple RNA sequence alignment, a prerequisite for information. The analysis of large-scale genome sequence data the inference of phylogenetic trees and for RNA structure is particularly dependent upon similarity searching methods (1-4). Sirnilarity searching methods are fairly well developed prediction, is a markedly circular problem: accurate multiple alignment relies on an accurate secondary structure prediction, for protein sequence analysis. Fast algorithms such as BLAST are in widespread use for detecting (5) and FASTA (6) and vice versa. RNA sequences that share a common function and structure can appear to be unrelated and unalignable until homologues of new protein sequences. Even more sensitive methods such as profiles (7, 8) or hidden Markov models (9, a common secondary structure is recognized. The most reliable 10) are available which use consensus information from multiple means of consensus RNA secondary structure prediction and sequence alignments to detect new members of protein sequence multiple alignment is the iterative, laborious refinement process families. of comparative sequence analysis (31, 32)-a process of computer-aided recognition of strongly correlated positions in There are also many biologically important macromolecules a multiple alignment followed by manual refinement of the that are composed of RNA. These include transfer RNA(1 1, 12), ribosomal RNA (13), group I and group II catalytic introns (14, alignment. The rapid discovery of new RNA sequence families 15), and spliceosomal small nuclear RNAs (16), to name just by in vitro selection methods, in particular, is creating a need a few. Target sites for genetic regulation are often specific for automatic RNA structure prediction and multiple alignment structures in mRNA molecules, such as the TAR or RRE binding methods (19-21, 33). sites in the human immunodeficiency virus genome (17) or the Here we introduce a probabilistic model, which we call a iron response elements in ferritin and transferrin receptor mRNA 'covariance model' (CM), which cleanly describes both the (18). In vitro selection methods select families of small RNA secondary structure and the primary sequence consensus of an *To whom correspondence should be addressed

CU U sequences, using an iterative training procedure that is - PDF document

.. 1994 Oxford University Press Nucleic Acids Research, 1994, Vol. 22, No. 11 2079-2088 RNA sequence analysis using covariance models Sean R.Eddy* and Richard Durbin MRC Laboratory of Molecular Biology, Hills Road, Cambridge CB2 2QH, UK Received

Massively Multiplexed Zinc Finger Protein Engineering Harvard iGEM 2011 K. Barclay, J. Chew, S.

10 Gbps (or) 1 Gbps Ethernet Tester PacketExpert 818 West Diamond Avenue - Third Floor,

HIV tropism assessment HIV tropism assessment HIV tropism assessment HIV tropism assessment

microbial amplicon reads Robert C Edgar Seminar in Computational Methods in Metagenomics and

Pathway Analysis Jenny Wu Outline Introduction to NGS data analysis in Cancer Genomics

VIRANA: A Standardized Analysis of Viral Next Generation Sequencing Data Bastian Beggel

Paper accepted to poster presentation at the IUSSP International Population Conference, Cape Town,

mtDNAprofiler A web based Program for Nomenclature and Comparison of mtDNA Sequences In Seok

The Role Of Mutation Analysis in Porphyria. Dr SharonWhatley Cardiff SAS Porphyria Service

Davis-Besse Nuclear Power Station August 15, 2002 1 Introduction FENOC Chief Operating Officer

Vision for the Cohort and the Precision Medicine Initiative Francis S. Collins, M.D., Ph.D.

BioMake BioMake Chris Mungall Mungall Chris Berkeley Drosophila Genome Project Berkeley

Successful gene expression studies using validated qPCR assays Jan Hellemans, CEO Biogazelle

BOLERO Hair volume/surface measurement fly-away/frizz analysis system Sebastien BREUGNOT &

It All Started in Ulithi, Micronesia Colonized a long time ago Hawaii Japan by polynesian

X-Line 101 June 2019 X-Line 101 X-Line Unit Overview What makes X-Line unique X-Line 101

Instrumentation best practices in Brewing Slide 1 Ola Wesstrom Instrumentation best practices in

Schouw & Co. Capital Markets Day Langelinie Pavillonen, 15 June 2017 Schouw & Co. CMD

marine refrigeration and air conditioning Our History Headquarters Shipbuilding Industry

Residential Sector AIM Training Workshop Tokyo, Japan Oct 22- 26, 2007 Residential Sector

Accelerating Condensate Development in the Heart of the Montney While Retaining Capital

Baselines for Retail Demand Response Programs Bruce Kaneshiro California Public Utilities

Baseline Budget Projections A Joint Seminar by the Congressional Budget Office and the

Goal II: Math 1 Key Performance Indicators Baseline Presentation March 22, 2018 S H A R O N L

Sambuz

Useful Links

Newsletter

Mail Us

CU U sequences, using an iterative training procedure that is - PDF document

.. 1994 Oxford University Press Nucleic Acids Research, 1994, Vol. 22, No. 11 2079-2088 RNA sequence analysis using covariance models Sean R.Eddy* and Richard Durbin MRC Laboratory of Molecular Biology, Hills Road, Cambridge CB2 2QH, UK Received

Massively Multiplexed Zinc Finger Protein Engineering Harvard iGEM 2011 K. Barclay, J. Chew, S.

10 Gbps (or) 1 Gbps Ethernet Tester PacketExpert 818 West Diamond Avenue - Third Floor,

HIV tropism assessment HIV tropism assessment HIV tropism assessment HIV tropism assessment

microbial amplicon reads Robert C Edgar Seminar in Computational Methods in Metagenomics and

Pathway Analysis Jenny Wu Outline Introduction to NGS data analysis in Cancer Genomics

VIRANA: A Standardized Analysis of Viral Next Generation Sequencing Data Bastian Beggel

Paper accepted to poster presentation at the IUSSP International Population Conference, Cape Town,

mtDNAprofiler A web based Program for Nomenclature and Comparison of mtDNA Sequences In Seok

The Role Of Mutation Analysis in Porphyria. Dr SharonWhatley Cardiff SAS Porphyria Service

Davis-Besse Nuclear Power Station August 15, 2002 1 Introduction FENOC Chief Operating Officer

Vision for the Cohort and the Precision Medicine Initiative Francis S. Collins, M.D., Ph.D.

BioMake BioMake Chris Mungall Mungall Chris Berkeley Drosophila Genome Project Berkeley

Successful gene expression studies using validated qPCR assays Jan Hellemans, CEO Biogazelle

BOLERO Hair volume/surface measurement fly-away/frizz analysis system Sebastien BREUGNOT &amp;

It All Started in Ulithi, Micronesia Colonized a long time ago Hawaii Japan by polynesian

X-Line 101 June 2019 X-Line 101 X-Line Unit Overview What makes X-Line unique X-Line 101

Instrumentation best practices in Brewing Slide 1 Ola Wesstrom Instrumentation best practices in

Schouw &amp; Co. Capital Markets Day Langelinie Pavillonen, 15 June 2017 Schouw &amp; Co. CMD

marine refrigeration and air conditioning Our History Headquarters Shipbuilding Industry

Residential Sector AIM Training Workshop Tokyo, Japan Oct 22- 26, 2007 Residential Sector

Accelerating Condensate Development in the Heart of the Montney While Retaining Capital

Baselines for Retail Demand Response Programs Bruce Kaneshiro California Public Utilities

Baseline Budget Projections A Joint Seminar by the Congressional Budget Office and the

Goal II: Math 1 Key Performance Indicators Baseline Presentation March 22, 2018 S H A R O N L

Sambuz

Useful Links

Newsletter

Mail Us

BOLERO Hair volume/surface measurement fly-away/frizz analysis system Sebastien BREUGNOT &

Schouw & Co. Capital Markets Day Langelinie Pavillonen, 15 June 2017 Schouw & Co. CMD