discussion session
play

Discussion Session Thursday May 15, 2008 1 Find out whos working - PowerPoint PPT Presentation

Discussion Session Thursday May 15, 2008 1 Find out whos working on bioinformatics at Miami Find out who has tools and expertise at Miami that can be applied to bioinformatics research Get biologists and non-biologists to talk


  1. Discussion Session Thursday May 15, 2008 1

  2.  Find out who’s working on bioinformatics at Miami  Find out who has tools and expertise at Miami that can be applied to bioinformatics research  Get biologists and non-biologists to talk to (and maybe even understand!) each other 2

  3.  Two-hour session  Brief introduction of attendees  Biologists – state research problems that desire collaboration on  Non-biologists – give tools and expertise available for collaboration on bioinformatics/biology problems  Break up into informal discussion session, with facilitation by  Chun Liang and Quinn Li, botany  Valerie Cross and John Karro, computer science 3

  4.  Expertise in biostatistics  Analysis of dose-related tumorigenic trends in the presence of treatment-related toxicity  Analysis of pharmacokinetic data, particularly, methods for testing the equivalence of the areas under concentration-time profile curves  Risk assessment  Inverse regression/calibration problems where the dose associated with a particular level of response is estimated and tested  Optimal design of experiments for simple compartmental models  Integration of model uncertainty in the generation of risk estimates 4

  5.  Analysis and optimization of algorithms  Interested in developing efficient algorithms for finding similar sequences in genomic databases  Work with problems that have well-defined measure of similarity or difference between objects  Improve problem solutions that currently use too much memory or take too much time  Edit distance (number of operations to change one text/genomic string to another) 5

  6.  Study how insect viral genes (esp. baculovirus and ascovirus) are regulated in insect cells  Baculovirus – would like bioinformatic prediction of which AATAAA used in certain processing  Ascovirus – would like bioinformatic search for particular stem loop structure, which could then be verified in lab 6

  7. Ontology - a vocabulary that represents a set of concepts of a particular domain and the relationships between those concepts  Gene Ontology (GO) guarantees the consistency of the referenced biological concepts in different databases  Use to annotate genes in various databases  Annotations used to determine similarity between genes and gene products  Group has made various ontology software tools 7

  8. QUOTA Multi-view FCA OntoSELF 1: lagging and leading strand elongation, 8 CDC2, DBP11, POL2

  9.  Primary focus on computationally-based analysis of DNA and RNA sequences  Develop tools to help with analysis  Example: Working on identification of functional genomic regions through comparison of genomes from related species  Example: Developed tools for the estimation of neutral substitution rates on a local scale  Study structure of rates  Study effect on evolution of genomic structure 9

  10.  Software engineering  Software risk management and assessment  Probabilistic risk assessment  Software design methodology  Experimental verification of software design methodology effectiveness  Visual programming languages 10

  11.  DNA tiling microarrays  Massive data sets  Broad coverage of genome  Low signal/noise ratio  Want to extract statistically significant information to justify validation experiments in a wet lab  Seek collaboration from statisticians to develop appropriate statistics  Seek collaboration from computer scientists to effectively implement statistical and data processing algorithms 11

  12.  Looking for collaboration on genomic sequence assembly and clustering  Work with expressed sequence tags (EST) from complementary DNA (cDNA)  How trace a given set of ESTs back to their original genes?  New technologies can now very quickly sequence enormous amounts of short pieces of cDNA  Want computational tools to do correct assembly and clustering 12

  13.  Software development in C/C++/Fortran for numerical computation  Conversion of software for parallel computation  Application support for various physics and biophysics packages, e.g., ANSYS, Abaqus  Modeling and simulation of vascular systems  Geometric model generation  Flow solving  Data visualization 13

  14.  Expertise is applied probability  Served on graduate committees in zoology  Helped graduate students with data analysis  Experience in  Analysis of variance  Markov chains  Hidden Markov Models 14

  15.  Expertise in optimization and simulation of complex systems  Bioinformatics experience  Sequencing by hybridization  Clustering the avian-flu viruses (with Henry Wan)  Working with Chun Liang (Botany) and CSA colleagues to cluster Expressed Sequence Tags (ESTs) to identify genes for conifers  Would like to hear from other biologists with similar research, e.g., use of ESTs for gene identification and regulation 15

  16.  Has taught classes in introductory statistics, regression analysis, and time series analysis  Extensive experience applying statistics in business, social science, and natural science  Time series analysis to study chemical concentrations of stream flows into Acton Lake  Applications of regression techniques 16

  17.  Scientific programming, especially C++ and MATLAB  Parallel programs on cluster  Graphical user interfaces (GUI) for programs  Mathematical modeling  Digital image processing  Basic knowledge of variety of mathematical techniques 17

  18.  Works in Michael Kennedy’s lab  Seek collaboration and support for  Nuclear Magnetic Resonance (NMR) data  Use of principal component analysis (PCA)  Use of partial least squares discriminant analysis (PLS-DA) 18

  19.  Installation and configuration of bioinformatics applications on the cluster  IT infrastructure planning and support - servers, network, storage, etc.  Scripting (writing programs for cluster) and help with cluster batch system  Database creation and advice on use  General support of cluster users 19

  20.  Principal expertise  Mathematical optimization (theory, algorithms, software)  Modeling of decision problems  Research interests  Reformulating mathematical problems for efficiency  Applications of optimization to data-fitting  Parallel processing in optimization  Optimal design of experiments  Areas of application (to date)  Crystallography, statistics, hydrology, econometrics, toxicology, engineering, ecology 20

  21.  Knowledge of statistics useful in  Microarray studies (separating signal from noise, cluster analysis, missing data), image analysis  Clinical studies, forestry and wild life, public health  Specific statistical tools  Bayesian hierarchical modeling and Markov chain Monte Carlo (MCMC) algorithms  Spatial analysis (areal data and point-referenced data), including prediction and model checking 21

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend