Statistical LeaRning
Katja Nowick, Lydia Mueller
Bioinformatics group,
Markus Kreuz
IMISE
Statistical LeaRning Katja Nowick, Lydia Mueller Bioinformatics - - PowerPoint PPT Presentation
Statistical LeaRning Katja Nowick, Lydia Mueller Bioinformatics group, Markus Kreuz IMISE What is R? Programming/scripting language Comprehensive statistical environment Strength : statistical data analysis + graphical display
Bioinformatics group,
IMISE
http://www.bioconductor.org/packages/release/bioc/
query(MotifDb, "DAL80") pfm.dal80.jaspar = query(MotifDb, "DAL80")[[1]] seqLogo(pfm.dal80.jaspar)
Finding binding motifs for a transcription factor from a database and draw logo With only 3 lines of code:
files = list.files("fastq", full=TRUE) names(files) = sub(".fastq", "", basename(files)) qas = lapply(seq_along(files), function(i, files) qa(readFastq(files[i]), names(files)[i]), files) qa <- do.call(rbind, qas) save(qa, file=file.path("output", "qa.rda")) browseURL(report(qa))
With 6 lines of code: From a directory of FastQ files to a full quality report:
@SEQ_ID_1 GATTTGGGGTTCAAAGCAGTATCGATCAAATAGTAAATCCATTTGTTCAACTCACAGTTT + !''*((((***+))%%%++)(%%%%).1***-+*''))**55CCF>>>>>>CCCCCCC65 @SEQ_ID_2 GATTTGGGGTTCAAAGCAGTATCGATCAAATAGTAAATCCATTTGTTCAACTCACAGTTT + !''*((((***+))%%%++)(%%%%).1***-+*''))**55CCF>>>>>>CCCCCCC65 @SEQ_ID_3
http://www.r-project.org/
– http://www.math.ilstu.edu/dhkim/Rstuff/Rtutor.html – http://www.statmethods.net/index.html – http://faculty.ucr.edu/~tgirke/Documents/R_BioCond/R_Bi
Nov 3rd: Introduction to R Nov 17th: Statistics and Graphics Nov 24th: A small programming project Dec 1st : Analysis of gene expression data Dec 15rd: Clustering and Gene Ontology
Multiple exercises in between
Multiple exercises in between