Examples of online analysis tools for gene expression data Tools - - PowerPoint PPT Presentation

examples of online analysis tools for gene expression data
SMART_READER_LITE
LIVE PREVIEW

Examples of online analysis tools for gene expression data Tools - - PowerPoint PPT Presentation

Examples of online analysis tools for gene expression data Tools integrated in data repositories Tools for raw data analysis (cel files, or other scanner output) Processed data analysis tools Tools linking gene expression with gene function


slide-1
SLIDE 1

Examples of online analysis tools for gene expression data

Tools integrated in data repositories Tools for raw data analysis (cel files, or other scanner output) Processed data analysis tools Tools linking gene expression with gene function Tools linking gene expression with sequence analysis

slide-2
SLIDE 2

Tools from the data repositories

Advantages : Fast Done for a huge amount of public data Allow quick & dirty overview of “what's already known” Drawbacks Not usable for custom data Not flexible, poor tuning Examples GEO ArrayExpress SAGEmap

slide-3
SLIDE 3

GEO Tools

Raw data retrieval (soft or matrix-formatted objects) GEO DataSet Cluster Analysis : a visualization tool for displaying precomputed cluster heat maps GEO Profiles : expression profiles per each gene/spot of one selected dataset

slide-4
SLIDE 4

GEO DataSet cluster analysis : example

slide-5
SLIDE 5

GEO DataSet cluster analysis : example

slide-6
SLIDE 6

GEO DataSet cluster analysis : example

slide-7
SLIDE 7

GEO differential expression analysis : example

slide-8
SLIDE 8

ArrayExpress Tools

Processed (matrix) or Raw data retrieval Expression Profiles (per gene and per experiment)

slide-9
SLIDE 9

SAGE Anatomic Viewer (SAV)

Displays gene expression results based on SAGE tags counts in human normal and malignant tissues

slide-10
SLIDE 10

Tools for raw files transformation

Input : Affymetrix cel files Genepix or Scanalyze output files Functions : Standard microarray corrections and normalization Background correction Spot filtering Intra- and Interchip normalization Replicate scaling Data quality assessment and scoring

slide-11
SLIDE 11

Tools for raw files transformation : Express Yourself

slide-12
SLIDE 12

Processed data analysis tools

Drawbacks Can be quite slow Input data format is very important Need to know well your data before using them Advantages Usually contains lots of functionalities Usable for custom data Can be very flexible Examples CIMminer GEDA Expression Profiler GEPAS

slide-13
SLIDE 13

CIMminer

Generates color-coded Clustered Image Maps (CIMs) ("heat maps") Easy to use, but few tuning possibilities Good start for online clustering tools

slide-14
SLIDE 14

GEDA

Specifically designed for the integrated analysis of global gene expression patterns in cancer Easy to use BUT : careful with the results interpretation

slide-15
SLIDE 15

GEDA : A few Screenshots

slide-16
SLIDE 16

GEDA : A few Screenshots

slide-17
SLIDE 17

GEDA : A few Screenshots

slide-18
SLIDE 18

GEDA : A few Screenshots

slide-19
SLIDE 19

GEDA : A few Screenshots

slide-20
SLIDE 20

GEDA : A few Screenshots

slide-21
SLIDE 21

Expression Profiler at EBI

slide-22
SLIDE 22

Expression Profiler at EBI

slide-23
SLIDE 23

Expression Profiler at EBI

slide-24
SLIDE 24

GEPAS

slide-25
SLIDE 25

GEPAS

slide-26
SLIDE 26

GEPAS

slide-27
SLIDE 27

GEPAS

slide-28
SLIDE 28

GEPAS

slide-29
SLIDE 29

GEPAS

slide-30
SLIDE 30

Tools to retrieve gene functions and annotations

Goals Link Gene Ontology information to co-expressed genes Find pathways specificities under certain biological conditions Find promoter elements common in co-expressed genes Input files Expression data matrix with classes AND gene names Gene lists to compare Promoter sequences in FASTA format Examples Carrie Babelomics DAVID : Database for Annotation, Visualization and Integrated Discovery Inclusive : MotifSampler SSA

slide-31
SLIDE 31

CARRIE Computational Ascertainment of Regulatory Relationships Inferred from Expression

Input Expression data matrix with gene Ids and sample classes Associated promoter sequences Output Known transcription factors associated with co-expressed genes KEGG pathways associated with genes Gene Ontology for selected genes

slide-32
SLIDE 32

CARRIE

slide-33
SLIDE 33

CARRIE

slide-34
SLIDE 34

Babelomics : FatiGO

Linked to the GEPAS gene expression analysis tools Web-tools for functional annotation and analysis of group of genes in high- throughput experiments.

slide-35
SLIDE 35

Babelomics : FatiGO

Input : Two gene lists to compare (differentially expressed genes) Different gene IDs supported (Entrez, HUGO, RefSeq, Affy...) Uses GO (Gene Ontology) database Output : Summary with the input parameters Summary input data: Initial number of genes, number of genes have ensembl correspondence and number of genes that have been used for the analysis. Links with the results for each repository that has been selected and the number of genes for which gene ontology annotation exist. Graphical view of GO terms represented in gene lists

slide-36
SLIDE 36

Babelomics : FatiGO

slide-37
SLIDE 37

Babelomics : FatiGO

slide-38
SLIDE 38

Babelomics : FatiGO

slide-39
SLIDE 39

MotifSampler

Description Part of the INCLUSive suite which also contains gene expression data analysis Tries to find motifs in a given list of sequences Input Sequences in FASTA format An organism-specific background model (given) Motif length Number of motifs to retrieve Output A list of motifs instances for each input sequence

slide-40
SLIDE 40

Other online Tool : ArrayQuest

Applies to data from GEO or custom data Contains Bioconductor methods, BioPerl and C++ based scripts Accepts new analysis method submission