Bioinformatics Fox Chase Cancer Center
Bayesian Decomposition Michael Ochs Fox Chase Cancer Center - - PowerPoint PPT Presentation
Bayesian Decomposition Michael Ochs Fox Chase Cancer Center - - PowerPoint PPT Presentation
Bayesian Decomposition Michael Ochs Fox Chase Cancer Center Bioinformatics Fox Chase Cancer Center Making Proteins Bioinformatics Fox Chase Cancer Center A Closer Look at Translation Post-Trans- lational Modification RNA Splicing miRNA
Bioinformatics Fox Chase Cancer Center
Making Proteins
Bioinformatics Fox Chase Cancer Center
A Closer Look at Translation
RNA Splicing miRNA Post-Trans- lational Modification
Bioinformatics Fox Chase Cancer Center
Identifying Pathways
www.promega.com
A B C D
A B C D 1 2 3
Bioinformatics Fox Chase Cancer Center
Goal of Analysis
Take measurements of thousands of genes, some of which are responding to stimuli
- f interest
* * 1 2 3 * * * * then identify the pathways And find the correct set of basis vectors that link to pathways
Bioinformatics Fox Chase Cancer Center
BD: Matrix Decomposition
* * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * *
Data X
gene 1 gene N * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * gene 1 gene N pattern 1 pattern k condition 1 condition M * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * pattern 1 pattern k condition 1 condition M
Distribution of Patterns Patterns of Behavior =
The behavior of
- ne gene can be
explained as a mixture of patterns with different behaviors
Bioinformatics Fox Chase Cancer Center
Patterns as Basis Vectors
Bioinformatics Fox Chase Cancer Center
BD with Knowledge of Classes
* * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * *
Data X
gene 1 gene N * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * gene 1 gene N pattern 1 pattern k condition 1 condition M * * * * 0 0 0 0 0 0 0 0 0 0 * * * * 0 0 0 0 0 0 0 0 0 0 * * * * * * * * * * * * pattern 1 pattern k condition 1 condition M
Distribution of Patterns Patterns of Behavior =
Bioinformatics Fox Chase Cancer Center
BD Structure
Atomic Domains Allow Encoding
- f Biological Information
Markov Chain Monte Carlo is used to explore possible sets of distributions and patterns
Bioinformatics Fox Chase Cancer Center
Project Normal Data
- Download Data from CAMDA Site
- Adjust for Background Measurement
- Take Ratios
- Calc Mean and SDOM for Each Ratio
- Eliminate M3T and M4T Data
- Eliminate 24 Points with Only 1 Data Pt
– 99% 4 Pts, 1% 3 Pts, 0.1% 2 Pts
Bioinformatics Fox Chase Cancer Center
Filtering of Genes
- Eliminated all ESTs
– Annotated Remaining Genes from Gene Ontology on Unigene Name
- Annotated all Genes on Clone ID
– 24% Changed Unigene Cluster – 948 Clones Had GO Process Information
Bioinformatics Fox Chase Cancer Center
Updating Annotations: ASAP
http://bioinformatics.fccc.edu/
Bioinformatics Fox Chase Cancer Center
Bayesian Decomposition
- Encoded 3 Known Patterns
– Kidney, 6 Conditions – Liver, 6 Conditions – Testis, 4 Conditions
- Allowed 1 - 3 Additional Patterns
– Account for Behavior Unrelated to Tissue Specific Expression
Bioinformatics Fox Chase Cancer Center
Fitting the Data
Bioinformatics Fox Chase Cancer Center
Four Patterns
0.05 0.1 0.15 0.2 0.25 0.3 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16
Kidney Liver Testis Background
Bioinformatics Fox Chase Cancer Center
Five Patterns
0.05 0.1 0.15 0.2 0.25 0.3 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16
Kidney Liver Testis Background 1 Background 2
Bioinformatics Fox Chase Cancer Center
Four vs Five Patterns
Bioinformatics Fox Chase Cancer Center
Gene Ontology
- Identify Genes “Only” in One Pattern
– See if Pattern Enhanced in GO
- Identify Genes in a Pattern
– 3σ above Zero in Distribution – Look at GO Assignments
- Identify Genes Lacking in Pattern
– Eliminate Background (Genes > 70%) – Look for Genes Not in Pattern (3σ)
Bioinformatics Fox Chase Cancer Center
Genes Only in Kidney by GO
endocytosis * neuropeptide signaling pathway anion transport * inorganic anion transport * vitamin B12 transport vitamin/cofactor transport * transmembrane receptor protein tyrosine kinase signaling pathway * enzyme linked receptor protein signaling pathway * receptor mediated endocytosis chloride transport * neurotransmitter transport *
Sodium transport, vesicle- mediated transport, amino acid transport, folate transport, homophilic cell adhesion, cell-cell adhesion, monovalent inorganic cation transport metal ion transport
From Old Annotations
> 10x Enhancement
Bioinformatics Fox Chase Cancer Center
Genes Only in Liver by GO
small molecule transport, histogenesis and
- rganogenesis,
embryogenesis and morphogenesis, lipid metabolism
protein targeting response to pest/pathogen/parasite response to chemical substance response to abiotic stimulus homophilic cell adhesion * cell-cell adhesion * drug susceptibility/resistance * response to drug cellular defense response antigen processing, endogenous antigen via MHC class I" antigen processing
From Old Annotations
> 10x Enhancement
Bioinformatics Fox Chase Cancer Center
Genes Only in Testis by GO
nuclear organization and biogenesis, chromosome
- rganization and
biogenesis, cell
- rganization and
biosynthesis
development * microtubule-based process microtubule-based movement regulation of transcription from Pol II promoter spermatogenesis * gametogenesis * reproduction * meiotic recombination DNA recombination
From Old Annotations
> 10x Enhancement
Bioinformatics Fox Chase Cancer Center
Kidney Genes, 3σ , > 2 fold
G-protein coupled receptor protein signaling pathway endocytosis S phase of mitotic cell cycle homophilic cell adhesion cell-cell adhesion perception of light perception of abiotic stimulus nitrogen metabolism anion transport amine metabolism mitotic cell cycle inflammatory response amino acid metabolism
Bioinformatics Fox Chase Cancer Center
Testis Genes, 3σ, >4 fold
S phase of mitotic cell cycle protein folding microtubule-based movement mitotic cell cycle regulation of cell shape and cell size spermatogenesis gametogenesis reproduction
Bioinformatics Fox Chase Cancer Center
Liver Genes, 3σ, >3 fold
tricarboxylic acid cycle
- rganogenesis
morphogenesis sensory perception response to chemical substance response to abiotic stimulus catabolic carbohydrate metabolism main pathways of carbohydrate metabolism energy derivation by oxidation of organic compounds energy pathways drug susceptibility/resistance response to drug amino acid metabolism
Bioinformatics Fox Chase Cancer Center
Genes Absent in Patterns
microtubule-based process spermatogenesis gametogenesis reproduction
- bsolete
biological_process unknown regulation of cell shape and cell size regulation of transcription from Pol II promoter monosaccharide metabolism
Absent in Kidney
microtubule-based movement actin cytoskeleton
- rganization and biogenesis
actin filament-based process cell differentiation spermatogenesis gametogenesis reproduction
Absent in Liver
Bioinformatics Fox Chase Cancer Center
Genes Absent in Background 1
mitotic cell cycle spermatogenesis gametogenesis reproduction regulation of transcription from Pol II promoter endocytosis actin cytoskeleton organization and biogenesis actin filament-based process protein targeting protein modification
- bsolete
biological_process unknown
Bioinformatics Fox Chase Cancer Center
Genes Present in Two Tissues
perception of light perception of abiotic stimulus amine metabolism amino acid metabolism immune response defense response homophilic cell adhesion cell-cell adhesion
Kidney/Testis not Liver
mitotic cell cycle
Kidney/Liver not Testis
Bioinformatics Fox Chase Cancer Center
Acknowledgements
- This Work
– Tom Moloshok – DJ Datta (Cambridge) – Andrew Kossenkov – Bill Speier (JHU)
- Colleagues
– J. Robert Beck – Frank Manion
- Programming
– Jeffrey Grant – Elizabeth Goralczyk – Luke Somers
- Others