ISLab
Intelligent Systems Lab Piet van Remortel piet.vanremortel@ua.ac.be
1
ISLab Intelligent Systems Lab Piet van Remortel - - PowerPoint PPT Presentation
ISLab Intelligent Systems Lab Piet van Remortel piet.vanremortel@ua.ac.be 1 Overview Who we are What we do Applied Machine learning Bio-informatics What we look for 2 Who ? Prof. Dr. Alain Verschoren Who we are Dr. Piet van Remortel
1
2
Who we are What we do Applied ML Bio-i What we look for
3
Who we are What we do Applied ML Bio-i What we look for
4
Who we are What we do Machine Learning Bio-i What we look for
5
What is it good for? extract knowledge from given data typically “hard” problems computationally hard enumeration of possibilities takes forever etc lack of traditional formalisms traditional statistics complete mathematical models of the problem high dimensions, discontinuities, ...
Who we are What we do Machine Learning Bio-i What we look for
6
Use biological evolution as a metaphor for search/
‘evolve’ solutions to problems is an existing biological organism a ‘solution’ to the problem of surviving in the environment ? iterate generate diversity select for some fitness measure
Who we are What we do Machine Learning Bio-i What we look for
7
Who we are What we do Machine Learning Bio-i What we look for
8
Who we are What we do Machine Learning Bio-i What we look for
PRO very robust: solve a problem given some fitness measure actually means: can cope with non-convex fitness landscapes actually means: can solve problems where good solutions are not necessarily alike each other actully means: solving the problem is harder then ‘climbing one hill’ in the landscape can be easily parallellized (CalcUA !)
9
Who we are What we do Machine Learning Bio-i What we look for
CON inherently slow probabilistic in nature true mechanics not well understood ‘it just works’ (more or less)
10
Who we are What we do Machine Learning Bio-i What we look for
What we used to do with GAs study fitness landscapes predict problem hardness etc What we do now apply ! e.g. Feature selection in predictive toxicology use a number of genes to characterise an unknown toxic compound
11
Who we are What we do Machine Learning Bio-i What we look for
Typical applications of GAs: high dimensional problem lots of interaction between the different elements of the solution the impact of one parameter depends on the value of other parameters e.g. hot-water tap in the shower possibility to assign numerical quality to a candidate solution
12
13
Who we are What we do Machine Learning Bio-i What we look for
13
13
Who we are What we do Machine Learning Bio-i What we look for
13
13
Who we are What we do Machine Learning Bio-i What we look for
13
13
Who we are What we do Machine Learning Bio-i What we look for
13
13
Who we are What we do Machine Learning Bio-i What we look for
13
13
Who we are What we do Machine Learning Bio-i What we look for
13
13
Who we are What we do Machine Learning Bio-i What we look for
13
Cooperation with EB&T (De Coen/Blust) Goal: Characterise unknown toxic compound by means of genetic expression fingerprint(s) 40-50 known chemicals as training set goal: pre-compliance screening for toxic mode of action algorithmic foundation: SVM/PLS/GP
14
Who we are What we do Machine Learning Bio-i What we look for
14
15
Who we are What we do Machine Learning Bio-i What we look for
15
16
Who we are What we do Machine Learning Bio-i What we look for
16
17
Who we are What we do Machine Learning Bio-i What we look for
17
17
Who we are What we do Machine Learning Bio-i What we look for
17
17
Who we are What we do Machine Learning Bio-i What we look for
17
17
Who we are What we do Machine Learning Bio-i What we look for
17
17
Who we are What we do Machine Learning Bio-i What we look for
17
18
Who we are What we do Machine Learning Bio-i What we look for
18
18
Who we are What we do Machine Learning Bio-i What we look for
18
18
Who we are What we do Machine Learning Bio-i What we look for
18
18
Who we are What we do Machine Learning Bio-i What we look for
18
18
Who we are What we do Machine Learning Bio-i What we look for
18
18
Who we are What we do Machine Learning Bio-i What we look for
18
19
Who we are What we do Machine Learning Bio-i What we look for
19
19
Who we are What we do Machine Learning Bio-i What we look for
19
19
Who we are What we do Machine Learning Bio-i What we look for
19
20
Who we are What we do Machine Learning Bio-i What we look for
20
21
Who we are What we do Machine Learning Bio-i What we look for
21
21
Who we are What we do Machine Learning Bio-i What we look for
21
22
Goal: Characterise unknown toxic compound by means of genetic expression fingerprint(s)
22
22
Goal: Characterise unknown toxic compound by means of genetic expression fingerprint(s)
22
Who we are What we do Machine Learning Bio-i What we look for
23
Who we are What we do Machine Learning Bio-i What we look for
23
Who we are What we do Machine Learning Bio-i What we look for
23
Who we are What we do Machine Learning Bio-i What we look for
23
Who we are What we do Machine Learning Bio-i What we look for
23
Who we are What we do Machine Learning Bio-i What we look for
24
Who we are What we do Machine Learning Bio-i What we look for
25
Who we are What we do Machine Learning Bio-i What we look for
26
Who we are What we do Machine Learning Bio-i What we look for
27
Who we are What we do Machine Learning Bio-i What we look for
28
Who we are What we do Machine Learning Bio-i What we look for
28
Who we are What we do Machine Learning Bio-i What we look for
28
Who we are What we do Machine Learning Bio-i What we look for
28
Who we are What we do Machine Learning Bio-i What we look for
28
Who we are What we do Machine Learning Bio-i What we look for
29
NETWORK TOPOLOGY INTERACTIONS SYNTHETIC GENE NETWORK SYNTHETIC EXPRESSION DATA INFERENCE Aracne SAMBA Genomica ADJACENCY MATRIX INFERRED CALCULATE PERFORMANCE METRIC
Topology type E.coli,Yeast, AB, ER, SW, DSF Interactions Interaction type ratio Activ/Inhib Noise
Input Noise Amount of Data # experiments # samples
SynTReN
ADJACENCY MATRIX ORIGINAL
Single Experiment run
COMPARE
(MODULE) NETWORK TOPOLOGY
Who we are What we do Machine Learning Bio-i What we look for
30
aracne interactiontypes: linear
Sensitivity Specificity
graphModel DSF graphModel ecoli_cluster graphModel ecoli_neighbor graphModel ER graphModel WS graphModel yeast_cluster graphModel yeast_neighbor
31
“SynTReN: a generator of synthetic gene expression data for design and analysis of structure learning algorithms”
Tim Van den Bulcke , Koenraad Van Leemput, Bart Naudts , Piet van Remortel , Hongwu Ma , Alain Verschoren , Bart De Moor and Kathleen Marchal BMC Bioinformatics 2006, 7:43 http:/ /homes.esat.kuleuven.be/~kmarchal/SynTReN/index.html
32
Who we are What we do Machine Learning Bio-i What we look for
33
Who we are What we do Machine Learning Bio-i What we look for
34
Who we are What we do Machine Learning Bio-i What we look for
35
small frequent subgraph also known as significant subgraph, graphlets
A D B C A B C A C
4C 3C R
Who we are What we do Machine Learning Bio-i What we look for
36
Searching for motifs in a graph means mapping onto existing vertices Vertices can overlap -> connections between motifs Edges do not overlap -> every edge explained by
V0 V3 V1 V2 V6 V4 V5 V7 V8
start vertex MOTIF SEARCH SPACE UNEXPLAINED GRAPH
Who we are What we do Machine Learning Bio-i What we look for
37
Who we are What we do Machine Learning Bio-i What we look for Who we are What we do Machine Learning Bio-i What we look for
38
A D B C A B C A C
4C 3C R
Who we are What we do Machine Learning Bio-i What we look for
39
Who we are What we do Machine Learning Bio-i What we look for
40
Who we are What we do Machine Learning Bio-i What we look for
41
For each attaching vertex To other motifs in the Motif Set
Who we are What we do Machine Learning Bio-i What we look for
42
Distance
P r
a b i l i t y
Who we are What we do Machine Learning Bio-i What we look for
43
Motif with just 1 edge With different sequence distance rule and attaching vertex can create stars, hubs
(a) (b) (c)
V8 V9 V10 V0 V1 V2 V3 V4 V5 V7 V6 V10 V8 V9Who we are What we do Machine Learning Bio-i What we look for
44
Who we are What we do Machine Learning Bio-i What we look for
45
Who we are What we do Machine Learning Bio-i What we look for
46
Histogram of decomposition likelihood
!log(P) Frequency
1000 2000 3000 4000 20 40 60 80 100 Chain!like Trees Highly!branched Trees
Who we are What we do Machine Learning Bio-i What we look for
47
Who we are What we do Machine Learning Bio-i What we look for
48
Projects regarding applications of machine learning (prediction, classification, optimization , ...) domains biology / bio-informatics / systems-bio finance (stock rate time series etc) planning, engineering ... contact: piet.vanremortel@ua.ac.be
Who we are What we do Machine Learning Bio-i What we look for
49