Including prior knowledge in machine learning for genomic data
Jean-Philippe Vert
Mines ParisTech / Curie Institute / Inserm
StatLearn workshop, Grenoble, March 17, 2011
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 1 / 68
Including prior knowledge in machine learning for genomic data - - PowerPoint PPT Presentation
Including prior knowledge in machine learning for genomic data Jean-Philippe Vert Mines ParisTech / Curie Institute / Inserm StatLearn workshop, Grenoble, March 17, 2011 J.P Vert (ParisTech) Prior knowlege in ML StatLearn 1 / 68 Outline
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 1 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 2 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 3 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 4 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 5 / 68
100 200 300 400 500 600 700 800 900 1000 −0.2 0.2 0.4 0.6 0.8 1 1.2 1.4
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 6 / 68
200 400 600 800 1000 1200 1400 1600 1800 2000 −1 −0.5 0.5 1 200 400 600 800 1000 1200 1400 1600 1800 2000 −1 −0.5 0.5 1 200 400 600 800 1000 1200 1400 1600 1800 2000 −1 −0.5 0.5 1 200 400 600 800 1000 1200 1400 1600 1800 2000 −1 −0.5 0.5 1
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 7 / 68
500 1000 1500 2000 2500 −0.5 0.5 500 1000 1500 2000 2500 −1 1 2 500 1000 1500 2000 2500 −2 −1 1 500 1000 1500 2000 2500 −2 2 4 500 1000 1500 2000 2500 −4 −2 2 500 1000 1500 2000 2500 −1 −0.5 0.5 500 1000 1500 2000 2500 −1 −0.5 0.5 500 1000 1500 2000 2500 −4 −2 2 500 1000 1500 2000 2500 −4 −2 2 500 1000 1500 2000 2500 −1 1
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 8 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 9 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 10 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 11 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 12 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 13 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 14 / 68
100 200 300 400 500 600 700 800 900 1000 −0.2 0.2 0.4 0.6 0.8 1 1.2 1.4
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 15 / 68
100 200 300 400 500 600 700 800 900 1000 −0.2 0.2 0.4 0.6 0.8 1 1.2 1.4
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 15 / 68
100 200 300 400 500 600 700 800 900 1000 −0.2 0.2 0.4 0.6 0.8 1 1.2 1.4
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 16 / 68
100 200 300 400 500 600 700 800 900 1000 −0.2 0.2 0.4 0.6 0.8 1 1.2 1.4
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 16 / 68
100 200 300 400 500 600 700 800 900 1000 −0.2 0.2 0.4 0.6 0.8 1 1.2 1.4
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 16 / 68
100 200 300 400 500 600 700 800 900 1000 −0.2 0.2 0.4 0.6 0.8 1 1.2 1.4
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 16 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 17 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 18 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 19 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 20 / 68
1 2 3 4 5 6 7 8 9 10 x 10
5
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 signal length seconds Speed for K=1, 10, 1e2, 1e3, 1e4, 1e5
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 21 / 68
100 200 300 400 500 600 700 800 900 1000 −0.2 0.2 0.4 0.6 0.8 1 1.2 1.4
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 22 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 23 / 68
100 200 300 400 500 600 700 800 900 1000 −0.5 0.5 1 1.5 100 200 300 400 500 600 700 800 900 1000 −0.5 0.5 1 1.5 100 200 300 400 500 600 700 800 900 1000 −0.5 0.5 1 1.5
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 24 / 68
100 200 300 400 500 600 700 800 900 1000 −0.5 0.5 1 1.5 100 200 300 400 500 600 700 800 900 1000 −0.5 0.5 1 1.5 100 200 300 400 500 600 700 800 900 1000 −0.5 0.5 1 1.5
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 24 / 68
100 200 300 400 500 600 700 800 900 1000 −0.5 0.5 1 1.5 100 200 300 400 500 600 700 800 900 1000 −0.5 0.5 1 1.5 100 200 300 400 500 600 700 800 900 1000 −0.5 0.5 1 1.5
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 25 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 26 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 27 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 28 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 29 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 30 / 68
100 200 300 400 500 600 700 800 900 1000 −2 2 4 100 200 300 400 500 600 700 800 900 1000 −2 2 4 100 200 300 400 500 600 700 800 900 1000 −2 2 4
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 31 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 32 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 33 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 34 / 68
100 200 300 400 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 p Accuracy 100 200 300 400 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 p Accuracy 100 200 300 400 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 p Accuracy ULARS WLARS ULasso WLasso ULARS WLARS ULasso WLasso ULARS WLARS ULasso WLasso
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 35 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 36 / 68
500 1000 1500 2000 2500 −0.5 0.5 500 1000 1500 2000 2500 −1 1 2 500 1000 1500 2000 2500 −2 −1 1 500 1000 1500 2000 2500 −2 2 4 500 1000 1500 2000 2500 −4 −2 2 500 1000 1500 2000 2500 −1 −0.5 0.5 500 1000 1500 2000 2500 −1 −0.5 0.5 500 1000 1500 2000 2500 −4 −2 2 500 1000 1500 2000 2500 −4 −2 2 500 1000 1500 2000 2500 −1 1
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 37 / 68
500 1000 1500 2000 2500 −0.5 0.5 500 1000 1500 2000 2500 −1 1 2 500 1000 1500 2000 2500 −2 −1 1 500 1000 1500 2000 2500 −2 2 4 500 1000 1500 2000 2500 −4 −2 2 500 1000 1500 2000 2500 −1 −0.5 0.5 500 1000 1500 2000 2500 −1 −0.5 0.5 500 1000 1500 2000 2500 −4 −2 2 500 1000 1500 2000 2500 −4 −2 2 500 1000 1500 2000 2500 −1 1
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 38 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 39 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 40 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 40 / 68
500 1000 1500 2000 2500 3000 3500 4000 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 1 BAC Weight
500 1000 1500 2000 2500 3000 3500 4000 −1 −0.8 −0.6 −0.4 −0.2 0.2 0.4 0.6 0.8 1 BAC Weight
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 41 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 42 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 43 / 68
N
biosynthesis
Protein kinases
DNA and RNA polymerase subunits Glycolysis / Gluconeogenesis Sulfur metabolism Porphyrin and chlorophyll metabolism Riboflavin metabolism Folate biosynthesis
Biosynthesis of steroids, ergosterol metabolism Lysine biosynthesis Phenylalanine, tyrosine and tryptophan biosynthesis Purine metabolism Oxidative phosphorylation, TCA cycle
Nitrogen, asparagine metabolism
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 44 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 45 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 46 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 47 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 47 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 48 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 49 / 68
N
biosynthesis
Protein kinases
DNA and RNA polymerase subunits Glycolysis / Gluconeogenesis Sulfur metabolism Porphyrin and chlorophyll metabolism Riboflavin metabolism Folate biosynthesis
Biosynthesis of steroids, ergosterol metabolism Lysine biosynthesis Phenylalanine, tyrosine and tryptophan biosynthesis Purine metabolism Oxidative phosphorylation, TCA cycle
Nitrogen, asparagine metabolism
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 50 / 68
a) b)
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 51 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 52 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 53 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 54 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 55 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 56 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 57 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 58 / 68
group (·) (middle) and ΩG
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 59 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 60 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 60 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 61 / 68
i +α2 j ≤1
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 62 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 63 / 68
OVERLAP (.)
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 64 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 65 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 66 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 67 / 68
J.P Vert (ParisTech) Prior knowlege in ML StatLearn 68 / 68