Functional Brain Imaging, Multi-Objective Optimisation and - - PowerPoint PPT Presentation

functional brain imaging multi objective optimisation and
SMART_READER_LITE
LIVE PREVIEW

Functional Brain Imaging, Multi-Objective Optimisation and - - PowerPoint PPT Presentation

Functional Brain Imaging, Multi-Objective Optimisation and Spatio-Temporal Data Mining Michle Sebag TAO, Universit Paris-Sud With Nicolas Tarrisson, Olivier Teytaud, Vojtech Krmicek, UPS Julien Lefevre, Sylvain Baillet, La


slide-1
SLIDE 1

Functional Brain Imaging, Multi-Objective Optimisation and Spatio-Temporal Data Mining

Michèle Sebag TAO, Université Paris-Sud

With Nicolas Tarrisson, Olivier Teytaud, Vojtech Krmicek, UPS Julien Lefevre, Sylvain Baillet, La Pitié-Salpétrière

slide-2
SLIDE 2

Motivations

Functional Brain Imagery

✎ Patients, Experiments, Measures ✎ Magneto-Encephalography

1,000 measures per sensor per second.

slide-3
SLIDE 3

The data

Spatio-temporal structure

✎ Sensors ✐ ❂ ✶✿✿◆ ✎ ✐ ✦ ✚ ▼✐ ❂ ✭①✐❀ ②✐❀ ③✐✮ ✷ ■ ❘✸ ❢❈✐❬t❪❀ t ❂ ✶✿✿❚❣ ✷ ■ ❘❚

slide-4
SLIDE 4

Overview

✎ Spatio-temporal Data Mining ✎ Multi-Objective Optimisation ✎ .. + Multi-modal Optimisation ✎ 4dMiner & Experimental Validation ✎ Discriminant Spatio-Temporal Patterns

slide-5
SLIDE 5

Goal

Find spatio-temporal patterns

✎ Spatial region ❆ ✚ ■ ❘✸ ✎ Temporal interval ■ ✚ ❢✶✿✿❚❣

defining

❱✭❆❀ ■✮ ❂ ❢❈❦❬t❪❀ ❦ ✷ ❆❀ t ✷ ■❣

SUCH THAT the variance of signals within ❱✭❆❀ ■✮ is low and ❆ ✂ ■ is a large spatio-temporal region “active areas of the brain”

slide-6
SLIDE 6

Position of the problem

In practice: done manually

✎ tedious ✎ non reproducible

Standard approach

✶❂ Extract a global spatio-temporal model

Independent Component Analysis

Hyvarinen et al., 2001

EM-based clustering of curves

Chudova et al., 2003

Markov Random Field

McCallum, 2004

  • r,

Inductive Database

Mannila et al., 1997

✷❂ Find specific spatio-temporal patterns from the global model

99.9 of the global model learned is useless

slide-7
SLIDE 7

Data Mining

Goal

✎ From massive amounts of data

and knowledge

✎ find novel, useful and valid knowledge

Vision Ideally pervasive knowledge Actually specialized expertise The need [human] knowledge management does not scale up The opportunity huge amounts of accessible data

slide-8
SLIDE 8

Discussion

Goals

✎ subjective

(novel & useful knowledge)

✎ multi-objective

(valid = precise or general ?) Requirements

✎ Scalability ✎ Flexibility ✦ tunable ✦ calibrated ✦ computational cost must be controllable

any-time algorithm Zilberstein 98

slide-9
SLIDE 9

MEG Mining: Multi-objective optimisation

Search space: Stable Spatio-Temporal Patterns

❳ ❂ ✽ ❃ ❁ ❃ ✿ ■ ❂ ❬t✶❀ t✷❪

temporal interval

center of the spatial region

r

radius of the spatial region

❞✇ ❂ ✭❛❀ ❜❀ ❝✮

distance weights ellipsoidal regions Objectives

✎ Temporal length ❵✭❳✮ ❂ t✷ t✶ ✎ Spatial area ❛✭❳✮ ❂ ❥❱✭❳✮❥ ❂ ❥❢❥ ❂ ❞✇✭✐❀ ❥✮ ❁ r❣❥ ✎ Spatio-temporal alignment ✛✭❳✮ ❂ ✶ ❵✭❳✮ ✂ ❛✭❳✮ ❳

❥✷❱✭❳✮

✛■✭✐❀ ❥✮

slide-10
SLIDE 10

✛■✭✐❀ ❥✮: I-alignment of sensors ✐ and ❥

✛■✭✐❀ ❥✮ ❂ ❁ ✐❀ ❥ ❃■ ✂ ✏ ✶

❥ ✖ ❈■

✐ ✖

❈■

❥ ❥

❥ ✖ ❈■

✐ ❥

with

❁ ✐❀ ❥ ❃■ ❂

Pt✷

t❂t✶ ❈✐✭t✮✿❈❥✭t✮

qPt✷

t❂t✶ ❈✐✭t✮✷ ✂ Pt✷ t❂t✶ ❈❥✭t✮✷

✖ ❈■

❥ ❂

Average ❢❈❥❬t❪❀ t ✷ ■❣

slide-11
SLIDE 11

Multi-objective Optimisation

Find ❆r❣▼❛①❢❋✐❀ ✐ ❂ ✶❀ ✷✿✿✿❀ ❋✐ ✿ ✡ ✦ ■

❘❣

Pareto domination

✎ ① ❁ ② iff ✚ ✽✐❀ ❋✐✭①✮ ✔ ❋✐✭②✮ ✾✐✵ ❋✐✵✭①✮ ❁ ❋✐✵✭②✮

Pareto Front

✎ Set of non dominated solutions.

  • bjects with highest quality and smallest cost...

Quality Pareto front Cost

slide-12
SLIDE 12

An Any-time Approach

Spatio-temporal data mining

✎ Monotonous criteria (variance increases with ■ and ❆) ✎ Antagonistic criteria (decrease ■ or ❆ to keep the variance low)

Complete vs Stochastic Search

✎ For experts to look at ✎ Agregate the results of several runs

Multi-Objective Evolutionary Computation

Kalyan Deb 2001

slide-13
SLIDE 13

Evolutionary Computation, the skeletton

❋ ✿ ✡ ✼✦ ■ ❘

✁ ✁ ✁ ✁ ✁ ✁ ✁ ✁ ✁ ✁ ✁ ✁ ✁ ✁ ✁ ✁ ✁ ✁ ✁ ✁ ✁ ✁ ✁ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✂ ✄ ✄ ✄ ✄ ✄ ✄ ✄ ✄ ✄ ✄ ✄ ✄ ✄ ✄ ✄ ✄ ✄ ✄ ✄ ✄ ✄ ✄ ✄ ✄ ☎ ☎ ☎ ☎ ☎ ☎ ☎ ☎ ☎ ☎ ☎ ☎ ☎ ☎ ☎ ☎ ☎ ☎ ☎ ☎ ☎ ☎ ☎ ☎ ✆ ✆ ✆ ✆ ✆ ✆ ✆ ✆ ✆ ✆ ✆ ✆ ✆ ✆ ✆ ✆ ✆ ✆ ✆ ✆ ✆ ✆ ✆ ✆ ✝ ✝ ✝ ✝ ✝ ✝ ✝ ✝ ✝ ✝ ✝ ✝ ✝ ✝ ✝ ✝ ✝ ✝ ✝ ✝ ✝ ✝ ✝ ✝ ✞ ✞ ✞ ✞ ✞ ✞ ✞ ✞ ✞ ✞ ✞ ✞ ✞ ✞ ✞ ✞ ✞ ✞ ✞ ✞ ✞ ✞ ✞ ✞ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✟ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✠ ✡ ✡ ✡ ✡ ✡ ✡ ✡ ✡ ✡ ✡ ✡ ✡ ✡ ✡ ✡ ✡ ✡ ✡ ✡ ✡ ✡ ✡ ✡ ✡ ☛ ☛ ☛ ☛ ☛ ☛ ☛ ☛ ☛ ☛ ☛ ☛ ☛ ☛ ☛ ☛ ☛ ☛ ☛ ☛ ☛ ☛ ☛ ☛ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ☞ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✌ ✍ ✍ ✍ ✍ ✍ ✍ ✍ ✍ ✍ ✍ ✍ ✍ ✍ ✍ ✍ ✍ ✍ ✍ ✍ ✍ ✍ ✍ ✍ ✍ ✎ ✎ ✎ ✎ ✎ ✎ ✎ ✎ ✎ ✎ ✎ ✎ ✎ ✎ ✎ ✎ ✎ ✎ ✎ ✎ ✎ ✎ ✎ ✎ ✏ ✏ ✏ ✏ ✏ ✏ ✏ ✏ ✏ ✏ ✏ ✏ ✏ ✏ ✏ ✏ ✏ ✏ ✏ ✏ ✏ ✏ ✏ ✏ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✑ ✒ ✒ ✒ ✒ ✒ ✒ ✒ ✒ ✒ ✒ ✒ ✒ ✒ ✒ ✒ ✒ ✒ ✒ ✒ ✒ ✒ ✒ ✒ ✒ ✒ ✒ ✒ ✒ ✒ ✒ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕ ✕

Evaluation Initialisation

Replacement

Best individual Stop ? Selection

Crossover, Mutation, ...

Offspring Evaluation Generation Parents Genitors

slide-14
SLIDE 14

Multi-objective evolutionary optimisation

Standard EC Find ❆r❣❖♣t✭❋✮

✎ Initialisation ✎ Selection ✎ Variation (crossover, mutation)

Multi-objective EC Find ❆r❣❖♣t❢❵✭❳✮❀ ❛✭❳✮❀ ✛✭❳✮❣ Differences Goal: sample the Pareto front Archive Selection after ❋ ✵✭❳✮, measuring: The Pareto rank of ❳ in the current population The percentage of the archive dominated by ❳ ...

slide-15
SLIDE 15

4d Miner: Multi-Objective Evolutionary Algorithm

Components

✎ Search space ✡ ✒ ■ ◆✸ ✂ ■ ❘✹ ❢❳ ❂ ✭■❀ ✐❀ r❀ ✇✮❀ ■ ✚ ❬✶❀ ❚❪❀ ✐ ✷ ❬✶❀ ◆❪❀ r ✷ ■ ❘❀ ✇ ✷ ■ ❘✸❣ ✎ Objectives ❛❀ ❵❀ ✛ ✎ Operators

– Initialisation sampling mechanism average interval length minimal acceptable alignement minimal pattern size – Variation operators

slide-16
SLIDE 16

Sampling mechanism ❳ ❂ ✭✐❀ ✇❀ ■❀ r✮

Daida 1999

✎ ✐ : uniformly drawn in ❬✶❀ ◆❪; ✎ ✇ ❂ ✭✶❀ ✶❀ ✶✮

initial = Euclidean

✎ ■ ❂

– t✶ : uniformly drawn in ❬✶❀ ❚❪ – ❵✭■✮ drawn ✘ ◆✭♠✐♥❵❀ ♠✐♥❵❂✶✵✮

♠✐♥❵ user supplied

– reject if t✶ ✰ ❵✭■✮ ❃ ❚

✎ r : such that the ball contains all neighbors with bounded ■-

alignment:

♠✐♥✛ user-supplied r ❂ ♠✐♥❦❢❞✇✭✐❀ ❦✮ s✿t✿ ✛■

✐❀❦ ❃ ♠✐♥✛✮❣

✎ reject if ❛✭❳✮ ❂ ❥❇✭✐❀ r✮❥ ❁ ♠✐♥❛ ♠✐♥❛ user supplied

Complexity: ❖✭◆ ❧♦❣ ◆ ✂ ♠✐♥❵✮

slide-17
SLIDE 17

First results: Failure!

Diversity of stable spatio temporal patterns

✎ seems OK...

Spatio−temporal width + + + + + + Pareto front Variance

...but all patterns represent the same spatio-temporal region... Failure analysis

✎ Experts are not interested in the Pareto front only. ✎... but in ALL active areas of the brain...

slide-18
SLIDE 18

Overview

✎ Spatio-temporal Data Mining ✎ Multi-Objective Optimisation ✎ .. + Multi-modal Optimisation ✎ Experimental Validation ✎ Discussion

slide-19
SLIDE 19

Multi-modal optimisation

Goal FIND ALL global (and local) optima. Multi-modal heuristics in EC

since Mahfoud, 95

Niching or Fitness sharing Selection based on ❋ ✵✭❳✮ ❂

❋✭❳✮ ✝❳✵s✐♠✭❳❀❳✵✮

1 Sim(X,X’) d(X,X’)=d

1 2 3 4 1.5 2.5 3.5 10 5 15 b dphimax 1 2 3 4 1.5 2.5 3.5 10 5 15

slide-20
SLIDE 20

Multi-objective multi-modal optimisation

Sharing-Pareto dominance

❳ ❂ ✭■❀ ✐❀ r❀ ✇✮ sp-dominates ❨ ❂ ✭■✵❀ ✐✵❀ r✵❀ ✇✵✮ iff ✎ ❳ Pareto dominates ❨

wrt ❛❀ ❵❀ ✛

✎ ❳ and ❨ overlap ❱✭❳✮ ❭ ❱✭❨ ✮ ✻❂ ❀ ■ ❭ ■✵ ✻❂ ❀

slide-21
SLIDE 21

Experimental validation

Goals of experiment

✎ Usability ✎ Scalability ✎ Performance / Recall

Datasets

✎ La Pitié-Salpétrière ✎ Artificial datasets

slide-22
SLIDE 22

Artificial Datasets

◆: number of sensors: ❢✺✵✵ ✿✿ ✹✵✵✵❣ ❚: number of time steps: ❢✶✵✵✵ ✿✿ ✽✵✵✵❣

1000 2000 500 1500 1 0.5 1.5

slide-23
SLIDE 23

Artificial Datasets, 2

Draw 10 spatio-temporal patterns Bias signals accordingly

Time Activity 1000 2000 500 1500 1 0.5 1.5 Artificial STP

slide-24
SLIDE 24

Performances

Recall : percentage of target patterns with representants in the archive.

◆ ❚

1,000 2,000 4,000 8,000 500 98 ✝ 5 93 ✝ 9 92 ✝ 7 79 ✝ 16 1000 96 ✝ 6 96 ✝ 6 82 ✝ 14 67 ✝ 12 2000 96 ✝ 5 87 ✝ 12 72 ✝ 14 49 ✝ 15 4000 89 ✝ 10 81 ✝ 13 56 ✝ 14 32 ✝ 16

Online Performance

Generations Recall 100 200 300 400 1 0.5

N = 500 N = 1000 N = 2000 N = 4000

4D-Miner, T = 4000

slide-25
SLIDE 25

Results, cont’d

N runtime (sec.) 1000 2000 3000 4000 100 200

T = 1000 T = 2000 T = 4000 T = 8000

Computational Cost

Limitations Very sensitive to the initialisation parameters

slide-26
SLIDE 26

Functional Brain Imagery

Time Activity 100 300 500 700

  • 100

100 Stable spatio-temporal pattern Time Activity 100 300 500 700 100

  • 50

50 Stable spatio-temporal pattern

slide-27
SLIDE 27

Discriminant Spatio-Temporal Patterns

Experimental setting

✎ A single person ✎ Setting 1 : sees a ball and let it go ✎ Setting 2 : sees a ball and catches it

Goal

✎ Find STPs with different activities in Setting 1 and Setting 2

(should be related to motor skills)

slide-28
SLIDE 28

Discriminant Spatio-Temporal Patterns ?

Time Activity 1000 2000

  • 100

100

  • 50

50 MEG Time Activity 410 430 450 470 490

  • 20
  • 10

10 20 MEG

slide-29
SLIDE 29

Discriminant Spatio-Temporal Patterns ?

Time Activity 1000 2000 100

  • 50

50 MEG Time Activity 2200 2150

  • 30
  • 20
  • 10

10 20 30 MEG

slide-30
SLIDE 30

Discussion

Discriminative learning

✎ Given ❍, hypothesis space ✎ Find ❤, discriminating positive and negative examples.

Generative learning

✎ build ❉✰, ❉ distribution of positive / negative examples ✎ Example ❳ is positive iff P❉✰✭❳✮ ❃ P❉✭❳✮

slide-31
SLIDE 31

Discussion, 2

Remark

✎ Generative learning more demanding ✎ But often more efficient

Why ?

✎ Easier to incorporate prior knowledge...

Discriminant STPs : a generative approach

✎ Find relevant hypotheses (STPs) ✎ Sort the discriminant ones

slide-32
SLIDE 32

Contributions: Spatio-temporal data mining

Based on multi-objective optimization as opposed to, constraints

Mannila Toivonen 97

An any-time algorithm controllable cost ✦ effective flexibility

slide-33
SLIDE 33

Perspectives

Convergence Type I and Type II errors Pruning a posteriori, increases the precision (a priori, kills the recall...) Functional brain imagery and variability among patients; among trials Activation scenarios The “grammar” of cell assemblies activity Learn the user’s criteria Interactive optimization

slide-34
SLIDE 34

Discriminant Spatio-Temporal Patterns

Time Activity 1100 1200 1300

  • 50

MEG Time Activity 1000 2000 100

  • 50

50 MEG

slide-35
SLIDE 35

Sampling mechanism ❳ ❂ ✭✐❀ ✇❀ ■❀ r✮

✎ ✐ : uniformly drawn in ❬✶❀ ◆❪; ✎ ✇ ❂ ✭✶❀ ✶❀ ✶✮

initial = Euclidean

✎ ■ ❂

– t✶ : uniformly drawn in ❬✶❀ ❚❪ – ❵✭■✮ drawn ✘ ◆✭♠✐♥❵❀ ♠✐♥❵❂✶✵✮

♠✐♥❵ user supplied

– reject if t✶ ✰ ❵✭■✮ ❃ ❚

✎ r : such that the ball contains all neighbors with bounded ■-

alignment:

r ❂ ♠✐♥❦❢❞✇✭✐❀ ❦✮ s✿t✿ ✛■

✐❀❦ ❃ ♠✐♥✛✮❣

✎ reject if ❛✭❳✮ ❂ ❥❇✭✐❀ r✮❥ ❁ ♠✐♥❛ ♠✐♥❛ user supplied

Complexity: ❖✭◆ ❧♦❣ ◆ ✂ ♠✐♥❵✮

slide-36
SLIDE 36

Variation operators

Mutation ❳ ❂ ✭✐❀ ✇❀ ■❀ r✮

✎ Self adaptive mutation of ✇❀ r ✎ Specific mutation operators for ✐ and ■. ✎ Random (initialisation operator)

Crossover ❳ ❂ ✭✐❀ ✇❀ ■❀ r✮ ✂ ❨ ❂ ✭✐✵❀ ✇✵❀ ■✵❀ r✵✮

✎ Restricted mating:

if the spatio-temporal areas are “close enough” user-supplied

slide-37
SLIDE 37

Selection : Pareto Archive

Steady state

✎ In each step, select an individual (tournament wrt Archive) ✎ Apply crossover or mutation ✎ Evaluate ✎ If non dominated in the population, store : ✎ Replace an individual (anti-selection)

slide-38
SLIDE 38

Artificial datasets

Curves

◆ ❂ ✺✵✵❀ ✿✿✹ ✵✵✵

nb sensors

❚ ❂ ✶ ✵✵✵❀ ✿✿✽ ✵✵✵

nb time steps For ✐ ❂ ✶✿✿◆ For t ❂ ✶✿✿❚

❈✐✭t✮ ❂ ❈✐✭t ✶✮ ✰ ✎ ✄ ✝✶

10 Target patterns

P = (✐ in ✶✿✿◆; ■ ✚ ❬✶❀ ❚❪; ✇ ✷ ■ ❘✸; r ✷ ■ ❘). ❈P = average of ❢❈❥✭t✮❀ t ✷ ■❀ ❞✇✭✐❀ ❥✮ ❁ r❣

Action

❈❥✭t✮ ❂ ✭✶ ☛✮❈❥✭t✮ ✰ ☛❈P ✂ ❡①♣✭❞✭t❀ ■✮ ❞✭✐❀ ❥✮✮