MACHINE LEARNING ON NEUROIMAGING DATA
Ilya Kuzovkin
AACIMP, August 2014
LECTURE 2: INTRODUCTION TO MACHINE LEARNING
M ACHINE L EARNING ON N EUROIMAGING D ATA L ECTURE 2: I NTRODUCTION - - PowerPoint PPT Presentation
M ACHINE L EARNING ON N EUROIMAGING D ATA L ECTURE 2: I NTRODUCTION TO M ACHINE L EARNING Ilya Kuzovkin AACIMP, August 2014 P REVIOUSLY ON SLIDES P REVIOUSLY ON SLIDES P REVIOUSLY ON SLIDES P REVIOUSLY ON SLIDES _____ data? _____
Ilya Kuzovkin
AACIMP, August 2014
LECTURE 2: INTRODUCTION TO MACHINE LEARNING
_____ data? _____ plot? _____ curve?
F? M? R? I?
BOLD? pixels? Data?
measure?
abilities to catch interesting stuff
datasets
data is good and which is not
what you asked for
about it
are free to do other things
Machine learning algorithm learns from examples
Machine learning algorithm learns from examples
Each object (instance) is described as a set set of parameters, called features
Length of the tail
…
Length of the tail
…
Amount of fur
…
Length of the tail
…
Amount of fur
…
Infer a rule to classify these cats.
Congratulations! You have invented “OneR” algorithm*
?
?
base
programming
regression
programming
handling (GMDH)
Quantization
learning (PAC
algorithms
algorithms
Create
Classifier Training set
Create
Classifier Training set
A p p l y t
Create
Classifier Accuracy is the % of correctly classified instances Training set
A p p l y t
Create
Classifier Accuracy is the % of correctly classified instances Training set
A p p l y t
How good is 50% accurate classifier?
Stupid classifier
Stupid classifier
Accuracy is ?
Stupid classifier
Accuracy is 0.5
Stupid classifier
Accuracy is 0.5 Accuracy is ?
Stupid classifier
Accuracy is 0.5 Accuracy is 0.9
Correctly identified as Everything identified as
Correctly identified as Everything identified as
Correctly identified as Everything identified as
Precision = ?
Correctly identified as Everything identified as
Precision = 0.9
Correctly identified as Everything identified as
Precision = 0.9 Precision = ?
Correctly identified as Everything identified as
Precision = 0.9 Precision = 0
Correctly identified as Everything identified as
Precision = 0.9 Precision = 0
Stupid classifier
Precision = 0.45
Identified All in the test set
Identified All in the test set
Recall = ?
Identified All in the test set
Recall = 1
Identified All in the test set
Recall = 1 Recall = ?
Identified All in the test set
Recall = 1 Recall = 0
Identified All in the test set
Recall = 1 Recall = 0
Stupid classifier
Recall = 0.5
Identified All in the test set
Recall = 1 Recall = 0 Precision = 0.9 Precision = 0
Recall = 1 Recall = 0 Precision = 0.9 Precision = 0 F1 ≈ 0.95 F1 = 0
Recall = 1 Recall = 0 Precision = 0.9 Precision = 0 F1 ≈ 0.95 F1 = 0 Average F1 ≈ 0.48
Fit model on a training set Tune parameters
50% 25% 25% Final test on a test set
50% 25% 25% But why can’t we do it all on
Fit model on a training set Tune parameters
Final test on a test set
a.k.a Too stupid
a.k.a Too stupid Balanced bias-variance tradeoff a.k.a OK
a.k.a Too stupid Balanced bias-variance tradeoff a.k.a OK
a.k.a Too smart
TRAINING VALIDATION
TRAINING VALIDATION
TRAINING VALIDATION
TRAINING VALIDATION
TRAINING VALIDATION
Electrodes Time
http://fc09.deviantart.net/fs70/i/2011/213/b/a/open_closed_eye__by_hydrofaux-d42e82y.jpg
http://fc09.deviantart.net/fs70/i/2011/213/b/a/open_closed_eye__by_hydrofaux-d42e82y.jpg
Electrodes Time
http://fc09.deviantart.net/fs70/i/2011/213/b/a/open_closed_eye__by_hydrofaux-d42e82y.jpg
Electrodes Time
What is your next move?
http://fc09.deviantart.net/fs70/i/2011/213/b/a/open_closed_eye__by_hydrofaux-d42e82y.jpg
Electrodes Time
Fourier transform of what?
http://fc09.deviantart.net/fs70/i/2011/213/b/a/open_closed_eye__by_hydrofaux-d42e82y.jpg
Electrodes
Time
http://fc09.deviantart.net/fs70/i/2011/213/b/a/open_closed_eye__by_hydrofaux-d42e82y.jpg
Electrodes
Time
http://fc09.deviantart.net/fs70/i/2011/213/b/a/open_closed_eye__by_hydrofaux-d42e82y.jpg
Electrodes
Time
http://fc09.deviantart.net/fs70/i/2011/213/b/a/open_closed_eye__by_hydrofaux-d42e82y.jpg
Electrodes
Time
http://fc09.deviantart.net/fs70/i/2011/213/b/a/open_closed_eye__by_hydrofaux-d42e82y.jpg
Electrodes
Time
http://fc09.deviantart.net/fs70/i/2011/213/b/a/open_closed_eye__by_hydrofaux-d42e82y.jpg
http://fc09.deviantart.net/fs70/i/2011/213/b/a/open_closed_eye__by_hydrofaux-d42e82y.jpg
http://fc09.deviantart.net/fs70/i/2011/213/b/a/open_closed_eye__by_hydrofaux-d42e82y.jpg
What is one instance in this case?
http://fc09.deviantart.net/fs70/i/2011/213/b/a/open_closed_eye__by_hydrofaux-d42e82y.jpg
What are the features?
http://fc09.deviantart.net/fs70/i/2011/213/b/a/open_closed_eye__by_hydrofaux-d42e82y.jpg
What is the dimension of feature space
http://fc09.deviantart.net/fs70/i/2011/213/b/a/open_closed_eye__by_hydrofaux-d42e82y.jpg
What is the class of this instance?
http://fc09.deviantart.net/fs70/i/2011/213/b/a/open_closed_eye__by_hydrofaux-d42e82y.jpg
How you
training set?
Now I know how your brain signal looks like when you think “LEFT” and “RIGHT”
Now I know how your brain signal looks like when you think “LEFT” and “RIGHT” Try me — think
Now I know how your brain signal looks like when you think “LEFT” and “RIGHT” Try me — think
It was wasn’t it?
Now I know how your brain signal looks like when you think “LEFT” and “RIGHT” Try me — think
How would you use such technology? It was wasn’t it?
?
?
?
Technology Electrical Magnetic Optical Name EEG ECoG Intracortical MEG fMRI fNIRS Invasive Portable Cost From $100 to $30,000+ $1000 grid $2000 per array $1 mln $2-3 mln $200,000 Temporal resolution 50 ms 3 ms 3 ms 50ms 1-2 s 1 s Spatial resolution 1+ cm 1 mm 0.5 mm - 0.05 mm 5 mm 1 mm voxels 5 mm Pattern classification VEP ERD/ ERS P300 Performance 2 class 90% 3 class 80% 4 class ? Large number
2 cls 90% Large number
8 cls 90% High* ~ same as EEG based 4 cls 90% 2 cls 90%
Technology Electrical Magnetic Optical Name EEG ECoG Intracortical MEG fMRI fNIRS Invasive Portable Cost From $100 to $30,000+ $1000 grid $2000 per array $1 mln $2-3 mln $200,000 Temporal resolution 50 ms 3 ms 3 ms 50ms 1-2 s 1 s Spatial resolution 1+ cm 1 mm 0.5 mm - 0.05 mm 5 mm 1 mm voxels 5 mm Signal classification VEP P300 Performance 2-class 90% 3-class 80% 4-class 60% Large number
Large number
8-cls 90% High* ~ same as EEG based 4-cls 90% 2-cls 90%