BUT WERE FORCED TO FIND OUT Ivan tajduhar istajduh@riteh.hr SSIP - PowerPoint PPT Presentation

EVERYTHING YOU NEVER WANTED TO KNOW ABOUT MACHINE LEARNING, BUT WERE FORCED TO FIND OUT Ivan Štajduhar istajduh@riteh.hr SSIP 2019 27 TH SUMMER SCHOOL ON IMAGE PROCESSING, TIMISOARA, ROMANIA July 10 th 2019

EVERYTHING YOU NEVER WANTED TO KNOW ABOUT MACHINE LEARNING, BUT WERE FORCED TO FIND OUT IN INTRODUCTION AND MOTIVATION

Challenges

Solution • Model-based techniques – Manually tailored – Variation and complexity in clinical data – Limited by current insights into clinical conditions, diagnostic modelling and therapy – Hard to establish analytical solutions

Hržić , Franko, et al. "Local-Entropy Based Approach for X-Ray Image Segmentation and Fracture Detection." Entropy 21.4 (2019): 338. (un uniri-tehn hnic-18 18-15) 15)

Machine learning • Model-based techniques – Manually tailored – Variation and complexity in clinical data – Limited by current insights into clinical conditions, diagnostic modelling and therapy – Hard to establish analytical solutions • An alternative: learning from data – Minimising an objective function

CT MRI PACS US PET

Summary • Introduction and motivation • Representation, optimisation & stuff • Evaluation metrics & experimental setup • Improving model performance

EVERYTHING YOU NEVER WANTED TO KNOW ABOUT MACHINE LEARNING, BUT WERE FORCED TO FIND OUT REPRESENTATION, , OPTIMISATION & STUFF

Machine learning • Machine learning techniques mainly deal with representation, performance assessment and optimisation: – The learning process is always preceded by the choice of a formal representation of the model. A set of possible models is called the space of the hypothesis. – Learning algorithm uses a cost function to determine (evalu luate) how successful a model is – Op Optimisation is the process of choosing the most successful models

Hypothesis • Learning type: supervised vs unsupervised unlabelled labelled data data • Hypothesis type: regression vs classification categorical continuous outcome outcome

Hypothesis Data Learning algorithm Observation Outcome h (known variables, (prediction) easily obtainable)

feature predictor extraction

Hypothesis and parameter estimation

Regularisation • A way of reducing overfitting by ignoring non-informative features • What is overfitting? • Many types of regularisation – Quadratic regulariser

Multilayer perceptron (MLP) • An extension of the logistic-regression idea

Multilayer perceptron (MLP) • Parameters estimated through backpropagation algorithm evidence error CS231n: Convolutional Neural Networks for Visual Recognition, Stanford University http://cs231n.stanford.edu/2016/

Multilayer perceptron (MLP) Convolutional layer Normalisation layer Activation function Fully-connected layer Normalisation layer Activation function CS231n: Convolutional Neural Networks for Visual Recognition, Stanford University http://cs231n.stanford.edu/2016/

Support vector machine (SVM) • Maximum margin classifier

Support vector machine (SVM) • Often used with kernels for dealing with linearly non- separable problems • Quadratic programming solver optimisation

Support vector machine (SVM)

Kernels • A possible way of dealing with linearly non-separable problems • A measure of similarity between data points • Kernel trick can implicitly escalate dimensionality

Tree models • An alternative form of mapping • Partition the input space into cuboid regions – Can be easily interpretable Temperature hot cold moderate yes yes Sky rainy cloudy sunny yes no no

Tree models • An alternative form of mapping • Partition the input space into cuboid regions – Can be easily interpretable

CART • CART hypothesis • Local optimisation (greedy) – recursive partitioning • Cost function

| CART |

Short recap • Representation, optimisation & stuff |

EVERYTHING YOU NEVER WANTED TO KNOW ABOUT MACHINE LEARNING, BUT WERE FORCED TO FIND OUT EVALUATION METRICS & EXPERIMENTAL SETUP

Evaluation metrics • The choice of an adequate model-evaluation metric depends on the modelling goal • Common metrics for common problems: – Mean squared error (for regression) – Classification accuracy

outcome = class = label Evaluation metrics Confusion matrix (normalised) • Confusion matrix – binary case – multiple classes (K>2) Observed outcome Predicted Predicted outcome outcome Observed True False outcome positive negative (TP) (FN) Observed False True outcome positive negative (FP) (TN) 36 Predicted outcome

Evaluation metrics • Common metrics for class-imbalanced problems, or when misclassifications are not equally bad: – Sensitivity (recall, true positive rate) Predicted Predicted – Specificity outcome outcome Observed True False – F1 score outcome positive negative (TP) (FN) Observed False True outcome positive negative (FP) (TN) 37

Evaluation metrics • For class-imbalanced problems, or when misclassifications are not equally bad (probabilistic classification): – Receiver operating characteristic (ROC) curve Predicted Predicted outcome outcome Observed True False outcome positive negative (TP) (FN) 1-SPEC Observed False True outcome positive negative (FP) (TN) Fawcett, Tom. "An introduction to ROC analysis." Pattern recognition letters 27.8 (2006): 861-874.

Evaluation metrics • For highly-skewed class distributions (probabilistic classification): – Precision-recall (PR) curve – Area under the curve (AUC) AUROC Davis, Jesse, and Mark Goadrich. "The relationship between Precision-Recall and ROC curves." Proceedings of the 23rd international conference on Machine learning. ACM, 2006.

Evaluation metrics • Uncertain labellings, e.g., censored survival data • Kaplan-Meier estimate of a survival function Kaplan-Meier estimate • Alternative evaluation metrics: – Log-rank test – Concordance index – Explained residual variation of integrated Brier score Risk group: LOW HIGH

Experimental setup • Setting up an unbiased experiment – How well will the model perform on new, yet unseen, data? train test • Dataset split: training • n-fold cross-validation or leave-one-out test • Fold stratification of class-wise distribution • Multiple iterations of fold splits

Experimental setup • Estimating a model wisely – validation data used for tuning hyperparameters Data Learning algorithm training MoA

Experimental setup training • Fair estimate of a classifier / regression model – data preprocessing can bias your conclusions – watch out for naturally correlated observations, e.g. • diagnostic scan of a patient now and before • different imaging modalities of the same subject – artificially generated data can cause a mess – use testing data only after you are done with the training • Fair estimate of a segmenter – splitting pixels of the same image into separate sets is usually not the best idea – also, use a separate testing set of images How well will the model perform on new, yet unseen, data?

Method performance comparison • You devised a new method – What makes your method better? – Is it significantly better? • Is the sample representative of a population? • Hypothesis: It is better (maybe) POPULATION

Method performance comparison • Hypothesis: A equals B! – Non-parametric statistical tests – A level of significance α is used to determine at which level the hypothesis may be rejected – The smaller the p -value, the stronger the evidence against the null hypothesis Demšar, Janez. "Statistical comparisons of classifiers over multiple data sets." Journal of Machine learning research 7.Jan ( 2006): 1-30. Derrac, Joaquín, et al. "A practical tutorial on the use of nonparametric statistical tests as a methodology for comparing evolutionary and swarm intelligence algorithms." Swarm and Evolutionary Computation 1.1 (2011): 3-18.

Two classifiers • Comparing performance against baseline (over multiple datasets) • Wilcoxon signed ranks test Demšar, Janez. "Statistical comparisons of classifiers over multiple data sets." Journal of Machine learning research 7.Jan ( 2006): 1-30.

Multiple classifiers • Comparing performance of multiple classifiers (over multiple datasets) • Friedman test • Post-hoc tests Demšar, Janez. "Statistical comparisons of classifiers over multiple data sets." Journal of Machine learning research 7.Jan ( 2006): 1-30.

Multiple classifiers • Friedman test Demšar, Janez. "Statistical comparisons of classifiers over multiple data sets." Journal of Machine learning research 7.Jan ( 2006): 1-30.

Multiple classifiers • Post-hoc tests • All-vs-all – Nemenyi test • One-vs-all – Bonferroni-Dunn test worse Demšar, Janez. "Statistical comparisons of classifiers over multiple data sets." Journal of Machine learning research 7.Jan ( 2006): 1-30.

BUT WERE FORCED TO FIND OUT Ivan tajduhar istajduh@riteh.hr SSIP - PowerPoint PPT Presentation

EVERYTHING YOU NEVER WANTED TO KNOW ABOUT MACHINE LEARNING, BUT WERE FORCED TO FIND OUT Ivan tajduhar istajduh@riteh.hr SSIP 2019 27 TH SUMMER SCHOOL ON IMAGE PROCESSING, TIMISOARA, ROMANIA July 10 th 2019 EVERYTHING YOU NEVER WANTED TO KNOW

Diferential Equations Forced vibrations ITI 26/03/2020 ITI Forced Vibrations Forced

Math 211 Math 211 Lecture #34 Forced Harmonic Motion November 14, 2003 2 Forced Harmonic

Math 211 Math 211 Lecture #35 Forced Harmonic Motion November 18, 2002 2 Forced Harmonic

Math 211 Math 211 Lecture #35 Forced Harmonic Motion November 19, 2001 2 Forced Harmonic

Math 211 Math 211 Lecture #35 Forced Harmonic Motion April 16, 2001 2 Forced Harmonic Motion

Forced Marriage (FM) & Honour Based Violence (HBV) Merseyside Afrah Qassim Chair of Savera

Subject comparability, forced Subject comparability, forced policy policy- -making and the

What is forced labour? ILO Forced Labour Convention, 1930 (No. 29) All work or service that

Forced/forceful browsing sws2 1 Forced browsing (not in book!) Supplying a URL directly

Class 16: Damped and forced oscillation Class 16: Damped and forced oscillation Damped SHM Equation

Math 211 Math 211 Lecture #36 Forced Harmonic Motion Nonlinear Systems November 21, 2001 2

Egyptian Pharaohs Who were the pharaohs? The pharaohs of Egypt were the kings and queens of

Commission: Out of touch, out of date, out of pocket April 2017 Commission: Out of touch, out of

Find Us! Find Us! Find Us! Find Us! Like Us! Like Us! Interact With Us! Interact With Us!

Jewish Celebrations Learning Objective: To find out about the festival of Hanukkah.

Global Trends in Forced Migration Refugee issues What are the challenges refugees face?

Lung Cancer Diagnosis in 2007 Greatest cause of cancer deaths worldwide Greatest cause of

Diep veneuze thrombose van de arm De chirurgische insteek Gert J. de Borst Dep of Vascular

30 Gy single dose Stereotactic Body Radiation Therapy to lung lesions: outcome in a large series

Base of Tongue/ Head and Neck 2019 2019-2020 NAACCR W EBINAR SERIES 2 Q&A Please submit

Approximating 1-Qubit Gates: Energy and Discrepancy Steven Damelin 2 Joint work with: Alec Greene

FJDALOMCSILLAPTS KLNLEGES KRLMNYEK KZTT skor etanol kor Mezopotmia

1-Lar arge ge Vei eins ns of of t the he bod ody 2- large arteries of the body Dr.Amjad d

THE TEMPORAL EVOLUTION OF SELF-HEALING AMITABH TREHAN LOUGHBOROUGH UNIVERSITY (Joint work with

BUT WERE FORCED TO FIND OUT Ivan tajduhar istajduh@riteh.hr SSIP - PowerPoint PPT Presentation

EVERYTHING YOU NEVER WANTED TO KNOW ABOUT MACHINE LEARNING, BUT WERE FORCED TO FIND OUT Ivan tajduhar istajduh@riteh.hr SSIP 2019 27 TH SUMMER SCHOOL ON IMAGE PROCESSING, TIMISOARA, ROMANIA July 10 th 2019 EVERYTHING YOU NEVER WANTED TO KNOW

Diferential Equations Forced vibrations ITI 26/03/2020 ITI Forced Vibrations Forced

Math 211 Math 211 Lecture #34 Forced Harmonic Motion November 14, 2003 2 Forced Harmonic

Math 211 Math 211 Lecture #35 Forced Harmonic Motion November 18, 2002 2 Forced Harmonic

Math 211 Math 211 Lecture #35 Forced Harmonic Motion November 19, 2001 2 Forced Harmonic

Math 211 Math 211 Lecture #35 Forced Harmonic Motion April 16, 2001 2 Forced Harmonic Motion

Forced Marriage (FM) &amp; Honour Based Violence (HBV) Merseyside Afrah Qassim Chair of Savera

Subject comparability, forced Subject comparability, forced policy policy- -making and the

What is forced labour? ILO Forced Labour Convention, 1930 (No. 29) All work or service that

Forced/forceful browsing sws2 1 Forced browsing (not in book!) Supplying a URL directly

Class 16: Damped and forced oscillation Class 16: Damped and forced oscillation Damped SHM Equation

Math 211 Math 211 Lecture #36 Forced Harmonic Motion Nonlinear Systems November 21, 2001 2

Egyptian Pharaohs Who were the pharaohs? The pharaohs of Egypt were the kings and queens of

Commission: Out of touch, out of date, out of pocket April 2017 Commission: Out of touch, out of

Find Us! Find Us! Find Us! Find Us! Like Us! Like Us! Interact With Us! Interact With Us!

Jewish Celebrations Learning Objective: To find out about the festival of Hanukkah.

Global Trends in Forced Migration Refugee issues What are the challenges refugees face?

Lung Cancer Diagnosis in 2007 Greatest cause of cancer deaths worldwide Greatest cause of

Diep veneuze thrombose van de arm De chirurgische insteek Gert J. de Borst Dep of Vascular

30 Gy single dose Stereotactic Body Radiation Therapy to lung lesions: outcome in a large series

Base of Tongue/ Head and Neck 2019 2019-2020 NAACCR W EBINAR SERIES 2 Q&amp;A Please submit

Approximating 1-Qubit Gates: Energy and Discrepancy Steven Damelin 2 Joint work with: Alec Greene

FJDALOMCSILLAPTS KLNLEGES KRLMNYEK KZTT skor etanol kor Mezopotmia

1-Lar arge ge Vei eins ns of of t the he bod ody 2- large arteries of the body Dr.Amjad d

THE TEMPORAL EVOLUTION OF SELF-HEALING AMITABH TREHAN LOUGHBOROUGH UNIVERSITY (Joint work with

Forced Marriage (FM) & Honour Based Violence (HBV) Merseyside Afrah Qassim Chair of Savera

Base of Tongue/ Head and Neck 2019 2019-2020 NAACCR W EBINAR SERIES 2 Q&A Please submit