MLCC 2015 machine learning applications Francesca Odone ML - PowerPoint PPT Presentation

MLCC 2015 machine learning applications Francesca Odone

ML applications Machine Learning systems are trained on examples rather than being programmed MLCC machine learning applications

Data challenges big data - extract real knowledge from very large dimensional datasets •computation, communication, privacy small data - bridge the gap between biological and artificial intelligence (generalize from few supervised data) • unsupervised, weakly supervised learning • prior knowledge and task/data structure MLCC machine learning applications

Big data & unsupervised learning MLCC machine learning applications

but how do they relate with the course contents? MLCC machine learning applications

plan (longer than needed) medical image analysis : image segmentation bioinformatics : gene selection computer vision : object detection, object recognition, ... human-machine interaction : action recognition, emotion recognition video-surveillance : behavior analysis, pose detection MLCC machine learning applications

Dynamic Contrast Enhanced MRI analysis Goal: study and implement methods to automatically discriminate different tissues based on different enhancement curve types Approach: •learn from data basis signals and express the enhancement curves as linear combinations of those signals MLCC machine learning applications

Dynamic Contrast Enhanced MRI analysis the dictionary is learnt from data: Vessels Atom #15 Synovial tissue 1 Atom #5 1.2 Synovial tissue 2 Atom Enhancement 1.2 Atom #16 Atom #14 1.0 1.0 Voxel Intensity 0.8 0.8 0.6 0.6 0.4 0.4 0.2 0.2 0.0 0 50 100 150 200 0.0 Time 50 100 150 200 0 Time Left: the three different types of generated ECs corresponding to different tissues in the simulated phantom. Right: the four most used atoms, corresponding to the EC patterns associated with each phantom regions. MLCC machine learning applications

Dynamic Contrast Enhanced MRI analysis • Automatic segmentation is obtained by means of an unsupervised method: each voxel is represented by its code (the coefficients u providing the lower reconstruction error w.r.t. the learnt basis D ) • Codes are clustered in 7 main groups (following the expert prior) manual annotation automatic segmentation provided by the expert MLCC machine learning applications

Microarray analysis Goals: •Design methods able to identify a gene signature, i.e., a panel of genes potentially interesting for further screening •Learn the gene signatures, i.e., select the most discriminant subset of genes on the available data MLCC machine learning applications

Microarray analysis A typical “-omics” scenario: High dimensional data - Few samples per class • tenths of data - tenths of thousands genes   → Variable selection High risk of selection bias • data distortion arising from the way the data are collected due to the small amount of data available   → Model assessment needed MLCC machine learning applications

  Elastic net and gene selection β ∈ R p || Y − � X || 2 + ⌧ ( || � || 1 + ✏ || � || 2 min 2 ) Consistency guaranteed - the more samples available the better the estimator Multivariate - it takes into account many genes at once Output: One-parameter family of nested lists with equivalent prediction ability and increasing correlation among genes • minimal list of prototype genes ✏ → 0 • longer lists including correlated genes ✏ 1 < ✏ 2 < ✏ 3 < . . . MLCC machine learning applications

Double optimization approach Variable selection step (elastic net) β ∈ R p || Y − � X || 2 + ⌧ ( || � || 1 + ✏ || � || 2 min 2 ) Classification step (OLS or RLS) || Y − β X || 2 2 + λ || β || 2 2 for each ✏ we have to choose � and ⌧ the combination prevents the elastic net shrinking effect MLCC machine learning applications

Dealing with selection bias ( λ 1 , . . . , λ A ) λ → ( τ 1 , . . . , τ B ) τ → the optimal pair ( λ ∗ , τ ∗ ) is one of the possible A · B pairs ( λ , τ ) MLCC machine learning applications

Computational issues • Computational time for LOO (for one task) time 1 − optim = (2 . 5 s to 25 s ) depending on the correlation parameter total time = A · B · N samples · time 1 − optim 20 · 20 · 30 · time 1 − optim ∼ 2 · 10 4 s to 2 · 10 5 s ∼ • 6 tasks → 1 week!! MLCC machine learning applications

Image understanding Image understanding as a general problem is still unsolved • today we are able to answer complex but specific questions such as object detection, image categorization, ... Machine learning has been the key to solve this kind of problems: • it deals with noise and intra-class variability by collecting appropriate data and finding suitable descriptions • Notice that images are relatively easy to gather (but not to label!) • many large benchmark datasets (with some bias) MLCC machine learning applications

gathering data with some help - iCubWorld MLCC machine learning applications

Object detection in images object detection is in essence a binary classification problem • image regions of variable size are classified: is it an instance of the object or not? unbalanced classes • in this 380x220 px image we perform ~6.5x10 5 tests and we should find only 11 positives the training set contains • images of positive examples (the object) • negative examples (background) MLCC machine learning applications

Representing the image content There is a lot of prior knowledge coming from the computer vision literature (filters, features, ...) •often it is easier and more effective to find explicit mappings towards high dimensional feature spaces •feature selection has been used to get rid of redundancy and speed up computation MLCC machine learning applications

Image feature selection rectangle features or Haar-like features (Viola & Jones) are one of the most effective representations of images for face detection •size of the initial dictionary: a 19 x 19 px image is mapped into a 64.000-dim feature vector! •feature selection may help us reducing the size and keeping only informative elements MLCC machine learning applications

Selecting feature groups Many image features have a characteristic internal structure An image patch is divided in regions or cells and represented according to the specific description, then all representations are concatenated Feature selection can be designed so to extract an entire group instead than a single feature MLCC machine learning applications

an interesting study case: Eigenfaces •Goal: represent face images for recognition purposes (who’s that face?) •build X - data matrix where each row is a face image (unfolded) •PCA(X T X): each eigenvector can be seen as an image, the eigenface ; • they are the directions in which the images differ from the mean image. •eigenvectors with the largest eigenvalues are kept •at run time an image is represented by projecting it onto the chosen directions •many variants... • this simple idea is more appropriate for image matching • not robust to illumination and view-point changes MLCC machine learning applications

Learning common patterns in temporal sequences MLCC machine learning applications

Learning common patterns in temporal sequences temporal sequences adaptive P-spectrum kernel for sequences space quantization φ P u ( s ) = |{ ( v 1 , v 2 ) : s = v 1 uv 2 }| where u 2 A P , while v 1 , v 2 are substrings such that v 1 2 A P 1 , v 2 2 A P 2 , and P 1 + P 2 + P = | s | . The associated kernel between two strings s 1 and s 2 is defined as: K P ( s 1 , s 2 ) = h φ P ( s 1 ) , φ P ( s 2 ) i = X φ P u ( s 1 ) φ P u ( s 2 ) . u ∈ A P String length independence is achieved with an appropriate normalization K P ( s 1 , s 2 ) ˆ K P ( s 1 , s 2 ) = . p p K P ( s 1 , s 1 ) K P ( s 2 , s 2 ) i , . . . , x k i { x i } N i =1 with x i = ( x 1 i , x 2 i ) > MLCC machine learning applications

HMI: iCub recognizing actions MLCC machine learning applications

HMI: emotion recognition from body movements •input data: streams of 3D measurements •intermediate representations: dimensions suggested by psychologists, related to space occupation or the quality of motion •gesture segmentation •multi-class classification of 6 emotions based on a combination of binary SVM classifiers MLCC machine learning applications

MLCC machine learning applications

Learning the appropriate type of grasp estimate the most likely grasps estimate the hand posture vector MLCC machine learning applications

Semi-supervised pose classification The capability of classifying people with respect to their orientation in space is important for a number of tasks • An example is the analysis of collective activities, where the reciprocal orientation of people within a group is an important feature Back • The typical approach relies Back Back Left Right on quantizing the possible Left Right orientations in 8 main angles Front Front Left Right Front • Appearance changes very Back smoothly and labeling may Left Left Front Left MLCC machine learning applications

MLCC 2015 machine learning applications Francesca Odone ML - PowerPoint PPT Presentation

MLCC 2015 machine learning applications Francesca Odone ML applications Machine Learning systems are trained on examples rather than being programmed MLCC machine learning applications Data challenges big data - extract real knowledge from

Applications to high dimensional problems Francesca Odone and Lorenzo Rosasco RegML 2013

MLCC 2018 Statistical Learning: Basic Concepts Lorenzo Rosasco UNIGE-MIT-IIT Outline Learning

RegMet Regularization Methods for High Dimensional Learning Francesca Odone , Lorenzo Rosasco

W HAT IS IT USEFUL FOR ? The learning paradigm is useful whenever the underlying process is

MLCC 2019 Variable Selection and Sparsity Lorenzo Rosasco UNIGE-MIT-IIT Outline Variable

N OISE ... p (y|x) x Y X the same x can generate different y (according to p ( y | x ) ): the

W HY S ELECTING V ARIABLES ? Nowadays many research areas produce data with tenth or hundred

MLCC 2017 Machine Learning Crash Course Universita' di Genova, Summer, 2017 Instructor : Lorenzo

L INEAR KERNEL CONT . For the linear kernel, 1 2 + 2 || Y K c || 2 2 c T K c min c R

H OW C AN W E D ESIGN AN A LGORITHM ? In all the above problems one can think of improving

Strategies in CLIL Francesca Costa francesca.costa@unicatt.it What will I talk about?

MLCC 2015 Dimensionality Reduction and PCA Lorenzo Rosasco UNIGE-MIT-IIT June 25, 2015 Outline

Logics in Machine Learning and Data Mining: Achievements and Open Issues Francesca A. Lisi

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

MLCC 2017 Deep Learning Lorenzo Rosasco UNIGE-MIT-IIT June 29, 2017 What? Classification

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

CSSE463: Image Recognition Day 26 This week Today: Finding lines and circles using the

Malicious Documents Trends: a Gmail Perspective Google, @elie with the help of many Googlers

Memor ory D y Defen enses es The Elevation from Obscurity to Headlines Rajeev Balasubramonian

Cloud Control Design Review Team Andrew Thompson Anna Lee Project Lead, Audio Streaming

VU Augmented Reality on Mobile Devices VU Augmented Reality on Mobile Devices Introduction

Improves Engagement Guest Presenter: Ken Gibson Senior Vice President The VisionLink Advisory

Media Retrieval (2) Prepared by Ling Guan Jose Lay Paisarn Muneesawang Ning Zhang Rui Zhang

Tax-Favored Benefits: A Key Reten6on Tool for Farm