10. Support Vector Machines Chlo-Agathe Azencot Centre for - PowerPoint PPT Presentation

Positjve defjnite matrices ● Have a unique Cholesky decompositjon L: lower triangular, with positjve elements on the diagonal ● Sesquilinear form is an inner product – conjugate symmetry – linearity in the fjrst argument – positjve defjniteness 71

Polynomial kernels ? Compute 72

Polynomial kernels More generally, for is an inner product in a feature space of all monomials of degree up to d. 73

Gaussian kernel What is the dimension of the feature space? 74

Gaussian kernel The feature space has infjnite dimension. 75

Toy example 77

Toy example: linear SVM 78

Toy example: polynomial SVM (d=2) 79

Kernels for strings 80

Protein sequence classifjcatjon Goal: predict which proteins are secreted or not, based on their sequence. 81

Substring-based representatjons ● Represent strings based on the presence/absence of substrings of fjxed length. ? Strings of length k 82

Substring-based representatjons ● Represent strings based on the presence/absence of substrings of fjxed length. – Number of occurrences of u in x : spectrum kernel [Leslie et al., 2002]. 83

Substring-based representatjons ● Represent strings based on the presence/absence of substrings of fjxed length. – Number of occurrences of u in x : spectrum kernel [Leslie et al., 2002]. – Number of occurrences of u in x, up to m mismatches: mismatch kernel [Leslie et al., 2004]. 84

Substring-based representatjons ● Represent strings based on the presence/absence of substrings of fjxed length. – Number of occurrences of u in x : spectrum kernel [Leslie et al., 2002]. – Number of occurrences of u in x, up to m mismatches: mismatch kernel [Leslie et al., 2004]. – Number of occcurrences of u in x, allowing gaps, with a weight decaying exponentjally with the number of gaps: substring kernel [Lohdi et al., 2002]. 85

Spectrum kernel ● Implementatjon: – Formally, a sum over |A k |terms – How many non-zero terms in ? ? 86

Spectrum kernel ● Implementatjon: – Formally, a sum over |A k |terms – At most | x | - k + 1 non-zero terms in – Hence: Computatjon in O(| x |+| x' |) ● Predictjon for a new sequence x: Write f(x) as a functjon of only |x|-k+1 weights. ? 87

Spectrum kernel ● Implementatjon: – Formally, a sum over |A k |terms – At most | x | - k + 1 non-zero terms in – Hence: Computatjon in O(| x |+| x' |) ● Fast predictjon for a new sequence x: 88

The choice of kernel maters Performance of several kernels on the SCOP superfamily recognitjon kernel [Saigo et al., 2004] 89

Kernels for graphs 90

Graph data ● Molecules ● Images [Harchaoui & Bach, 2007] 91

Subgraph-based representatjons 0 1 1 0 0 1 0 0 0 1 0 1 0 0 1 no occurrence 1+ occurrences of the 1 st feature of the 10 th feature 92

Tanimoto & MinMax ● The Tanimoto and MinMax similaritjes are kernels 93

Which subgraphs to use? ● Indexing by all subgraphs... – Computjng all subgraph occurences is NP-hard. – Actually, fjnding whether a given subgraph occurs in a graph is NP-hard in general. htup://jeremykun.com/2015/11/12/a-quasipolynomial-tjme-algorithm-for-graph-isomorphism-the-details/ 94

Which subgraphs to use? ● Specifjc subgraphs that lead to computatjonally effjcient indexing: – Subgraphs selected based on domain knowledge E.g. chemical fjngerprints – All frequent subgraphs [Helma et al., 2004] – All paths up to length k [Nicholls 2005] – All walks up to length k [Mahé et al., 2005] – All trees up to depth k [Rogers, 2004] – All shortest paths [Borgwardt & Kriegel, 2005] – All subgraphs up to k vertjces (graphlets) [Shervashidze et al., 2009] 95

Which subgraphs to use? Path of length 5 Walk of length 5 Tree of depth 2 96

Which subgraphs to use? Paths Trees Walks [Harchaoui & Bach, 2007] 97

The choice of kernel maters Predictjng inhibitors for 60 cancer cell lines [Mahé & Vert, 2009] 98

The choice of kernel maters ● COREL14: 1400 natural images, 14 classes ● Kernels: histogram (H), walk kernel (W), subtree kernel (TW), weighted subtree kernel (wTW), combinatjon (M). [Harchaoui & Bach, 2007] 99

Summary ● Linearly separable case: hard-margin SVM ● Non-separable, but stjll linear: sofu-margin SVM ● Non-linear: kernel SVM ● Kernels for – real-valued data – strings – graphs. 100

10. Support Vector Machines Chlo-Agathe Azencot Centre for - PowerPoint PPT Presentation

Foundatjons of Machine Learning CentraleSuplec Fall 2017 10. Support Vector Machines Chlo-Agathe Azencot Centre for Computatjonal Biology, Mines ParisTech chloe-agathe.azencott@mines-paristech.fr Learning objectjves Defjne a

Kernel Machines Support Vector Machines 1 Kernel Machines Optimal Separating HyperPlanes Soft

Kernel Machines Steven J Zeil Old Dominion Univ. Fall 2010 1 Support Vector Machines Kernel

? 17.10.2018 3 17.10.2018 4 Support Vector Machines (SVM): Background Support Vector Machines

Support Vector Machines October 16, 2018 Support Vector Machines October 16, 2018 1 / 31

Relevance Vector Machines Jukka Lankinen LUT February 21, 2011 Jukka Lankinen Relevance Vector

Vector addition: The zero vector The D -vector whose entries are all zero is the zero vector ,

Support Vector Machines & Kernelization Barna Saha Most of the slides are made using David

Introduction Kailash Awati Instructor DataCamp Support Vector Machines in R Preliminaries

Support Vector Machines Support Vector Machines Hypothesis Space Hypothesis Space variable

Support Vector Machines (Ch. 18.9) SVM Basics Support Vector Machines (SVMs) try to do our

Support vector machines CS 446 Part 1: linear support vector machines 1.0 1.0 1.0 0.8 0.8

SUPPORT VECTOR MACHINES SUPPORT VECTOR MACHINES Matthieu R Bloch Tuesday, February 25, 2020 1

RBF Kernels: Generating a complex dataset DataCamp Support Vector Machines in R A bit about RBF

Machine Learning for NLP Support Vector Machines Aurlie Herbelot 2019 Centre for Mind/Brain

Generating a radially separable dataset DataCamp Support Vector Machines in R Generating a 2d

Support Vector Machines 290N, 2014 Support Vector Machines (SVM) Supervised learning

On the Hierarchical Modeling Technique with Applications Jingfang Huang Department of

Boolean Matrix Multiplication (BMM) and CFG parsing by Franziska Ebert 23.3.2007 Boolean

An Overview of the Conceptual Development of Computer Media Dr. Gabriela Avram Introduction to

Detection and Estimation Theory Lecture 8 Mojtaba Soltanalian- UIC msol@uic.edu

15-388/688 - Practical Data Science: Relational Data J. Zico Kolter Carnegie Mellon University

Chapter 4 Programming with MATLAB Algorithms and Control Structures Algorithm: an ordered

Big graphs for big data: parallel matching and Outline clustering on billion-vertex graphs

From Nesterovs Estimate Sequence To Riemannian Acceleration Kwangjun Ahn, Suvrit Sra COLT

Sambuz

Useful Links

Newsletter

Mail Us

10. Support Vector Machines Chlo-Agathe Azencot Centre for - PowerPoint PPT Presentation

Foundatjons of Machine Learning CentraleSuplec Fall 2017 10. Support Vector Machines Chlo-Agathe Azencot Centre for Computatjonal Biology, Mines ParisTech chloe-agathe.azencott@mines-paristech.fr Learning objectjves Defjne a

Kernel Machines Support Vector Machines 1 Kernel Machines Optimal Separating HyperPlanes Soft

Kernel Machines Steven J Zeil Old Dominion Univ. Fall 2010 1 Support Vector Machines Kernel

? 17.10.2018 3 17.10.2018 4 Support Vector Machines (SVM): Background Support Vector Machines

Support Vector Machines October 16, 2018 Support Vector Machines October 16, 2018 1 / 31

Relevance Vector Machines Jukka Lankinen LUT February 21, 2011 Jukka Lankinen Relevance Vector

Vector addition: The zero vector The D -vector whose entries are all zero is the zero vector ,

Support Vector Machines &amp; Kernelization Barna Saha Most of the slides are made using David

Introduction Kailash Awati Instructor DataCamp Support Vector Machines in R Preliminaries

Support Vector Machines Support Vector Machines Hypothesis Space Hypothesis Space variable

Support Vector Machines (Ch. 18.9) SVM Basics Support Vector Machines (SVMs) try to do our

Support vector machines CS 446 Part 1: linear support vector machines 1.0 1.0 1.0 0.8 0.8

SUPPORT VECTOR MACHINES SUPPORT VECTOR MACHINES Matthieu R Bloch Tuesday, February 25, 2020 1

RBF Kernels: Generating a complex dataset DataCamp Support Vector Machines in R A bit about RBF

Machine Learning for NLP Support Vector Machines Aurlie Herbelot 2019 Centre for Mind/Brain

Generating a radially separable dataset DataCamp Support Vector Machines in R Generating a 2d

Support Vector Machines 290N, 2014 Support Vector Machines (SVM) Supervised learning

On the Hierarchical Modeling Technique with Applications Jingfang Huang Department of

Boolean Matrix Multiplication (BMM) and CFG parsing by Franziska Ebert 23.3.2007 Boolean

An Overview of the Conceptual Development of Computer Media Dr. Gabriela Avram Introduction to

Detection and Estimation Theory Lecture 8 Mojtaba Soltanalian- UIC msol@uic.edu

15-388/688 - Practical Data Science: Relational Data J. Zico Kolter Carnegie Mellon University

Chapter 4 Programming with MATLAB Algorithms and Control Structures Algorithm: an ordered

Big graphs for big data: parallel matching and Outline clustering on billion-vertex graphs

From Nesterovs Estimate Sequence To Riemannian Acceleration Kwangjun Ahn, Suvrit Sra COLT

Sambuz

Useful Links

Newsletter

Mail Us

Support Vector Machines & Kernelization Barna Saha Most of the slides are made using David