Exploration of classification methods: SVM and KDE Xi Cheng, Heng - PowerPoint PPT Presentation

Exploration of classification methods: SVM and KDE Xi Cheng, Heng Xu, Jing Peng, Zimeng Wang, Andy Wu, Shiyuan Li University of California, Davis Instructor: Xiaodong Li RTG June 2017 Classification Methods UC Davis 1 / 25

Introduction Project Summary Goal: To publish a Wiki page and draft text notes detailing the classification methods Support Vector Machines (SVM) and Kernel Density Classification (KDC) so that anyone may learn about them Part 1: Conceptual Study Part 2: Empirical Analysis Classification Methods UC Davis 1 / 25

Introduction What is Classification? Classification is the problem of identifying which category a new observation belongs to, given a set of features for that observation and a set of observations whose category is known Example: Classifying email into spam vs. non-spam Classification Methods UC Davis 2 / 25

Support Vector Machine Hard Margin Figure: Infinitely many hyperplanes; hyperplane separates data into 2 classes; Our goal is to use training data to develop a classifier to correctly classify test data with certain constraints Classification Methods UC Davis 3 / 25

Support Vector Machine Maximal Margin Classifier Optimal separating hyperplane: hyperplane that has the farthest minimum distance to the training observations. Primal Optimization Problem: 2 maximize || w || w , b y i ( w T x i + b ) ≥ 1 , s . t . i = 1 , . . . , n Classification Methods UC Davis 4 / 25

Support Vector Machine Support Vector Classifier Described by a soft margin, allowing some observations to be on the wrong side of the margin or even incorrect side of the hyperplane subject to a cost parameter Primal Optimization Problem: n 1 2 w T w + C � minimize ǫ i w , b ,ǫ i i =1 y i ( w T x i + b ) ≥ 1 − ǫ i , s . t . ǫ i ≥ 0 i = 1 , . . . , n Classification Methods UC Davis 5 / 25

Support Vector Machine Support Vector Machine An extension of Support Vector Classifiers that enlarges the feature space using kernels to create a non-linear decision boundary Different Dual Optimization problems depending on choice of kernel, notably only depending on the inner products of observations Classification Methods UC Davis 6 / 25

Support Vector Machine Support Vector Machine Maximal Margin Classifiers, Support Vector Classifiers, and Support Vector Machines are all considered Support Vector Machines Linear Kernel, ǫ i = 0 Linear Kernel, ǫ i > 0 Radial Kernel Classification Methods UC Davis 7 / 25

Kernel Density Classification Naive Bayes Classifier Given a vector x = ( x 1 , ..., x n ) T , We assign the probability P ( C k | x 1 , ... x n ) to the event that the observation x i belongs to the class C k . We assume each feature is conditionally independent of every other feature given the class variable. Using Bayes’ theorem, the Naive Bayes classifier is the following function that assigns the observation to the class: m � y = argmax ˆ P ( C k ) P ( x i | C k ) k ∈{ 1 ,..., K } i =1 Classification Methods UC Davis 8 / 25

Kernel Density Classification Kernel Density Estimation Next, we want to know how to calculate the conditional probability P ( x i | C k ) in a non-parametric way Using histograms, we can estimate the probability as f ( x 0 ) = # x i ∈ N ( x 0 ) ˆ nh where h > 0 is a parameter called the bandwidth Classification Methods UC Davis 9 / 25

Kernel Density Classification Kernel Density Estimation Using kernels we can obtain a smooth estimate for the pdf n f n ( x ) = 1 K ( x − x i ˆ � ) nh h i =1 where h > 0 is the bandwidth, and K ( u ) is the kernel function Classification Methods UC Davis 10 / 25

Kernel Density Classification Bias-Variance Tradeoff The choice of bandwidth h is important because of the bias-variance tradeoff Classification Methods UC Davis 11 / 25

Empirical Study Overall of Our Empirical Studies Six Individual Empirical Studies: Heart Disease Data Analysis Andy Wu Text Classification(BBC News Data Set...) Shiyuan Li Categorical Predictors(Connect-4 Data Set...) Xi Cheng Sentiment Analysis(IMDB Reviews Data Set...) Zimeng Wang SVM for Unbalanced Data Jing Peng Connection between SVM, LDA and QDA Heng Xu Classification Methods UC Davis 12 / 25

Empirical Study Connection between SVM, LDA and QDA What is LDA and QDA: Classification Methods UC Davis 13 / 25

Empirical Study Connection between SVM, LDA and QDA When we want use LDA and QDA? LDA: Assuming each class has the same variance - covariance matrices. Straight Line. QDA: Assuming each class has different variance - covariance matrices. Quadratic Curve. Classification Methods UC Davis 14 / 25

Empirical Study Covariance Adjusted SVM Linear SVM with Soft-Margin: n 1 2 w T w + C � minimize ε i w , b ,ε i (1) i =1 y i ( w T x i + b ) ≥ 1 − ε i , and ε i ≥ 0 , i = 1 , . . . , n , s.t. Dual form of Kernel SVM: n n n α i − 1 � � � max α i α j y i y j K ( x i , x j ) 2 α i ≥ 0 i =1 i =1 j =1 (2) n � y i α i = 0 and 0 ≤ α i ≤ C , for i = 1 , ..., n s . t . i =1 Classification Methods UC Davis 15 / 25

Empirical Study Covariance Adjusted SVM We here use S to denote the pooled covariance matrix and we want to add variance-covariance into our consideration: n 1 � 2 w T Sw + C minimize ε i w , b ,ε i i =1 y i ( w T x i + b ) ≥ 1 − ε i , and ε i ≥ 0 , i = 1 , . . . , n , s.t. We can verify that this model is equivalent to multiply the inverse of the square root of pooled covariance matrix to our data, and then apply SVM to the new data: n 1 1 1 2 w T ( S 2 ) T S � 2 w + C minimize ε i , w , b ,ε i i =1 1 2 S − 1 y i ( w T S 2 x i + b ) ≥ 1 , for i = 1,...,n, s.t. Classification Methods UC Davis 16 / 25

Empirical Study Connection between SVM, LDA and QDA Case 1: 2 dimension, Same Variance-Covariance matrix and merged heavily Classification Methods UC Davis 17 / 25

Empirical Study Connection between SVM, LDA and QDA Case 2: 2 dimension, Same Variance-Covariance matrix but not merged heavily Classification Methods UC Davis 18 / 25

Empirical Study Connection between SVM, LDA and QDA 2 dimension, different variance-covariance matrix(using SVM with polynomial kernel of degree 2 and QDA) Classification Methods UC Davis 19 / 25

Empirical Study Connection between SVM, LDA and QDA Case 3: If two classes are mixed with each other Classification Methods UC Davis 20 / 25

Empirical Study Connection between SVM, LDA and QDA Case 5: If two classes are not mixed such heavily Classification Methods UC Davis 21 / 25

Empirical Study Connection between SVM, LDA and QDA Case 4: If mixed heavily, even in some extreme cases Classification Methods UC Davis 22 / 25

Empirical Study Connection between SVM, LDA and QDA Opinions: If two classes twisted with each other a lot, linear SVM and LDA(Polynomial Kernel SVM and QDA) will construct extreme similar classifiers. Classification Methods UC Davis 23 / 25

Empirical Study We are writing all of our thoughts and work in an INTERESTING report here! Classification Methods UC Davis 24 / 25

Exploration of classification methods: SVM and KDE Xi Cheng, Heng - PowerPoint PPT Presentation

Exploration of classification methods: SVM and KDE Xi Cheng, Heng Xu, Jing Peng, Zimeng Wang, Andy Wu, Shiyuan Li University of California, Davis Instructor: Xiaodong Li RTG June 2017 Classification Methods UC Davis 1 / 25 Introduction

KDE Edu KDE for Education Aleix Pol, Laszlo Papp July 2012 Aleix Pol, Laszlo Papp KDE Edu

KDE Usability Project August 9 2008 Celeste Lyn Paul celeste@kde.org KDE Usability Project

KOrganizer The KDE Calendaring and Scheduling Application Cornelius Schumacher The KDE Project

KDE libraries for Qt application developers Kevin Krammer <krammer@kde.org>, FOSDEM 2013

Frank Karlitschek KDE Developer openDesktop.org KDE-Look.org KDE-Apps.org Social Desktop

Debian KDE-Extras Team Debian KDE-Extras Team Mark Purcell <msp@debian.org> Introduction

KDE Frameworks 5 David Faure 28/06/2012 | Tallinn | Akademy Be Free. KDE About me David Faure

KDE Quality Teams The Quality Team Project (a little background) Me (even less background)

KDE Device spectrum: Plasma Netbook Marco Martin Why for netbooks? Why KDE SC? Why KDE SC?

SVM on Intel Graphics Jesse Barnes Intel Open Source Technology Center 1 What is SVM?

SVM-flexible discriminant analysis Huimin Peng November 20, 2014 Outline SVM Nonlinear SVM =

Overview SVM theoretical framework ORACLE data mining technology SVM parameter

The Qt and KDE Frameworks: An Overview Kevin Ottens Kevin Ottens The Qt and KDE

The Qt and KDE Frameworks: An Overview Kevin Ottens Kevin Ottens The Qt and KDE

Meta-Programming in KDE The technology behind KConfig XT and friends Cornelius Schumacher The

Unit Testing Why and How to Write Unit Tests in KDE? David Faure faure@kde.org Kvin Ottens

A Bayesian inversion approach to recovering material parameters in hyperelastic solids using

Performing Bayesian analysis in Stata using WinBUGS Tom Palmer, John Thompson & Santiago

Spam Filtering with Naive Bayes Classifier Yuriy Arabskyy June 6, 2017 Table of contents What

KALMAN AND PARTICLE FILTERS Tutorial 10 H. R. B. Orlande, M. J. Colao, G. S. Dulikravich, F.

Critical Tests of Theory of the Early Universe using the Cosmic Microwave Background Eiichiro

Applying the data nullspace projection method to a geoacoustic Bayesian inversion in a randomly

Performance Performance Highlights: Bayesian methodology and actuarial science Case

Planning and Delivering your Presentation This document should be read together with the

Exploration of classification methods: SVM and KDE Xi Cheng, Heng - PowerPoint PPT Presentation

Exploration of classification methods: SVM and KDE Xi Cheng, Heng Xu, Jing Peng, Zimeng Wang, Andy Wu, Shiyuan Li University of California, Davis Instructor: Xiaodong Li RTG June 2017 Classification Methods UC Davis 1 / 25 Introduction

KDE Edu KDE for Education Aleix Pol, Laszlo Papp July 2012 Aleix Pol, Laszlo Papp KDE Edu

KDE Usability Project August 9 2008 Celeste Lyn Paul celeste@kde.org KDE Usability Project

KOrganizer The KDE Calendaring and Scheduling Application Cornelius Schumacher The KDE Project

KDE libraries for Qt application developers Kevin Krammer &lt;krammer@kde.org&gt;, FOSDEM 2013

Frank Karlitschek KDE Developer openDesktop.org KDE-Look.org KDE-Apps.org Social Desktop

Debian KDE-Extras Team Debian KDE-Extras Team Mark Purcell &lt;msp@debian.org&gt; Introduction

KDE Frameworks 5 David Faure 28/06/2012 | Tallinn | Akademy Be Free. KDE About me David Faure

KDE Quality Teams The Quality Team Project (a little background) Me (even less background)

KDE Device spectrum: Plasma Netbook Marco Martin Why for netbooks? Why KDE SC? Why KDE SC?

SVM on Intel Graphics Jesse Barnes Intel Open Source Technology Center 1 What is SVM?

SVM-flexible discriminant analysis Huimin Peng November 20, 2014 Outline SVM Nonlinear SVM =

Overview SVM theoretical framework ORACLE data mining technology SVM parameter

The Qt and KDE Frameworks: An Overview Kevin Ottens Kevin Ottens The Qt and KDE

The Qt and KDE Frameworks: An Overview Kevin Ottens Kevin Ottens The Qt and KDE

Meta-Programming in KDE The technology behind KConfig XT and friends Cornelius Schumacher The

Unit Testing Why and How to Write Unit Tests in KDE? David Faure faure@kde.org Kvin Ottens

A Bayesian inversion approach to recovering material parameters in hyperelastic solids using

Performing Bayesian analysis in Stata using WinBUGS Tom Palmer, John Thompson &amp; Santiago

Spam Filtering with Naive Bayes Classifier Yuriy Arabskyy June 6, 2017 Table of contents What

KALMAN AND PARTICLE FILTERS Tutorial 10 H. R. B. Orlande, M. J. Colao, G. S. Dulikravich, F.

Critical Tests of Theory of the Early Universe using the Cosmic Microwave Background Eiichiro

Applying the data nullspace projection method to a geoacoustic Bayesian inversion in a randomly

Performance Performance Highlights: Bayesian methodology and actuarial science Case

Planning and Delivering your Presentation This document should be read together with the

KDE libraries for Qt application developers Kevin Krammer <krammer@kde.org>, FOSDEM 2013

Debian KDE-Extras Team Debian KDE-Extras Team Mark Purcell <msp@debian.org> Introduction

Performing Bayesian analysis in Stata using WinBUGS Tom Palmer, John Thompson & Santiago