CS473 CS-473 Text Categorization (II) Luo Si Department of - PowerPoint PPT Presentation

CS473 CS-473 Text Categorization (II) Luo Si Department of Computer Science Purdue University

Text Categorization (IV) Outline  Support Vector Machine (SVM) A Large-Margin Classifier  Introduction to SVM  Linear, hard margin  Linear, Soft margin  Non-Linear SVM  Discussion

History of SVM A brief history of SVM  SVM is inspired from statistical learning theory by Vapnik (1979) [3]  Put into practical application as “Large Margin Classifiers” in (1992) [1]  SVM became famous for its success in handwritten digit recognition [2]  SVM has been successfully utilized in  Image detection  Speaker identification  Text categorization  Many other problems… [1] B.E. Boser et al . A Training Algorithm for Optimal Margin Classifiers. Proceedings of the Fifth Annual Workshop on Computational Learning Theory 5 144-152, Pittsburgh, 1992. [2] L. Bottou et al . Comparison of classifier methods: a case study in handwritten digit recognition. Proceedings of the 12th IAPR International Conference on Pattern Recognition, vol. 2, pp. 77-82, 1994. [3] V. Vapnik. The Nature of Statistical Learning Theory. 2 nd edition, Springer, 1999.

Support Vector Machine Consider a two-class (binary classification problem like text categorization), find a line to separate data points in two classes There are many possible solutions! Are those decision boundaries equally good?

Support Vector Machine A slight variation of the data makes some decision boundaries incorrect

Large-Margin Decision Criterion The decision boundary should be far away from the data points of two classes as much as possible Indicates the margin between data points and the decision boundary should be large Margin Positive and Negative Data points have equal margin

Large-Margin Decision Criterion Margin Closest positive data point to boundary   T W X b 1 i Closest negative data point to boundary    T W X b 1 j The margin is:

Linear SVM Let {x 1 , ..., x n } denote input data. For example, vector representation of all documents Let y i be the binary indicator 1 or -1 that indicates whether x i belongs to a particular category c or not The decision boundary should classify all points correctly The decision boundary can be found by solving the following constrained optimization problem

Hard Margin Linear SVM Solution The optimal parameters are    * w y X i i i  i SV     * y W X ( b ) 1 i SV i i Prediction is made by:       sign WX ( b ) sign ( y X ( X ) b ) i i i  i SV

Soft Margin Linear SVM Solution What about linearly non-separable data?

Soft Margin Linear SVM Solution We tolerate some error for specific data points as  2  1

Soft Margin Linear SVM Introduction “slack variables”, slack variables are always positive Introduce const C to balance error for linear boundary and the margin The optimization problem becomes

Non-linear SVM Linear SVM only uses a line to separate data points, how to generalize it to non-linear case? Key idea: transform X i to a higher dimension space  Input space: the space the point x i are located  Feature space: the space of f( x i ) after transformation

Non-linear SVM Key idea: transform X i to a higher dimension space x 2 x 1 =0

Non-linear SVM Key idea: transform X i to a higher dimension space  Input space: the space the point x i are located  Feature space: the space after transformation Use Ф ( x i ) to transform low level feature to high level feature Sometimes, the Ф ( x i ) transformation maps to very high dimensional space or even infinite dimensional space

Text Categorization: Evaluation Performance of different algorithms on Reuters-21578 corpus: 90 categories, 7769 Training docs, 3019 test docs, (Yang, JIR 1999)

SVM Toolkit SMO: Sequential Minimal Optimization SVM-Light LibSVM BSVM ……

Text Categorization (II) Outline  Support Vector Machine (SVM) A Large-Margin Classifier  Introduction to SVM  Linear, hard margin  Linear, Soft margin  Non-Linear SVM  Discussion

CS473 CS-473 Text Categorization (II) Luo Si Department of - PowerPoint PPT Presentation

CS473 CS-473 Text Categorization (II) Luo Si Department of Computer Science Purdue University Text Categorization (IV) Outline Support Vector Machine (SVM) A Large-Margin Classifier Introduction to SVM Linear, hard margin

Dynamic Programming Lecture 8 February 15, 2011 Sariel (UIUC) CS473 1 Spring 2011 1 / 38

Sorting networks Lecture 24 November 19, 2015 Sariel (UIUC) New CS473 1 Fall 2015 1 / 35

Heuristics, Approximation Algorithms Lecture 24 Nov 18, 2016 Chandra & Ruta (UIUC) CS473

More Network Flow Applications Lecture 16 October 19, 2016 Chandra & Ruta (UIUC) CS473 1

Network Flow Algorithms Lecture 14 October 12, 2016 Chandra & Ruta (UIUC) CS473 1 Fall

Entropy and Shannons Theorem Lecture 24 November 18, 2015 Sariel (UIUC) New CS473 1 Fall

Fast Fourier Transform Lecture 23 November 17, 2015 Sariel (UIUC) New CS473 1 Fall 2015 1 /

CS473 Web Search (II) Luo Si Department of Computer Science Purdue University Modified Slides

Dynamic Programming on Trees Lecture 4 September 2, 2016 Chandra & Ruta (UIUC) CS473 1

CS473: Link Analysis Luo Si Department of Computer Science Purdue University Borrowed Slides

Approximation Algorithms for TSP Lecture 26 Dec 2, 2016 Chandra & Ruta (UIUC) CS473 1

Approximation Algorithms Lecture 8 September 17, 2015 Sariel (UIUC) New CS473 1 Fall 2015 1

Flow Variants Lecture 17 October 21, 2016 Chandra & Ruta (UIUC) CS473 1 Fall 2016 1 / 17

CS473 Federated Text Search Luo Si Department of Computer Science Purdue University Abstract

CS 473: Algorithms Ruta Mehta University of Illinois, Urbana-Champaign Spring 2018 Ruta (UIUC)

Luo Si Department of Computer Science Purdue University Basic Concepts of IR: Outline Basic

Data Mining and Machine Learning: Fundamental Concepts and Algorithms dataminingbook.info

CS257 Linear and Convex Optimization Lecture 7 Bo Jiang John Hopcroft Center for Computer

In SMV I IAML: Support Vector Machines II We saw: Max margin trick Nigel Goddard

Support Vector Machines COMP 640 Ryan Spring, Sarah Kim

Rewriting in Practice Ashish Tiwari SRI International Menlo Park, CA 94025 tiwari@csl.sri.com

13.1 Review of Last Lecture Review of primal and dual of SVM. Insights: Dual only depends on

Lecture 22 But not sufficient for the real world: At least 2 key missing pieces System

LEARNING-BASED TESTING: RECENT PROGRESS AND FUTURE PROSPECTS Karl Meinke Computer Science

CS473 CS-473 Text Categorization (II) Luo Si Department of - PowerPoint PPT Presentation

CS473 CS-473 Text Categorization (II) Luo Si Department of Computer Science Purdue University Text Categorization (IV) Outline Support Vector Machine (SVM) A Large-Margin Classifier Introduction to SVM Linear, hard margin

Dynamic Programming Lecture 8 February 15, 2011 Sariel (UIUC) CS473 1 Spring 2011 1 / 38

Sorting networks Lecture 24 November 19, 2015 Sariel (UIUC) New CS473 1 Fall 2015 1 / 35

Heuristics, Approximation Algorithms Lecture 24 Nov 18, 2016 Chandra &amp; Ruta (UIUC) CS473

More Network Flow Applications Lecture 16 October 19, 2016 Chandra &amp; Ruta (UIUC) CS473 1

Network Flow Algorithms Lecture 14 October 12, 2016 Chandra &amp; Ruta (UIUC) CS473 1 Fall

Entropy and Shannons Theorem Lecture 24 November 18, 2015 Sariel (UIUC) New CS473 1 Fall

Fast Fourier Transform Lecture 23 November 17, 2015 Sariel (UIUC) New CS473 1 Fall 2015 1 /

CS473 Web Search (II) Luo Si Department of Computer Science Purdue University Modified Slides

Dynamic Programming on Trees Lecture 4 September 2, 2016 Chandra &amp; Ruta (UIUC) CS473 1

CS473: Link Analysis Luo Si Department of Computer Science Purdue University Borrowed Slides

Approximation Algorithms for TSP Lecture 26 Dec 2, 2016 Chandra &amp; Ruta (UIUC) CS473 1

Approximation Algorithms Lecture 8 September 17, 2015 Sariel (UIUC) New CS473 1 Fall 2015 1

Flow Variants Lecture 17 October 21, 2016 Chandra &amp; Ruta (UIUC) CS473 1 Fall 2016 1 / 17

CS473 Federated Text Search Luo Si Department of Computer Science Purdue University Abstract

CS 473: Algorithms Ruta Mehta University of Illinois, Urbana-Champaign Spring 2018 Ruta (UIUC)

Luo Si Department of Computer Science Purdue University Basic Concepts of IR: Outline Basic

Data Mining and Machine Learning: Fundamental Concepts and Algorithms dataminingbook.info

CS257 Linear and Convex Optimization Lecture 7 Bo Jiang John Hopcroft Center for Computer

In SMV I IAML: Support Vector Machines II We saw: Max margin trick Nigel Goddard

Support Vector Machines COMP 640 Ryan Spring, Sarah Kim

Rewriting in Practice Ashish Tiwari SRI International Menlo Park, CA 94025 tiwari@csl.sri.com

13.1 Review of Last Lecture Review of primal and dual of SVM. Insights: Dual only depends on

Lecture 22 But not sufficient for the real world: At least 2 key missing pieces System

LEARNING-BASED TESTING: RECENT PROGRESS AND FUTURE PROSPECTS Karl Meinke Computer Science

Heuristics, Approximation Algorithms Lecture 24 Nov 18, 2016 Chandra & Ruta (UIUC) CS473

More Network Flow Applications Lecture 16 October 19, 2016 Chandra & Ruta (UIUC) CS473 1

Network Flow Algorithms Lecture 14 October 12, 2016 Chandra & Ruta (UIUC) CS473 1 Fall

Dynamic Programming on Trees Lecture 4 September 2, 2016 Chandra & Ruta (UIUC) CS473 1

Approximation Algorithms for TSP Lecture 26 Dec 2, 2016 Chandra & Ruta (UIUC) CS473 1

Flow Variants Lecture 17 October 21, 2016 Chandra & Ruta (UIUC) CS473 1 Fall 2016 1 / 17