CS489/698 Lecture 10: Feb 6, 2017 Kernel methods [D] Chap. 11 [B] - PowerPoint PPT Presentation

Kernel Methods • Idea: use large (possibly infinite) set of fixed non- linear basis functions • Normally, complexity depends on number of basis functions, but by a “dual trick”, complexity depends on the amount of data • Examples: – Gaussian Processes (next class) – Support Vector Machines (next week) – Kernel Perceptron – Kernel Principal Component Analysis CS489/698 (c) 2017 P. Poupart 3

Kernel Function • Let be a set of basis functions that map inputs to a feature space. • In many algorithms, this feature space only appears in the dot product of input pairs . • Define the kernel function to be the dot product of any pair in feature space. – We only need to know , not CS489/698 (c) 2017 P. Poupart 4

Gram Matrix • Let be the Gram matrix • Substitute in objective: � � � � � � � � � � � • Solution: set gradient to 0 �� • Prediction: where is the training set and is a test instance CS489/698 (c) 2017 P. Poupart 7

Dual Linear Regression • Prediction: • Linear regression where we find dual solution instead of primal solution w . • Complexity: – Primal solution: depends on # of basis functions – Dual solution: depends on amount of data • Advantage: can use very large # of basis functions • Just need to know kernel CS489/698 (c) 2017 P. Poupart 8

Constructing Kernels • Two possibilities: – Find mapping to feature space and let – Directly specify • Can any function that takes two arguments serve as a kernel? • No, a valid kernel must be positive semi-definite – In other words, must factor into the product of a transposed matrix by itself (e.g., ) – Or, all eigenvalues must be greater than or equal to 0. CS489/698 (c) 2017 P. Poupart 9

Constructing Kernels • Can we construct directly without knowing ? • Yes, any positive semi-definite is fine since there is a corresponding implicit feature space. But positive semi-definiteness is not always easy to verify. • Alternative, construct kernels from other kernels using rules that preserve positive semi-definiteness CS489/698 (c) 2017 P. Poupart 11

Rules to construct Kernels • Let and be valid kernels • The following kernels are also valid: � � 1. � � � � 2. � � � 3. is polynomial with coeffs 0 � � � 4. � � � � 5. � � � � � 6. � � � � 7. � � � � 8. is symmetric positive semi-definite � � � 9. � � � � � � � � � 10. � � � � � � � where � CS489/698 (c) 2017 P. Poupart 12

Common Kernels • Polynomial kernel: – is the degree – Feature space: all degree M products of entries in – Example: Let and be two images, then feature space could be all products of M pixel intensities • More general polynomial kernel: with – Feature space: all products of up to M entries in CS489/698 (c) 2017 P. Poupart 13

Non-vectorial Kernels • Kernels can be defined with respect to other things than vectors such as sets, strings or graphs • Example for strings: similarity between two documents (weighted sum of all non-contiguous strings that appear in both documents and ). • Lodhi, Saunders, Shawe-Taylor, Christianini, Watkins, Text Classification Using String Kernels , JMLR, p. 419-444, 2002. CS489/698 (c) 2017 P. Poupart 15

CS489/698 Lecture 10: Feb 6, 2017 Kernel methods [D] Chap. 11 [B] - PowerPoint PPT Presentation

CS489/698 Lecture 10: Feb 6, 2017 Kernel methods [D] Chap. 11 [B] Sec. 6.1, 6.2 [M] Sec. 14.1, 14.2 [H] Chap. 9 [HTF] Chap. 6 CS489/698 (c) 2017 P. Poupart 1 Non-linear Models Recap Generalized linear models: Neural networks:

CS489/698 Lecture 11: Feb 8, 2017 Gaussian Processes [B] Section 6.4 [M] Chap. 15 [HTF] Sec.

CS489/698 Lecture 22: March 27, 2017 Bagging and Distributed Computing [RN] Sec. 18.10, [M] Sec.

CS489/698 Lecture 9: Feb 1, 2017 Multi-layer Neural Networks, Error Backpropagation [D] Chapt.

March 2018 Progress Report March Feb Anderson March Feb Anderson March Feb Anderson March

35 30 33 20 10 10 8 7 0 Feb 10 Aug 10 Feb 11 Aug 11 Feb 12 Aug 12 Feb 13 Aug 13

SNMP MIBs to manage G.698.2 parameters

Alexander Volya 2016, Feb. GGI Lecture notes www.volya.net Alexander Volya 2016, Feb. GGI

19 th ,20 th Feb 2010 Feb 2010 1 19 th ,20 th Feb 2010 Feb 2010 2 Contents Importance of

1 21-Feb-17 2 21-Feb-17 3 21-Feb-17

Banburismus Banburismus Monday Feb 23 and Wednesday Feb 25 Monday Feb 23 and Wednesday Feb

TN Save a Life TN Save a Life Program Program Op Opio ioid id Cr Cris isis is in in Te

Item V.A Trend Analysis Unsheltered Sheltered 1,882 1897 1,860 1,803 1,729 1,698 674 670

2008 Ozone National Ambient Air Quality Standards Doris McLeod Air Quality Planner 804/698-4197

Internship at the Museum of Modern Art Library Charles Kreloff LIS 698 Practicum &

www.WhiteHouseStrategies.com USA: 312.523.6220 UK: 07908 219 698

Simen Lieungh CEO Key Financials $ 698 Mill $ 1,435 Mill Revenue FY 2018 Gross debt as per Q2

On the Multiplicative Complexity of 6-variable Boolean Functions C a gda s C alk,

Advanced topic: Space complexity CSCI 3130 Formal Languages and Automata Theory Siu On CHAN

Lecture 14: Growth of Functions and Complexity Dr. Chengjiang Long Computer Vision Researcher at

Non-monotone Submodular Maximization with Nearly Optimal Adaptivity and Query Complexity Matthew

On formal complexity measures Pavel Pudl ak Mathematical Institute, Academy of Sciences,

Software Quality Metrics What does software quality mean? and How is it measured? Factors in

Measuring Empirical Computational Complexity with trend-prof Simon Goldsmith Alex Aiken Daniel

WORDS BASED ON PERIODICITY Antonio Restivo University of Palermo Italy Joint work with Filippo

CS489/698 Lecture 10: Feb 6, 2017 Kernel methods [D] Chap. 11 [B] - PowerPoint PPT Presentation

CS489/698 Lecture 10: Feb 6, 2017 Kernel methods [D] Chap. 11 [B] Sec. 6.1, 6.2 [M] Sec. 14.1, 14.2 [H] Chap. 9 [HTF] Chap. 6 CS489/698 (c) 2017 P. Poupart 1 Non-linear Models Recap Generalized linear models: Neural networks:

CS489/698 Lecture 11: Feb 8, 2017 Gaussian Processes [B] Section 6.4 [M] Chap. 15 [HTF] Sec.

CS489/698 Lecture 22: March 27, 2017 Bagging and Distributed Computing [RN] Sec. 18.10, [M] Sec.

CS489/698 Lecture 9: Feb 1, 2017 Multi-layer Neural Networks, Error Backpropagation [D] Chapt.

March 2018 Progress Report March Feb Anderson March Feb Anderson March Feb Anderson March

35 30 33 20 10 10 8 7 0 Feb 10 Aug 10 Feb 11 Aug 11 Feb 12 Aug 12 Feb 13 Aug 13

SNMP MIBs to manage G.698.2 parameters

Alexander Volya 2016, Feb. GGI Lecture notes www.volya.net Alexander Volya 2016, Feb. GGI

19 th ,20 th Feb 2010 Feb 2010 1 19 th ,20 th Feb 2010 Feb 2010 2 Contents Importance of

1 21-Feb-17 2 21-Feb-17 3 21-Feb-17

Banburismus Banburismus Monday Feb 23 and Wednesday Feb 25 Monday Feb 23 and Wednesday Feb

TN Save a Life TN Save a Life Program Program Op Opio ioid id Cr Cris isis is in in Te

Item V.A Trend Analysis Unsheltered Sheltered 1,882 1897 1,860 1,803 1,729 1,698 674 670

2008 Ozone National Ambient Air Quality Standards Doris McLeod Air Quality Planner 804/698-4197

Internship at the Museum of Modern Art Library Charles Kreloff LIS 698 Practicum &amp;

www.WhiteHouseStrategies.com USA: 312.523.6220 UK: 07908 219 698

Simen Lieungh CEO Key Financials $ 698 Mill $ 1,435 Mill Revenue FY 2018 Gross debt as per Q2

On the Multiplicative Complexity of 6-variable Boolean Functions C a gda s C alk,

Advanced topic: Space complexity CSCI 3130 Formal Languages and Automata Theory Siu On CHAN

Lecture 14: Growth of Functions and Complexity Dr. Chengjiang Long Computer Vision Researcher at

Non-monotone Submodular Maximization with Nearly Optimal Adaptivity and Query Complexity Matthew

On formal complexity measures Pavel Pudl ak Mathematical Institute, Academy of Sciences,

Software Quality Metrics What does software quality mean? and How is it measured? Factors in

Measuring Empirical Computational Complexity with trend-prof Simon Goldsmith Alex Aiken Daniel

WORDS BASED ON PERIODICITY Antonio Restivo University of Palermo Italy Joint work with Filippo

Internship at the Museum of Modern Art Library Charles Kreloff LIS 698 Practicum &