Instance Based Learning k -Nearest Neighbor Locally weighted - PDF document

Instance Based Learning • k -Nearest Neighbor • Locally weighted regression • Radial basis functions • Case-based reasoning • Lazy and eager learning 1

Instance-Based Learning Key idea: just store all training examples � x i , f ( x i ) � Nearest neighbor: • Given query instance x q , first locate nearest training example x n , then estimate ˆ f ( x q ) ← f ( x n ) k -Nearest neighbor: • Given x q , take vote among its k nearest nbrs (if discrete-valued target function) • take mean of f values of k nearest nbrs (if real-valued) � k i =1 f ( x i ) ˆ f ( x q ) ← k 2

When To Consider Nearest Neighbor • Instances map to points in ℜ n • Less than 20 attributes per instance • Lots of training data Advantages: • Training is very fast • Learn complex target functions • Don’t lose information Disadvantages: • Slow at query time • Easily fooled by irrelevant attributes 3

Voronoi Diagram − − − + + x q − + + − 4

Behavior in the Limit Consider p ( x ) defines probability that instance x will be labeled 1 (positive) versus 0 (negative). Nearest neighbor: • As number of training examples → ∞ , approaches Gibbs Algorithm Gibbs: with probability p ( x ) predict 1, else 0 k -Nearest neighbor: • As number of training examples → ∞ and k gets large, approaches Bayes optimal Bayes optimal: if p ( x ) > . 5 then predict 1, else 0 Note Gibbs has at most twice the expected error of Bayes optimal 5

Distance-Weighted k NN Might want to weight nearer neighbors more heavily... � k i =1 w i f ( x i ) ˆ f ( x q ) ← � k i =1 w i where 1 w i ≡ d ( x q , x i ) 2 and d ( x q , x i ) is distance between x q and x i Note now it makes sense to use all training examples instead of just k → Shepard’s method 6

Curse of Dimensionality Imagine instances described by 20 attributes, but only 2 are relevant to target function Curse of dimensionality : nearest nbr is easily mislead when high-dimensional X One approach: • Stretch j th axis by weight z j , where z 1 , . . . , z n chosen to minimize prediction error • Use cross-validation to automatically choose weights z 1 , . . . , z n • Note setting z j to zero eliminates this dimension altogether see [Moore and Lee, 1994] 7

Locally Weighted Regression Note k NN forms local approximation to f for each query point x q Why not form an explicit approximation ˆ f ( x ) for region surrounding x q • Fit linear function to k nearest neighbors • Fit quadratic, ... • Produces “piecewise approximation” to f Several choices of error to minimize: • Squared error over k nearest neighbors E 1 ( x q ) ≡ 1 ( f ( x ) − ˆ f ( x )) 2 � 2 x ∈ k nearest nbrs of x q • Distance-weighted squared error over all nbrs E 2 ( x q ) ≡ 1 f ( x )) 2 K ( d ( x q , x )) x ∈ D ( f ( x ) − ˆ � 2 • . . . 8

Radial Basis Function Networks • Global approximation to target function, in terms of linear combination of local approximations • Used, e.g., for image classification • A different kind of neural network • Closely related to distance-weighted regression, but “eager” instead of “lazy” 9

Radial Basis Function Networks f(x) w 0 w w k 1 1 ... ... a (x) a (x) a (x) 1 2 n where a i ( x ) are the attributes describing instance x , and k f ( x ) = w 0 + u =1 w u K u ( d ( x u , x )) � One common choice for K u ( d ( x u , x )) is 1 u d 2 ( x u ,x ) 2 σ 2 K u ( d ( x u , x )) = e 10

Training Radial Basis Function Net- works Q1: What x u to use for each kernel function K u ( d ( x u , x )) • Scatter uniformly throughout instance space • One for each cluster of instances (use prototypes) • Or use training instances (reflects instance distribution) Q2: How to train weights (assume here Gaussian K u ) • First choose variance (and perhaps mean) for each K u – e.g., use EM • Then hold K u fixed, and train linear output layer – efficient methods to fit linear function 11

Case-Based Reasoning Can apply instance-based learning even when X � = ℜ n → need different “distance” metric Case-Based Reasoning is instance-based learning applied to instances with symbolic logic descriptions ((user-complaint error53-on-shutdown) (cpu-model PowerPC) (operating-system Windows) (network-connection PCIA) (memory 48meg) (installed-applications Excel Netscape VirusScan) (disk 1gig) (likely-cause ???)) 12

Case-Based Reasoning in CADET CADET: 75 stored examples of mechanical devices • each training example: � qualitative function, mechanical structure � • new query: desired function, • target value: mechanical structure for this function Distance metric: match qualitative function descriptions 13

Case-Based Reasoning in CADET A stored case: T−junction pipe Structure: Function: Q ,T = temperature T Q 1 1 + Q = waterflow 1 Q 3 Q + 2 Q ,T 3 3 T + 1 T 3 Q ,T T + 2 2 2 A problem specification: Water faucet Structure: Function: + C Q + ? t c + + Q + m C Q + f h − + + T c T m T + h 14

Case-Based Reasoning in CADET • Instances represented by rich structural descriptions • Multiple cases retrieved (and combined) to form solution to new problem • Tight coupling between case retrieval and problem solving Bottom line: • Simple matching of cases useful for tasks such as answering help-desk queries • Area of ongoing research 15

Lazy and Eager Learning Lazy: wait for query before generalizing • k -Nearest Neighbor , Case based reasoning Eager: generalize before seeing query • Radial basis function networks, ID3, C4.5, Backpropagation, NaiveBayes, . . . Does it matter? • Eager learner creates one global approximation • Lazy learner can create many local approximations • If they use same H , lazy can represent more complex functions (e.g., consider H = linear functions) 16

Instance Based Learning k -Nearest Neighbor Locally weighted - PDF document

Instance Based Learning k -Nearest Neighbor Locally weighted regression Radial basis functions Case-based reasoning Lazy and eager learning 1 Instance-Based Learning Key idea: just store all training examples x i , f ( x i

INSTANCE BASED LEARNING 2 Instance-Based Learning Distance function defines whats learned

Instance recognition Thurs April 6 Kristen Grauman UT Austin Instance recognition Indexing

Divide And Conquer Small And Large Instance Small instance. Sort a list that has n <=

I Instance-level recognition t l l iti Cordelia Schmid INRIA Instance-level recognition

Divide And Conquer Small And Large Instance Small instance. Sort a list that has n <=

Test Instance Generation Test Instance Generation for MAX 2SAT for MAX 2SAT Mitsuo Motoki

Nearest Neighbor Learning (Instance Based Learning) l Classify based on local similarity l Ranges

Instance-based Learning Hamid R. Rabiee Spring 2015 http://ce.sharif.edu/courses/93-94/2/ce717-1

Learning for Categorization Sample Category Learning Problem A training example is an instance

Multiple Instance Detection Network with Online Instance Classifier Refinement Peng Tang

Explaining the Stars: Weighted Multiple-Instance Learning for Aspect-Based Sentiment Analysis

About any instance (fi rst instance, appeal, cassation, the ARTYUSHENKO & PARTNERS IS THE

Instance-level recognition Cordelia Schmid INRIA, Grenoble Instance-level recognition Search

CPSC 213 2.4.4-2.4.6 Textbook 2ed: 3.9.1 1ed: 3.9.1 Introduction to Computer

Instance-level recognition Cordelia Schmid INRIA, Grenoble Instance-level recognition Search

Semantic Segmentation / Instance Segmentation Based on Deep learning Yiding Liu 2018.12.08

M-Flash: Fast Billion-Scale Graph Computation Using a Bimodal Block Processing Model Hugo

DiGSNP: A web tool for disease-gene-SNP prioritization Carmen Navarro 1 , Carlos Cano 1 Armando

CAYCE UPDAT E Ma y 18, 2015 @Na shville MDHA #E nvisionCa yc e AGE NDA 1. Re vie w of E

NOAA SURFRAD Current Activities NOAA GRAD: Kathleen Lantz, John Augustine, Gary Hodges, Jim

A Comparison of Radial and Linear Charts for Visualizing Daily Patterns Manuela Waldner ,

Radial Projection Techniques InfoVis SS2020 G4 12 05 2020 Radial Projection Basics Also

Classes of Herz-Schur multipliers Ivan Todorov April 2014 Toronto Content Positive multipliers

R A D I A L V E L O C I T Y S E A R C H F O R L O N G - P E R I O D E X O P L A N E T S A N

Instance Based Learning k -Nearest Neighbor Locally weighted - PDF document

Instance Based Learning k -Nearest Neighbor Locally weighted regression Radial basis functions Case-based reasoning Lazy and eager learning 1 Instance-Based Learning Key idea: just store all training examples x i , f ( x i

INSTANCE BASED LEARNING 2 Instance-Based Learning Distance function defines whats learned

Instance recognition Thurs April 6 Kristen Grauman UT Austin Instance recognition Indexing

Divide And Conquer Small And Large Instance Small instance. Sort a list that has n &lt;=

I Instance-level recognition t l l iti Cordelia Schmid INRIA Instance-level recognition

Divide And Conquer Small And Large Instance Small instance. Sort a list that has n &lt;=

Test Instance Generation Test Instance Generation for MAX 2SAT for MAX 2SAT Mitsuo Motoki

Nearest Neighbor Learning (Instance Based Learning) l Classify based on local similarity l Ranges

Instance-based Learning Hamid R. Rabiee Spring 2015 http://ce.sharif.edu/courses/93-94/2/ce717-1

Learning for Categorization Sample Category Learning Problem A training example is an instance

Multiple Instance Detection Network with Online Instance Classifier Refinement Peng Tang

Explaining the Stars: Weighted Multiple-Instance Learning for Aspect-Based Sentiment Analysis

About any instance (fi rst instance, appeal, cassation, the ARTYUSHENKO &amp; PARTNERS IS THE

Instance-level recognition Cordelia Schmid INRIA, Grenoble Instance-level recognition Search

CPSC 213 2.4.4-2.4.6 Textbook 2ed: 3.9.1 1ed: 3.9.1 Introduction to Computer

Instance-level recognition Cordelia Schmid INRIA, Grenoble Instance-level recognition Search

Semantic Segmentation / Instance Segmentation Based on Deep learning Yiding Liu 2018.12.08

M-Flash: Fast Billion-Scale Graph Computation Using a Bimodal Block Processing Model Hugo

DiGSNP: A web tool for disease-gene-SNP prioritization Carmen Navarro 1 , Carlos Cano 1 Armando

CAYCE UPDAT E Ma y 18, 2015 @Na shville MDHA #E nvisionCa yc e AGE NDA 1. Re vie w of E

NOAA SURFRAD Current Activities NOAA GRAD: Kathleen Lantz, John Augustine, Gary Hodges, Jim

A Comparison of Radial and Linear Charts for Visualizing Daily Patterns Manuela Waldner ,

Radial Projection Techniques InfoVis SS2020 G4 12 05 2020 Radial Projection Basics Also

Classes of Herz-Schur multipliers Ivan Todorov April 2014 Toronto Content Positive multipliers

R A D I A L V E L O C I T Y S E A R C H F O R L O N G - P E R I O D E X O P L A N E T S A N

Divide And Conquer Small And Large Instance Small instance. Sort a list that has n <=

Divide And Conquer Small And Large Instance Small instance. Sort a list that has n <=

About any instance (fi rst instance, appeal, cassation, the ARTYUSHENKO & PARTNERS IS THE