weighted sums of random kitchen
play

WEIGHTED SUMS OF RANDOM KITCHEN SINKS Replacing minimization with - PowerPoint PPT Presentation

WEIGHTED SUMS OF RANDOM KITCHEN SINKS Replacing minimization with randomization in learning The model Given a set of training data in a domain Fit a function to minimize risk Empirical Risk Risk Loss Function Hinge loss


  1. WEIGHTED SUMS OF RANDOM KITCHEN SINKS Replacing minimization with randomization in learning

  2. The model • Given a set of training data in a domain • Fit a function to minimize risk • Empirical Risk • Risk

  3. Loss Function • Hinge loss (SVM) • Exponential loss (AdaBoost) • Quadratic loss

  4. Form of solution function • Consider solutions in the form weights feature functions • Feature functions • Eigenfunctions (kernel SVM) • Decision trees/stumps (AdaBoost) • More feature functions gives better classification

  5. Solving f • Approximate • This is hard! • New approach: • Randomly choose and minimize over

  6. Randomized approach • Training data • Feature function • Number of features • Parameter distribution • Scaling factor • Algorithm • Draw feature parameters randomly from • Let • Minimize empirical risk

  7. Experimental Results vs AdaBoost • Three datasets • adult • activity • KDDCUP99 • Feature function • sampled uniformly at random • sampled from Gaussian

  8. Experimental Results vs AdaBoost

  9. Pros and Cons • Pros • Much faster • Allows simple and efficient experimentation of feature functions • Cons • Some loss in quality • Need to tune probability distribution (not needed in practice)

  10. Concentration of Risk • The randomized algorithm returns a function such that with probability • Number of training points • Number of feature vectors • Lipschitz constant of loss function • Bound approximation error • Lowest risk versus lowest risk from functions returned is not large • Bound estimation error • True risk of every function returned is close to its empirical risk

  11. Proof • minimizer of risk over all solution functions • minimizer of risk over functions returned • minimizer of empirical risk over functions returned • Then with probability

  12. Bounding approximation error • Lemma 1. Let be i.i.d. random variables in a ball of radius centered about the origin in a Hilbert space. Then with probability • Construct functions • Then there exists • So that

  13. Bounding approximation error • If the loss function has Lipschitz constant , for any two functions • Then

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend