Machine learning Boltzmann Machines Dima Kochkov 1 1 Department of - PowerPoint PPT Presentation

Machine learning Machine learning basics Energy based models Algorithmic improvements Examples Summary Machine learning Boltzmann Machines Dima Kochkov 1 1 Department of Physics University of Illinois at Urbana-Champaign Algorithm interest meeting, 2016

Machine learning Machine learning basics Energy based models Algorithmic improvements Examples Summary Outline Algorithmic improvements 4 Machine learning 1 Markov Chain Monte Carlo Motivation Mean Field approximation Hall of fame Restricted Boltzmann Machine learning basics 2 Machines General framework Neural networks Examples 5 Energy based models Matching probability 3 Hopfield Nets distribution Boltzmann machines Image features extraction

Machine learning Machine learning basics Energy based models Algorithmic improvements Examples Summary Motivation for machine learning Machine learning is a problem solving approach that comes in handy when algorithmic solution is hard to obtain. Pros: requires minimum prior knowledge solution can adapt to a new environment Cons: inefficient use of hardware weaker guarantees of correctness requires big datasets for complex problems

Machine learning Machine learning basics Energy based models Algorithmic improvements Examples Summary Examples of successful applications Just to name a few: Speech recognition - Siri, ok Google Image recognition - ImageNet Fraud detection Recommendation systems - Netflix competition Games - AlphaGo Funky stuff like self driving cars, robotics etc

Machine learning Machine learning basics Energy based models Algorithmic improvements Examples Summary Optimizational point of view Solution to the problem is given in a variational form of a black box with a gazillion of knobs. Algorithm tunes those knobs to have a better solution. Algorithm is data driven. supervised learning (SVM, BackProp, Decision Trees etc) unsupervised learning (Kmeans, EM, Boltzmann Machines )

Machine learning Machine learning basics Energy based models Algorithmic improvements Examples Summary Artificial Neural Networks Artificial Neural Networks represent a class of models that constitute a set of connected units (neurons). Most of the time one can define following properties of a neuron: input values, x i vector < bool > vector < double > output value, f(input, links), usually f( w i x i ) f = tanh ( w i x i ) 1 f = 1+ e − ( wi xi ) f = max (0 , w i x i ) Activity pattern evolves according to a specific rule of the network. I will use s i to represent i th neuron, or v i and h i .

Machine learning Machine learning basics Energy based models Algorithmic improvements Examples Summary Memory storage One of the simplest energy based models - Hopfield Net: Binary units s i ∈ 0 , 1 Symmetric weights w i , j = w j , i Features a global energy function E Energy minimas correspond to memories � � E = − b i s i − w i , j s i s j (1) i i , j Figure: Hopfield Net

Machine learning Machine learning basics Energy based models Algorithmic improvements Examples Summary Basic Boltzmann machine Ingredients for the Boltzmann machine: Hopfield net + hidden units Gibbs probability − E distribution P = e T Z � � E = − b i s i − w i , j s i s j (2) i i , j Figure: Boltzmann machine s can be either visible ( v ) or hidden ( h ) units of the model A generative model with a potential for data interpretation.

Machine learning Machine learning basics Energy based models Algorithmic improvements Examples Summary How does BM ”interpret” the data States of the hidden units correspond to interpretations of the data. Low energy states of the hidden units given visible units correspond to ”good” interpretations Common structure allows low energy interpretations Figure: Data interpretation

Machine learning Machine learning basics Energy based models Algorithmic improvements Examples Summary How do we learn? Learning objective We want minimize the ”distance” between the probability distribution our Boltzmann machine generates and the distribution from which the data was drawn. � � P = log( P ( v = d i )) = log( P ( v = d i ) (3) i ∈ data i ∈ data ∂ P ∂ ′ ) − ′ , h = h ′ )) � � � = ( − E ( v = d i , h = h − E ( v = v ∂ w α,β ∂ w α,β i h ′ v ′ , h ′ (4) ∂ P � � � = ( s α s β ) | v = d i − ( s α s β ) (5) ∂ w α,β i h ′ v ′ , h ′ ∂ P = < s α s β > data − < s α s β > (6) ∂ w α,β

Machine learning Machine learning basics Energy based models Algorithmic improvements Examples Summary Algorithm To train a Boltzmann machine on a given dataset we: fix visible units to the values of the data instance compute < s α s β > (positive phase) set visible units free and again compute < s α s β > (negative phase) after processing a batch of data, update parameters Potential issues Can’t efficiently compute < s α s β >

Machine learning Machine learning basics Energy based models Algorithmic improvements Examples Summary MCMC Markov Chain Monte Carlo: e − E � � < s α s β > = s α s β p ( s ) = (7) s α s β Z s s clamp the data on the visible neurons sample s α s β from the Markov chain (positive phase) set visible units free and again sample s α s β (negative phase) repeat for the dataset, update weights Potential issues Markov chain might take a very long time to equilibrate How do we know if we have a good estimate?

Machine learning Machine learning basics Energy based models Algorithmic improvements Examples Summary better MCMC We can use a clever trick to have a warm start. We keep a set of equilibrated Markov chains with fixed and free visible units. Equilibrated chains with clamped units (”Particles”) are used to evaluate < s α s β > data Equilibrated chains with free visible units (”Fantasy particles”) are used to evaluate < s α s β > Status Still to slow for most applications In theory should work well only for a full batch learning Much better than previously described method

Machine learning Boltzmann Machines Dima Kochkov 1 1 Department of - PowerPoint PPT Presentation

Machine learning Machine learning basics Energy based models Algorithmic improvements Examples Summary Machine learning Boltzmann Machines Dima Kochkov 1 1 Department of Physics University of Illinois at Urbana-Champaign Algorithm interest

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

INTRODUCTION TO MACHINE LEARNING Joseph C. Osborn CS 51A Spring 2020 Machine Learning is

Human and Machine Learning Tom Mitchell Machine Learning Department Carnegie Mellon University

Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification

Machine Learning - Intro Aarti Singh Machine Learning 10-701/15-781 Sept 8, 2010 You tell me

MACHINE LEARNING Kernel Canonical Correlation Analysis 1 ADVANCED MACHINE LEARNING ADVANCED

Machine learning for finance Nathan George Data Science Professor DataCamp Machine Learning

APPLIED MACHINE LEARNING Methods for Clustering K-means, Soft K-means DBSCAN 1 MACHINE

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

Fifteen Minutes of Unwanted Fame: Detecting and Characterizing Doxing Peter Snyder*

Channels inclusion, falsification, and verification Francesco Buscemi 1 in coll. with: S.

Computer Security DD2395 http://www.csc.kth.se/utbildning/kth/kurser/DD2395/dasakh11/ Fall 2011

Core Claim Formal Methods in Software Development Computer programs/systems are subject to exact

Latest on DEMETRA: usage and I T concepts for today and tom orrow On: 11 th May, 2006 By:

Prog 2, Kangaroo Hall of Fame, Midterm Review, (Interpolation if time) Week 6, Wed Feb 9

Farm Business Management: The Fundamentals of Good Practice Peter L. Nuthall Chapter 17 The

Sales Update Dubai Oct 2017 Engine Leasing Introduction to ELF The market Market size