Learning Discrete Graphical Models with Neural Networks Andrey - PowerPoint PPT Presentation

Learning Discrete Graphical Models with Neural Networks Andrey Lokhov joint work with Abhijith Jayakumar, Sidhant Misra, Marc Vuffray UNCLASSIFIED Managed by Triad National Security, LLC for the U.S. Department of Energy’s NNSA

Graphical Models Probability distribution ! " has conditional dependency structure according to a given graph Factorization property Separation property " ( " # " # " ( " ' " % " $ " ' " % " $ " & " & ! " ∝ exp 1 : 2 (" 2 ) " ( |(" ' , " # ) is independent of (" % , " & , " $ ) 2∈2456789 UNCLASSIFIED Managed by Triad National Security, LLC for the U.S. Department of Energy’s NNSA

Graphical Model Learning Informally Unsupervised learning task Dimensions of the problem - Number of samples: % - Observe draws of random vectors ! - Number of variables: & - Learn structure and parameters of - Alphabet size: ' a positive distribution " ! > 0 (! ) ∈ 1, … , ' ) Prior work in computationally efficient learning Convex optimization based methods Mutual Information based greedy methods Vuffray, Misra, Lokhov ( 2016 , 2018 ) Bresler ( 2015 ) Klivans, Meka ( 2017 ) Hamilton, Koehler, Moitra ( 2017 ) Wu, Sanghavi, Dimakis ( 2019 ) UNCLASSIFIED Managed by Triad National Security, LLC for the U.S. Department of Energy’s NNSA

Setting of Graphical Model Learning The model has a parametric form: - Observe random draws of " - Recover parameters ∗ - ( (" ( ) ≤ 5 ! " ∝ exp ' + ( 2 + − + ∗ 2 (∈* Basis functions are centered: Prior ℓ 8 -bound on parameters: ∗ ≤ < ∗ + 9 8 = ' + ( = ' - ( " ( = 0, 0 ∈ 1 (∋9 > ? UNCLASSIFIED Managed by Triad National Security, LLC for the U.S. Department of Energy’s NNSA

Method for Solving the Inverse problem: GRISE Generalized Regularized Interaction Screening (GRISE) Arbitrary parametric form C 1 ∗ - ( (" ( ) 5 A ) + 0 = arg min @ ' exp − ' + ( - ( (" ( ! " ∝ exp ' + ( = > AB1 (∈* > (∈* s.t. + 0 1 ≤ 3 4 Local Reconstruction (one neighborhood at a time) Convex Function (with low complexity minimization using entropic descent) UNCLASSIFIED Managed by Triad National Security, LLC for the U.S. Department of Energy’s NNSA

Intuition Behind GRISE: Infinite Sample Size Limit ∗ + 1 0 1 〈 S i *(J i ,H i ) 〉 ∗ - ( (" ( ) ! " ∝ exp ' + ( 〈 S i *(J i =0,H i =0) 〉 =1 (∈* i i 2→4 0 1 ∗ + 1 = 6 exp 9 0 1 + 1 − ' + ( - ( " ( (∈* 8 (=) + 1 ∗ + 1 J i ∗ 0 1 〈 S i *(J i *,H i *) 〉 ∗ + 1 ∗ = 0 H i ∇ ; 8 0 1 (<) i + 1 UNCLASSIFIED Managed by Triad National Security, LLC for the U.S. Department of Energy’s NNSA

Theorem for Learning Gibbs Distributions with GRISE (Informal) With high probability, GRISE estimates : ≤ & ! " − " ∗ 2 with a number of samples : + , -. log 2 /& 4 ( = * and computational complexity : * + 2 . Precise finite sample analysis with proofs: arXiv:1902.00600 UNCLASSIFIED Managed by Triad National Security, LLC for the U.S. Department of Energy’s NNSA

Complete Basis Function Hierarchies: Monomial Basis Example # < # % ∗ # % # % + ∗ # % # ∗ # % + - # ∝ exp 2 4 % 2 4 %' 2 4 %'" ' # " + ⋯ %∈3 (%,')∈8 9 (%,',")∈8 : # = # " # ' Binary alphabet # ∈ −1, +1 , ! " # " ∈ # % , # % # ' , # % # ' # " , … Monomial basis functions UNCLASSIFIED Managed by Triad National Security, LLC for the U.S. Department of Energy’s NNSA

Complete Basis Function Hierarchies: Monomial Basis Example " 7 " ( ∗ " ( " ( + ∗ " ( " ∗ " ( + ! " ∝ exp ' + ( ' + (0 ' + (04 0 " 4 + ⋯ (∈* ((,0)∈2 3 ((,0,4)∈2 5 " 8 " 4 " 0 Interaction Screening Loss: M 1 > + ( = arg min I ' exp −" ( + ( + ' + (0 " ( + ' + (04 " 0 " 4 + ⋯ F G JKL 0 0,4 ; < = . For 9 -wise models, the computational complexity of GRISE is : UNCLASSIFIED Managed by Triad National Security, LLC for the U.S. Department of Energy’s NNSA

Neural Net Parametrization of the Partial Energy Function Interaction Screening Loss: 3 1 > ? # = arg min . / exp −8 # ? # + / ? #B 8 # + / ? #BD 8 B 8 D + ⋯ @ , 012 B B,D Neural Net Interaction Screening Loss: 3 1 " # = arg min ! . / exp −8 # ΝΝ(8\8 # ; " # ) + , 012 If Neural Net is expressive enough, the global minima of NN-GRISE loss are interaction screening minima corresponding to recovered local energy UNCLASSIFIED Managed by Triad National Security, LLC for the U.S. Department of Energy’s NNSA

Illustration on a small ( ! = #$ ) tractable model of order % = & NN-GRISE hierarchy contains higher-order polynomials in its hypothesis space NN-GRISE explores a different basis functions hierarchy, and gets close to the true model with less parameters UNCLASSIFIED Managed by Triad National Security, LLC for the U.S. Department of Energy’s NNSA

Comparison of conditional distributions for a larger problem For p=15, L=6 problem, monomial basis contains 3472 terms, and GRISE becomes intractable Only order L=4 is practically feasible with GRISE NN basis has less parameters (349) and uses less training samples UNCLASSIFIED Managed by Triad National Security, LLC for the U.S. Department of Energy’s NNSA

Structure Learning with NN-GRISE 3 1 (2) " # = arg min ! . / exp −8 # ΝΝ(8\8 # ; " # ) + ? " # 2 + , 012 Regularization through penalty on first layer weights Variables @ outside of the neighborhood of A do not influence the output at the interaction screening minima UNCLASSIFIED Managed by Triad National Security, LLC for the U.S. Department of Energy’s NNSA

Summary - GRISE is a convex estimator for learning arbitrary discrete graphical models with rigorous guarantees, improving upon sampling complexities of previous methods Efficient Learning of Discrete Graphical Models M. Vuffray, S. Misra, A. Y. Lokhov (2020) - NN-GRISE is a computationally efficient non-convex estimator that uses the non-linear representation power of Neural Nets to exploit sparse basis hierarchies - NN-GRISE can still learn the MRF structure, full energy function representation, and conditional distributions that can be used for re-sampling from the learned model Learning of Discrete Graphical Models with Neural Networks Abhijith J., A. Y. Lokhov, S. Misra, M. Vuffray (2020) UNCLASSIFIED Managed by Triad National Security, LLC for the U.S. Department of Energy’s NNSA

Questions? UNCLASSIFIED Managed by Triad National Security, LLC for the U.S. Department of Energy’s NNSA

Learning Discrete Graphical Models with Neural Networks Andrey - PowerPoint PPT Presentation

Slide 1 Learning Discrete Graphical Models with Neural Networks Andrey Lokhov joint work with Abhijith Jayakumar, Sidhant Misra, Marc Vuffray UNCLASSIFIED Managed by Triad National Security, LLC for the U.S. Department of Energys NNSA

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

Graphical Models Graphical Models Bayesian Networks Siamak Ravanbakhsh Fall 2019 Previously on

Transforming Graphical System Models to Graphical Attack Models ! Joint work with Marieta

Probabilistic Graphical Models Probabilistic Graphical Models introduction to learning Siamak

Neural Networks Neural networks arise from attempts to model Neural Networks human/animal

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

Probabilistic Graphical Models Probabilistic Graphical Models Structure learning in Bayesian

Probabilistic Graphical Models Probabilistic Graphical Models Variable elimination Siamak

Learning in Graphical Models Andrea Passerini passerini@disi.unitn.it Machine Learning Learning

Probabilistic Graphical Models Probabilistic Graphical Models parameter learning in undirected

Probabilistic Graphical Models Probabilistic Graphical Models Undirected Models Fall 2019

Neural Networks and their Application to Go Neural Networks Learning Blackjack Theory Training

Probabilistic Graphical Models Probabilistic Graphical Models Parameter learning in Bayesian

Sequential Data with Neural Networks Recurrent Neural Networks Sequential input / output Greg

Probabilistic Graphical Models CMSC 678 UMBC Probabilistic Graphical Models A graph G that

Graphical models for Neuroscience Part I Giuseppe Vinci Department of Statistics Rice

PubPol 201 Trade Policies under Trump Module 3: International Safeguards Trade Policy

Spack in Shared Environment 2018 Scalable Tools Workshop LANL Programming and Runtime

Charles Tracy Kate Bickert Linda Cook Haifa Bint-Kadi Rivers, Trails & Conservation Golden

Meeting 31: 17 August 2017 Karakia 2 Karakia Ko te tumanako Kia pai tenei r Kia tutuki i

SciLifeLab Bioinformatics Platform National Bioinformatics Infrastructure Sweden (NBIS) Bjrn

Factors affecting students re- enrolment at a public university system David Rodriguez-Gomez 1

Knowledge Graph Reasoning CSCI 699: ML4Know Instructor: Xiang Ren USC Computer Science Overview

5 Now there was in the citadel of Susa a Jew of the tribe of Benjamin, named Mordecai ... 6 who

Learning Discrete Graphical Models with Neural Networks Andrey - PowerPoint PPT Presentation

Slide 1 Learning Discrete Graphical Models with Neural Networks Andrey Lokhov joint work with Abhijith Jayakumar, Sidhant Misra, Marc Vuffray UNCLASSIFIED Managed by Triad National Security, LLC for the U.S. Department of Energys NNSA

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

Graphical Models Graphical Models Bayesian Networks Siamak Ravanbakhsh Fall 2019 Previously on

Transforming Graphical System Models to Graphical Attack Models ! Joint work with Marieta

Probabilistic Graphical Models Probabilistic Graphical Models introduction to learning Siamak

Neural Networks Neural networks arise from attempts to model Neural Networks human/animal

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

Probabilistic Graphical Models Probabilistic Graphical Models Structure learning in Bayesian

Probabilistic Graphical Models Probabilistic Graphical Models Variable elimination Siamak

Learning in Graphical Models Andrea Passerini passerini@disi.unitn.it Machine Learning Learning

Probabilistic Graphical Models Probabilistic Graphical Models parameter learning in undirected

Probabilistic Graphical Models Probabilistic Graphical Models Undirected Models Fall 2019

Neural Networks and their Application to Go Neural Networks Learning Blackjack Theory Training

Probabilistic Graphical Models Probabilistic Graphical Models Parameter learning in Bayesian

Sequential Data with Neural Networks Recurrent Neural Networks Sequential input / output Greg

Probabilistic Graphical Models CMSC 678 UMBC Probabilistic Graphical Models A graph G that

Graphical models for Neuroscience Part I Giuseppe Vinci Department of Statistics Rice

PubPol 201 Trade Policies under Trump Module 3: International Safeguards Trade Policy

Spack in Shared Environment 2018 Scalable Tools Workshop LANL Programming and Runtime

Charles Tracy Kate Bickert Linda Cook Haifa Bint-Kadi Rivers, Trails &amp; Conservation Golden

Meeting 31: 17 August 2017 Karakia 2 Karakia Ko te tumanako Kia pai tenei r Kia tutuki i

SciLifeLab Bioinformatics Platform National Bioinformatics Infrastructure Sweden (NBIS) Bjrn

Factors affecting students re- enrolment at a public university system David Rodriguez-Gomez 1

Knowledge Graph Reasoning CSCI 699: ML4Know Instructor: Xiang Ren USC Computer Science Overview

5 Now there was in the citadel of Susa a Jew of the tribe of Benjamin, named Mordecai ... 6 who

Charles Tracy Kate Bickert Linda Cook Haifa Bint-Kadi Rivers, Trails & Conservation Golden