Modernmachinelearningmethods fortrustworthyscience TomCharnock - PowerPoint PPT Presentation

Modern machine learning methods   for trustworthy science Tom Charnock Institut d'Astrophysique de Paris      

Why neural networks don't work   (and how to use them) Tom Charnock Institut d'Astrophysique de Paris      

Why neural networks don't work    Tom Charnock Institut d'Astrophysique de Paris      

Apologies about the term bias when something is intrinsically unknowable it is biased if there is some offset, which could in principle be corrected, it is biased

Apologies about the term bias when something is intrinsically unknowable it is biased if there is some offset, which could in principle be corrected, it is biased I (almost always) mean the top one

  ℕℕ ( � , � ) : � → � An approximation to a model,   : � → �

A crazy likelihood surface of how likely we are to get targets from data

What are we actually interested in?

 ( � | � ) = ∫ � � � �  ( � | � , � , � )  ( � , � )

 ( � | � ) = ∫ � � � �  ( � | � , � , � )  ( � , � )  - Posterior    - Likelihood   ( � | � )  ( � | � , � , � ) predictive density  How likely are the targets to How likely are the true be generated by a particular targets given some data?  network?    - Probability density   ( � , � ) What is the probability of obtaining a particular network with particular parameter values?

 ( � | � ) = ∫ � � � �  ( � | � , � , � )  ( � , � )        

Where does this information about the weights and hyperparameters come from?

Training and validation data

Training and validation data Training data and targets:    } train � train � train { � , � ≡ { , | � ∈ [1, � train ]} � �   Validation data and targets:  } val � val � val { � , � ≡ { , | � ∈ [1, � val ]} � � Posterior distribution of weights and hyperparameters } train } val  ( � , � | { � , � , { � , � ) ∝ } train } val  ( � , � | { � , � , { � , � ) � ( � , � )

The failing of traditional training

The failing of traditional training         approximator      : � → � � ( � , � ) smooth and convex  ℕℕ ( � , � ) : � → �     Cost function and likelihood    ( � | � , � , � ) complex and non-convex � ∗ � ∗ � ( � , � ) = − ln  ( � | � , , ) in   and  � �

Optimising (or training) a network

Optimising (or training) a network What are the maximum likelihood estimates of the weights?   � MLE } train } train � ∗ = argmax [  ( { � | { � , � , ) ] �

Local maximum likelihood estimates  

The main problem...

We degenerate the posterior   } train } train  ( � , � | { � , � ) ∝  ( � , � | { � , � ) � ( � , � ) � MLE � ∗ → � ( � − , � − )

All predictions are (probably incorrect) estimates       ( � | � ) = � ( � )

There is no way to interpret how close   is to  ... � �

There is no way to interpret how close   is to  ... � � Because the likelihood is non-interpretably complex

Are there better methods?

Variational inference

} train  ( � | � ) = ∫ � � � � � �  ( � | � , � , � )  ( � | � , � , { � , � ) � ( � , � )  

Still depends on �xed weights in the complex likelihood surface  and choice of variational distribution   } train  ( � | � ) = ∫ � � � � � �  ( � | � , � , � )  ( � | � , � , { � , � ) � MLE � ∗ × � ( � − , � − ) � ∗ � MLE � ∗ } train = ∫ � �  ( � | � , � , )  ( � | , , { � , � ).

Bayesian neural networks

} train  ( � | � ) = ∫ � � � �  ( � | � , � , � )  ( � , � | { � , � ) ∝ ∫ � � � �  ( � | � , � , � ) � train � train � train ×  ( | , � , � ) � ( � , � ). � � ∏ � Sample the likelihood of the training data

Still dependent on the training data!   Classical network :  } train � MLE � ∗  ( � , � | { � , � ) → � ( � − , � − ) Variational inference :  } train } train � MLE � ∗  ( � , � | { � , � ) =  ( � | , , { � , � ) Bayesian networks :  ∏ � train } train � train � train  ( � , � | { � , � ) =  ( | , � , � ) � ( � , � ) � � �

Problems with physical models...

How can we use a neural network then?

Build it into the physical model

Method 1 :     Infer the data, physics and the neural network

Method 2 :    Understand the likelihood (using neural physical engines)

Method 3 :     Likelihood-free inference

Compare distance between observed summaries and simulation summaries and select results within    �  

Conclusions

Conclusions Neural networks are not to be trusted They can make trusty companions - when the correct framework is introduced Using statistics we can build neural networks into the forward model to get unbiased results

For more information read my new blog    bit.ly/ProbNN

Modernmachinelearningmethods fortrustworthyscience TomCharnock - PowerPoint PPT Presentation

Modernmachinelearningmethods fortrustworthyscience TomCharnock Institutd'AstrophysiquedeParis Whyneuralnetworksdon'twork (andhowtousethem) TomCharnock

Trustworthy Computing * Reverse engineers agree on that! Trustworthy Computing Trustworthy

MODERN 1 MODERN 2 MODERN 3 MODERN 4 MODERN A peep at some distant orb has power to raise

TCIPG TECHNICAL CLUSTERS AND THREADS Trustworthy Trustworthy Technologies for Wide Technologies

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Trustworthy Technologies for Wide Area Monitoring and Control Carl Hauser Number of Activities:

Trustworthy Technologies for Local Area Management, Monitoring, and Control Tom Overbye Number

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

APPLIED MACHINE LEARNING Methods for Clustering K-means, Soft K-means DBSCAN 1 MACHINE

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

Machine learning for finance Nathan George Data Science Professor DataCamp Machine Learning

Modern Risk Modern Risk Modern Risk Management Modern Risk Management anagement Concepts:

Kernel Methods for Regression Support Vector Regression Gaussian Mixture Regression Gaussian

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

4CSLL5 Parameter Estimation (Supervised and Unsupervised) Supervised Maximum Likelihood

Machine Learning Estimation Hamid R. Rabiee Spring 2015

Recap: N -gram models ANLP Lecture 6 We can model sentence probs by conditioning each word on

Gaussian Mixture Models & EM CE-717: Machine Learning Sharif University of Technology M.

Mobility Inequality in the United States 1 download slides at: www.inequality.com/slides

Supporting Mobility in MobilityFirst F. Zhang, K. Nagaraja, T. Nguyen, D. Raychaudhuri, Y. Zhang

Intergenerational mobility in developing countries: on the axiomatic foundation of

Coexistence, Collaboration, and Coordination Paradigms in the Presence of Mobility Gruia-Catalin

Modernmachinelearningmethods fortrustworthyscience TomCharnock - PowerPoint PPT Presentation

Modernmachinelearningmethods fortrustworthyscience TomCharnock Institutd'AstrophysiquedeParis Whyneuralnetworksdon'twork (andhowtousethem) TomCharnock

Trustworthy Computing * Reverse engineers agree on that! Trustworthy Computing Trustworthy

MODERN 1 MODERN 2 MODERN 3 MODERN 4 MODERN A peep at some distant orb has power to raise

TCIPG TECHNICAL CLUSTERS AND THREADS Trustworthy Trustworthy Technologies for Wide Technologies

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Trustworthy Technologies for Wide Area Monitoring and Control Carl Hauser Number of Activities:

Trustworthy Technologies for Local Area Management, Monitoring, and Control Tom Overbye Number

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

APPLIED MACHINE LEARNING Methods for Clustering K-means, Soft K-means DBSCAN 1 MACHINE

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

Machine learning for finance Nathan George Data Science Professor DataCamp Machine Learning

Modern Risk Modern Risk Modern Risk Management Modern Risk Management anagement Concepts:

Kernel Methods for Regression Support Vector Regression Gaussian Mixture Regression Gaussian

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

4CSLL5 Parameter Estimation (Supervised and Unsupervised) Supervised Maximum Likelihood

Machine Learning Estimation Hamid R. Rabiee Spring 2015

Recap: N -gram models ANLP Lecture 6 We can model sentence probs by conditioning each word on

Gaussian Mixture Models &amp; EM CE-717: Machine Learning Sharif University of Technology M.

Mobility Inequality in the United States 1 download slides at: www.inequality.com/slides

Supporting Mobility in MobilityFirst F. Zhang, K. Nagaraja, T. Nguyen, D. Raychaudhuri, Y. Zhang

Intergenerational mobility in developing countries: on the axiomatic foundation of

Coexistence, Collaboration, and Coordination Paradigms in the Presence of Mobility Gruia-Catalin

Gaussian Mixture Models & EM CE-717: Machine Learning Sharif University of Technology M.