Tutorial on Methods for Interpreting and Understanding Deep Neural - PowerPoint PPT Presentation

Tutorial on Methods for Interpreting and Understanding Deep Neural Networks Wojciech Samek Grégoire Montavon Klaus-Robert Müller (Fraunhofer HHI) (TU Berlin) (TU Berlin) 1:30 - 2:00 Part 1: Introduction 2:00 - 3:00 Part 2a: Making Deep Neural Networks Transparent 3:00 - 3:30 Break 3:30 - 4:00 Part 2b: Making Deep Neural Networks Transparent 4:00 - 5:00 Part 3: Applications & Discussion ICASSP 2017 Tutorial — W. Samek, G. Montavon & K.-R. Müller

Before we start We thank our collaborators ! Sebastian Lapuschkin Alexander Binder (Fraunhofer HHI) (SUTD) Lecture notes will be online soon at: Please ask questions at any time ! /29 ICASSP 2017 Tutorial — W. Samek, G. Montavon & K.-R. Müller

Tutorial on Methods for Interpreting and Understanding Deep Neural Networks W. Samek, G. Montavon, K.-R. Müller Part 1: Introduction ICASSP 2017 Tutorial — W. Samek, G. Montavon & K.-R. Müller

Recent ML Systems achieve superhuman Performance Deep Net outperforms humans in image classification AlphaGo beats Go DeepStack beats human champ professional poker players Autonomous search-and-rescue drones outperform humans Deep Net beats human at Computer out-plays recognizing traffic signs humans in "doom" IBM's Watson destroys humans in jeopardy /29 ICASSP 2017 Tutorial — W. Samek, G. Montavon & K.-R. Müller 4

From Data to Information Huge volumes of data Solve task Interpretable Information extract Computing power Deep Nets / Kernel Machines / … Information (implicit) /29 ICASSP 2017 Tutorial — W. Samek, G. Montavon & K.-R. Müller 5

From Data to Information Interpretability AlexNet Clarifai VGG GoogleNet ResNet (16.4%) (11.1%) (7.3%) (6.7%) (3.57%) Performance Interpretable Data Information for human Crucial in many applications (industry, sciences …) /29 ICASSP 2017 Tutorial — W. Samek, G. Montavon & K.-R. Müller 6

Interpretable vs. Powerful Models ? Non-linear model Linear model vs. Poor fit, but easily Can be very complex interpretable “global explanation” “individual explanation” /29 ICASSP 2017 Tutorial — W. Samek, G. Montavon & K.-R. Müller 7

Interpretable vs. Powerful Models ? Non-linear model Linear model vs. Poor fit, but easily Can be very complex interpretable “global explanation” “individual explanation” 60 million parameters We have techniques to interpret and 650,000 neurons explain such complex models ! /29 ICASSP 2017 Tutorial — W. Samek, G. Montavon & K.-R. Müller 8

Interpretable vs. Powerful Models ? train best train interpretable interpret it vs. model model suboptimal or biased due to assumptions (linearity, sparsity …) /29 ICASSP 2017 Tutorial — W. Samek, G. Montavon & K.-R. Müller 9

Dimensions of Interpretability Different dimensions prediction prediction of “interpretability” “Explain why a certain pattern x has “Explain why a certain pattern x has been classified in a certain way f(x).” been classified in a certain way f(x).” model model “What would a pattern belonging “What would a pattern belonging to a certain category typically look to a certain category typically look like according to the model.” like according to the model.” data “Which dimensions of the data are most relevant for the task.” /29 ICASSP 2017 Tutorial — W. Samek, G. Montavon & K.-R. Müller 10

Why Interpretability ? 1) Verify that classifier works as expected Wrong decisions can be costly and dangerous “Autonomous car crashes, “AI medical diagnosis system because it wrongly recognizes …” misclassifies patient’s disease …” /29 ICASSP 2017 Tutorial — W. Samek, G. Montavon & K.-R. Müller 11

Why Interpretability ? 2) Improve classifier Generalization error Generalization error + human experience /29 ICASSP 2017 Tutorial — W. Samek, G. Montavon & K.-R. Müller 12

Why Interpretability ? 3) Learn from the learning machine “It's not a human move. I've Old promise: never seen a human play this “Learn about the human brain.” move.” (Fan Hui) /29 ICASSP 2017 Tutorial — W. Samek, G. Montavon & K.-R. Müller 13

Why Interpretability ? 4) Interpretability in the sciences Stock market analysis: In medical diagnosis: “Model predicts share value “Model predicts that X will with __% accuracy.” survive with probability __” What to do with this Great !!! information ? /29 ICASSP 2017 Tutorial — W. Samek, G. Montavon & K.-R. Müller 14

Why Interpretability ? 4) Interpretability in the sciences Learn about the physical / biological / chemical mechanisms. (e.g. find genes linked to cancer, identify binding sites …) /29 ICASSP 2017 Tutorial — W. Samek, G. Montavon & K.-R. Müller 15

Why Interpretability ? 5) Compliance to legislation European Union’s new General “right to explanation” Data Protection Regulation Retain human decision in order to assign responsibility. “With interpretability we can ensure that ML models work in compliance to proposed legislation.” /29 ICASSP 2017 Tutorial — W. Samek, G. Montavon & K.-R. Müller 16

Why Interpretability ? Interpretability as a gateway between ML and society • Make complex models acceptable for certain applications. • Retain human decision in order to assign responsibility. • “Right to explanation” Interpretability as powerful engineering tool • Optimize models / architectures • Detect flaws / biases in the data • Gain new insights about the problem • Make sure that ML models behave “correctly” /29 ICASSP 2017 Tutorial — W. Samek, G. Montavon & K.-R. Müller 17

Techniques of Interpretation /29 ICASSP 2017 Tutorial — W. Samek, G. Montavon & K.-R. Müller 18

Techniques of Interpretation Interpreting models better understand (ensemble) internal representation - find prototypical example of a category - find pattern maximizing activity of a neuron Explaining decisions crucial for many (individual) practical applications - “why” does the model arrive at this particular prediction - verify that model behaves as expected /29 ICASSP 2017 Tutorial — W. Samek, G. Montavon & K.-R. Müller 19

Techniques of Interpretation In medical context • Population view (ensemble) • Which symptoms are most common for the disease • Which drugs are most helpful for patients • Patient’s view (individual) • Which particular symptoms does the patient have • Which drugs does he need to take in order to recover Both aspects can be important depending on who you are (FDA, doctor, patient). /29 ICASSP 2017 Tutorial — W. Samek, G. Montavon & K.-R. Müller 20

Techniques of Interpretation Interpreting models - find prototypical example of a category - find pattern maximizing activity of a neuron cheeseburger goose car /29 ICASSP 2017 Tutorial — W. Samek, G. Montavon & K.-R. Müller 21

Techniques of Interpretation Interpreting models - find prototypical example of a category - find pattern maximizing activity of a neuron cheeseburger goose car simple regularizer (Simonyan et al. 2013) /29 ICASSP 2017 Tutorial — W. Samek, G. Montavon & K.-R. Müller 22

Techniques of Interpretation Interpreting models - find prototypical example of a category - find pattern maximizing activity of a neuron cheeseburger goose car complex regularizer (Nguyen et al. 2016) /29 ICASSP 2017 Tutorial — W. Samek, G. Montavon & K.-R. Müller 23

Techniques of Interpretation Explaining decisions - “why” does the model arrive at a certain prediction - verify that model behaves as expected /29 ICASSP 2017 Tutorial — W. Samek, G. Montavon & K.-R. Müller 24

Techniques of Interpretation Explaining decisions - “why” does the model arrive at a certain prediction - verify that model behaves as expected - Sensitivity Analysis - Layer-wise Relevance Propagation (LRP) /29 ICASSP 2017 Tutorial — W. Samek, G. Montavon & K.-R. Müller 25

Techniques of Interpretation Sensitivity Analysis (Simonyan et al. 2014) /29 ICASSP 2017 Tutorial — W. Samek, G. Montavon & K.-R. Müller 26

Techniques of Interpretation Layer-wise Relevance Propagation (LRP) (Bach et al. 2015) “every neuron gets it’s share of relevance depending on activation and strength of connection.” Theoretical interpretation Deep Taylor Decomposition (Montavon et al., 2017) /29 ICASSP 2017 Tutorial — W. Samek, G. Montavon & K.-R. Müller 27

Techniques of Interpretation Techniques of Interpretation Sensitivity Analysis: LRP / Taylor Decomposition: “what makes this image “what makes this image less / more ‘scooter’ ?” ‘scooter’ at all ?” /29 ICASSP 2017 Tutorial — W. Samek, G. Montavon & K.-R. Müller 28

More to come Part 2 Part 2 Part 3 quality of explanations, applications, interpretability in the sciences, discussion /29 ICASSP 2017 Tutorial — W. Samek, G. Montavon & K.-R. Müller 29

Tutorial on Methods for Interpreting and Understanding Deep Neural - PowerPoint PPT Presentation

Tutorial on Methods for Interpreting and Understanding Deep Neural Networks Wojciech Samek Grgoire Montavon Klaus-Robert Mller (Fraunhofer HHI) (TU Berlin) (TU Berlin) 1:30 - 2:00 Part 1: Introduction 2:00 - 3:00 Part 2a: Making Deep

Tutorial Tutorial A2 is out, its called Inpainting Tutorial Tutorial A2 is out, its called

Toward a Toward a Overview Sociology of Sociology of Introduction Interpreting

A GAMS TUTORIAL A GAMS TUTORIAL A GAMS TUTORIAL WHAT IS GAMS ? General Algebraic Modeling

ICASSP 2017 Tutorial on Methods for Interpreting and Understanding Deep Neural Networks G.

Excel Tutorial 1 Getting Started with Excel Tutorial 2 Formatting a Workbook Tutorial 3

Interpreting Psychological Reports Stephanie Verlinden, PsyD May 4, 2015 Interpreting

Annual Financial Statements: Annual Financial Statements: Annual Financial Statements: Annual

PowerPoint Tutorial 1 Creating a Presentation Tutorial 2 Applying and Modifying Text and

Comp 1402 Winter 2008 Tutorial #1 Tutorial 1 The objectives of this tutorial will be:

PROGRAMMING TUTORIAL Thierry Lepley, April 4 th 2016 TUTORIAL GOAL Intermediate Tutorial for

Do Fifty- Two Motivation Overview of the Language

UPPAAL Tutorial UPPAAL Tutorial UPPAAL Tutorial Introduction Introduction Alexandre David

Tutorial: TF-Ranking for sparse features Tutorial: TF-Ranking for sparse features This tutorial

Methods for interpreting and understanding deep neural networks Grgoire Montavon,

Understanding Short Text xts ACL 2016 Tutorial Zhongyuan Wang (Microsoft Research) Haixun Wang

UNDERSTANDING (LMOU) LOCAL MEMORANDUM OF UNDERSTANDING (LMOU) LOCAL MEMORANDUM OF UNDERSTANDING

for NeuronBank Ontology Weiling Li, Rajshekhar Sunderraman, and Paul Katz Georgia State

a i a j Input Input Activation Output Output Links Function Function Links Input

Universality and Individuality in Recurrent Neural Networks Niru Maheswaranathan, Alex Williams,

The Potjans-Diesmann local microcircuit model using different neuron classes for excitatory and

CMP784 DEEP LEARNING Lecture #03 Multi-layer Perceptrons Aykut Erdem // Hacettepe University

CSCE 478/878 Lecture 4: Artificial Neural Networks Stephen D. Scott (Adapted from Tom

Simulating neural computation and information processing with Brian Marcel Stimberg Institut de

Effective Approaches to Attention-based Neural Machine Translation Minh-Thang Luong , Hieu Pham,