Layer-wise Relevance Propagation in Neural Neural Networks Deep - PowerPoint PPT Presentation

LRP Ariyan Zarei Motivation Having More interpretable Layer-wise Relevance Propagation in Neural Neural Networks Deep Learning Shortcomings Networks to have more interpretable Papers and Demo Introduction Machine Learning models Terminology and Notations Relevance Properties Examples of Relevance Taylor Decomposition as Relevance Ariyan Zarei Layer-wise Relevance Propagation Local Layer-wise Relevance University of Arizona Notes on Relevance Rules General Algorithm ariyanzarei@email.arizona.edu LRP Rules LRP-0 LRP-Epsilon February 25, 2020 LRP-Gamma LRP Rules Comparison Which Rule to use for each layer Different starting relevance for the output layer Conclusion

LRP Overview Ariyan Zarei Motivation Having More interpretable Neural Networks Motivation Deep Learning Shortcomings Having More interpretable Neural Networks Papers and Demo Deep Learning Shortcomings Introduction Papers and Demo Terminology and Notations Introduction Relevance Properties Terminology and Notations Relevance Properties Examples of Relevance Examples of Relevance Taylor Decomposition as Relevance Taylor Decomposition as Relevance Layer-wise Relevance Propagation Layer-wise Relevance Local Layer-wise Relevance Propagation Notes on Relevance Rules Local Layer-wise Relevance General Algorithm Notes on Relevance Rules General Algorithm LRP Rules LRP Rules LRP-0 LRP-0 LRP-Epsilon LRP-Epsilon LRP-Gamma LRP-Gamma LRP Rules Comparison Which Rule to use for each LRP Rules Comparison layer Different starting relevance Which Rule to use for each layer for the output layer Different starting relevance for the output layer Conclusion Conclusion

LRP Ariyan Zarei Motivation Having More interpretable Neural Networks Deep Learning Shortcomings Papers and Demo Introduction Terminology and Notations Relevance Properties Motivation Examples of Relevance Taylor Decomposition as Relevance Layer-wise Relevance Propagation Local Layer-wise Relevance Notes on Relevance Rules General Algorithm LRP Rules LRP-0 LRP-Epsilon LRP-Gamma LRP Rules Comparison Which Rule to use for each layer Different starting relevance for the output layer Conclusion

LRP Having More interpretable Neural Networks Ariyan Zarei ◮ Interpretable Machine Learning (ML) Theme in our Motivation Colloquium Having More interpretable Neural Networks Deep Learning ◮ Medical Applications of ML, specially Medical Image Shortcomings Papers and Demo Analysis Introduction ◮ Deep Learning (DL) for analyzing histopathological Terminology and Notations Relevance Properties Slides Examples of Relevance Taylor Decomposition as Relevance Layer-wise Relevance Propagation Local Layer-wise Relevance Notes on Relevance Rules General Algorithm LRP Rules LRP-0 LRP-Epsilon LRP-Gamma LRP Rules Comparison Which Rule to use for each layer Different starting relevance for the output layer Figure: A sampled window inside the cancerous region of a Slide Conclusion

LRP Deep Learning Shortcomings Ariyan Zarei Motivation ◮ Paying Attention to irrelevant and spurious features Having More interpretable Neural Networks Deep Learning Shortcomings Papers and Demo Introduction Terminology and Notations Relevance Properties Examples of Relevance Taylor Decomposition as Relevance Layer-wise Relevance Propagation Local Layer-wise Relevance Notes on Relevance Rules General Algorithm LRP Rules LRP-0 LRP-Epsilon LRP-Gamma LRP Rules Comparison Which Rule to use for each layer ◮ Feature Selection not useful. Different starting relevance for the output layer Conclusion

LRP Deep Learning Shortcomings Ariyan Zarei ◮ Paying Attention to irrelevant and spurious features Motivation Having More interpretable Simple example: Neural Networks Deep Learning Shortcomings Papers and Demo Introduction Terminology and Notations Relevance Properties Examples of Relevance Taylor Decomposition as Relevance Layer-wise Relevance Propagation Local Layer-wise Relevance Notes on Relevance Rules General Algorithm LRP Rules LRP-0 LRP-Epsilon LRP-Gamma LRP Rules Comparison Which Rule to use for each layer Different starting relevance for the output layer Conclusion

LRP Deep Learning Shortcomings Ariyan Zarei Motivation Having More interpretable Neural Networks Deep Learning Shortcomings Papers and Demo Introduction Terminology and Notations Relevance Properties ◮ Deep Neural Networks’ Challenges Medical Sciences Examples of Relevance Taylor Decomposition as ◮ Fix this problem Relevance Layer-wise ◮ Explain the predictions of the Models Relevance Propagation Local Layer-wise Relevance Notes on Relevance Rules General Algorithm LRP Rules LRP-0 LRP-Epsilon LRP-Gamma LRP Rules Comparison Which Rule to use for each layer Different starting relevance for the output layer Conclusion

LRP Papers and Demo Ariyan Zarei Motivation Having More interpretable Neural Networks Deep Learning Shortcomings Papers and Demo Introduction ◮ Layer-Wise Relevance Propagation: An Overview Terminology and Notations Relevance Properties (Explainable AI: Interpreting, Explaining and Visualizing Examples of Relevance Taylor Decomposition as Deep Learning Chapter 10) Relevance Layer-wise ◮ Explaining nonlinear classification decisions with deep Relevance Propagation Taylor decomposition (Elsevier Pattern Recognition) Local Layer-wise Relevance Notes on Relevance Rules Demo: Link General Algorithm LRP Rules LRP-0 LRP-Epsilon LRP-Gamma LRP Rules Comparison Which Rule to use for each layer Different starting relevance for the output layer Conclusion

LRP Introduction Ariyan Zarei Motivation Having More interpretable Neural Networks Deep Learning Shortcomings Papers and Demo Why the neural network is making a particular decision. Introduction Terminology and Notations ◮ Assess and Validate the prediction and the reason Relevance Properties Examples of Relevance behind it with another inexpensive method. Taylor Decomposition as Relevance ◮ Given the final output of a class (softmax), where in the Layer-wise Relevance input the network is attending. Propagation Local Layer-wise Relevance ◮ Which parts of the input affect the prediction Notes on Relevance Rules General Algorithm (positively and negatively). LRP Rules LRP-0 LRP-Epsilon LRP-Gamma LRP Rules Comparison Which Rule to use for each layer Different starting relevance for the output layer Conclusion

LRP Terminology and Notations Ariyan Zarei Motivation Having More interpretable Neural Networks Deep Learning Note: we focus on images and CNNs in this talk but LRP Shortcomings Papers and Demo can be applied to all other forms of data and networks and Introduction models. Terminology and Notations Relevance Properties ◮ Input Image: x ∈ R d = { x p } , p ∈ { 1 , 2 , ..., d } Examples of Relevance Taylor Decomposition as ◮ Prediction: f ( x ) : R d → R + quantifies the presence of Relevance Layer-wise an object in the input. Relevance Propagation ◮ Zero: absence of the object Local Layer-wise Relevance ◮ Other values: degree of certainty Notes on Relevance Rules General Algorithm ◮ Relevance: R ( x ) : R d → R + d Heatmap with the same LRP Rules LRP-0 LRP-Epsilon size as the input LRP-Gamma LRP Rules Comparison Which Rule to use for each layer Different starting relevance for the output layer Conclusion

LRP Relevance Properties Ariyan Zarei Motivation Having More interpretable Neural Networks Deep Learning Shortcomings Papers and Demo Introduction Terminology and Notations 1. Conservation: ∀ x : f ( x ) = � p R ( x ) p Relevance Properties Examples of Relevance 2. Being Positive: ∀ x , p : R ( x ) p ≥ 0 Taylor Decomposition as Relevance Layer-wise 3. Consistent: if properties 1 and 2 hold. if Relevance f ( x ) = 0 ⇒ ∀ p : R ( x ) p = 0 Propagation Local Layer-wise Relevance Notes on Relevance Rules General Algorithm LRP Rules LRP-0 LRP-Epsilon LRP-Gamma LRP Rules Comparison Which Rule to use for each layer Different starting relevance for the output layer Conclusion

LRP Examples of Relevance Ariyan Zarei Motivation Having More interpretable Neural Networks 1. Put all relevance to one pixel Deep Learning Shortcomings Papers and Demo 2. Divide the relevance equally between all input pixels Introduction ∀ p : R ( x ) p = 1 d f ( x ) Terminology and Notations Relevance Properties 3. Natural Decomposition: if the function f has some sort Examples of Relevance Taylor Decomposition as of natural decomposition between the input pixels. Relevance f ( x ) = � p f p ( x p ) ⇒ ∀ p : R ( x ) p = f p ( x p ) Layer-wise Relevance Propagation 4. Taylor Decomposition around a reference point. Local Layer-wise Relevance x ) + ( ∂ f x ) ⊤ ( x − ˜ f ( x ) = f (˜ ∂ x | x = ˜ x ) + ǫ Notes on Relevance Rules General Algorithm LRP Rules ∂ f f ( x ) = 0 + � ∂ x p | x = ˜ x ( x p − ˜ x p ) + ǫ LRP-0 p LRP-Epsilon LRP-Gamma ∀ p : R ( x ) p = ∂ f ∂ x p | x = ˜ x ( x p − ˜ x p ) LRP Rules Comparison Which Rule to use for each layer Different starting relevance for the output layer Conclusion

Layer-wise Relevance Propagation in Neural Neural Networks Deep - PowerPoint PPT Presentation

LRP Ariyan Zarei Motivation Having More interpretable Layer-wise Relevance Propagation in Neural Neural Networks Deep Learning Shortcomings Networks to have more interpretable Papers and Demo Introduction Machine Learning models

Topic of this talk Topic of this talk From E- -Relevance Relevance From E to W- -Relevance

PLANT PROPAGATION An Overview of Plant Propagation Methods Two Techniques of Stem Cutting

Network Layer October 2, 2019 guha.jayachandran@sjsu.edu Layer 2: Protocol atop Layer 1

WISe Screening April 2, 2019 Agenda: -Introductions -Purpose of WISe CANS Screen -Use of CANS

Lecture 6: Wireless Link Layer, Lecture 6: Wireless Link Layer, MAC protocols, CSMA MAC

1 Transport Layer Transport Layer Outline Message, Segment, Datagram Transport-layer

ELEC / COMP 177 Fall 2016 Some slides from Kurose and Ross, Computer Networking , 5 th Edition

5 Network Layer Network Layer Network Layer Network Layer Example: Choosing among multiple ASes

Relevance Vector Machines Jukka Lankinen LUT February 21, 2011 Jukka Lankinen Relevance Vector

Relevance Feedback Relevance Feedback Relevance Feedback Prof. Paolo Ciaccia Prof. Paolo

Visualizing and Understanding Neural Machine Translation Yanzhuo Ding, Yang Liu, Huanbo Luan,

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

THE AMATEURS FRIEND OR Enemy A short course on Propagation Propagation What is it? What

1 How to deal with Radio Propagation How to deal with Radio Propagation Where are you from?

Physical of radio propagation Two types of propagation models

10 mm Cytoarchitecture and function layer 4: input layer 5: output Motor cortex: expanded layer

On the Alignment of Source Code Quality Perspectives through Experimentation: An Industrial Case

Pattern Structures Pattern Structures Models describe whole or a large part of the data

Customer requirements Developer requirements Business requirements

An Architectural Approach to Support Online Updates of SPL

Digital Curators: W ho, W hat, & How A Perspective from OCLC Program s & Research

CPSC 875 CPSC 875 John D McGregor John D. McGregor C 23 Wrap up Role/Process/Workproduct

Walking toward moving goalposts: agile management for evolving systems Richard Golding, Theodore

Viewpoint on high-tech We should think of a computer

Layer-wise Relevance Propagation in Neural Neural Networks Deep - PowerPoint PPT Presentation

LRP Ariyan Zarei Motivation Having More interpretable Layer-wise Relevance Propagation in Neural Neural Networks Deep Learning Shortcomings Networks to have more interpretable Papers and Demo Introduction Machine Learning models

Topic of this talk Topic of this talk From E- -Relevance Relevance From E to W- -Relevance

PLANT PROPAGATION An Overview of Plant Propagation Methods Two Techniques of Stem Cutting

Network Layer October 2, 2019 guha.jayachandran@sjsu.edu Layer 2: Protocol atop Layer 1

WISe Screening April 2, 2019 Agenda: -Introductions -Purpose of WISe CANS Screen -Use of CANS

Lecture 6: Wireless Link Layer, Lecture 6: Wireless Link Layer, MAC protocols, CSMA MAC

1 Transport Layer Transport Layer Outline Message, Segment, Datagram Transport-layer

ELEC / COMP 177 Fall 2016 Some slides from Kurose and Ross, Computer Networking , 5 th Edition

5 Network Layer Network Layer Network Layer Network Layer Example: Choosing among multiple ASes

Relevance Vector Machines Jukka Lankinen LUT February 21, 2011 Jukka Lankinen Relevance Vector

Relevance Feedback Relevance Feedback Relevance Feedback Prof. Paolo Ciaccia Prof. Paolo

Visualizing and Understanding Neural Machine Translation Yanzhuo Ding, Yang Liu, Huanbo Luan,

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

THE AMATEURS FRIEND OR Enemy A short course on Propagation Propagation What is it? What

1 How to deal with Radio Propagation How to deal with Radio Propagation Where are you from?

Physical of radio propagation Two types of propagation models

10 mm Cytoarchitecture and function layer 4: input layer 5: output Motor cortex: expanded layer

On the Alignment of Source Code Quality Perspectives through Experimentation: An Industrial Case

Pattern Structures Pattern Structures Models describe whole or a large part of the data

Customer requirements Developer requirements Business requirements

An Architectural Approach to Support Online Updates of SPL

Digital Curators: W ho, W hat, &amp; How A Perspective from OCLC Program s &amp; Research

CPSC 875 CPSC 875 John D McGregor John D. McGregor C 23 Wrap up Role/Process/Workproduct

Walking toward moving goalposts: agile management for evolving systems Richard Golding, Theodore

Viewpoint on high-tech We should think of a computer

Digital Curators: W ho, W hat, & How A Perspective from OCLC Program s & Research