Attention and its (mis)interpretation Danish Pruthi 1 - PowerPoint PPT Presentation

Sequence-to-sequence Tasks Task Example Bigram Flipping {w 1 , w 2 … w 2n-1 , w 2n } → {w 2 , w 1 , … w 2n , w 2n-1 } {w 1 ,w 2, … w n-1 , w n } → {w 1 ,w 2 , … w n , w n-1 } Sequence Copying {w 1 ,w 2, … w n-1 , w n } → {w n ,w n-1 , … w 2 , w 1 } Sequence Reversal This is an example. → Dieser ist ein Beispiel. English - German MT 19

Manipulating Attention 20

  Manipulating Attention • Let be the impermissible tokens, m is the mask   𝖩 20

    Manipulating Attention • Let be the impermissible tokens, m is the mask   𝖩 • For any task-specific loss function, a penalty term is added   20

      Manipulating Attention • Let be the impermissible tokens, m is the mask   𝖩 • For any task-specific loss function, a penalty term is added   • The penalty term penalizes the model for allocating attention to impermissible tokens   20

Manipulating Attention Total attention mass on all the "allowed" tokens 21

Manipulating Attention Penalty coefficient that modulates attention on impermissible tokens Total attention mass on all the "allowed" tokens 21

Manipulating Attention Penalty coefficient that modulates attention on impermissible tokens Total attention mass on all the "allowed" tokens • Side note: In a parallel work, Wiegre ff e and Pinter (2019) propose a di ff erent penalty term 21

Manipulating Attention • Multiple attention heads 22

  Manipulating Attention • Multiple attention heads • Optimizing the mean over a set of attention heads   22

    Manipulating Attention • Multiple attention heads • Optimizing the mean over a set of attention heads   • One of the attention heads can be assigned a large amount of attention to impermissible tokens   22

Outline 1. What Is attention mechanism? 2. Attention-as-explanations 3. Manipulating attention weights 4. Results and discussion 5. Conclusion 23

BiLSTM + Attention y α 1 α 2 α 3 α n ….. biLSTM biLSTM biLSTM biLSTM x 1 x 2 x 3 x n 24

Embedding + Attention (No recurrent connections) y α 1 α 2 α 3 α n x 3 x 1 x 2 x n 25

Transformer-based Model Devlin et. al 26

Restricted BERT Predictions Movie Good [SEP] L 12 Movie Good [SEP] L.. L 1 L 0 [CLS] [CLS] Original 27

Restricted BERT Predictions Movie Good [SEP] L 12 Movie Good [SEP] L.. L 1 L 0 [CLS] [CLS] Original Predictions Movie L 12 L.. L 1 Good [SEP] Impermissible L 0 [CLS] Permissible Delhi [SEP] Capital Restricted 27

Occupation Prediction 28

Occupation Prediction Accuracy Attention Mass 100 75 50 25 0 Original Manipulated   Manipulated   ( λ = 0.1) ( λ = 1.0) Attention type 28

Occupation Prediction Accuracy Attention Mass 100 99.7 97.2 75 50 25 0 Original Manipulated   Manipulated   ( λ = 0.1) ( λ = 1.0) Attention type 28

Occupation Prediction Accuracy Attention Mass 100 99.7 97.2 97.1 75 50 25 0 0 Original Manipulated   Manipulated   ( λ = 0.1) ( λ = 1.0) Attention type 28

Occupation Prediction Accuracy Attention Mass 100 99.7 97.2 97.4 97.1 75 50 25 0 0 0 Original Manipulated   Manipulated   ( λ = 0.1) ( λ = 1.0) Attention type 28

Classification Tasks 29

Alternate mechanisms Gender-Identification 30

Alternate mechanisms Gender-Identification At inference time, what if we hard set the corresponding attention mass to ZERO? 30

Alternate mechanisms Gender-Identification 50 % 100% At inference time, what if we hard set the corresponding attention mass to ZERO? 30

Bigram Flip 31

Bigram Flip Accuracy Attention Mass 100 75 50 25 0 Original None Uniform Manipulated Attention type 31

Bigram Flip Accuracy Attention Mass 100 100 94.5 75 50 25 0 Original None Uniform Manipulated Attention type 31

Bigram Flip Accuracy Attention Mass 100 100 96.5 94.5 75 50 25 0 0 Original None Uniform Manipulated Attention type 31

Bigram Flip Accuracy Attention Mass 100 100 97.9 96.5 94.5 75 50 25 0 5.2 0 Original None Uniform Manipulated Attention type 31

Bigram Flip Accuracy Attention Mass 100 100 99.9 97.9 96.5 94.5 75 50 25 0 5.2 0.4 0 Original None Uniform Manipulated Attention type 31

Bigram Flip Original 32

Bigram Flip Original Manipulated 32

Bigram Flip A di ff erent seed Original Manipulated 32

Sequence Copy 33

Sequence Copy Accuracy Attention Mass 100 75 50 25 0 Original None Uniform Manipulated Attention type 33

Sequence Copy Accuracy Attention Mass 100 100 98.8 75 50 25 0 Original None Uniform Manipulated Attention type 33

Sequence Copy Accuracy Attention Mass 100 100 98.8 84.1 75 50 25 0 0 Original None Uniform Manipulated Attention type 33

Sequence Copy Accuracy Attention Mass 100 100 98.8 93.8 84.1 75 50 25 0 5.2 0 Original None Uniform Manipulated Attention type 33

Sequence Copy Accuracy Attention Mass 100 100 99.9 98.8 93.8 84.1 75 50 25 0 5.2 0.01 0 Original None Uniform Manipulated Attention type 33

Attention and its (mis)interpretation Danish Pruthi 1 - PowerPoint PPT Presentation

Attention and its (mis)interpretation Danish Pruthi 1 Acknowledgements Mansi Gupta Bhuwan Dhingra Graham Neubig Zachary C. Lipton 2 Outline 1. What is attention mechanism? 2. Attention-as-explanations 3. Manipulating attention weights 4.

Attention in NLP CS 6956: Deep Learning for NLP Overview What is attention Attention in

INTERPRETATION INTERPRETATION INTERPRETATION INTERPRETATION How can I know what How can I know

Realistic Image Synthesis - MIS and Path Tracing - Philipp Slusallek Karol Myszkowski Gurprit

Attention, Transformer and BERT Prof. Kuan-Ting Lai 2020/6/16 Attention is All You Need! A.

Attention Eye tracking seminar 2/19/15 Presented by Tatiana Emmanouil Outline What is

Trends in Interpretation SCIC-Universities Conference 6-7 April 2017 Ana MOUZINHO DE

MIS Planning Building strong plans for Major Improvement Strategies John Argue 1 COVID-19 and

Siriraj Hospital Siriraj Hospital Siriraj MIS Activities Siriraj MIS Activities Today

Napoleon Bunaparte Lecture : Dy Chunsong Group: Nine Mr. Ben Phorn Mis Sok ngim Mis. Em Theary

MIS Project MIS Project Hala Salah Salah Hala Hany El- -Sawah Sawah Hany El Hany El Hany

Algorithms in Nature SOP & MIS Maximal Independent Set (MIS) A fundamental problem in

Maximal Independent Set Stefan Schmid @ T-Labs, 2011 What is a MIS? MIS An independent set

Attention! 1. Definitions and behavioral effects 2. Effects on neural firing rates: Spatial

The Attention Economy What is the attention economy? A business model where you (as the

Geometric Interpretation of the Derivative (Review) Geometric Interpretation of the Derivative

An interpretation of surface displacements An interpretation of surface displacements An

Behavior Patient Safety Connection Behavior Patient Safety Connection Does Any Doubt Remain?

Chapter 5 Born about 370 AD. She was the first woman to make a substantial contribution

CSE 158 Web Mining and Recommender Systems Introduction What is CSE 158? In this course we will

Variations on Nonparametric Additive Models: Computational and Statistical Aspects John Lafferty

PDL1 inhibitors in Relapsed/ Refractory Hodgkin Lymphoma: Robert Chen, MD Associate Professor

Welcome to the LIFE Webinar Series. We will be starting soon. The Low-Income Forum on Energy

JUNE CROP PRODUCTION Executive Summary 12:00 PM JUNE 11, 2014 Contents Field Crops Fruit

Get Online with a Multifaceted, Multilingual, Professional Development Program for School

Attention and its (mis)interpretation Danish Pruthi 1 - PowerPoint PPT Presentation

Attention and its (mis)interpretation Danish Pruthi 1 Acknowledgements Mansi Gupta Bhuwan Dhingra Graham Neubig Zachary C. Lipton 2 Outline 1. What is attention mechanism? 2. Attention-as-explanations 3. Manipulating attention weights 4.

Attention in NLP CS 6956: Deep Learning for NLP Overview What is attention Attention in

INTERPRETATION INTERPRETATION INTERPRETATION INTERPRETATION How can I know what How can I know

Realistic Image Synthesis - MIS and Path Tracing - Philipp Slusallek Karol Myszkowski Gurprit

Attention, Transformer and BERT Prof. Kuan-Ting Lai 2020/6/16 Attention is All You Need! A.

Attention Eye tracking seminar 2/19/15 Presented by Tatiana Emmanouil Outline What is

Trends in Interpretation SCIC-Universities Conference 6-7 April 2017 Ana MOUZINHO DE

MIS Planning Building strong plans for Major Improvement Strategies John Argue 1 COVID-19 and

Siriraj Hospital Siriraj Hospital Siriraj MIS Activities Siriraj MIS Activities Today

Napoleon Bunaparte Lecture : Dy Chunsong Group: Nine Mr. Ben Phorn Mis Sok ngim Mis. Em Theary

MIS Project MIS Project Hala Salah Salah Hala Hany El- -Sawah Sawah Hany El Hany El Hany

Algorithms in Nature SOP &amp; MIS Maximal Independent Set (MIS) A fundamental problem in

Maximal Independent Set Stefan Schmid @ T-Labs, 2011 What is a MIS? MIS An independent set

Attention! 1. Definitions and behavioral effects 2. Effects on neural firing rates: Spatial

The Attention Economy What is the attention economy? A business model where you (as the

Geometric Interpretation of the Derivative (Review) Geometric Interpretation of the Derivative

An interpretation of surface displacements An interpretation of surface displacements An

Behavior Patient Safety Connection Behavior Patient Safety Connection Does Any Doubt Remain?

Chapter 5 Born about 370 AD. She was the first woman to make a substantial contribution

CSE 158 Web Mining and Recommender Systems Introduction What is CSE 158? In this course we will

Variations on Nonparametric Additive Models: Computational and Statistical Aspects John Lafferty

PDL1 inhibitors in Relapsed/ Refractory Hodgkin Lymphoma: Robert Chen, MD Associate Professor

Welcome to the LIFE Webinar Series. We will be starting soon. The Low-Income Forum on Energy

JUNE CROP PRODUCTION Executive Summary 12:00 PM JUNE 11, 2014 Contents Field Crops Fruit

Get Online with a Multifaceted, Multilingual, Professional Development Program for School

Algorithms in Nature SOP & MIS Maximal Independent Set (MIS) A fundamental problem in