Attention for Machine Comprehension Made by : Rishab Goel Based on - PowerPoint PPT Presentation

Jan 29, 2024 •204 likes •681 views

Attention for Machine Comprehension Made by : Rishab Goel Based on slides by: Alex Graves, Hien Quoc, Renjie Liao Highway Networks Benefits ... Benefits ... Importance ... For training very deep architectures By allowing better information

Attention for Machine Comprehension Made by : Rishab Goel Based on slides by: Alex Graves, Hien Quoc, Renjie Liao
Highway Networks
Benefits ...
Benefits ...
Importance ... For training very deep architectures By allowing better information flow Better optimization Intuition : linear transformation/input suffice for learning, language at higher level of http://colah.github.io/posts/2014-03-NN-Manifolds-Topology/ abstraction???
Hien Quoc Dang
Idea of Maxout Hien Quoc Dang
Intuitions Inspired from dropout Similar to bagging but integrated as a part of single network Hien Quoc Dang
Idea of Maxout ... Hien Quoc Dang
Idea of Maxout ... Hien Quoc Dang
Comparison to Rectifiers Hien Quoc Dang
Why Maxout Work ? Hien Quoc Dang
Slides : Santi Pascual
LSTMs ... Chris Olah’s blog
Need for Attention The embeddings not sufficient to encode information over long distances Helps to attend to important patch of data Interpretability to the model
Attentive Reader
DYNAMIC COATTENTION NETWORKS FOR QUESTION ANSWERING Authors : Caiming Xiong, Victor Zhong, Richard Socher
Introduction Machine Comprehension No knowledge base required Till SQUAD no large scale, natural dataset Cloze style datasets like CNN/Mail Daily Synthetic/small size
About SQuAD Consists questions on a set of Wikipedia articles Wh type questions The answer is a segment of text, or span Source : Rajpurkar et al.
Model in nutshell ... Socher et al
Doc and Query Encoder Socher et al
Liked ● Gagan Socher et al
Liked : all Dynamic Decoder Socher et al
Highway Maxout Network ... Socher et al
Socher et al
Socher et al
Disliked ● Gagan (pt. 3) Implementation ● Akshay (pt. 4) claim not proven 1. CoreNLP for preprocessing 2. GloVe word vectors pretrained on 840B Common Crawl corpus 3. OOV set to 0 4. Sentinel vectors randomly initialized, optimized during training
Iterative process visualisation ... Socher et al
Socher et al
Disliked ● Haroun (ensemble gain too Results much) Socher et al
Liked ● Barun ● Nupur Socher et al
Liked Performance across diff. types of ques. ● Shantanu Socher et al
Liked ● Prachi Ablation studies ... Socher et al
Predictions Socher et al
Logistic Regression Prediction : Theatre Museum Socher et al
Comments : Trouble decoding multiple intuitive answer Socher et al
Cons Lack error analysis, need more ablation studies[Barun, Surag] System give extractive answer and not abstractive[Nupur] Do not compare HMN and MN[all] Unintuitive decoder[Dinesh]
Doubts ... Why HMN worked out? Role of sentinel vectors?? Error propagation in argmax function Maxout for LSTMs as well (not clear) Use multiple initialisation of start and end pointers ( how ??)
Extensions ... Use approach for others datasets like CNN/Daily Mail and MS COCO QA [Barun] Use different attention, Match LSTM [Barun] Bi-directional attention [Gagan] Use iterative idea to visual QA, classification, NER, SRL etc [Akshay, Surag] Find synonyms[Haroun]
Extensions ... Combine char2vec and word2vec embeddings to represent the document and query
Thanks!

Recommend

Comprehension Skills: Teacher Presentation Book, Comprehension Skills: Teacher Presentation Book,

HY5KX0GQ5VVT // eBook Comprehension Skills: Teacher Presentation Book, Comprehension B1 Comprehension Skills: Teacher Presentation Book, Comprehension Skills: Teacher Presentation Book, Comprehension B1 Comprehension B1 Filesize: 1.96 MB

47 views • 4 slides

Literacy Strategies Literacy Strategies What is comprehension? What is comprehension? Simply

Literacy Strategies Literacy Strategies What is comprehension? What is comprehension? Simply put, comprehension is the act of Simply put, comprehension is the act of understanding what you are reading. understanding what you are

709 views • 21 slides

Attention in NLP CS 6956: Deep Learning for NLP Overview What is attention Attention in

Attention in NLP CS 6956: Deep Learning for NLP Overview What is attention Attention in encoder-decoder networks Various kinds of attention 2 Overview What is attention? Attention in encoder-decoder networks 3 Visual

973 views • 73 slides

Quantifying Program Complexity and Comprehension Quantifying Program Complexity and Comprehension

Quantifying Program Complexity and Comprehension Quantifying Program Complexity and Comprehension Quantifying Program Complexity and Comprehension Michael Hansen, Andrew Lumsdaine, Rob Goldstone, Raquel Hill, Chen Yu Michael Hansen, Andrew

1.34k views • 99 slides

Attention Eye tracking seminar 2/19/15 Presented by Tatiana Emmanouil Outline What is

Attention Eye tracking seminar 2/19/15 Presented by Tatiana Emmanouil Outline What is attention? How is attention allocated? How are eye movements related to attention? Further questions Attention Attention

332 views • 18 slides

Attention, Transformer and BERT Prof. Kuan-Ting Lai 2020/6/16 Attention is All You Need! A.

Attention, Transformer and BERT Prof. Kuan-Ting Lai 2020/6/16 Attention is All You Need! A. Waswani et al., NIPS , 2017 Google Brain & University of Toronto 2 Attention Visual attention and textual attention

629 views • 21 slides

COMPREHENSION Minjoon Seo, Aniruddha Kembhavi, Ali Farhadi, Hannaneh Hajishirzi Presenter: Wenda

BI-DIRECTIONAL ATTENTION FLOW FOR MACHINE COMPREHENSION Minjoon Seo, Aniruddha Kembhavi, Ali Farhadi, Hannaneh Hajishirzi Presenter: Wenda Qiu 04/01/2020 Machine Comprehension Question Answering: Answer a query about a given context

655 views • 61 slides

(Age 7-11) A new solution for guided reading Agenda Why a comprehension programme? What is Bug

Introducing Bug Club Comprehension (Age 7-11) A new solution for guided reading Agenda Why a comprehension programme? What is Bug Club Comprehension? Pedagogical principles A week of teaching in practice Bug Club Family Positioning

793 views • 41 slides

End of Year Exam (SA2) Components 1) Language Usage and Comprehension 2) Oral 3) Listening

End of Year Exam (SA2) Components 1) Language Usage and Comprehension 2) Oral 3) Listening Comprehension 4) Writing Language Usage and Comprehension Duration: 1h 15 mins Booklet A Booklet B Grammar MCQ Grammar Cloze (2 passages

893 views • 31 slides

Attention! 1. Definitions and behavioral effects 2. Effects on neural firing rates: Spatial

4/14/17 Attention! 1. Definitions and behavioral effects 2. Effects on neural firing rates: Spatial attention Attention to features 3. Directing attention: Posterior parietal cortex Frontal eye fields Top-down and bottom-up attention 1

338 views • 17 slides

The Attention Economy What is the attention economy? A business model where you (as the

The Attention Economy What is the attention economy? A business model where you (as the company) want to hold the users attention as much as possible. Attention is treat like a scarce resource What are ethical issues that have emerged

170 views • 3 slides

Advanced Neural Machine Translation Gongbo Tang 23 September 2019 Outline NMT with Attention

Advanced Neural Machine Translation Gongbo Tang 23 September 2019 Outline NMT with Attention Mechanisms 1 Attention Mechanisms Understanding Attention Mechanisms Attention Variants NMT at Different Granularities 2 Hybrid Models

831 views • 57 slides

Advanced Neural Machine Translation Gongbo Tang 21 September 2020 Outline NMT with Attention

Advanced Neural Machine Translation Gongbo Tang 21 September 2020 Outline NMT with Attention Mechanisms 1 Attention Mechanisms Understanding Attention Mechanisms Attention Variants NMT at Different Granularities 2 Hybrid Models

811 views • 56 slides

Coordinated Interplay of Scene, Utterance, and World Knowledge - Comprehension of spoken

Coordinated Interplay of Scene, Utterance, and World Knowledge - Comprehension of spoken utterances that relate to a visual scene - Eye movements in scenes during utterance comprehension - utterance can direct attention to an object in the scene

209 views • 10 slides

Using Natural Language Relations between Answer Choices for Machine Comprehension Rajkumar Pujari

Using Natural Language Relations between Answer Choices for Machine Comprehension Rajkumar Pujari and Dan Goldwasser June 5, 2019 Overview Model Results Conclusion Intuition Intuition When humans perform Reading Comprehension, we answer

329 views • 32 slides

Convolutional Spatial Attention Model for Reading Comprehension with Multiple- Choice Questions Z

Convolutional Spatial Attention Model for Reading Comprehension with Multiple- Choice Questions Z HIPENG C HEN , Y IMING C UI * , W ENTAO M A , S HIJIN W ANG , G UOPING H U J OINT L ABORATORY OF HIT AND I FLYTEK R ESEARCH ( HFL ), B EIJING , C

936 views • 28 slides

FlexDNN: Input-Adaptive On-Device Deep Learning for Efficient Mobile Vision ACM/IEEE Symposium on

FlexDNN: Input-Adaptive On-Device Deep Learning for Efficient Mobile Vision ACM/IEEE Symposium on Edge Computing (SEC) Biyi Fang, Xiao Zeng, Faen Zhang, Hui Xu and Mi Zhang 1 Mobile Vision Systems are Revolutionizing Our Lives Now Drones

422 views • 20 slides

Coupling Technical Assistance with Student Service Learning in Mine Water Reclamation KELSEA J.

Coupling Technical Assistance with Student Service Learning in Mine Water Reclamation KELSEA J. GREEN, MORGAN C. WHITED AND WILLIAM H. J. STROSNIDER IN ASSOCIATION WITH SAINT FRANCIS UNIVERSITY AND THE CENTER FOR WATERSHED RESEARCH AND SERVICE

504 views • 22 slides

Analysis of electronic voting protocols in applied pi calculus Mark Ryan University of

Analysis of electronic voting protocols in applied pi calculus Mark Ryan University of Birmingham based on joint work with Ben Smyth Steve Kremer Mounira Kourjieh IFIP WG 1.3, Udine, Italy September 2009 Outline Electronic voting Applied

695 views • 30 slides

III.3.1 Alluvial fans Geoscience: the Earth and its Resources

III.3.1 Alluvial fans Geoscience: the Earth and its Resources Prof. Dr. G. Berto, sink distribu3on distribu3on source source The high mountains:

361 views • 8 slides

Ice-sheet dynamics: the influence of glacier sliding on ice loss and sea level Ian Hewitt,

Ice-sheet dynamics: the influence of glacier sliding on ice loss and sea level Ian Hewitt, Mathematical Institute, University of Oxford Greenland How does meltwater penetrating to the bed affect ice-sheet motion? What implications does this

703 views • 34 slides

HVP contribution of the light quarks Davide Giusti to (g -2) including QED corrections with

HVP contribution of the light quarks Davide Giusti to (g -2) including QED corrections with twisted-mass fermions OUTLINE Isospin breaking effects on the lattice XXXVI International (RM123 method) Symposium on Lattice Field Theory

505 views • 22 slides

Adventures with AIRS: continued Tim P. Barnett David W. Pierce Eric Fetzer Andrew Gettleman

Adventures with AIRS: continued Tim P. Barnett David W. Pierce Eric Fetzer Andrew Gettleman Amy Braverman Sam Iacobellis Outline/Summary Water Vapor: AIRS vs. Climate Models (models wrong AND error is important) . Cloud Issues (what

430 views • 26 slides

Radon monitoring in the Kamioka mine Guillaume Pronost Kamioka Observatory, ICRR, University of

Radon monitoring in the Kamioka mine Guillaume Pronost Kamioka Observatory, ICRR, University of Tokyo TAUP conference, 2019 September 10th, Toyama (Supported by KAKENHI Grant-in-Aid for Scientific Research on Innovative Areas 26104008) Why

512 views • 17 slides