A Convolutional Neural Network for Modelling Sentences Nal - PowerPoint PPT Presentation

Aug 24, 2022 •35 likes •195 views

A Convolutional Neural Network for Modelling Sentences Nal Kalchbrenner Edward Grefenstette Phil Blunsom Department of Computer Science, Oxford University Overview of Model Represent sentences by extracting more abstract features Input:

A Convolutional Neural Network for Modelling Sentences Nal Kalchbrenner Edward Grefenstette Phil Blunsom Department of Computer Science, Oxford University
Overview of Model Represent sentences by extracting more abstract features Input: sequence of word embeddings Output: classification probabilities Each layer involves 1. Convolution 2. Dynamic k -Max Pooling 3. Apply a non-linearity (tanh)
One-Dimensional Convolution 1. The filter m ∈ R m 2. The sequence s ∈ R s Returns sequence c ∈ R s − m +1 c j = m T s j − m +1: j , j = 1 , ..., s − m + 1 Takes a dot product between length m subsequences of s and the filter m Wide convolution pads s with m − 1 zeros on the left.
Convolution with Word Embeddings Assume word embeddings of dimension d Filter m will be in R d × m Sequence s will be in R d × s Each row of m will be convolved with the corresponding row of s
k -Max Pooling (LeCun et al.) Given k and sequence p ∈ R p , p ≥ k 1. Return k largest elements of p 2. Keep elements in their original order Denoted p k max ∈ R k
Dynamic k -Max Pooling “Smooth extraction of higher-order features” ✓ ⇠ L − l ⇡◆ k L = max k top , s L I k top is fixed parameter I l is current layer I L is total number of layers I s is sentence length
Folding Elementwise sum of pairs rows of a matrix f : R d × n → R d / 2 × n f ( M ) = N where N [ i , j ] = M [2 i , j ] + M [2 i + 1 , j ] , i = 0 , ..., d / 2 − 1, j = 0 , ... n − 1 I Introduces dependencies between di ff erent feature rows I No added parameters
Size of Network Model First Layer Second Layer * Width Filters Width Filters k -top Binary 7 6 5 14 4 Multi-class 10 6 7 12 5
Training Top layer is soft-max nonlinearity to predict probability distribution L 2 regularization of parameters in objective function Parameters are word embeddings, filter weights, & fully connected layers Trained using Adagrad with mini-batches “Processes multiple millions of sentences per hour on one GPU”
Experiments 1. Predicting sentiment of movie reviews - binary (Socher et al. 2013) 2. Predicting sentiment of movie reviews - multi-class (Socher et al. 2013) 3. Categorization of questions (Li and Roth 2002) 4. Sentiment of Tweets, labels based on emoticons(Go et al. 2009) Feature embedding dimensionality chosen based on size of dataset
Movies accuracy
First layer feature-detectors
TREC 6-way classification accuracy
Twitter sentiment
Conclusion Dynamic Convolutional Neural Networks I Convolutions apply function to n-grams I Dynamic k -max pooling extracts most active feature, and chooses k based on layer and sentence length I Composing these two operations can be seen as feature detection I Outperformed/stayed competitive with other neural approaches, baseline models, and state-of-the-art approaches without needing handcrafted features

Recommend

Convolutional Neural Networks Convolutional neural networks One of the major kinds of ANNs in use

Convolutional Neural Networks<br/><br/> 5/4/19, 4(03 PM Convolutional Neural Networks<br/><br/> 5/4/19, 4(03 PM Convolutional Neural Networks Convolutional neural networks One of the major kinds of ANNs in use UMaine

412 views • 9 slides

Convolutional Neural Networks ---- Off the shelf top notch performances Convolutional Neural

Transfer Learning with Convolutional Neural Networks ---- Off the shelf top notch performances Convolutional Neural Networks A breakthough Convolutional Neural Networks VGG-16 example Layers of Convolutional filters Bottleneck

625 views • 23 slides

Outline Convolutional Neural Network Architectures for Matching Natural Language Sentences.

Outline Hu, NIPS14 Irsoy, NIPS14 Outline Convolutional Neural Network Architectures for Matching Natural Language Sentences. NIPS14 Convolutional Sentence Model Convolutional Matching Models Experiments Deep Recursive Neural

779 views • 35 slides

Convolutional Kuan-Ting Lai 2020/3/31 Neural Network Convolutional Neural Networks (CNN)

Convolutional Kuan-Ting Lai 2020/3/31 Neural Network Convolutional Neural Networks (CNN) A.k.a. CNN or ConvNet Adit Deshpande, A Beginner's Guide To Understanding Convolutional Neural Networks. Digital Images Input array: an images

1.42k views • 72 slides

Introduction CSCE 970 CSCE 970 Lecture 4: Lecture 4: Convolutional Convolutional Neural

Introduction CSCE 970 CSCE 970 Lecture 4: Lecture 4: Convolutional Convolutional Neural Neural CSCE 970 Lecture 4: Networks Networks Good for data with a grid-like topology Stephen Scott Convolutional Neural Networks Stephen Scott

355 views • 3 slides

Convolutional Neural Nets 4-25-16 Reading Quiz Convolutional neural networks are most commonly

Convolutional Neural Nets 4-25-16 Reading Quiz Convolutional neural networks are most commonly used for what sort of task? a) Recognizing images b) Transcribing speech c) Translating documents d) Playing games Neural network review

431 views • 13 slides

A Convolutional Neural Network for Modelling Sentences Nal Kalchbrenner Edward Grefenstette Phil

A Convolutional Neural Network for Modelling Sentences Nal Kalchbrenner Edward Grefenstette Phil Blunsom { nal.kalchbrenner, edward.grefenstette, phil.blunsom } @cs.ox.ac.uk Department of Computer Science University of Oxford Abstract The

338 views • 11 slides

ON TEGRA X1 ALAN WANG, NVIDIA Convolutional Neural Network optimization target Result

DIRECT CONVOLUTION FOR DEEP NEURAL NETWORK CLASSIFICATION ON TEGRA X1 ALAN WANG, NVIDIA Convolutional Neural Network optimization target Result Convolutional Fully Connected Input layer layer Convolutional Layer An example: A E

621 views • 18 slides

Neural Network Part 3: Convolutional Neural Networks CS 760@UW-Madison Goals for the lecture

Neural Network Part 3: Convolutional Neural Networks CS 760@UW-Madison Goals for the lecture you should understand the following concepts convolutional neural networks (CNN) convolution and its advantage pooling and its advantage 2

750 views • 50 slides

Convolutional Neural Nets CS447 Natural Language Processing (J. Hockenmaier)

Lecture 8: Convolutional Neural Nets Convolutional Neural Nets CS447 Natural Language Processing (J. Hockenmaier) https://courses.grainger.illinois.edu/cs447/ 1 Convolutional Neural Nets (ConvNets, CNNs) [4 parameters, applied 3

615 views • 24 slides

Convolutional Neural Networks for Sentence Classification Yoon Kim New York University 1 / 34

Convolutional Neural Networks for Sentence Classification Convolutional Neural Networks for Sentence Classification Yoon Kim New York University 1 / 34 Convolutional Neural Networks for Sentence Classification Agenda Word Embeddings

389 views • 34 slides

Convolutional Neural Networks 08, 10 & 17 Nov, 2016 J. Ezequiel Soto S. Image Processing

Convolutional Neural Networks 08, 10 & 17 Nov, 2016 J. Ezequiel Soto S. Image Processing 2016 Prof. Luiz Velho Convolutional Neural Networks 1 Summary & References 08/11 ImageNet Classification with Deep Convolutional Neural Networks

648 views • 26 slides

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural IR tasks Neural IR architecture Feature Representations Neural IR query auto completion Neural IR query suggestion Neural IR document

1.48k views • 18 slides

CS4501: Introduction to Computer Vision Deeper Convolutional Neural Network Architectures Last

CS4501: Introduction to Computer Vision Deeper Convolutional Neural Network Architectures Last Class Neural Networks multilayer perceptron model (MLP) Backpropagation Convolutional Neural Networks Todays Class More on

772 views • 41 slides

Semantic Segmentation of the sekleton in bone scintigraphy images with convolutional neural

Semantic Segmentation of the sekleton in bone scintigraphy images with convolutional neural networks Problem Description Convolutional Neural Networks Convolutional Layer Max Pooling Transforming Classification networks to segmentation

157 views • 12 slides

Convolutional Neural Networks in Speech Lecture 20 CS 753 Instructor: Preethi Jyothi

Convolutional Neural Networks in Speech Lecture 20 CS 753 Instructor: Preethi Jyothi Convolutional Neural Networks (CNNs) Fully connected (dense) layers have no awareness of spatial information Key concept behind convolutional layers is

983 views • 41 slides

Natural Language Processing (CSE 490U): Neural Language Models Noah Smith 2017 c University

Natural Language Processing (CSE 490U): Neural Language Models Noah Smith 2017 c University of Washington nasmith@cs.washington.edu January 1318, 2017 1 / 57 Quick Review A language model is a probability distribution over V .

815 views • 57 slides

Request-Level and Data-Level Parallelism in Warehouse-Scale Computers 1 MO401 2013 Tpicos

MO401 IC-UNICAMP IC/Unicamp 2013s1 Prof Mario Crtes Captulo 6 Request-Level and Data-Level Parallelism in Warehouse-Scale Computers 1 MO401 2013 Tpicos IC-UNICAMP Programming models and workload for Warehouse-Scale Computers

1.06k views • 49 slides

(Big) Data Storage Systems Corso di Sistemi e Architetture per Big Data A.A. 2019/2020 Valeria

Macroarea di Ingegneria Dipartimento di Ingegneria Civile e Ingegneria Informatica (Big) Data Storage Systems Corso di Sistemi e Architetture per Big Data A.A. 2019/2020 Valeria Cardellini Laurea Magistrale in Ingegneria Informatica The

608 views • 29 slides

Information Systems (Informationssysteme) Jens Teubner, TU Dortmund

Information Systems (Informationssysteme) Jens Teubner, TU Dortmund jens.teubner@cs.tu-dortmund.de Summer 2013 Jens Teubner Information Systems Summer 2013 c 1 Part II Overview of Database Systems Jens Teubner Information

641 views • 19 slides

Modelling and Control of Dynamic Systems Course Organisation Sven Laur University of Tartu

Modelling and Control of Dynamic Systems Course Organisation Sven Laur University of Tartu Course description The main focus is on the machine-learning methods: System identification with neural networks Control of dynamic systems

303 views • 9 slides

Statistical learning of biological networks: a brief overview Florence dAlchBuc IBISC

Statistical learning of biological networks: a brief overview Florence dAlchBuc IBISC CNRS, Universit dEvry, GENOPOLE, Evry, France Email: florence.dalche@ibisc.fr Statistical learning of biological networks: a brief overview 1 /

580 views • 32 slides

Lifelong Sequential Modeling for User Response Prediction Kan Ren, Jiarui Qin, Yuchen Fang,

Lifelong Sequential Modeling for User Response Prediction Kan Ren, Jiarui Qin, Yuchen Fang, Weinan Zhang, Lei Zheng, Yong Yu Weijie Bian, Guorui Zhou, Jian Xu, Xiaoqiang Zhu, Kun Gai May 2019 User Response Prediction Predict the

725 views • 16 slides

Class Admin Overview Overview of Complex Networks Class admin Class admin Basic definitions

Overview Class Admin Overview Overview of Complex Networks Class admin Class admin Basic definitions Basic definitions Complex Networks, Course 303A, Spring, 2009 Books Books Examples of Examples of Complex Networks Complex Networks

368 views • 14 slides