Compositional Distributional Semantic Models for Semantic - PowerPoint PPT Presentation

Compositional Distributional Semantic Models for Semantic Relatedness and Entailment Sidharth Gupta Sai Krishna Prasad Guided By:- Amitabha Mukherjee

Distributional Semantic Models (DSMs) ● Distributional hypothesis - words that occur in the same context tend to have similar meanings ● Firth - “a word is characterised by the company it keeps” ● Collect distributional information for words in a corpus in high-dimensional vectors ● Unsupervised learning of vectors for words ● Semantic similarity for words - define in terms of vector similarity

Compositional DSMs ● How to combine the meanings of words, to understand the semantics of full sentences? ● Extend DSMs - compositionality ● Simple approaches: ○ Weighted sum of vectors ○ Element wise product of vectors ○ Commutative, no attention to syntax ● Operator words - modify the meanings of other words in their context (adjectives, transitive verbs) ● Model these as matrices - “act” on the meanings of other words

DataSet ● SICK Database ● 10,000 english sentence pairs divided equally between the training and test data sets ● The training data contains the following fields 1. sentence_A 2. sentence_B 3. relatedness_score 4. entailment_judgment - Entailment, Neutral or Contradiction

Task ● SemEval 2014 - task 1 ● SubTask 1 - output the degree of relatedness between two sentences ● SubTask 2 - output the semantic entailment holding between two sentences ● Relatedness score in the training data - average score given by 10 human beings collected for each pair. ● Entailment label - majority label of 5 human beings

Categorical CDSMs ● Grefenstette and Sadrzadeh, 2011 [2] ● Pregroup grammars specify syntax for sentences/phrases in the language ● Pregroup grammars - associate types (atomic or compound) with all words in the lexicon ● Eg. cats [ n ] like [ n r sn l ] milk [ n ] ● Syntax guided semantic composition ● Using distribution information for words provided by a DSM, construct matrices for relational words

Categorical CDSMs ● Matrices for a relational word P ○ dimensionality mr ( r x r x … r - m times) ○ m - adjoint types specified by grammar ○ sum over all instances in corpus appropriate element from the corresponding word vectors ( w 1 ,w 2 ,...,w m ) ● Sentence vector computation ○ Elementwise product over the the matrix for P and the appropriate element from w 1 x w 2 x … x w m

Recursive Matrix-Vector Spaces ● A word is represented using a vector and a matrix ● The vector contains the meaning of the word (a = R n ) ● The matrix Captures how the word changes the meaning of neighbouring words or phrases.(A = n*n) ● A composition of two words is represented as p = f(a, b, R, K) = P = Where R is a known syntactic relation, K is background knowledge, and W and W m are (n*2n) matrices

Recursive Matrix-Vector Spaces ● The model generalizes many earlier models such as 1. Mitchell and Lapata where p = Ba + Ab 2. Baroni and Zamparelli where p = Ab 3. Socher (2011) where p = a + b ● θ = (W,W M ,W label ,L,L M ) Learning is done by using gradient descent method over the parameter space ● To reduce the dimensionality we represent A = U*V + dia(a) ● It is also the only model that properly negates the sentiment

References 1. Socher, Richard, Brody Huval, Christopher Manning, and Andrew Ng, 2012. Semantic compositionality through recursive matrix--vector spaces. In Proceedings of EMNLP. 2. Grefenstette, Edward and Mehrnoosh Sadrzadeh, 2011. Experimental support for a categorical compositional distributional model of meaning. In Proceedings of EMNLP. 3. Mitchell, Jeff and Mirella Lapata, 2008. Vector -based models of semantic composition. In Proceedings of ACL. Columbus, OH. 4. Mitchell, Jeff and Mirella Lapata, 2010. Composition in distributional models of semantics. Cognitive Science, 34(8): 1388–1429.

Examples ● A man is jumping into an empty pool. There is no biker jumping in the air. Score :- 1.6 ● A person in black jacket is doing tricks on the motorbike. A man in black jacket is doing tricks on the motorbike. Score :- 4.9 ● Two teams are competing in a football match. Two groups of people are competing in a football match. Entailment ● The brown horse is near a red barrel. The brown horse is far from a red barrel. Contradiction ● A man in black jacket is doing tricks on a motorbike. A man is riding a cycle. Neutral

Compositional Distributional Semantic Models for Semantic - PowerPoint PPT Presentation

Compositional Distributional Semantic Models for Semantic Relatedness and Entailment Sidharth Gupta Sai Krishna Prasad Guided By:- Amitabha Mukherjee Distributional Semantic Models (DSMs) Distributional hypothesis - words that occur in

Compositional Distributional Models of Meaning Dimitri Kartsaklis Mehrnoosh Sadrzadeh School of

Synonymy in an approach to combined distributional and compositional semantics Ann Copestake and

Distributional Semantics The unsupervised modeling of meaning on a large scale Tim Van de Cruys

Linear mixed models with improper priors and flexible distributional assumptions for longitudinal

Compositional and Distributional Models of Meaning for Natural Language Stephen Clark Natural

Incrementality in Compositional Distributional Semantics M. Sadrzadeh, EECS, QMUL SemDial 2018

Compositionality in Semantic Spaces Martha Lewis ILLC University of Amsterdam 2nd Symposium on

Learning Compositional Semantics for Introduction Open Domain Semantic Parsing Meaning

Type Theory and Distributional Models of Meaning Shalom Lappin Kings College London Workshop

Distributional Compositionality Intro to Distributional Semantics Raffaella Bernardi University

Statistics and Samples in Distributional Reinforcement Learning Mark Rowland, Robert Dadashi,

Statistics and Samples in Distributional Reinforcement Learning Rowland, Dadashi, Kumar, Munos,

Automatic construction of distributional thesaurus (for multiple languages) Zheng ZHANG 1 st

Detecting Learner Errors in the Choice of Content Words using Compositional Distributional

Understanding compound words A new perspective from compositional systems in distributional

Event Knowledge in Compositional Distributional Semantics Ludovica Pannitto Master Thesis in

Welcome Feel free to use your computer at any time Introduce yourself to your table

Unit 5: Inference for categorical variables Lecture 3: Chi-square tests Statistics 101 Thomas

Classification of finite semigroups and categories using computational methods Najwa Ghannoum W.

MA/CSSE 473 Day 14 Strassen's Algorithm: Matrix Multiplication Decrease and Conquer DFS

Cloud Computing Gabriel Antoniu Inria Computing as a Utility first suggested by John McCarthy in

EC project calls & funding available for storgae projects Peter Szegedi TERENA TERENA

The Bright Side of Black Holes : dark matter, primordial black holes and the cosmic infrared

MCG-ICT-CAS TRECVID 2008 Automatic Video 2008 Automatic Video Retrieval System Retrieval System

Compositional Distributional Semantic Models for Semantic - PowerPoint PPT Presentation

Compositional Distributional Semantic Models for Semantic Relatedness and Entailment Sidharth Gupta Sai Krishna Prasad Guided By:- Amitabha Mukherjee Distributional Semantic Models (DSMs) Distributional hypothesis - words that occur in

Compositional Distributional Models of Meaning Dimitri Kartsaklis Mehrnoosh Sadrzadeh School of

Synonymy in an approach to combined distributional and compositional semantics Ann Copestake and

Distributional Semantics The unsupervised modeling of meaning on a large scale Tim Van de Cruys

Linear mixed models with improper priors and flexible distributional assumptions for longitudinal

Compositional and Distributional Models of Meaning for Natural Language Stephen Clark Natural

Incrementality in Compositional Distributional Semantics M. Sadrzadeh, EECS, QMUL SemDial 2018

Compositionality in Semantic Spaces Martha Lewis ILLC University of Amsterdam 2nd Symposium on

Learning Compositional Semantics for Introduction Open Domain Semantic Parsing Meaning

Type Theory and Distributional Models of Meaning Shalom Lappin Kings College London Workshop

Distributional Compositionality Intro to Distributional Semantics Raffaella Bernardi University

Statistics and Samples in Distributional Reinforcement Learning Mark Rowland, Robert Dadashi,

Statistics and Samples in Distributional Reinforcement Learning Rowland, Dadashi, Kumar, Munos,

Automatic construction of distributional thesaurus (for multiple languages) Zheng ZHANG 1 st

Detecting Learner Errors in the Choice of Content Words using Compositional Distributional

Understanding compound words A new perspective from compositional systems in distributional

Event Knowledge in Compositional Distributional Semantics Ludovica Pannitto Master Thesis in

Welcome Feel free to use your computer at any time Introduce yourself to your table

Unit 5: Inference for categorical variables Lecture 3: Chi-square tests Statistics 101 Thomas

Classification of finite semigroups and categories using computational methods Najwa Ghannoum W.

MA/CSSE 473 Day 14 Strassen's Algorithm: Matrix Multiplication Decrease and Conquer DFS

Cloud Computing Gabriel Antoniu Inria Computing as a Utility first suggested by John McCarthy in

EC project calls &amp; funding available for storgae projects Peter Szegedi TERENA TERENA

The Bright Side of Black Holes : dark matter, primordial black holes and the cosmic infrared

MCG-ICT-CAS TRECVID 2008 Automatic Video 2008 Automatic Video Retrieval System Retrieval System

EC project calls & funding available for storgae projects Peter Szegedi TERENA TERENA