Decompositional Semantics for Improved Language Models Pranjal - PowerPoint PPT Presentation

Decompositional Semantics for Improved Language Models Pranjal Singh Supervisor: Dr. Amitabha Mukerjee B.Tech - M.Tech Dual Degree Thesis Defense Department of Computer Science & Engineering IIT Kanpur June 15, 2015

Introduction Background Datasets Method and Experiments Results Conclusion and Future Work Appendix Outline Introduction 1 Background 2 Datasets 3 Method and Experiments 4 Results 5 Conclusion and Future Work 6 Appendix 7 Pranjal Singh Decompositional Semantics for Improved Language Models

Introduction Background Datasets Method and Experiments Results Conclusion and Future Work Appendix Introduction to Decompositional Semantics Decompositional Semantics is a way to describe a language entity word/paragraph/document by a constrained representation that identifies the most relevant representation conveying the semantics of the whole. For example, a document can be broken into aspects such as its tf-idf representation, distributed semantics vector, etc. Pranjal Singh Decompositional Semantics for Improved Language Models

Introduction Background Datasets Method and Experiments Results Conclusion and Future Work Appendix Introduction to Decompositional Semantics Why need Decompositional Semantics? It is language independent It decomposes language entity into various aspects that are latent in its meaning All aspects are important in their own ways Pranjal Singh Decompositional Semantics for Improved Language Models

Introduction Background Datasets Method and Experiments Results Conclusion and Future Work Appendix Introduction to Decompositional Semantics Decompositional Semantics in Sentiment Analysis domain, A set of documents D = { d 1 , . . . , d | D | } A set of aspects A = { a 1 , . . . , a | M | } Training data for n ( n < | D | ) documents, T = { l d 1 , . . . , l d n } Example : Documents tf-idf Word Vector Average Document Vector BOW 0 0 1 0 d 1 d 2 0 1 1 0 d 3 1 0 0 1 d 4 x x x x d 5 1 1 1 1 Using T , D and A , the supervised classifier C learns a representation to predict sentiments of individual documents. Pranjal Singh Decompositional Semantics for Improved Language Models

Introduction Background Datasets Method and Experiments Results Conclusion and Future Work Appendix Problem Statement Better Language Representation To highlight the vitality of Decompositional Semantics in language representation To use Distributional Semantics for under resourced languages such as Hindi To demonstrate the effect of various parameters on language representation Pranjal Singh Decompositional Semantics for Improved Language Models

Introduction Background Datasets Method and Experiments Results Conclusion and Future Work Appendix Contribution of this thesis Hindi Better representation of Hindi text using Distributional semantics Achieved state-of-the-art results for sentiment analysis on product and movie review corpus Paper accepted in regICON’15 New Corpus Released a corpus of 700 Hindi movie reviews Largest corpus in Hindi in reviews domain English Proposed a more generic representation of English text Achieved state-of-the-art results for sentiment analysis on IMDB movie reviews and Amazon electronics reviews Submitted in EMNLP’15 Pranjal Singh Decompositional Semantics for Improved Language Models

Introduction Background Datasets Method and Experiments Results Conclusion and Future Work Appendix Outline Introduction 1 Background 2 Datasets 3 Method and Experiments 4 Results 5 Conclusion and Future Work 6 Appendix 7 Pranjal Singh Decompositional Semantics for Improved Language Models

Introduction Background Datasets Method and Experiments Results Conclusion and Future Work Appendix Background on Language Representation Bag of Words(BOW) Model Document d i represented by v d i ∈ R | V | Each element in v d i denotes presence/absence of each word Drawbacks : High-dimensionality Ignores word ordering Ignores word context Very sparse No relative importance to words Pranjal Singh Decompositional Semantics for Improved Language Models

Introduction Background Datasets Method and Experiments Results Conclusion and Future Work Appendix Background on Language Representation Term Frequency-Inverse Document Frequency(tf-idf) Model Document d i represented by v d i ∈ R | V | Each element in v d i is the product of term frequency and inverse document frequency: tfidf ( t , d ) = tf ( t , d ) × log( � D � df ( t ) ) Gives weights to terms which are less frequent and hence important Drawbacks : High-dimensionality Ignores word ordering Ignores word context Very sparse No relative importance to words Pranjal Singh Decompositional Semantics for Improved Language Models

Introduction Background Datasets Method and Experiments Results Conclusion and Future Work Appendix Background on Language Representation Distributed Representation of Words(Mikolov et al., 2013b) Each word w i ∈ V is represented using a vector v w i ∈ R k The vocabulary V can be represented by a matrix V ∈ R k ×| V | Vectors ( v w i ) should encode the semantics of the words in vocabulary Drawbacks : Ignores exact word ordering Cannot represent documents as vectors without composition Pranjal Singh Decompositional Semantics for Improved Language Models

Introduction Background Datasets Method and Experiments Results Conclusion and Future Work Appendix Background on Language Representation Distributed Representation of Documents(Le and Mikolov, 2014) Each document d i ∈ D is represented using a vector v d i ∈ R k The set D can be represented by a matrix D ∈ R k ×| D | Vectors ( v d i ) should encode the semantics of the documents Comments : Can represent documents Ignores contribution of indvidual word while building document vectors Pranjal Singh Decompositional Semantics for Improved Language Models

Decompositional Semantics for Improved Language Models Pranjal - PowerPoint PPT Presentation

Decompositional Semantics for Improved Language Models Pranjal Singh Supervisor: Dr. Amitabha Mukerjee B.Tech - M.Tech Dual Degree Thesis Defense Department of Computer Science & Engineering IIT Kanpur June 15, 2015 Introduction

Semantics 1 / 21 Outline What is semantics? Denotational semantics Semantics of naming What

Decompositional Semantics Rachel Rudinger January 30, 2020 A story about semantic annotation

Operational Semantics 1 / 14 Outline What is semantics? Operational Semantics What is

15-411: Dynamic Semantics Jan Ho ff mann Dynamic Semantics Static semantics: definition of

Universal Decompositional Semantic Parsing Elias Stengel-Eskin Aaron Steven White Sheng Zhang

Models of Language Evolution models thereof its evolution language Models of Language Evolution

Improved pythonDEVS Simulator Improved pythonDEVS Simulator Improved pythonDEVS Simulator

Semantics is an indispensable aspect of a query language Semantics is an indispensable aspect of

Polyteam Semantics Team Semantics Axiomatizations in team semantics Polyteams and Jonni

Semantics in Practice Semantics of Practice How do we write semantics? 1: pen-and-paper How do

Introductory Notes Jigsaw Semantics or: Dynamic Semantics Put Together Again Formal semantics

Polyteam Semantics Team Semantics Axiomatisations in team semantics Polyteams and

Algorithms for Natural Language Processing Lecture 7: Lexical Semantics Three Ways of Looking

PL: A Whirlwind Tour Semantics and Foundations Program Semantics To analyze programs, we

Semantics of PCF and the full abstraction problem Language, Operational Semantics and Models

4 Language Models 2: Log-linear Language Models This chapter will discuss another set of language

The Six-Day Spacecraft: Creating a Plug-and-play approach for aerospace systems y James Lyke

Science, Technology and Telecommunications Committee July 7, 2015 Presenter : Robert Gorrell,

Electricity The Hazards and Risks Encountered by the Emergency Services 18 September, 2015

Systems DEVELOPING AUTONOMOUS STREET-LEGAL VEHICLES: ANALYSIS OF INTELLIGENT GROUND VEHICLE

Advanced Storage Technology Consortium Launch Event January 31 February 1 2011 ASTC

Chapter 1 : Computer Science Class XI ( As per CBSE Board) Computer System & Organization

2DECOMP&FFT A Highly Scalable 2D Decomposition Library and FFT Interface Ning Li and

Exploring Life Presentation of KILROY 1 Our Vision We want to be the preferred source of

Decompositional Semantics for Improved Language Models Pranjal - PowerPoint PPT Presentation

Decompositional Semantics for Improved Language Models Pranjal Singh Supervisor: Dr. Amitabha Mukerjee B.Tech - M.Tech Dual Degree Thesis Defense Department of Computer Science & Engineering IIT Kanpur June 15, 2015 Introduction

Semantics 1 / 21 Outline What is semantics? Denotational semantics Semantics of naming What

Decompositional Semantics Rachel Rudinger January 30, 2020 A story about semantic annotation

Operational Semantics 1 / 14 Outline What is semantics? Operational Semantics What is

15-411: Dynamic Semantics Jan Ho ff mann Dynamic Semantics Static semantics: definition of

Universal Decompositional Semantic Parsing Elias Stengel-Eskin Aaron Steven White Sheng Zhang

Models of Language Evolution models thereof its evolution language Models of Language Evolution

Improved pythonDEVS Simulator Improved pythonDEVS Simulator Improved pythonDEVS Simulator

Semantics is an indispensable aspect of a query language Semantics is an indispensable aspect of

Polyteam Semantics Team Semantics Axiomatizations in team semantics Polyteams and Jonni

Semantics in Practice Semantics of Practice How do we write semantics? 1: pen-and-paper How do

Introductory Notes Jigsaw Semantics or: Dynamic Semantics Put Together Again Formal semantics

Polyteam Semantics Team Semantics Axiomatisations in team semantics Polyteams and

Algorithms for Natural Language Processing Lecture 7: Lexical Semantics Three Ways of Looking

PL: A Whirlwind Tour Semantics and Foundations Program Semantics To analyze programs, we

Semantics of PCF and the full abstraction problem Language, Operational Semantics and Models

4 Language Models 2: Log-linear Language Models This chapter will discuss another set of language

The Six-Day Spacecraft: Creating a Plug-and-play approach for aerospace systems y James Lyke

Science, Technology and Telecommunications Committee July 7, 2015 Presenter : Robert Gorrell,

Electricity The Hazards and Risks Encountered by the Emergency Services 18 September, 2015

Systems DEVELOPING AUTONOMOUS STREET-LEGAL VEHICLES: ANALYSIS OF INTELLIGENT GROUND VEHICLE

Advanced Storage Technology Consortium Launch Event January 31 February 1 2011 ASTC

Chapter 1 : Computer Science Class XI ( As per CBSE Board) Computer System &amp; Organization

2DECOMP&amp;FFT A Highly Scalable 2D Decomposition Library and FFT Interface Ning Li and

Exploring Life Presentation of KILROY 1 Our Vision We want to be the preferred source of

Chapter 1 : Computer Science Class XI ( As per CBSE Board) Computer System & Organization

2DECOMP&FFT A Highly Scalable 2D Decomposition Library and FFT Interface Ning Li and