Language Modelling Makes Sense Propagating Representations through - PowerPoint PPT Presentation

Language Modelling Makes Sense Propagating Representations through WordNet for Full-Coverage Word Sense Disambiguation Da Daniel l Lo Loureiro, Alípio Jorge ACL – Florence, 31 July 2019

Sense Embeddings Exploiting the latest Neural Language Models (NLMs) for sense-level representation learning. • Beat SOTA for Word Sense Disambiguation (WSD). • Full WordNet in NLM-space (+100K common sense concepts). • Concept-level analysis of NLMs. Introduction Related Work Our Approach Performance Applications Conclusions

Sense Embeddings Exploiting the latest Neural Language Models (NLMs) for sense-level representation learning. • Beat SOTA for English Word Sense Disambiguation (WSD). • Full WordNet in NLM-space (+100K common sense concepts). • Concept-level analysis of NLMs. Introduction Related Work Our Approach Performance Applications Conclusions

Related Work Introduction Rel elated Work ork Our Approach Performance Applications Conclusions

Related Work [Luo et al. (2018b)] [Luo et al. (2018a)] [Peters et al. (2018)] [Iacobacci et al. (2016)] [Vial et al. (2018)] [Melamud et al. (2016)] [Raganato et al. (2017)] [Zhong and Ng (2010)] [Yuan et al. (2016)] Sense-level Bag-of-Features Deep Sequence Representations Classifiers Classifiers (k-NN) (SVM) (BiLSTM) (over NLM reprs.) Introduction Rel elated Work ork Our Approach Performance Applications Conclusions

Bag-of of-Features Classifiers It Makes Sense (IMS) [Zhong and Ng (2010)] : • POS tags, surrounding words, local collocations. • SVM for each word type in training. • Fallback: Most Frequent Sense (MFS). “glasses” • Improved with word embedding features. [Iacobacci et al. (2016)] • Still competitive (!) Introduction Rel elated Work ork Our Approach Performance Applications Conclusions

Deep Sequence Classifiers Bi-directional LSTMs (BiLSTMs): • Better with: • Attention (as everything else). • Auxiliary losses. (POS, lemmas, lexnames) [Raganato et al. (2017)] • Glosses, via co-attention mechanisms. [Luo et al. (2018)] • Still must fallback on MFS. • Not that much better than bag-of- features… [Raganato et al. (2017)] Introduction Rel elated Work ork Our Approach Performance Applications Conclusions

Context xtual k-NN NN Matching Contextual Word Embeddings: • Produce Sense Embeddings from NLMs (averaging). • Sense embs. can be compared with contextual embs. [Ruder (2018)] • Disambiguation = Nearest Neighbour search (1-NN). • Sense embs. limited to annotations. MFS required. • Promising, but early attempts. Introduction Rel elated Work ork Our Approach Performance Applications Conclusions

Our Approach Introduction Related Work Our ur Ap Approach Performance Applications Conclusions

Our Approach • Expand the k-NN approach to full-coverage of WordNet. • Matching senses becomes trivial, no MFS fallbacks needed. • Full-set of sense embeddings in NLM-space is useful beyond WSD. Introduction Related Work Our ur Ap Approach Performance Applications Conclusions

Challenges Introduction Related Work Our ur Ap Approach Performance Applications Conclusions

Challenges • Overcome very limited sense annotations (covers 16% senses). • Infer missing senses correctly so that task performance improves. • Rely only on sense embeddings, no lemma or POS features. Introduction Related Work Our ur Ap Approach Performance Applications Conclusions

Challenges • Overcome very limited sense annotations (covers 16% senses). • Infer missing senses correctly so that task performance improves. • Rely only on sense embeddings, no lemma or POS features. Bootstrap Propagate Enrich Reinforce Annotated Dataset WordNet Glosses Morphological Embeddings WordNet Ontology Introduction Related Work Our ur Ap Approach Performance Applications Conclusions

Bootstrapping Sense Embeddings Can your insurance company aid you in reducing administrative costs ? Would it be feasible to limit the menu in order to reduce feeding costs ? Introduction Related Work Our ur Ap Approach Performance Applications Conclusions

Bootstrapping Sense Embeddings reduce%2:30:00:: insurance_company%1:14:00:: cost%1:21:00:: aid%2:41:00:: administrative%3:01:00:: Can your insurance company aid you in reducing administrative costs ? Would it be feasible to limit the menu in order to reduce feeding costs ? feasible%5:00:00:possible:00 menu%1:10:00:: feeding%1:04:01:: limit%2:30:00:: reduce%2:30:00:: cost%1:21:00:: Introduction Related Work Our ur Ap Approach Performance Applications Conclusions

Bootstrapping Sense Embeddings 𝑑 1 𝑑 1 𝑑 1 reduce%2:30:00:: insurance_company%1:14:00:: cost%1:21:00:: 𝑑 1 𝑑 1 aid%2:41:00:: administrative%3:01:00:: 𝑑 2 𝑑 2 𝑑 2 feasible%5:00:00:possible:00 menu%1:10:00:: feeding%1:04:01:: 𝑑 2 𝑑 2 𝑑 2 limit%2:30:00:: reduce%2:30:00:: cost%1:21:00:: Introduction Related Work Our ur Ap Approach Performance Applications Conclusions

Bootstrapping Sense Embeddings 𝑑 1 𝑑 1 reduce%2:30:00:: cost%1:21:00:: 𝑑 2 𝑑 2 reduce%2:30:00:: cost%1:21:00:: Introduction Related Work Our ur Ap Approach Performance Applications Conclusions

Bootstrapping Sense Embeddings 𝑑 1 𝑑 2 𝑑 n … + + + reduce%2:30:00:: reduce%2:30:00:: reduce%2:30:00:: 𝑤 reduce%2:30:00:: = n 𝑑 1 𝑑 2 𝑑 n … + + + cost%1:21:00:: cost%1:21:00:: cost%1:21:00:: 𝑤 cost%1:21:00:: = n Introduction Related Work Our ur Ap Approach Performance Applications Conclusions

Bootstrapping Sense Embeddings 𝑑 1 𝑑 2 𝑑 n + + … + reduce%2:30:00:: reduce%2:30:00:: reduce%2:30:00:: 𝑤 reduce%2:30:00:: = n 𝑑 1 𝑑 2 𝑑 n … + + + cost%1:21:00:: cost%1:21:00:: cost%1:21:00:: 𝑤 cost%1:21:00:: = n Outcome: 33,360 sense embeddings (16% coverage) Introduction Related Work Our ur Ap Approach Performance Applications Conclusions

Propagating Sense Embeddings WordNet’s units, synsets, represent concepts at different levels. Introduction Related Work Our ur Ap Approach Performance Applications Conclusions

Language Modelling Makes Sense Propagating Representations through - PowerPoint PPT Presentation

Language Modelling Makes Sense Propagating Representations through WordNet for Full-Coverage Word Sense Disambiguation Da Daniel l Lo Loureiro, Alpio Jorge ACL Florence, 31 July 2019 Sense Embeddings Exploiting the latest Neural

Word Sense Word Sense Word Sense Disambiguation Disambiguation Disambiguation Presented by

When the plain sense of Scripture makes common sense, make no other sense, therefore take every

Chapter 1 Politics Makes Sense Politics makes sense because politics is everywhere

SENSE 2013 Findings for College of Southern Idaho Presentation Overview SENSE Overview

TUFF TUFF TUFF TUFF TUFF TUFF TUFF TUFF MAKING MAKING MAKING MAKING SENSE OF SENSE OF

The Holy Grail of Sense Definition: The Holy Grail of Sense Definition: Creating a

Nothing in biology makes sense except in light of evolution -Theodosius Dobzansky

What Makes People Listen To Your Presentation Orourke James What Makes People Listen To Your

Free Software and the Environment Ben ONeill What makes free software good? What makes free

The Modelling and Simulation Process 1. History of Modelling and Simulation 2. Modelling and

(Modelling) Semantics of Modelling Languages Hans Vangheluwe 7 September 2010, Lisboa, Portugal

Modelling with Differential Equations Modelling with Differential Equations Modelling with

Geo Sense Presentation Actions Geo Sense Actions What is it? How does it work? Before Geo

Word Sense Disambiguation Word Sense Disambiguation (WSD) Given A

Making Sense of Word Sense 24 February, 2011 Deutschen Gesellschaft fr Sprachwissenschaft (DGfS)

SENSE Trial Once Daily Etravirine versus Efavirenz in Treatment-Naive SENSE: Study Design Study

libcppa An actor library for C++ with extensible group semantic libcppa Dominik Charousset July

Ruske svadbene torte No plastics or any other artificial stuff is being used - everything is

How JLab Makes The Beam Jefferson Lab is a US Department of Energy national laboratory and the

With Tekkotsu David S. Touretzky Computer Science Department Carnegie Mellon Ethan J.

Week 5 Video 3 Relationship Mining Association Rule Mining Association Rule Mining Try to

Non-interrogative wh-constructions in Chuj Hadas Kotek Michael Yoshitaka Erlewine McGill

IT452 Advanced Web and Internet Systems Set 11: Mashups (emphasis on Google tools) Examples

Learning to Use RIMS II Multipliers The RIMS II Data and System of Analysis Dave Swenson 1