Entities as a Gemma Boleda 2 Window into Gabriella Lapesa 1 V Thejas - - PowerPoint PPT Presentation

entities as a
SMART_READER_LITE
LIVE PREVIEW

Entities as a Gemma Boleda 2 Window into Gabriella Lapesa 1 V Thejas - - PowerPoint PPT Presentation

Institute for Natural Language Processing Collaborations with Abhijeet Gupta 1 Marco Baroni 2 Entities as a Gemma Boleda 2 Window into Gabriella Lapesa 1 V Thejas 3 (Distributional) Matthijs Westera 2 Semantics 1 University of Stuttgart


slide-1
SLIDE 1

Institute for Natural Language Processing

Sebastian Padó

Entities as a Window into (Distributional) Semantics

Collaborations with Abhijeet Gupta1 Marco Baroni2 Gemma Boleda2 Gabriella Lapesa1 V Thejas3 Matthijs Westera2

1 University of Stuttgart 2 UPF Barcelona 3 BITS Pilani

slide-2
SLIDE 2

RANLP, September 3, 2019 2

slide-3
SLIDE 3

RANLP, September 3, 2019 3

slide-4
SLIDE 4

RANLP, September 3, 2019 4

  • deal, option are

categories (concepts)

  • Listed in dictionary
  • Macron, Brexit are

individual entities/events

  • Listed in encyclopedia
slide-5
SLIDE 5

Model-theoretic semantics

  • Meaning of language units defined relative to world

model (Gamut 1991: Universe U = set of individuals)

  • Proper nouns and other entities:
  • Mapped onto elements of the universe
  • Common nouns, adjectives, and other categories:
  • Mapped onto sets of elements of the universe

RANLP, September 3, 2019 5

  • E. Macron
  • B. Johnson

politician Brexit events U

slide-6
SLIDE 6

Model-theoretic semantics

  • Meaning of language units defined relative to world

model (Gamut 1991: Universe U = set of individuals)

  • Proper nouns and other entities:
  • Mapped onto elements of the universe
  • Common nouns, adjectives, and other categories:
  • Mapped onto sets of elements of the universe

RANLP, September 3, 2019 6

  • E. Macron
  • B. Johnson

politician Brexit events U

Entities and categories are fundamentally different What about current NLP?

slide-7
SLIDE 7

Distributional Semantics (DS)

  • Dominant paradigm to acquire lexical information:
  • Learn linear algebra

representations of linguistic units from context

  • A.k.a. Vector spaces,

embeddings, distributed representations

  • Still DS because all use the “distributional hypothesis”:

“You shall know a word by the company it keeps” (Firth, Harris, Miller & Charles 1991, etc.)

RANLP, September 3, 2019 7

Macron Brexit deal

  • ption

Johnson

slide-8
SLIDE 8

Distributional Semantics (DS)

  • Dominant paradigm to acquire lexical information:
  • Learn linear algebra

representations of linguistic units from context

  • A.k.a. Vector spaces,

embeddings, distributed representations

  • Still DS because all use the “distributional hypothesis”:

“You shall know a word by the company it keeps” (Firth, Harris, Miller & Charles 1991, etc.)

RANLP, September 3, 2019 8

Macron Brexit deal

  • ption

Johnson

How is this applied to categories / entities in NLP? Split by subcommunity

slide-9
SLIDE 9

Computational Lexical Semantics

  • Strong focus on modelling linguistic aspects of

meaning: categories and relations among categories

  • Hyponymy/hypernymy (entailment),

synonymy, meronymy

  • Also diachronic change

“Interested in generalizations”

RANLP, September 3, 2019 9 From Clarke 2009

slide-10
SLIDE 10

Semantic Web / Information Extraction

  • Complementary focus on modelling world knowledge

aspects of meaning: entities and relations among entities

  • Knowledge bases /

knowledge graphs “Interested in particularities”

RANLP, September 3, 2019 10

slide-11
SLIDE 11

The Current Situation

  • So Distributional Semantics

is applied

  • to both entities and categories
  • to learn fairly different things
  • How is this possible?
  • “It just works”
  • DS is a practice without a theory

RANLP, September 3, 2019 11

Macron Brexit deal

  • ption

Johnson

slide-12
SLIDE 12

Agenda for this presentation

  • Q: Are there relevant differences in the way we can

apply DS to modelling entities and categories?

  • Research strand 1: Knowledge Bases
  • How far can we push DS in learning world knowledge?
  • Research strand 2: The Instantiation Relation
  • How do categories and entities behave distributionally?

RANLP, September 3, 2019 12

Benefit: insights into capabilities and limits

  • f distributional approaches to meaning
slide-13
SLIDE 13

Agenda for this presentation

  • Q: Are there relevant differences in the way we can

apply DS to modelling entities and categories?

  • Research strand 1: Knowledge Bases
  • How far can we push DS in learning world knowledge?
  • Research strand 2: The Instantiation Relation
  • How do categories and entities behave distributionally?

RANLP, September 3, 2019 13

Benefit: insights into capabilities and limits

  • f distributional approaches to meaning
slide-14
SLIDE 14
  • Challenge: KBs are incomplete

[Min et al. 2013, West et al. 2014]

  • Knowledge Base Completion

(KBC): Add missing edges to knowledge graph

  • Very active area of research
  • Representation learning
  • Learn embeddings for

entities and relations

14

Strand 1: Knowledge Base Completion

RANLP, September 3, 2019

slide-15
SLIDE 15
  • KBC embeddings can be learned from text, KB, or both
  • Our Interest: limits of distributional semantics
  • Focus on text-based embeddings of entities
  • Entities have fine-grained attributes with specific values
  • Research Question: Can all attributes be predicted from

vanilla word embeddings? (And if not, why not?)

15

Entity Embeddings and KBC

Italy Population : 61 million Area : 301,000 sq.km Language : Italian Contained by : Europe Currency used: Euro Italy sunny 30 wine 15 beach 12 Rome 10 Naples 6

RANLP, September 3, 2019

slide-16
SLIDE 16
  • Task: Use entity embeddings to predict entity attributes

with Multi-Layer Perceptron (MLP)

  • Numeric: predict value(s)
  • Categorical: predict embedding

for relatum (Italy, currency, Euro)

16

Simple Supervised KBC [Gupta et al. 15,17]

Italy Population : 61 million Area : 301,000 sq.km Language : Italian Contained by : Europe Currency used: Euro

Entity Embedding Hidden Layer (All) Numeric Attribute Values Entity Embedding Hidden Layer Categorical Attribute Value Embedding 1-hot Attribute Vector Output n n n |N| |C| h h tanh tanh tanh σ

RANLP, September 3, 2019

slide-17
SLIDE 17

Evaluation of Attributes

  • Categorical attributes: Mean Reciprocal Rank (MRR)
  • Mean rank of predicted relatum embedding among nearest

neighbors of true relatum embedding

  • Numeric attributes: Correlation
  • Spearman correlation between predicted and true rankings of

entities w.r.t. attribute

RANLP, September 3, 2019 17

(Leaving out details here; see papers)

slide-18
SLIDE 18
  • Em

Embe beddi ddings ngs: Google News vectors (Mikolov et al. 2013)

  • Word2Vec skipgram, 300 dimensions
  • Ex

Expe periment ntal setup: up: Train/Test on 7 FreeBase domains

18

Experimental Setup

Domain # Entities (train/val/test) |C| |N| Animal 279/93/93 22 118 Book 16/5/6 8 2 Citytown 1783/594/595 57 62 Country 155/53/51 79 698 Employer 720/140/141 50 55 Organization 187/63/62 36 32 People 85/28/29 25 76 Sum 3225/976/977 277 1043

RANLP, September 3, 2019

slide-19
SLIDE 19
  • Em

Embe beddi ddings ngs: Google News vectors (Mikolov et al. 2013)

  • Word2Vec skipgram, 300 dimensions
  • Ex

Expe periment ntal setup: up: Train/Test on 7 FreeBase domains

19

Experimental Setup

Domain # Entities (train/val/test) |C| |N| Animal 279/93/93 22 118 Book 16/5/6 8 2 Citytown 1783/594/595 57 62 Country 155/53/51 79 698 Employer 720/140/141 50 55 Organization 187/63/62 36 32 People 85/28/29 25 76 Sum 3225/976/977 277 1043

RANLP, September 3, 2019

Three case studies / observations (My) explanation to follow

slide-20
SLIDE 20

Feature Correlation of MLP Geolocation (Lat. / Long.) 0. 0.93 93 GDP_per_capita 0. 0.89 89 CO2_emissions_per_capita 0. 0.88 88 … … GDP_nominal 0. 0.78 78 … … Date_founded 0.54 Religion_percentage 0.42

Domain Country: Numeric Attributes

21

  • Attributes differ greatly in difficulty
  • Geographical attributes easy (Louwerse et al. 2009)

RANLP, September 3, 2019

best worst

slide-21
SLIDE 21

Geolocation: The Good

RANLP, September 3, 2019 22

Actual Predicted A Hong Kong B Bangladesh C Cocos Islands D Eritrea E Latvia F Belarus G Iran

slide-22
SLIDE 22

23

Geolocation: The Bad

Actual Predicted

A New Caledonia B Cocos Islands C Cook Islands D Mauritius E Niue F Tuvalu G Vanuatu Actual Predicted

RANLP, September 3, 2019

slide-23
SLIDE 23

Feature Correlation of MLP Geolocation (Lat. / Long.) 0. 0.93 93 GDP_per_capita 0. 0.89 89 CO2_emissions_per_capita 0. 0.88 88 … … GDP_nominal 0. 0.78 78 … … Date_founded 0.54 Religion_percentage 0.42

Domain Country: GDP

24

  • Even very similar attributes differ substantially (?)

RANLP, September 3, 2019

best worst

slide-24
SLIDE 24

Feature Correlation of MLP Geolocation (Lat. / Long.) 0. 0.93 93 GDP_per_capita 0. 0.89 89 CO2_emissions_per_capita 0. 0.88 88 … … GDP_nominal 0. 0.78 78 … … Date_founded 0.54 Religion_percentage 0.42

Domain Country: Difficult Attributes

25 RANLP, September 3, 2019

  • The most difficult attributes appear to be very sp

speci cific

best worst

slide-25
SLIDE 25
  • Our KBC task = learn mappings from context-derived

embedding space to attribute space

  • 1. Attribute must correlate with prominent context cues
  • 2. Entities with similar values of attribute must co-occur with

similar context cues

26

Contextual Support

RANLP, September 3, 2019

Switzerland Luxembourg China GDP per capita

slide-26
SLIDE 26
  • Our KBC task = learn mappings from (BOW)

embedding space to attribute space

  • 1. Attribute must correlate with prominent context cues
  • 2. Entities with similar values of attribute must co-occur with

similar context cues

27

Contextual Support

RANLP, September 3, 2019

Germany Luxembourg China GDP per capita

The extent to which this holds: degree of contextual support

slide-27
SLIDE 27

Contextual Support Accounts for…

  • The island displacement: “Hubness effect”
  • Predictions for sparse entities dominated by similar, more

frequent entities

RANLP, September 3, 2019 28

A New Caledonia B Cocos Islands C Cook Islands D Mauritius E Niue F Tuvalu G Vanuatu

slide-28
SLIDE 28

Contextual Support Accounts for…

  • The island displacement: “Hubness effect”
  • Predictions for sparse entities dominated by similar, more

frequent entities

RANLP, September 3, 2019 29

A New Caledonia B Cocos Islands C Cook Islands D Mauritius E Niue F Tuvalu G Vanuatu

Context cues: Ocean, islands, palms, … Hubs with these contexts: Maldives, Seychelles, Mauritius

slide-29
SLIDE 29

Contextual Support Accounts for

  • GDP_per_capita being easier than GDP_nominal
  • GDP per capita comes with more consistent context cues

RANLP, September 3, 2019 30

List of countries 2 Luxembourg Switzerland Norway Ireland Iceland Qatar GDP per capita List of countries 1 United States China Japan Germany UK India GDP nominal

slide-30
SLIDE 30

Contextual Support Accounts for

  • GDP_per_capita being easier than GDP_nominal
  • GDP per capita comes with more consistent context cues

RANLP, September 3, 2019 31

List of countries 2 Luxembourg Switzerland Norway Ireland Iceland Qatar GDP per capita

Context cues: Luxury, finance, tax evasion, …

List of countries 1 United States China Japan Germany UK India GDP nominal

Context cues: …?

slide-31
SLIDE 31

Contextual Support Accounts for

  • Difficulty of learning very specific attributes (date of

foundation, countries exported to..)

  • Indicated by highly specific, low frequency context cues
  • “Drowned out” by other information in pretrained BOW vectors
  • Compare to pattern-based approach (Hearst 1992):

RANLP, September 3, 2019 32

The modern state of Italy was created in the year 1861. In 1861, Italy was largely unified. The Kingdom of Italy was founded

  • n this day in 1861.

Italy Date founded : 1861 Area : 301,000 sq.km Language : Italian Contained by : Europe Currency used: Euro

slide-32
SLIDE 32

Take-home from Strand 1

  • Knowledge can only be learned distributionally if has a

substantial degree of contextual support

  • Future directions:
  • Measuring / quantifying contextual support
  • Increasing contextual support
  • Fine-tuning on labeled data – not a panacea (?)
  • Present specific patterns to learner (Roller & Erk 2016)
  • Use meta-information about

attribute-attribute relations

RANLP, September 3, 2019 33

GDP_per_capita = GDP_nominal / population

slide-33
SLIDE 33

Agenda for this presentation

  • Q: Are there relevant differences in the way we can apply

DS to modelling entities and categories?

  • Research strand 1: Knowledge Bases
  • How far can we push DS in learning world knowledge?
  • Research strand 2: The Instantiation Relation
  • How do categories and entities behave distributionally?

RANLP, September 3, 2019 34

Benefit: insights into capabilities and limits

  • f distributional approaches to meaning
slide-34
SLIDE 34
  • We introduce a new semantic relation: in

instantia iatio ion

  • Hypernymy -- relation between tw

two cate tegories

[Baroni et al. 12, Roller et al. 14, Santus et al. 14, Levy et al. 15, etc.]

  • Instantiation -- relation between en

entity an and ca category

  • Many-to-many, not reflexive, not symmetrical, not transitive

35

Strand 2: Instantiation

[Gupta et al. EACL 2017, ArXiv] politician president president Lincoln

RANLP, September 3, 2019

slide-35
SLIDE 35
  • 22k pairs: 5.5k positive pairs + 3* 5.5k negative pairs
  • Positive: Group entity with category
  • “Instance hypernym” relation from WordNet
  • Negative 1: INVERSE

(switch entity and category)

  • Negative 2: INST2INST(entity + random other entity)
  • Negative 3: NOTINST (entity + wrong related category)

36

An Instantiation Dataset

RANLP, September 3, 2019

Positive Abraham Lincoln – POTUS Mumbai – city Inverse POTUS – Abraham Lincoln city – Mumbai Inst2Inst Abraham Lincoln – Duncan Grant Mumbai – Vicksburg NotInst-inClass Abraham Lincoln – doctor Mumbai – residential area

<latexit sha1_base64="v9TSojFt79iPwNtjhrQg0cxGSnA=">AFk3icjVRNb9NAEHWaBkr4aAvixGVF1YpLqzgXkLiUtgiQKATRtJXqBqvN+kq+2HtjksiK3+SG/+EI+Mk0LpxUfdgjWfmvXmemXWcKumx1fpVW6ovN+7dX3nQfPjo8ZPVtfWnx95mjosut8q60xi8UNKILkpU4jR1AnSsxEk83C/iJ5fCeWnNEY5T0dMwMLIvOSC5ztfrw8iIH9xqDSbJI4lWTvIxYi48k/GY7t4TKRT4GLSbOcbS6vJxd1xC2pmaFyV8nd6Wt1qrEoi5r/kr9Y/K8Ij6S6hLiD8KnWO4o3UiYLcral2Vfg/a0YtwgZKBuDukI0o1gMpMkR4kwBAZhSik2ajEVoU5cpQWb5HNEQ2RZ7PwKdKsHCa3abYBGLtEwqkR3rJcrLAv0udnABmn2WhlbIsO1t1vl61P1OocNMxyALD5c4JsIFnqJ7lDgDUN5Nsq0ZkiJzrkoOGlm1kIPMcDsgwODJT3Hkg9nLlBQbi5wEgNryZ09M2uxOQRUHiaU6W0YtrVTInlaMtUTniZCIMSFAO6eNMJxJY2Uk+HEAlahr/jPV/baO20poctGuHc2Ajmp3O+XluNqGimqQIvlu0sbKXYy8Gh5KpYucwLWqUhDMQZmQa08L18+luYsE3yJKxPivuWejn1XkfkoL0f65gyNeCFvxkrnFWxswz7b3q5NGmGwvBZoX6mGFpW/GNYIp3gqMZkAHe0dJzxC3DAkS5ZqcpoJrXki60dUsd82Tu/MtTC8GbDFo3j9k7Y2gm/tTd29+bNXAleBC+DV0EYvA52g49BJ+gGvP6z/nu5trzUeN5429hrHMxSl2pzLOgdBqHfwA9vc6K</latexit><latexit sha1_base64="v9TSojFt79iPwNtjhrQg0cxGSnA=">AFk3icjVRNb9NAEHWaBkr4aAvixGVF1YpLqzgXkLiUtgiQKATRtJXqBqvN+kq+2HtjksiK3+SG/+EI+Mk0LpxUfdgjWfmvXmemXWcKumx1fpVW6ovN+7dX3nQfPjo8ZPVtfWnx95mjosut8q60xi8UNKILkpU4jR1AnSsxEk83C/iJ5fCeWnNEY5T0dMwMLIvOSC5ztfrw8iIH9xqDSbJI4lWTvIxYi48k/GY7t4TKRT4GLSbOcbS6vJxd1xC2pmaFyV8nd6Wt1qrEoi5r/kr9Y/K8Ij6S6hLiD8KnWO4o3UiYLcral2Vfg/a0YtwgZKBuDukI0o1gMpMkR4kwBAZhSik2ajEVoU5cpQWb5HNEQ2RZ7PwKdKsHCa3abYBGLtEwqkR3rJcrLAv0udnABmn2WhlbIsO1t1vl61P1OocNMxyALD5c4JsIFnqJ7lDgDUN5Nsq0ZkiJzrkoOGlm1kIPMcDsgwODJT3Hkg9nLlBQbi5wEgNryZ09M2uxOQRUHiaU6W0YtrVTInlaMtUTniZCIMSFAO6eNMJxJY2Uk+HEAlahr/jPV/baO20poctGuHc2Ajmp3O+XluNqGimqQIvlu0sbKXYy8Gh5KpYucwLWqUhDMQZmQa08L18+luYsE3yJKxPivuWejn1XkfkoL0f65gyNeCFvxkrnFWxswz7b3q5NGmGwvBZoX6mGFpW/GNYIp3gqMZkAHe0dJzxC3DAkS5ZqcpoJrXki60dUsd82Tu/MtTC8GbDFo3j9k7Y2gm/tTd29+bNXAleBC+DV0EYvA52g49BJ+gGvP6z/nu5trzUeN5429hrHMxSl2pzLOgdBqHfwA9vc6K</latexit><latexit sha1_base64="v9TSojFt79iPwNtjhrQg0cxGSnA=">AFk3icjVRNb9NAEHWaBkr4aAvixGVF1YpLqzgXkLiUtgiQKATRtJXqBqvN+kq+2HtjksiK3+SG/+EI+Mk0LpxUfdgjWfmvXmemXWcKumx1fpVW6ovN+7dX3nQfPjo8ZPVtfWnx95mjosut8q60xi8UNKILkpU4jR1AnSsxEk83C/iJ5fCeWnNEY5T0dMwMLIvOSC5ztfrw8iIH9xqDSbJI4lWTvIxYi48k/GY7t4TKRT4GLSbOcbS6vJxd1xC2pmaFyV8nd6Wt1qrEoi5r/kr9Y/K8Ij6S6hLiD8KnWO4o3UiYLcral2Vfg/a0YtwgZKBuDukI0o1gMpMkR4kwBAZhSik2ajEVoU5cpQWb5HNEQ2RZ7PwKdKsHCa3abYBGLtEwqkR3rJcrLAv0udnABmn2WhlbIsO1t1vl61P1OocNMxyALD5c4JsIFnqJ7lDgDUN5Nsq0ZkiJzrkoOGlm1kIPMcDsgwODJT3Hkg9nLlBQbi5wEgNryZ09M2uxOQRUHiaU6W0YtrVTInlaMtUTniZCIMSFAO6eNMJxJY2Uk+HEAlahr/jPV/baO20poctGuHc2Ajmp3O+XluNqGimqQIvlu0sbKXYy8Gh5KpYucwLWqUhDMQZmQa08L18+luYsE3yJKxPivuWejn1XkfkoL0f65gyNeCFvxkrnFWxswz7b3q5NGmGwvBZoX6mGFpW/GNYIp3gqMZkAHe0dJzxC3DAkS5ZqcpoJrXki60dUsd82Tu/MtTC8GbDFo3j9k7Y2gm/tTd29+bNXAleBC+DV0EYvA52g49BJ+gGvP6z/nu5trzUeN5429hrHMxSl2pzLOgdBqHfwA9vc6K</latexit><latexit sha1_base64="bhZPe1CKHbdFPRHTYSlj9D8eFx0=">ACF3icbZDNSgMxFIXv1L9aq23XboJFcFVm3OhScOygv2BtpRMJtOGZpIhuSMtQ1/Are/g07gTF76N6c/Cab0Q+Dgn4Z6cMJXCou/eKWDw6Pjk/Jp5axaOb+o1atdqzPDeIdpqU0/pJZLoXgHBUreTw2nSh5L5w9rvzeKzdWaPWCi5SPEjpRIhaMopPa43rTb/nrIfsQbKEJ2xk3vNow0ixLuEImqbWDwE9xlFODgkm+rAwzy1PKZnTCBw4VTbgd5eucS3LtlIjE2rijkKzVvy9ymli7SEJ3M6E4tbveSvzPG2QY349yodIMuWKbRXEmCWqy+jSJhOEM5cIBZUa4rIRNqaEMXTWFLfN1IWaj1DGtqiOreO+NI1GOz2tQ/d21bgt4JnH8pwCVdwAwHcwQM8QRs6wCN3j3PrxP72vTdMnbVt6Awnjfv6eZozo=</latexit><latexit sha1_base64="jiKMypDM3t7VZbwUWsVnC18OJCc=">AFiHicjVRNb9NAEHWaBkotEXixGVF1YpLK6eXInEBWgRIfATRtJXqBqvN+kq+2HtjksiK3+SG/+EI2Mn0LpxUfdgjWfmvXmemXWcKukxDH81lprLrXv3Vx60H64+ery2vrF67G3muOhxq6w7jcELJY3oUQlTlMnQMdKnMSjgyJ+cimcl9Yc4SQVfQ1DIweSA5LrfKM5ioz4wa3WYJI8kmjlNI9QjIkr/2g87hWPaT2KXAxbVezeX15KOuCU1M1TuKrlXvtanGouyqPkv+YvF/4rwSKoriDsIL7XeUbyRMlmQsyPNgQLvb8W4RchQ2RjUFaIdxWIoTY4QZwoIwJRSbNpmLEKbukwJMqvniIbItm7MehUCda5Zu8RLGKRlktsmu9RHlZoN/EDi5As0/S0AoZtrPDul+Pet8p9DnTMcjCwyVOiHCBp+geJc4AlHeTbHuGpMicq5aDRlYv5DAzHAx78BgRc+x5CMfZ25YEG4tMFLD6wkdfbOrMHkEFJ7mVCutmHY9U2I52iqVE14mwqAExYAuXjmB2NJG6nIkaBl+Dve8/XNcDcsD1s0OnNjM5if7vlGYy2iopmCrxYtrNOmGI/B4eSq2LlMi9olUYwFGdkGtDC9/PytzBlW+RJ2IAUDyz1svReR+SgvZ/omDI14IW/GSucdbGzDAcv+7k0aYbC8FmhQaYWlb8Y1gineCoJmQAd7R0nPELcMCRLlmlyngmteKLrR1Rx3zVO78y1MLOzYtGsd7u51wt/MtDFaCZ8Hz4EXQCfaD18GHoBv0At782fy93Fheaj1tvWq9nTV7qTHv+pOgclqHfwA7Fs2K</latexit><latexit sha1_base64="jiKMypDM3t7VZbwUWsVnC18OJCc=">AFiHicjVRNb9NAEHWaBkotEXixGVF1YpLK6eXInEBWgRIfATRtJXqBqvN+kq+2HtjksiK3+SG/+EI2Mn0LpxUfdgjWfmvXmemXWcKukxDH81lprLrXv3Vx60H64+ery2vrF67G3muOhxq6w7jcELJY3oUQlTlMnQMdKnMSjgyJ+cimcl9Yc4SQVfQ1DIweSA5LrfKM5ioz4wa3WYJI8kmjlNI9QjIkr/2g87hWPaT2KXAxbVezeX15KOuCU1M1TuKrlXvtanGouyqPkv+YvF/4rwSKoriDsIL7XeUbyRMlmQsyPNgQLvb8W4RchQ2RjUFaIdxWIoTY4QZwoIwJRSbNpmLEKbukwJMqvniIbItm7MehUCda5Zu8RLGKRlktsmu9RHlZoN/EDi5As0/S0AoZtrPDul+Pet8p9DnTMcjCwyVOiHCBp+geJc4AlHeTbHuGpMicq5aDRlYv5DAzHAx78BgRc+x5CMfZ25YEG4tMFLD6wkdfbOrMHkEFJ7mVCutmHY9U2I52iqVE14mwqAExYAuXjmB2NJG6nIkaBl+Dve8/XNcDcsD1s0OnNjM5if7vlGYy2iopmCrxYtrNOmGI/B4eSq2LlMi9olUYwFGdkGtDC9/PytzBlW+RJ2IAUDyz1svReR+SgvZ/omDI14IW/GSucdbGzDAcv+7k0aYbC8FmhQaYWlb8Y1gineCoJmQAd7R0nPELcMCRLlmlyngmteKLrR1Rx3zVO78y1MLOzYtGsd7u51wt/MtDFaCZ8Hz4EXQCfaD18GHoBv0At782fy93Fheaj1tvWq9nTV7qTHv+pOgclqHfwA7Fs2K</latexit><latexit sha1_base64="ZF8xoBF6wXViKLYnBUHqYGga4yY=">AFk3icjVRNb9NAEHWaBkr4aAvixGVF1YpLKzsXkLiUtgiQKATRtJXqFqvN+kq+2HtjksiK3+SG/+EI7NOoHXjou7BGs/Me/M8M+sk8JBGP5qLDWXW/furzxoP3z0+Mnq2vrTY2dy3iPGWnsaUIdl0LzHgiQ/DSznKpE8pNktO/jJ5fcOmH0EUwy3ld0qMVAMAroOl9vjmLNfzCjFNVpEQswYlrEwMfIVXzSDjr+MY3HLqOMT9vVbH15PdnX4bek5hrLXSX3ytf6VG1A+Jr/kr8Y+K8IB6i6griD8FLrHcVrIdIFOdtC70vq3K0YuwgZSpNQeYVoxwkfCl0ATXJEUCklGTaJiQGk9lcjSr5wiHSLbI+zFVmeQkumZ3EBaTWIm0Ftk1ToC49Oh3iaUXVJHPQuMKabK9Tbpfj3rfMXSYq4QK72ECJki4wO7h4kzAObdJNuaITEy56rlwJHVCznINaOafLBUQ0XPsWAjl+R26Ak3Fxix4fWEFr/ZVpgcUOAO51QrzU+7nik1DEyVynInUq5BUEkoXrxyAonBjVTlEGKOy/B3vOdrG+FOWB6yaERzYyOYn+75emM1xqK5wgrML9tZFGbQL6gFwaRfudxXKURHfIzNDV3PWL8rcwJZvoSckAFQ8M9rL0XkcUVDk3UQlmKgoX7mbMO+tiZzkM3vQLobMcuGazQoNcEjDE/2NIKixnICdoUGZx6RhF9RSBnjJKlXGM6kVX2LMCDvmqt75lcEWRjcbtmgcd3aicCf6Fm7s7s2buRK8CF4Gr4IoeB3sBh+DbtALWPNn8/dyY3mp9bz1trXOpilLjXmGdB5bQO/wA9Hc6I</latexit><latexit sha1_base64="v9TSojFt79iPwNtjhrQg0cxGSnA=">AFk3icjVRNb9NAEHWaBkr4aAvixGVF1YpLqzgXkLiUtgiQKATRtJXqBqvN+kq+2HtjksiK3+SG/+EI+Mk0LpxUfdgjWfmvXmemXWcKumx1fpVW6ovN+7dX3nQfPjo8ZPVtfWnx95mjosut8q60xi8UNKILkpU4jR1AnSsxEk83C/iJ5fCeWnNEY5T0dMwMLIvOSC5ztfrw8iIH9xqDSbJI4lWTvIxYi48k/GY7t4TKRT4GLSbOcbS6vJxd1xC2pmaFyV8nd6Wt1qrEoi5r/kr9Y/K8Ij6S6hLiD8KnWO4o3UiYLcral2Vfg/a0YtwgZKBuDukI0o1gMpMkR4kwBAZhSik2ajEVoU5cpQWb5HNEQ2RZ7PwKdKsHCa3abYBGLtEwqkR3rJcrLAv0udnABmn2WhlbIsO1t1vl61P1OocNMxyALD5c4JsIFnqJ7lDgDUN5Nsq0ZkiJzrkoOGlm1kIPMcDsgwODJT3Hkg9nLlBQbi5wEgNryZ09M2uxOQRUHiaU6W0YtrVTInlaMtUTniZCIMSFAO6eNMJxJY2Uk+HEAlahr/jPV/baO20poctGuHc2Ajmp3O+XluNqGimqQIvlu0sbKXYy8Gh5KpYucwLWqUhDMQZmQa08L18+luYsE3yJKxPivuWejn1XkfkoL0f65gyNeCFvxkrnFWxswz7b3q5NGmGwvBZoX6mGFpW/GNYIp3gqMZkAHe0dJzxC3DAkS5ZqcpoJrXki60dUsd82Tu/MtTC8GbDFo3j9k7Y2gm/tTd29+bNXAleBC+DV0EYvA52g49BJ+gGvP6z/nu5trzUeN5429hrHMxSl2pzLOgdBqHfwA9vc6K</latexit><latexit sha1_base64="v9TSojFt79iPwNtjhrQg0cxGSnA=">AFk3icjVRNb9NAEHWaBkr4aAvixGVF1YpLqzgXkLiUtgiQKATRtJXqBqvN+kq+2HtjksiK3+SG/+EI+Mk0LpxUfdgjWfmvXmemXWcKumx1fpVW6ovN+7dX3nQfPjo8ZPVtfWnx95mjosut8q60xi8UNKILkpU4jR1AnSsxEk83C/iJ5fCeWnNEY5T0dMwMLIvOSC5ztfrw8iIH9xqDSbJI4lWTvIxYi48k/GY7t4TKRT4GLSbOcbS6vJxd1xC2pmaFyV8nd6Wt1qrEoi5r/kr9Y/K8Ij6S6hLiD8KnWO4o3UiYLcral2Vfg/a0YtwgZKBuDukI0o1gMpMkR4kwBAZhSik2ajEVoU5cpQWb5HNEQ2RZ7PwKdKsHCa3abYBGLtEwqkR3rJcrLAv0udnABmn2WhlbIsO1t1vl61P1OocNMxyALD5c4JsIFnqJ7lDgDUN5Nsq0ZkiJzrkoOGlm1kIPMcDsgwODJT3Hkg9nLlBQbi5wEgNryZ09M2uxOQRUHiaU6W0YtrVTInlaMtUTniZCIMSFAO6eNMJxJY2Uk+HEAlahr/jPV/baO20poctGuHc2Ajmp3O+XluNqGimqQIvlu0sbKXYy8Gh5KpYucwLWqUhDMQZmQa08L18+luYsE3yJKxPivuWejn1XkfkoL0f65gyNeCFvxkrnFWxswz7b3q5NGmGwvBZoX6mGFpW/GNYIp3gqMZkAHe0dJzxC3DAkS5ZqcpoJrXki60dUsd82Tu/MtTC8GbDFo3j9k7Y2gm/tTd29+bNXAleBC+DV0EYvA52g49BJ+gGvP6z/nu5trzUeN5429hrHMxSl2pzLOgdBqHfwA9vc6K</latexit><latexit sha1_base64="v9TSojFt79iPwNtjhrQg0cxGSnA=">AFk3icjVRNb9NAEHWaBkr4aAvixGVF1YpLqzgXkLiUtgiQKATRtJXqBqvN+kq+2HtjksiK3+SG/+EI+Mk0LpxUfdgjWfmvXmemXWcKumx1fpVW6ovN+7dX3nQfPjo8ZPVtfWnx95mjosut8q60xi8UNKILkpU4jR1AnSsxEk83C/iJ5fCeWnNEY5T0dMwMLIvOSC5ztfrw8iIH9xqDSbJI4lWTvIxYi48k/GY7t4TKRT4GLSbOcbS6vJxd1xC2pmaFyV8nd6Wt1qrEoi5r/kr9Y/K8Ij6S6hLiD8KnWO4o3UiYLcral2Vfg/a0YtwgZKBuDukI0o1gMpMkR4kwBAZhSik2ajEVoU5cpQWb5HNEQ2RZ7PwKdKsHCa3abYBGLtEwqkR3rJcrLAv0udnABmn2WhlbIsO1t1vl61P1OocNMxyALD5c4JsIFnqJ7lDgDUN5Nsq0ZkiJzrkoOGlm1kIPMcDsgwODJT3Hkg9nLlBQbi5wEgNryZ09M2uxOQRUHiaU6W0YtrVTInlaMtUTniZCIMSFAO6eNMJxJY2Uk+HEAlahr/jPV/baO20poctGuHc2Ajmp3O+XluNqGimqQIvlu0sbKXYy8Gh5KpYucwLWqUhDMQZmQa08L18+luYsE3yJKxPivuWejn1XkfkoL0f65gyNeCFvxkrnFWxswz7b3q5NGmGwvBZoX6mGFpW/GNYIp3gqMZkAHe0dJzxC3DAkS5ZqcpoJrXki60dUsd82Tu/MtTC8GbDFo3j9k7Y2gm/tTd29+bNXAleBC+DV0EYvA52g49BJ+gGvP6z/nu5trzUeN5429hrHMxSl2pzLOgdBqHfwA9vc6K</latexit><latexit sha1_base64="v9TSojFt79iPwNtjhrQg0cxGSnA=">AFk3icjVRNb9NAEHWaBkr4aAvixGVF1YpLqzgXkLiUtgiQKATRtJXqBqvN+kq+2HtjksiK3+SG/+EI+Mk0LpxUfdgjWfmvXmemXWcKumx1fpVW6ovN+7dX3nQfPjo8ZPVtfWnx95mjosut8q60xi8UNKILkpU4jR1AnSsxEk83C/iJ5fCeWnNEY5T0dMwMLIvOSC5ztfrw8iIH9xqDSbJI4lWTvIxYi48k/GY7t4TKRT4GLSbOcbS6vJxd1xC2pmaFyV8nd6Wt1qrEoi5r/kr9Y/K8Ij6S6hLiD8KnWO4o3UiYLcral2Vfg/a0YtwgZKBuDukI0o1gMpMkR4kwBAZhSik2ajEVoU5cpQWb5HNEQ2RZ7PwKdKsHCa3abYBGLtEwqkR3rJcrLAv0udnABmn2WhlbIsO1t1vl61P1OocNMxyALD5c4JsIFnqJ7lDgDUN5Nsq0ZkiJzrkoOGlm1kIPMcDsgwODJT3Hkg9nLlBQbi5wEgNryZ09M2uxOQRUHiaU6W0YtrVTInlaMtUTniZCIMSFAO6eNMJxJY2Uk+HEAlahr/jPV/baO20poctGuHc2Ajmp3O+XluNqGimqQIvlu0sbKXYy8Gh5KpYucwLWqUhDMQZmQa08L18+luYsE3yJKxPivuWejn1XkfkoL0f65gyNeCFvxkrnFWxswz7b3q5NGmGwvBZoX6mGFpW/GNYIp3gqMZkAHe0dJzxC3DAkS5ZqcpoJrXki60dUsd82Tu/MtTC8GbDFo3j9k7Y2gm/tTd29+bNXAleBC+DV0EYvA52g49BJ+gGvP6z/nu5trzUeN5429hrHMxSl2pzLOgdBqHfwA9vc6K</latexit><latexit sha1_base64="v9TSojFt79iPwNtjhrQg0cxGSnA=">AFk3icjVRNb9NAEHWaBkr4aAvixGVF1YpLqzgXkLiUtgiQKATRtJXqBqvN+kq+2HtjksiK3+SG/+EI+Mk0LpxUfdgjWfmvXmemXWcKumx1fpVW6ovN+7dX3nQfPjo8ZPVtfWnx95mjosut8q60xi8UNKILkpU4jR1AnSsxEk83C/iJ5fCeWnNEY5T0dMwMLIvOSC5ztfrw8iIH9xqDSbJI4lWTvIxYi48k/GY7t4TKRT4GLSbOcbS6vJxd1xC2pmaFyV8nd6Wt1qrEoi5r/kr9Y/K8Ij6S6hLiD8KnWO4o3UiYLcral2Vfg/a0YtwgZKBuDukI0o1gMpMkR4kwBAZhSik2ajEVoU5cpQWb5HNEQ2RZ7PwKdKsHCa3abYBGLtEwqkR3rJcrLAv0udnABmn2WhlbIsO1t1vl61P1OocNMxyALD5c4JsIFnqJ7lDgDUN5Nsq0ZkiJzrkoOGlm1kIPMcDsgwODJT3Hkg9nLlBQbi5wEgNryZ09M2uxOQRUHiaU6W0YtrVTInlaMtUTniZCIMSFAO6eNMJxJY2Uk+HEAlahr/jPV/baO20poctGuHc2Ajmp3O+XluNqGimqQIvlu0sbKXYy8Gh5KpYucwLWqUhDMQZmQa08L18+luYsE3yJKxPivuWejn1XkfkoL0f65gyNeCFvxkrnFWxswz7b3q5NGmGwvBZoX6mGFpW/GNYIp3gqMZkAHe0dJzxC3DAkS5ZqcpoJrXki60dUsd82Tu/MtTC8GbDFo3j9k7Y2gm/tTd29+bNXAleBC+DV0EYvA52g49BJ+gGvP6z/nu5trzUeN5429hrHMxSl2pzLOgdBqHfwA9vc6K</latexit><latexit sha1_base64="v9TSojFt79iPwNtjhrQg0cxGSnA=">AFk3icjVRNb9NAEHWaBkr4aAvixGVF1YpLqzgXkLiUtgiQKATRtJXqBqvN+kq+2HtjksiK3+SG/+EI+Mk0LpxUfdgjWfmvXmemXWcKumx1fpVW6ovN+7dX3nQfPjo8ZPVtfWnx95mjosut8q60xi8UNKILkpU4jR1AnSsxEk83C/iJ5fCeWnNEY5T0dMwMLIvOSC5ztfrw8iIH9xqDSbJI4lWTvIxYi48k/GY7t4TKRT4GLSbOcbS6vJxd1xC2pmaFyV8nd6Wt1qrEoi5r/kr9Y/K8Ij6S6hLiD8KnWO4o3UiYLcral2Vfg/a0YtwgZKBuDukI0o1gMpMkR4kwBAZhSik2ajEVoU5cpQWb5HNEQ2RZ7PwKdKsHCa3abYBGLtEwqkR3rJcrLAv0udnABmn2WhlbIsO1t1vl61P1OocNMxyALD5c4JsIFnqJ7lDgDUN5Nsq0ZkiJzrkoOGlm1kIPMcDsgwODJT3Hkg9nLlBQbi5wEgNryZ09M2uxOQRUHiaU6W0YtrVTInlaMtUTniZCIMSFAO6eNMJxJY2Uk+HEAlahr/jPV/baO20poctGuHc2Ajmp3O+XluNqGimqQIvlu0sbKXYy8Gh5KpYucwLWqUhDMQZmQa08L18+luYsE3yJKxPivuWejn1XkfkoL0f65gyNeCFvxkrnFWxswz7b3q5NGmGwvBZoX6mGFpW/GNYIp3gqMZkAHe0dJzxC3DAkS5ZqcpoJrXki60dUsd82Tu/MtTC8GbDFo3j9k7Y2gm/tTd29+bNXAleBC+DV0EYvA52g49BJ+gGvP6z/nu5trzUeN5429hrHMxSl2pzLOgdBqHfwA9vc6K</latexit>
slide-36
SLIDE 36
  • Architecture: let’s use an MLP again (1 hidden layer)
  • Inspiration: hypernymy classifier (Roller et al. 14)
  • Input: Embeddings for two words, e.g. v = w1||w2
  • Output: Binary decision (instantiation or not)

37

Modeling Instantiation

RANLP, September 3, 2019

(We experimented with different variations)

slide-37
SLIDE 37

Experimental Setup

  • Own dataset
  • Train-dev-test split with memorization filtering
  • No entity or concept appears in more than one section
  • Embeddings: Google News
  • Baseline: Always predict instantiation
  • Evaluation: F1 for class instantiation

RANLP, September 3, 2019 38

slide-38
SLIDE 38
  • Inverse and Inst2Inst are very simple
  • Entities and categories are simple to distinguish
  • NotInst is very difficult: hardly beats baseline!
  • Corresponds well to findings about hypernymy

39

Results

RANLP, September 3, 2019

Dataset Pos:Neg Neg ex. BLpos MLP Pos + Inverse 1:1 POTUS – Lincoln 0.67 0.96 Pos + Inst2Inst 1:1 Lincoln – Grant 0.67 0.91 Pos + NotInst 1:1 Lincoln – doctor 0.67 0.69 Pos + Union 1:3 all 0.40 0.63

<latexit sha1_base64="ogWiD6VaAOdo0AeKDbmoNkH3DGs=">AE5nicfVNbxMxEN02AUr4auHIxSItQkJd7abkoz1VgARIoQTRtJW6Uep1nNTUa69sb2m02r/ADXHlP3Hit3BhdrNJN2mLfZjR+M3z83jGDznTxnH+LC2Xyrdu31m5W7l3/8HDR6trjw+0jBShXSK5VEc+1pQzQbuGU6PQkVx4HN6J+9Sc8Pz6nSTIp9Mw5pL8AjwYaMYAOh/tryb0/Qb0QGARaD2BOMDZLYM/QCuOI9aT4IbRLvQoeY0KQyh41EdsVwHcDkhnJLpEpZe0/vEycF8GpenoJrVSQ59MRE7HBfsSxSmLE801go6SC8vUWG6iMQeg56ki9s0dHqZsaemGD97q93p9cFIdSJ8k6xD62O8jzkBewgYo4nXEBAXqJUm0AcnfclPTfvcL2txEbSbgIwSEkGM3mAde7sxzfS8KyRQjhnLNBl43iksTJY9JXEXSDZyBsFUBqg7RXi9DiZ7jz+MHbtVT7Jwq3aTkPS30XVKBpIYqYrcje1iPpor0EaRMxKTvFo9M6+cS2mO3ZxX2Gwk18lKGytTtTVFQyNgzpMiY2OrmOlR6JxpQ1T6q1XHdrKFrjpu7lStfHX6q389eHIUGEIx1ofu05oejFWhGe9mekKbTfGR7RY3AFDqjuxdn0JWgDIgM0hHoNJfxfFi1mxDjQehz4gAywOdWLZ2nwurPjyAxbvZiJMDJUkMlFw4gjI1E6ymjAFCWGj8HBRDHQisgpVpgYGJm5W2ZzVoj5Up5BxXQC1XIXa3PVOajZ7pZd+1yr7jbyuq1YT61n1gvLtZrWrvXe6lhdi5R2SiclVvpaPi1/L/8o/5xAl5fynCfW3Cr/+gcXkW0b</latexit>
slide-39
SLIDE 39

What makes NotInst hard?

RANLP, September 3, 2019 40

slide-40
SLIDE 40

Category representation

  • Our assumption: Embedding of noun x is a good

representation of category x

  • (Universally assumed in lexical semantic modeling)
  • That is actually questionable:
  • Informativity
  • Ambiguity, including metaphors
  • Lexical choice and speaker

intent (Lapesa et al. 2017, Westera and Boleda 2019)

RANLP, September 3, 2019 41

Grass is green Elephant in the room Fotograf vs. Fotografin (generic/female photographer)

slide-41
SLIDE 41
  • Are there alternatives for concept representation?
  • Formal semantics: (extension of) concept = set of

entities instantiating it

  • In our context: Represent categories by the centroid of

their entity embeddings („centroid embedding“)

  • Vs. traditional approach: „concept embedding“

42

Re-representing Categories

RANLP, September 3, 2019

  • E. Macron
  • B. Johnson

politician U

slide-42
SLIDE 42

43

Does It Work?

RANLP, September 3, 2019

slide-43
SLIDE 43
  • Extension of previous experiment
  • Centroids built on training set
  • Im

Improvement fo for ce centroid-ba based ca category re repre resentation

44

Experimental Validation

RANLP, September 3, 2019

Dataset Pos:Neg BLPos Concept emb. Centroid emb. Pos + Inverse 1:1 0.67 0.96 0.98 Pos + Inst2Inst 1:1 0.67 0.91 0.91 Pos + NotInst 1:1 0.67 0.67 0.79 Pos + Union 1:3 0.40 0.63 0.76

<latexit sha1_base64="iS4w3ReHTQbUg2KbAO920nKAsM=">AEPXicfVPNbtNAEHZtfkr4aQpHLitSEAIpshOUtD1VlANIqAoSaSvFUbReb9JV1ruWd10aWX4InoYj8Bg8ADfElQMXxj9N7AYyZ7Z2W+WX+z9kLOlLbtbxumde36jZubtxq379y9t9Xcvn+sZBwROiSy+jUw4pyJuhQM83paRhRHicnjzw2z/5JxGiknxXi9COg7wTLApI1hDarJtPnMF/UBkEGDhJ65gzE8TV9ML4EqOpH4jlE7dCxViQtNGDRuLKnYogO8fSKYlWyEzys5/eJk4r4Kz09MVtOF6dMZEorEXcxylCUcEULSBgJztQyjmFOE8mVmr7AGfTR6gZS7R/RGUTo5dudSdEjgWya7kDyUApCQ41o4LWzJRU6kswv1q6L3ID5GfeSGSrRc5SdF+DOvgNvu93ro9zv9Qq3C6VrFaBHvSHOpcuryix2USW2BU4d/kHeNPEbvf30mofVD1sSZSNKyfq5gQv7IKnW+fp1XhcCgO5VLoxabstp0bWg+cMmgZpQ0mzd+uL0kcgJKEY6VGjh3qcYIjzQjPxh4rClOd4xkdQShwQNU4yS91ih5DxkdTGcEjNMqz1YoEB0otAg+QAdZn6upelvzb3ijW091xwkQYaypI0Wgac6Qlyv4Q5LOIEs0XEGASMTgrImc4wkTDTax1WV7fSs6Tcg6KqRTUcq5qsx4cd9pOt91512kd9ErdNo2HxiPjqeEYfePAeG0MjKFBzI/mJ/OL+dX6bH23flg/C6i5UdY8MGpm/foDBLJLnw=</latexit>
slide-44
SLIDE 44
  • Categories and entities differ in distributional behavior
  • Can be distinguished easily
  • But capturing entity-category relations is tricky
  • Analogy to difficult attributes in Strand 1
  • How to improve comparability?
  • Here: Centroid-based representation
  • Conceptually appealing
  • Requires more information about categories than just their

names, namely instances

45

Take-home from Strand 2

RANLP, September 3, 2019

How many instances are needed? Sneak preview!

slide-45
SLIDE 45

46

concept 1+ inst 2+ inst 3+ inst 4+inst 5+inst

slide-46
SLIDE 46

Wrap-up

  • The Distributional Hypothesis – usage determines

meaning – is at the heart of many NLP applications

  • But is it really true?
  • My proposal today: Let’s relate the properties of

information we want to learn to the properties of the linguistic material we want to learn it from

  • This presentation: Entities vs. categories
  • Other direction: Speaker intention vs. linguistic usage

RANLP, September 3, 2019 47

slide-47
SLIDE 47

Thank you!

e-mail phone +49 (0) 711 685- www. University of Stuttgart Sebastian Padó 81394 ims.uni-stuttgart.de/~pado pado@ims.uni-stuttgart.de