Grounded Word Sense Translation Chiraag Lala, Pranava Madhyastha and - - PowerPoint PPT Presentation

grounded word sense translation
SMART_READER_LITE
LIVE PREVIEW

Grounded Word Sense Translation Chiraag Lala, Pranava Madhyastha and - - PowerPoint PPT Presentation

Grounded Word Sense Translation Chiraag Lala, Pranava Madhyastha and Lucia Specia Why look at images? Why look at images? A man holding a seal Ein Mann hlt einen Seehund Ein Mann hlt ein Siegel Multimodal Machine


slide-1
SLIDE 1

Grounded Word Sense Translation

Chiraag Lala, Pranava Madhyastha and Lucia Specia

slide-2
SLIDE 2

Why look at images?

slide-3
SLIDE 3

Why look at images?

“A man holding a seal” “Ein Mann hält ein Siegel” “Ein Mann hält einen Seehund”

slide-4
SLIDE 4

Multimodal Machine Translation

slide-5
SLIDE 5

This paper: focus on ambiguous words only

slide-6
SLIDE 6

Tagging Task

slide-7
SLIDE 7

The Dataset

From Multi30K: take words in the source language (En) with multiple translations in the target languages (De, Fr) with different meanings

En-Fr En-De Ambiguous words 661 745 Samples 44,779 53,868 Avg candidates/word 3 4.1 MFT 77% 65%

slide-8
SLIDE 8

Human Annotation

Humans manually labelled the test set and marked cases when they needed images

slide-9
SLIDE 9

Human Annotation

Annotators found image necessary in 7.8% of the samples for En-De, and 8.6% for En-Fr Words like player, hat and coat require the image as text alone is not sufficient to disambiguate

slide-10
SLIDE 10

Computational Models: BLSTM+image

slide-11
SLIDE 11

Computational Models: BLSTM+object_prepend

slide-12
SLIDE 12

Results

Accuracy: proportion of ambiguous words correctly translated Main finding: ULSTM benefits much more from global image features than BLSTM

slide-13
SLIDE 13

Results

Main finding: BLSTM models with pre-pending

  • bject

categories

  • utperform all the other models