Translating Negation: Induction, Search and Model Errors Federico - - PowerPoint PPT Presentation

translating negation
SMART_READER_LITE
LIVE PREVIEW

Translating Negation: Induction, Search and Model Errors Federico - - PowerPoint PPT Presentation

Translating Negation: Induction, Search and Model Errors Federico Fancellu & Bonnie Webber School of Informatics University of Edinburgh f.fancellu@sms.ed.ac.uk, bonnie@inf.ed.ac.uk www.inf.ed.ac.uk Why


slide-1
SLIDE 1

www.inf.ed.ac.uk

Translating Negation: Induction, Search and Model Errors



 Federico Fancellu & Bonnie Webber
 School of Informatics
 University of Edinburgh
 f.fancellu@sms.ed.ac.uk, bonnie@inf.ed.ac.uk


slide-2
SLIDE 2

www.inf.ed.ac.uk

Why bother? - Examples

slide-3
SLIDE 3

www.inf.ed.ac.uk

在 同 一 个 急诊 的 值班 中 , 我 两 次 没有 发现 病患 得了 盲肠炎 。 During my emergency duty , I have n’t diagnosed a patient with appendicitis twice .

Why bother? - Examples

slide-4
SLIDE 4

www.inf.ed.ac.uk

在 同 一 个 急诊 的 值班 中 , 我 两 次 没有 发现 病患 得了 盲肠炎 。 During my emergency duty , I have n’t diagnosed a patient with appendicitis twice .

Why bother? - Examples

slide-5
SLIDE 5

www.inf.ed.ac.uk

在 同 一 个 急诊 的 值班 中 , 我 两 次 没有 发现 病患 得了 盲肠炎 。 in an emergency duty , I had two patients have been found . During my emergency duty , I have n’t diagnosed a patient with appendicitis twice .

Why bother? - Examples

slide-6
SLIDE 6

www.inf.ed.ac.uk

在 同 一 个 急诊 的 值班 中 , 我 两 次 没有 发现 病患 得了 盲肠炎 。 in an emergency duty , I had two patients have been found . During my emergency duty , I have n’t diagnosed a patient with appendicitis twice .

Why bother? - Examples

slide-7
SLIDE 7

www.inf.ed.ac.uk

在 同 一 个 急诊 的 值班 中 , 我 两 次 没有 发现 病患 得了 盲肠炎 。 in an emergency duty , I had two patients have been found . During my emergency duty , I have n’t diagnosed a patient with appendicitis twice .

Why bother? - Examples

slide-8
SLIDE 8

www.inf.ed.ac.uk

在 同 一 个 急诊 的 值班 中 , 我 两 次 没有 发现 病患 得了 盲肠炎 。 in an emergency duty , I had two patients have been found . During my emergency duty , I have n’t diagnosed a patient with appendicitis twice .

Why bother? - Examples

slide-9
SLIDE 9

www.inf.ed.ac.uk

Why bother? - Examples

slide-10
SLIDE 10

www.inf.ed.ac.uk

如果 他 拿 了 不 属于 他 的 东西 并不 说明 他 就是 个 惯偷

Even if he took things that do not belong to him, that does n’t mean he is a thief

Why bother? - Examples

slide-11
SLIDE 11

www.inf.ed.ac.uk

如果 他 拿 了 不 属于 他 的 东西 并不 说明 他 就是 个 惯偷

Even if he took things that do not belong to him, that does n’t mean he is a thief

Why bother? - Examples

slide-12
SLIDE 12

www.inf.ed.ac.uk

如果 他 拿 了 不 属于 他 的 东西 并不 说明 他 就是 个 惯偷

Even if he took things that do not belong to him, that does n’t mean he is a thief

if he took to the things he does not mean that he is not a

Why bother? - Examples

slide-13
SLIDE 13

www.inf.ed.ac.uk

如果 他 拿 了 不 属于 他 的 东西 并不 说明 他 就是 个 惯偷

Even if he took things that do not belong to him, that does n’t mean he is a thief

if he took to the things he does not mean that he is not a

Why bother? - Examples

slide-14
SLIDE 14

www.inf.ed.ac.uk

如果 他 拿 了 不 属于 他 的 东西 并不 说明 他 就是 个 惯偷

Even if he took things that do not belong to him, that does n’t mean he is a thief

if he took to the things he does not mean that he is not a

Why bother? - Examples

slide-15
SLIDE 15

www.inf.ed.ac.uk

如果 他 拿 了 不 属于 他 的 东西 并不 说明 他 就是 个 惯偷

Even if he took things that do not belong to him, that does n’t mean he is a thief

if he took to the things he does not mean that he is not a

Why bother? - Examples

slide-16
SLIDE 16

www.inf.ed.ac.uk

Why bother? – BLEU scores

slide-17
SLIDE 17

www.inf.ed.ac.uk

Why bother? – BLEU scores

  • BLEU scores also showed a problem in translating negation
slide-18
SLIDE 18

www.inf.ed.ac.uk

Why bother? – BLEU scores

  • BLEU scores also showed a problem in translating negation
slide-19
SLIDE 19

www.inf.ed.ac.uk

Why bother? – BLEU scores

  • BLEU scores also showed a problem in translating negation
  • Similar trend for:

– German à English – Czech à English

slide-20
SLIDE 20

www.inf.ed.ac.uk

What’s exactly wrong with negation?

slide-21
SLIDE 21

www.inf.ed.ac.uk

What’s exactly wrong with negation?

Potential problem

slide-22
SLIDE 22

www.inf.ed.ac.uk

What’s exactly wrong with negation?

Potential problem Translating negation

slide-23
SLIDE 23

www.inf.ed.ac.uk

What’s exactly wrong with negation?

Hypothesis

Potential problem Translating negation

slide-24
SLIDE 24

www.inf.ed.ac.uk

What’s exactly wrong with negation?

Hypothesis

Potential problem Translating negation

  • structural mismatch between source and

target language (Collins et al.,2005)

slide-25
SLIDE 25

www.inf.ed.ac.uk

What’s exactly wrong with negation?

Hypothesis

Potential problem Translating negation

  • structural mismatch between source and

target language (Collins et al.,2005)

  • not enough negative training data (Wetzel

& Bond, 2012)

slide-26
SLIDE 26

www.inf.ed.ac.uk

What’s exactly wrong with negation?

Hypothesis

Potential problem Translating negation

  • structural mismatch between source and

target language (Collins et al.,2005)

  • not enough negative training data (Wetzel

& Bond, 2012)

  • the translation rules does not contain

negation (Baker et al., 2013)

slide-27
SLIDE 27

www.inf.ed.ac.uk

What’s exactly wrong with negation?

Hypothesis

Potential problem Translating negation

  • structural mismatch between source and

target language (Collins et al.,2005)

  • not enough negative training data (Wetzel

& Bond, 2012)

  • the translation rules does not contain

negation (Baker et al., 2013)

  • the scoring function does not contain any

negation-related feature (Fancellu & Webber, 2014)

slide-28
SLIDE 28

www.inf.ed.ac.uk

What’s exactly wrong with negation?

Hypothesis Testing

BLEU?

Potential problem Translating negation

  • structural mismatch between source and

target language (Collins et al.,2005)

  • not enough negative training data (Wetzel

& Bond, 2012)

  • the translation rules does not contain

negation (Baker et al., 2013)

  • the scoring function does not contain any

negation-related feature (Fancellu & Webber, 2014)

slide-29
SLIDE 29

www.inf.ed.ac.uk

What’s exactly wrong with negation?

Hypothesis Testing

BLEU?

Potential problem Translating negation

  • structural mismatch between source and

target language (Collins et al.,2005)

  • not enough negative training data (Wetzel

& Bond, 2012)

  • the translation rules does not contain

negation (Baker et al., 2013)

  • the scoring function does not contain any

negation-related feature (Fancellu & Webber, 2014)

slide-30
SLIDE 30

www.inf.ed.ac.uk

What’s exactly wrong with negation?

Hypothesis Testing

BLEU?

Potential problem Translating negation

  • structural mismatch between source and

target language (Collins et al.,2005)

  • not enough negative training data (Wetzel

& Bond, 2012)

  • the translation rules does not contain

negation (Baker et al., 2013)

  • the scoring function does not contain any

negation-related feature (Fancellu & Webber, 2014)

slide-31
SLIDE 31

www.inf.ed.ac.uk

What’s exactly wrong with negation?

Hypothesis Testing

BLEU?

Potential problem Translating negation

  • structural mismatch between source and

target language (Collins et al.,2005)

  • not enough negative training data (Wetzel

& Bond, 2012)

  • the translation rules does not contain

negation (Baker et al., 2013)

  • the scoring function does not contain any

negation-related feature (Fancellu & Webber, 2014) NO ERROR ANALYSIS

slide-32
SLIDE 32

www.inf.ed.ac.uk

Rationale

slide-33
SLIDE 33

www.inf.ed.ac.uk

Rationale

Hypothesis Testing

BLEU?

Potential problem

slide-34
SLIDE 34

www.inf.ed.ac.uk

Rationale

Hypothesis Testing

BLEU?

Error analysis Potential problem Problem(s)

slide-35
SLIDE 35

www.inf.ed.ac.uk

Contributions

slide-36
SLIDE 36

www.inf.ed.ac.uk

Contributions

  • Present ongoing work on:
slide-37
SLIDE 37

www.inf.ed.ac.uk

Contributions

  • Present ongoing work on:

– Finding the causes of negation-related error during decoding

slide-38
SLIDE 38

www.inf.ed.ac.uk

Contributions

  • Present ongoing work on:

– Finding the causes of negation-related error during decoding – Highlighting the shortcomings of previous techniques

  • Constrained decoding
slide-39
SLIDE 39

www.inf.ed.ac.uk

Contributions

  • Present ongoing work on:

– Finding the causes of negation-related error during decoding – Highlighting the shortcomings of previous techniques

  • Constrained decoding

– Develop an informative way to analyze the translation of negation at each step during decoding

  • Chart analysis
slide-40
SLIDE 40

www.inf.ed.ac.uk

Sub-constituents of negation

在 同 一 个 急诊 的 值班 中 , 我 两 次 没有 发现 病患 得了 盲肠炎 。 During my emergency duty , I have n’t diagnosed a patient with appendicitis twice .

slide-41
SLIDE 41

www.inf.ed.ac.uk

Sub-constituents of negation

在 同 一 个 急诊 的 值班 中 , 我 两 次 没有 发现 病患 得了 盲肠炎 。 During my emergency duty , I have n’t diagnosed a patient with appendicitis twice .

  • Cue : the morpheme, word or multi-word unit inherently expressing negation.
  • im-possible, breathlessness, 不要脸, 不少,…
  • by no means, save, …
slide-42
SLIDE 42

www.inf.ed.ac.uk

Sub-constituents of negation

在 同 一 个 急诊 的 值班 中 , 我 两 次 没有 发现 病患 得了 盲肠炎 。 During my emergency duty , I have n’t diagnosed a patient with appendicitis twice .

  • Cue : the morpheme, word or multi-word unit inherently expressing negation.
  • im-possible, breathlessness, 不要脸, 不少,…
  • by no means, save, …
  • Event : the lexical unit the cue directly refers to
slide-43
SLIDE 43

www.inf.ed.ac.uk

Sub-constituents of negation

在 同 一 个 急诊 的 值班 中 , 我 两 次 没有 发现 病患 得了 盲肠炎 。 During my emergency duty , I have n’t diagnosed a patient with appendicitis twice .

  • Cue : the morpheme, word or multi-word unit inherently expressing negation.
  • im-possible, breathlessness, 不要脸, 不少,…
  • by no means, save, …
  • Event : the lexical unit the cue directly refers to
  • Scope: all the elements whose falsity would prove negation to be false.
  • The event is included in the scope
slide-44
SLIDE 44

www.inf.ed.ac.uk

What kind of errors?

slide-45
SLIDE 45

www.inf.ed.ac.uk

What kind of errors?

  • Manual analysis of the errors involved in translating negation (Fancellu &

Webber, 2015 – Ex-Prom @ NAACL ‘15)

slide-46
SLIDE 46

www.inf.ed.ac.uk

What kind of errors?

  • Manual analysis of the errors involved in translating negation (Fancellu &

Webber, 2015 – Ex-Prom @ NAACL ‘15) – Annotation of the sub-constituents of negation

slide-47
SLIDE 47

www.inf.ed.ac.uk

What kind of errors?

  • Manual analysis of the errors involved in translating negation (Fancellu &

Webber, 2015 – Ex-Prom @ NAACL ‘15) – Annotation of the sub-constituents of negation – HMEANT (Lo & Wu, 2010) to calculate P, R and F1 measure

slide-48
SLIDE 48

www.inf.ed.ac.uk

What kind of errors?

  • Manual analysis of the errors involved in translating negation (Fancellu &

Webber, 2015 – Ex-Prom @ NAACL ‘15) – Annotation of the sub-constituents of negation – HMEANT (Lo & Wu, 2010) to calculate P, R and F1 measure – Classification of the errors into deletion, reordering and insertion errors

slide-49
SLIDE 49

www.inf.ed.ac.uk

What kind of errors?

  • Manual analysis of the errors involved in translating negation (Fancellu &

Webber, 2015 – Ex-Prom @ NAACL ‘15) – Annotation of the sub-constituents of negation – HMEANT (Lo & Wu, 2010) to calculate P, R and F1 measure – Classification of the errors into deletion, reordering and insertion errors – Results:

slide-50
SLIDE 50

www.inf.ed.ac.uk

What kind of errors?

  • Manual analysis of the errors involved in translating negation (Fancellu &

Webber, 2015 – Ex-Prom @ NAACL ‘15) – Annotation of the sub-constituents of negation – HMEANT (Lo & Wu, 2010) to calculate P, R and F1 measure – Classification of the errors into deletion, reordering and insertion errors – Results:

  • Cue is easiest to translate followed by event and scope

difficult

slide-51
SLIDE 51

www.inf.ed.ac.uk

What kind of errors?

  • Manual analysis of the errors involved in translating negation (Fancellu &

Webber, 2015 – Ex-Prom @ NAACL ‘15) – Annotation of the sub-constituents of negation – HMEANT (Lo & Wu, 2010) to calculate P, R and F1 measure – Classification of the errors into deletion, reordering and insertion errors – Results:

  • Cue is easiest to translate followed by event and scope

difficult

  • Deletion across all categories
slide-52
SLIDE 52

www.inf.ed.ac.uk

What kind of errors?

  • Manual analysis of the errors involved in translating negation (Fancellu &

Webber, 2015 – Ex-Prom @ NAACL ‘15) – Annotation of the sub-constituents of negation – HMEANT (Lo & Wu, 2010) to calculate P, R and F1 measure – Classification of the errors into deletion, reordering and insertion errors – Results:

  • Cue is easiest to translate followed by event and scope

difficult

  • Deletion across all categories
  • Scope reordering
slide-53
SLIDE 53

www.inf.ed.ac.uk

What is the source of these errors?

slide-54
SLIDE 54

www.inf.ed.ac.uk

What is the source of these errors?

  • Rule/phrase Table: the best translation cannot be generated because

its necessary phrases/rules are absent from the search space à induction errors

slide-55
SLIDE 55

www.inf.ed.ac.uk

What is the source of these errors?

  • Rule/phrase Table: the best translation cannot be generated because

its necessary phrases/rules are absent from the search space à induction errors

  • Search space: the most probable output is absent from the search

space à search errors

slide-56
SLIDE 56

www.inf.ed.ac.uk

What is the source of these errors?

  • Rule/phrase Table: the best translation cannot be generated because

its necessary phrases/rules are absent from the search space à induction errors

  • Search space: the most probable output is absent from the search

space à search errors

  • Model: the model scores a sub-optimal translation higher than an
  • ptimal one à model errors
slide-57
SLIDE 57

www.inf.ed.ac.uk

Constrained decoding

slide-58
SLIDE 58

www.inf.ed.ac.uk

Constrained decoding

  • Tries to reconstruct the reference
slide-59
SLIDE 59

www.inf.ed.ac.uk

Constrained decoding

  • Tries to reconstruct the reference
  • Reference reachability as a proxy to analyze errors during decoding
slide-60
SLIDE 60

www.inf.ed.ac.uk

Constrained decoding

  • Tries to reconstruct the reference
  • Reference reachability as a proxy to analyze errors during decoding
  • Implemented as a feature in Moses:

– 1 if the hypothesis is a sub-string of the reference – - inf if the hypothesis is not a sub-string of the reference

slide-61
SLIDE 61

www.inf.ed.ac.uk

Constrained Decoding

slide-62
SLIDE 62

www.inf.ed.ac.uk

Constrained Decoding

  • If the reference is reconstructed:
slide-63
SLIDE 63

www.inf.ed.ac.uk

Constrained Decoding

  • If the reference is reconstructed:

– Search vs. model errors (Wisniewski and Yvon, 2013):

  • if p(e) < p(ê): search error
  • if p(e) > p(ê): model error

*e: 1-best hypothesis ê : reconstructed reference

slide-64
SLIDE 64

www.inf.ed.ac.uk

Constrained Decoding

  • If the reference is reconstructed:

– Search vs. model errors (Wisniewski and Yvon, 2013):

  • if p(e) < p(ê): search error
  • if p(e) > p(ê): model error
  • If the reference can not be reconstructed:

*e: 1-best hypothesis ê : reconstructed reference

slide-65
SLIDE 65

www.inf.ed.ac.uk

Constrained Decoding

  • If the reference is reconstructed:

– Search vs. model errors (Wisniewski and Yvon, 2013):

  • if p(e) < p(ê): search error
  • if p(e) > p(ê): model error
  • If the reference can not be reconstructed:

– Increase the translation option limit (Auli and Lopez, 2009)

  • if the reference can now be reconstructed à induction error

*e: 1-best hypothesis ê : reconstructed reference

slide-66
SLIDE 66

www.inf.ed.ac.uk

Constrained Decoding

  • If the reference is reconstructed:

– Search vs. model errors (Wisniewski and Yvon, 2013):

  • if p(e) < p(ê): search error
  • if p(e) > p(ê): model error
  • If the reference can not be reconstructed:

– Increase the translation option limit (Auli and Lopez, 2009)

  • if the reference can now be reconstructed à induction error

– Increase the cube pruning pop limit

  • if the reference can now be reconstructed à search error

*e: 1-best hypothesis ê : reconstructed reference

slide-67
SLIDE 67

www.inf.ed.ac.uk

Locality issues

slide-68
SLIDE 68

www.inf.ed.ac.uk

Locality issues

  • Negation is usually a local phenomenon
slide-69
SLIDE 69

www.inf.ed.ac.uk

Locality issues

  • Negation is usually a local phenomenon

就 拿 住 在 村 东南 一个 小 弯 子 里 的 湾 家人 来 说 吧 , 虽然 那 一家 子 的 家长 有点 不要脸 , 我们 伟大 的 中 村 不是 照样 会 罩 着 这 一 家 吗 ?

slide-70
SLIDE 70

www.inf.ed.ac.uk

Locality issues

  • Negation is usually a local phenomenon
  • If we fail to reconstruct a whole reference, it is unclear whether it is

because of negation

就 拿 住 在 村 东南 一个 小 弯 子 里 的 湾 家人 来 说 吧 , 虽然 那 一家 子 的 家长 有点 不要脸 , 我们 伟大 的 中 村 不是 照样 会 罩 着 这 一 家 吗 ?

slide-71
SLIDE 71

www.inf.ed.ac.uk

Locality issues

  • Negation is usually a local phenomenon
  • If we fail to reconstruct a whole reference, it is unclear whether it is

because of negation

  • Solution: isolate the part containing negation and use them as input to

CD

就 拿 住 在 村 东南 一个 小 弯 子 里 的 湾 家人 来 说 吧 , 虽然 那 一家 子 的 家长 有点 不要脸 , 我们 伟大 的 中 村 不是 照样 会 罩 着 这 一 家 吗 ?

slide-72
SLIDE 72

www.inf.ed.ac.uk

Locality issues

  • Negation is usually a local phenomenon
  • If we fail to reconstruct a whole reference, it is unclear whether it is

because of negation

  • Solution: isolate the part containing negation and use them as input to

CD 那 一家 子 的 家长 有点 不要脸 the parents of the family are somewhat shameless

slide-73
SLIDE 73

www.inf.ed.ac.uk

Results

slide-74
SLIDE 74

www.inf.ed.ac.uk

Results

  • We could generate max. 16
  • ut of 54 sentences (29%)
slide-75
SLIDE 75

www.inf.ed.ac.uk

Results

  • We could generate max. 16
  • ut of 54 sentences (29%)
  • Enlarging translation option

limit and cube pruning pop limit leads to a small improvement – Just a few induction/ search errors

slide-76
SLIDE 76

www.inf.ed.ac.uk

Results

  • We could generate max. 16
  • ut of 54 sentences (29%)
  • Enlarging translation option

limit and cube pruning pop limit leads to a small improvement – Just a few induction/ search errors

  • p(e) always < p(ê)

– model errors

slide-77
SLIDE 77

www.inf.ed.ac.uk

Discussion

slide-78
SLIDE 78

www.inf.ed.ac.uk

Discussion

  • Ad-interim conclusion: one should enhance the model
slide-79
SLIDE 79

www.inf.ed.ac.uk

Discussion

  • Ad-interim conclusion: one should enhance the model
  • However:
slide-80
SLIDE 80

www.inf.ed.ac.uk

Discussion

  • Ad-interim conclusion: one should enhance the model
  • However:

– We are basing our results on less than a half test sentences

  • ! CD is based only one or a few references vs. virtually infinite

ways of translating a sentence

slide-81
SLIDE 81

www.inf.ed.ac.uk

Discussion

  • Ad-interim conclusion: one should enhance the model
  • However:

– We are basing our results on less than a half test sentences

  • ! CD is based only one or a few references vs. virtually infinite

ways of translating a sentence – If model errors, which score component is the most responsible?

slide-82
SLIDE 82

www.inf.ed.ac.uk

Discussion

  • Ad-interim conclusion: one should enhance the model
  • However:

– We are basing our results on less than a half test sentences

  • ! CD is based only one or a few references vs. virtually infinite

ways of translating a sentence – If model errors, which score component is the most responsible? – CD treats decoding as a “black box”

slide-83
SLIDE 83

www.inf.ed.ac.uk

Discussion

  • Ad-interim conclusion: one should enhance the model
  • However:

– We are basing our results on less than a half test sentences

  • ! CD is based only one or a few references vs. virtually infinite

ways of translating a sentence – If model errors, which score component is the most responsible? – CD treats decoding as a “black box” – It is hard to connect CD with deletion and reordering errors

slide-84
SLIDE 84

www.inf.ed.ac.uk

Chart analysis

  • Analysis of each step during decoding
  • Access to hypothesis stacks and sub-scores

– In-depth analysis of model errors

  • We can understand the causes of deletion and reordering errors
  • We can analyze the translation of cue, event and scope separately
  • We can analyze patterns of translation amongst these elements
slide-85
SLIDE 85

www.inf.ed.ac.uk

How does it work?

slide-86
SLIDE 86

www.inf.ed.ac.uk

How does it work?

  • Input à decoding chart trace
slide-87
SLIDE 87

www.inf.ed.ac.uk

How does it work?

  • Input à decoding chart trace
  • A good translation of negation needs to meet four conditions:
  • 1. The cue has to be translated
  • 2. The event has to be translated
  • 3. The cue has to refer to the right event
  • 4. The scope elements should be placed in the correct negation

scope

slide-88
SLIDE 88

www.inf.ed.ac.uk

How does it work?

  • Input à decoding chart trace
  • A good translation of negation needs to meet four conditions:
  • 1. The cue has to be translated
  • 2. The event has to be translated
  • 3. The cue has to refer to the right event
  • 4. The scope elements should be placed in the correct negation

scope deletion reordering

slide-89
SLIDE 89

www.inf.ed.ac.uk

How does it work? – Cont’d

  • Assuming we know the elements of negation on the source, the cell has

to satisfy a given condition if it cover one or more of those elements

放弃 没有 他们 政府

slide-90
SLIDE 90

www.inf.ed.ac.uk

How does it work? – Cont’d

  • Assuming we know the elements of negation on the source, the cell has

to satisfy a given condition if it cover one or more of those elements

放弃 没有 他们 政府

slide-91
SLIDE 91

www.inf.ed.ac.uk

How does it work? – Cont’d

  • Assuming we know the elements of negation on the source, the cell has

to satisfy a given condition if it cover one or more of those elements

放弃 没有 他们 政府

event needs to be translated

slide-92
SLIDE 92

www.inf.ed.ac.uk

How does it work? – Cont’d

  • Assuming we know the elements of negation on the source, the cell has

to satisfy a given condition if it cover one or more of those elements

放弃 没有 他们 政府

scope element attached to the right event

slide-93
SLIDE 93

www.inf.ed.ac.uk

How does it work? – Cont’d

  • Assuming we know the elements of negation on the source, the cell has

to satisfy a given condition if it cover one or more of those elements

放弃 没有 他们 政府

cue needs to be translated

slide-94
SLIDE 94

www.inf.ed.ac.uk

How does it work? – Cont’d

  • Assuming we know the elements of negation on the source, the cell has

to satisfy a given condition if it cover one or more of those elements

放弃 没有 他们 政府

cue should refer to the right event

slide-95
SLIDE 95

www.inf.ed.ac.uk

How does it work? – Cont’d

  • Assuming we know the elements of negation on the source, the cell has

to satisfy a given condition if it cover one or more of those elements

放弃 没有 他们 政府

✓ All elements should be translated and should correctly related to each other

slide-96
SLIDE 96

www.inf.ed.ac.uk

Stack analysis – model errors

放弃 没有 他们 政府

  • 1. gave up | p(e|f) p(f|e) p(LM) plex. …
  • 2. not | p(e|f) p(f|e) p(LM) plex. …

[…] 10: did not give up | p(e|f) p(f|e) p(LM) plex. …

  • Analysis whether a component is more responsible for model errors

10 meets all conditions, 1 does not 1: p(e|f) p(f|e) p(LM) plex(e|f) plex(e|f) 10: p(e|f) p(f|e) p(LM) plex(e|f) plex(e|f) ✖ ✓

slide-97
SLIDE 97

www.inf.ed.ac.uk

Stack analysis – search/induction errors

放弃 没有 他们 政府

slide-98
SLIDE 98

www.inf.ed.ac.uk

Stack analysis – search/induction errors

放弃 没有 他们 政府

slide-99
SLIDE 99

www.inf.ed.ac.uk

Stack analysis – search/induction errors

放弃 没有 他们 政府

slide-100
SLIDE 100

www.inf.ed.ac.uk

Stack analysis – search/induction errors

  • cue has to be translated in all

cells marked with

放弃 没有 他们 政府

slide-101
SLIDE 101

www.inf.ed.ac.uk

Stack analysis – search/induction errors

  • cue has to be translated in all

cells marked with

  • If no cue is found in any of these

cells: – Modify translation option limit and cube pruning pop limit to assess the presence of search and model errors

  • Same applies to the other two

elements

放弃 没有 他们 政府

slide-102
SLIDE 102

www.inf.ed.ac.uk

Stack analysis – others/ongoing

放弃 没有 他们 政府

slide-103
SLIDE 103

www.inf.ed.ac.uk

Stack analysis – others/ongoing

放弃 没有 他们 政府

  • Rule trace to study negation

element combinatory tendencies

slide-104
SLIDE 104

www.inf.ed.ac.uk

Stack analysis – others/ongoing

放弃 没有 他们 政府

  • Rule trace to study negation

element combinatory tendencies

  • Is cue translated along side the

event? h

slide-105
SLIDE 105

www.inf.ed.ac.uk

Stack analysis – others/ongoing

放弃 没有 他们 政府

  • Rule trace to study negation

element combinatory tendencies

  • Is cue translated along side the

event?

  • Is cue and event translated

separately and combined together via glue rules? h h h

slide-106
SLIDE 106

www.inf.ed.ac.uk

Stack analysis – others/ongoing

放弃 没有 他们 政府

  • Rule trace to study negation

element combinatory tendencies

  • Is cue translated along side the

event?

  • Is cue and event translated

separately and combined together via glue rules?

  • What about event and scope?

h h h

slide-107
SLIDE 107

www.inf.ed.ac.uk

Negation detection

  • Source à annotations from manual error analysis
  • Target?
  • 1. gave up ||| […]
  • 2. not ||| […]

10: did not give up ||| […] […] 25: he did not give up ||| […]

slide-108
SLIDE 108

www.inf.ed.ac.uk

Negation detection

  • Source à annotations from manual error analysis
  • Target?
  • 1. gave up ||| […]
  • 2. not ||| […]

10: did not give up ||| […] […] 25: he did not give up ||| […]

slide-109
SLIDE 109

www.inf.ed.ac.uk

Negation detection

  • Source à annotations from manual error analysis
  • Target?
  • 1. gave up ||| […]
  • 2. not ||| […]

10: did not give up ||| […] […] 25: he did not give up ||| […]

slide-110
SLIDE 110

www.inf.ed.ac.uk

Negation detection

  • Source à annotations from manual error analysis
  • Target?
  • 1. gave up ||| […]
  • 2. not ||| […]

10: did not give up ||| […] […] 25: he did not give up ||| […]

slide-111
SLIDE 111

www.inf.ed.ac.uk

Negation detection

  • Source à annotations from manual error analysis
  • Target?
  • 1. gave up ||| […]
  • 2. not ||| […]

10: did not give up ||| […] […] 25: he did not give up ||| […]

Not No Neither Impossible By no means […]

cue

*CRF (F1 > 90%)

slide-112
SLIDE 112

www.inf.ed.ac.uk

Negation detection

  • Source à annotations from manual error analysis
  • Target?
  • 1. gave up ||| […]
  • 2. not ||| […]

10: did not give up ||| […] […] 25: he did not give up ||| […]

Not No Neither Impossible By no means […]

cue

*CRF (F1 > 90%)

slide-113
SLIDE 113

www.inf.ed.ac.uk

Negation detection

  • Source à annotations from manual error analysis
  • Target?
  • 1. gave up ||| […]
  • 2. not ||| […]

10: did not give up ||| […] […] 25: he did not give up ||| […]

Not No Neither Impossible By no means […]

cue

*CRF (F1 > 90%)

放弃 防抗 去 […] Give up Protest go […] event

*CCEDIT

slide-114
SLIDE 114

www.inf.ed.ac.uk

Negation detection

  • Source à annotations from manual error analysis
  • Target?
  • 1. gave up ||| […]
  • 2. not ||| […]

10: did not give up ||| […] […] 25: he did not give up ||| […]

Not No Neither Impossible By no means […]

cue

*CRF (F1 > 90%)

放弃 防抗 去 […] Give up Protest go […] event

*CCEDIT

scope ??

slide-115
SLIDE 115

www.inf.ed.ac.uk

Negation detection

  • Source à annotations from manual error analysis
  • Target?
  • 1. gave up ||| […]
  • 2. not ||| […]

10: did not give up ||| […] […] 25: he did not give up ||| […]

Not No Neither Impossible By no means […]

cue

*CRF (F1 > 90%)

放弃 防抗 去 […] Give up Protest go […] event

*CCEDIT

scope ??

  • Better approach: paraphrase + automatic negation

detection (see Future Work)

slide-116
SLIDE 116

www.inf.ed.ac.uk

System and initial results

  • System:

– Zh à En HIERO; 54 sentences containing negation (from the manual error analysis)

  • Results:

– Errors related to the translation of the cue – The cue is never absent from the chart of any sentence

  • no search or induction error

– Analysis of the model sub-scores:

  • Indirect probabilities (translation and lexical) are responsible for >

60% of bad-ranking

  • LM only 25%
slide-117
SLIDE 117

www.inf.ed.ac.uk

Conclusion

  • Translating negation is problematic
  • Previous error detection techniques do not offer an in-depth analysis
  • A chart analysis offers a better insight in the decoding process
slide-118
SLIDE 118

www.inf.ed.ac.uk

Future Work

  • The soldier wasn’t afraid of death
  • The soldier had no fear of death
  • The soldier didn’t fear death
  • The soldier, without any fear of death, […]
  • The soldier was fearless of death

Paraphrase generation

Negation detection component

  • Negation detection in the target hypothesis
  • No list! How to leverage a reference translation?

[…]

n: a soldier no fear of dying

[…]

stack

The soldier wasn’t afraid of death

slide-119
SLIDE 119

www.inf.ed.ac.uk

Thank you!