Chunk-level Reordering of Source Language Sentences with - - PowerPoint PPT Presentation

chunk level reordering of source language sentences with
SMART_READER_LITE
LIVE PREVIEW

Chunk-level Reordering of Source Language Sentences with - - PowerPoint PPT Presentation

NAACL HLT Rochester, NY April 2007 Chunk-level Reordering of Source Language Sentences with Automatically Learned Rules for Statistical Machine Translation Yuqi Zhang, Richard Zens and Hermann Ney Human Language Technology and Pattern


slide-1
SLIDE 1

NAACL HLT – Rochester, NY – April 2007

Chunk-level Reordering of Source Language Sentences with Automatically Learned Rules for Statistical Machine Translation Yuqi Zhang, Richard Zens and Hermann Ney Human Language Technology and Pattern Recognition Lehrstuhl für Informatik 6 Computer Science Department RWTH Aachen University, Germany

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 2007 1

slide-2
SLIDE 2

Overview

  • Introduction
  • Baseline system
  • Chunk parsing
  • Rules extraction
  • Reordering lattice generation
  • Results
  • Conclusions and outlook

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 2007 2

slide-3
SLIDE 3

Introduction

goal: improve MT utilizing syntactic knowledge idea: reordering at the chunk level approach:

  • 1. chunk source sentence
  • 2. reorder chunks
  • 3. represent alternative reorderings in a lattice
  • 4. translate lattice

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 2007 3

slide-4
SLIDE 4

Phrase-based SMT

log-linear combination of several model: P r(eI

1|fJ 1 ) =

exp M

m=1 λmhm(eI 1, fJ 1 )

  • I′,e′I′

1

exp M

m=1 λmhm(e′I′ 1 , fJ 1 )

  • models:
  • phrase translation model
  • phrase count features
  • word-based translation model
  • word and phrase penalty
  • target language model (6-gram)
  • distortion penalty model

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 2007 4

slide-5
SLIDE 5

System Architecture

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 2007 5

slide-6
SLIDE 6

Example

source ke yi dan shi wo men chu zu che bu duo POS v c r v n d m chunks v c r NP VP English gloss yes but we taxi not many reordering rules NP VP → VP NP r NP VP → r VP NP r NP VP → VP r NP

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 2007 6

slide-7
SLIDE 7

Example (cont’d)

  • reordering lattice:

1 ke yi 2 dan shi 3 wo men 11 bu 4 chu zu 8 bu 5 che 9 duo 10 chu zu 7 che 6 bu duo 12 duo wo men

  • translation result:

reference yes, but there are not many rental cars here baseline yes , but we do rent car is not chunk-reordering yes , but we do not have much rental car

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 2007 7

slide-8
SLIDE 8

Chunk Parsing

  • POS tagging + word segmentation with ICTCLAS tool

Institute of Computing Technology, Chinese Academy of Sciences

  • training data for chunker: Chinese Treebank (LDC2005T01)
  • 24 chunk types
  • MaxEnt tagger

– input features: word + POS tag – output: chunk types + chunk boundary

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 2007 8

slide-9
SLIDE 9

Reordering Rules Extraction

  • convert word-to-word alignment to chunk-to-word alignment
  • run standard phrase extraction on chunk-to-word alignment

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 2007 9

slide-10
SLIDE 10

Reordering Rules Extraction (cont’d)

(a) monotone phrase, (b) reordering phrase, (c) cross phrase

  • extract rules from monotone phrases and reordering phrases

– e.g. NP0NP1# NP0 NP1 NP0NP1# NP1 NP0 – within a subsentence, not across punctuations

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 2007 10

slide-11
SLIDE 11

Reordering Lattice Generation I

  • apply reordering rules to chunked source sentence
  • represent alternative reorderings as a lattice
  • example:

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 2007 11

slide-12
SLIDE 12

Reordering Lattice Generation II

  • chunk-level lattice:

4 NP1 1 NP0 6 v 5 v 2 NP0 3 NP0 NP1 7 v v NP0 NP1

  • word-level lattice:

1 f2 8 f0 14 f6 2 f3 3 f4 4 f5 5 f6 20 f0 6 f0 7 f1 9 f1 10 f2 16 f6 11 f3 12 f4 13 f5 f6 15 f0 f1 17 f2 18 f3 19 f4 f5 f1 Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 2007 12

slide-13
SLIDE 13

Reordering Model

  • use language model to weigh lattice
  • training:

– chunk source training data – generate chunk-to-word alignment – reorder source chunks to monotonize alignment – train LM on reordered source training data

  • word-level LM

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 2007 13

slide-14
SLIDE 14

Chunking Result

  • corpus statistics (Chinese Treebank LDC2005T01):

train test sentences 17 785 1 000 words 486 468 21 851 chunks 105 773 4 680

  • tagging results:

word-level chunk-level accuracy [%] precision [%] recall [%] F-measure [%] 74.51 65.2 61.5 63.3

  • number of chunk types: 24
  • both chunk type and boundary have to be correct

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 2007 14

slide-15
SLIDE 15

Corpus Statistics

Chinese English Train Sentences 40k Words 308k 377k Dev(dev4) Sentences 489 Words 5 478 6 008 Test Sentences 500 IWSLT04 Words 3 866 3 581 Test Sentences 506 IWSLT05 Words 3 652 3 579 Test Sentences 500 IWSLT06 Words 5 846 –

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 2007 15

slide-16
SLIDE 16

Statistics of Reordering Rules

10 20 30 40 50 2 4 6 8 10 12 14 number of rules (k) rule length

total

singletons reorder rules

total: 184k, singletons: 88%, reorder rules: 34%

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 2007 16

slide-17
SLIDE 17

Translation Results

WER [%] PER [%] NIST BLEU [%] IWSLT04 baseline 47.3 38.2 7.78 39.1 chunk reordering 46.3 37.2 7.70 40.9 IWSLT05 baseline 45.0 37.3 7.40 41.8 chunk reordering 44.6 36.8 7.51 42.3 IWSLT06 baseline 67.4 50.0 6.65 22.4 chunk reordering 65.6 50.4 6.46 23.3

  • evaluation without punctuation marks and in lower case
  • baseline: RWTH IWSLT 2006 system without rescoring

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 2007 17

slide-18
SLIDE 18

Chunk-level vs. POS-level

Translation performance (IWSLT 2004): WER [%] PER [%] NIST BLEU [%] Baseline 47.3 38.2 7.78 39.1 POS 46.9 37.5 7.38 39.7 Chunk 46.3 37.2 7.70 40.9 Lattice statistics:

  • avg. density used

translation per sent rules time [min:sec] Baseline

  • 1:22

POS 15.7 6 868 7:08 Chunk 8.2 3 685 3:47

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 2007 18

slide-19
SLIDE 19

Translation Examples (IWSLT04)

reference about twenty-five seconds baseline seconds about twenty-five chunk reorder about twenty five seconds reference could n’t you make it a little cheaper baseline could not you some better chunk reorder can’t you make it a little cheaper ones reference how much is admission baseline admission fees how much is it chunk reorder how much is the admission reference may i have that gift wrapped please baseline wrap can i have a gift chunk reorder can i have a gift wrapped please

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 2007 19

slide-20
SLIDE 20

Summary

  • idea:
  • 1. chunk input sentence
  • 2. reorder chunks
  • 3. represent alternative reorderings as lattice
  • 4. translate lattice
  • nice improvements on IWSLT task
  • chunk-level reordering better than POS-level reordering

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 2007 20

slide-21
SLIDE 21

Outlook

  • large data task (e.g. NIST)
  • other language pairs
  • improve chunk parsing
  • better reordering model
  • analyze what kind of rules work well

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 2007 21

slide-22
SLIDE 22

THANK YOU FOR YOUR ATTENTION!

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 2007 22

slide-23
SLIDE 23

ICTCLAS POS Tag Set

n noun r pron nr person name rg pron morpheme ns location name m number ng noun morpheme q quantity t time d adverb s location p prep f position word c conjunction v verb u auxiliary vd verb adv e interjection vn noun verb y modal particle vg verb morpheme

  • nomatopoeia

a adj h prefix ad adv adj k suffix an adj noun w punctuation ag adj morpheme b determiner

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 2007 23

slide-24
SLIDE 24

Syntactic Tag Set of Chunks

ADJP adjective phrase ADVP adverbial phrase headed by AD (adverb) CLP classifier phrase CP clause headed by C (complementizer) DNP phrase formed by ′XP + DEG′ DP determiner phrase DVP phrase formed by ′XP + DEV ′ FRAG fragment IP simple clause headed by I (INFL) LCP phrase formed by ′XP + LC′ LST list marker NP noun phrase PP preposition phrase PRN parenthetical QP quantifier phrase UCP unidentical coordination phrase VP verb phrase Details are in "The bracketing gudelines for penn chinese treebank 3.0", Technical Re- port 00-08, University of Pennsylvania(2000) IRCS Report.

Zhang, Zens, Ney: Chunk-Reordering for SMT HLT/NAACL April, 2007 24