Language Stuff (Slides from Hal Daume III) Digitizing Speech 2 - - PowerPoint PPT Presentation

language stuff
SMART_READER_LITE
LIVE PREVIEW

Language Stuff (Slides from Hal Daume III) Digitizing Speech 2 - - PowerPoint PPT Presentation

Language Stuff (Slides from Hal Daume III) Digitizing Speech 2 Hal Daum III (me@hal3.name) CS421: Intro to AI Speech in an Hour Speech input is an acoustic wave form s p ee ch l a b


slide-1
SLIDE 1

Language Stuff

(Slides from Hal Daume III)

slide-2
SLIDE 2

CS421: Intro to AI 2 Hal Daumé III (me@hal3.name)

Digitizing Speech

slide-3
SLIDE 3

CS421: Intro to AI 3 Hal Daumé III (me@hal3.name)

Speech in an Hour

➢ Speech input is an acoustic wave form

s p ee ch l a b

Graphs from Simon Arnfield’s web tutorial on speech, Sheffield: http://www.psyc.leeds.ac.uk/research/cogn/speech/tutorial/

“l” to “a” transition:

slide-4
SLIDE 4

CS421: Intro to AI 4 Hal Daumé III (me@hal3.name)

Frequency gives pitch; amplitude gives volume

sampling at ~8 kHz phone, ~16 kHz mic (kHz=1000 cycles/sec)

Fourier transform of wave displayed as a spectrogram

darkness indicates energy at each frequency

s p ee ch l a b

f r e q u e n c y a m p l i t u d e

Spectral Analysis

slide-5
SLIDE 5

CS421: Intro to AI 5 Hal Daumé III (me@hal3.name)

Adding 100 Hz + 1000 Hz Waves

Time (s) 0.05 –0.9654 0.99

slide-6
SLIDE 6

CS421: Intro to AI 6 Hal Daumé III (me@hal3.name)

Spectrum

100 1000

Frequency in Hz Amplitude Frequency components (100 and 1000 Hz) on x-axis

slide-7
SLIDE 7

CS421: Intro to AI 7 Hal Daumé III (me@hal3.name)

Part of [ae] from “lab”

Note complex wave repeating nine times in figure

Plus smaller waves which repeats 4 times for every large pattern

Large wave has frequency of 250 Hz (9 times in .036 seconds)

Small wave roughly 4 times this, or roughly 1000 Hz

Two little tiny waves on top of peak of 1000 Hz waves

slide-8
SLIDE 8

CS421: Intro to AI 8 Hal Daumé III (me@hal3.name)

Back to Spectra

Spectrum represents these freq components

Computed by Fourier transform, algorithm which separates

  • ut each frequency component of wave.

x-axis shows frequency, y-axis shows magnitude (in decibels, a log measure of amplitude)

Peaks at 930 Hz, 1860 Hz, and 3020 Hz.

slide-9
SLIDE 9

CS421: Intro to AI 9 Hal Daumé III (me@hal3.name)

Acoustic Feature Sequence

➢ Time slices are translated into acoustic feature

vectors (~39 real numbers per slice)

➢ These are the observations, now we need the hidden

states X

f r e q u e n c y

……………………………………………..e12e13e14e15e16………..

slide-10
SLIDE 10

CS421: Intro to AI 10 Hal Daumé III (me@hal3.name)

State Space

P(E|X) encodes which acoustic vectors are appropriate for each phoneme (each kind of sound)

P(X|X’) encodes how sounds can be strung together

We will have one state for each sound in each word

From some state x, can only:

Stay in the same state (e.g. speaking slowly)

Move to the next position in the word

At the end of the word, move to the start of the next word

We build a little state graph for each word and chain them together to form our state space X

slide-11
SLIDE 11

CS421: Intro to AI 11 Hal Daumé III (me@hal3.name)

HMMs for Speech

slide-12
SLIDE 12

CS421: Intro to AI 12 Hal Daumé III (me@hal3.name)

Markov Process with Bigrams

Figure from Huang et al page 618

slide-13
SLIDE 13

CS421: Intro to AI 13 Hal Daumé III (me@hal3.name)

Decoding

While there are some practical issues, finding the words given the acoustics is an HMM inference problem

We want to know which state sequence x1:T is most likely given the evidence e1:T:

From the sequence x, we can simply read off the words

slide-14
SLIDE 14

CS421: Intro to AI 14 Hal Daumé III (me@hal3.name)

Training (aka “preview of ML”)

Two key components of a speech HMM:

Acoustic model: p(E | X)

Language model: p(X | X')

Where do these come from?

Can we estimate these models from data:

p(E | X) might be estimated from transcribed speech

p(X | X') might be estimated from large amounts of raw text

slide-15
SLIDE 15

CS421: Intro to AI 15 Hal Daumé III (me@hal3.name)

n-gram Language Models

➢ Assign a probability to a sequences of words ➢ If I gave you a copy of the web, how would you

estimate these probabilities? pw1,w2,...,wI = ∏

i=1 I

pwi∣ w1,... ,wi−1 ≈ ∏

i=1 I

pwi∣ wi−k ,...,wi−1

Need to “smooth” estimates intelligently to avoid zero probability n-grams. Language modeling is the art of good smoothing. See [Goodman 1998], [Teh 2007]

slide-16
SLIDE 16

CS421: Intro to AI 16 Hal Daumé III (me@hal3.name)

Acoustic models

➢ What if I gave you data like: ➢ How would you estimate p(E|X)? ➢ What's wrong with this approach?

f r e q u e n c y

………………………...…………..sp ee ch l ae b......

slide-17
SLIDE 17

CS421: Intro to AI 17 Hal Daumé III (me@hal3.name)

Acoustic models II

➢ What does our data really look like: ➢ We'd like to know alignments between transcript

and waveform

➢ Suppose someone gave us a good speech

recognizer.... could we figure out alignments from that?

yesterday I went to visit the speech lab Acc: W:

slide-18
SLIDE 18

CS421: Intro to AI 18 Hal Daumé III (me@hal3.name)

Expectation Maximization

➢ A general framework to do parameter

estimation in the presence of hidden variables

➢ Repeat ad infinitum:

E-step: make probabilistic guesses at latent variables

M-step: fit parameters according to these guesses I LIKE A I W:

slide-19
SLIDE 19

CS421: Intro to AI 19 Hal Daumé III (me@hal3.name)

Expectation Maximization

I LIKE A I W: Acc:

e p( e | “I”) p( e | “LIKE”) p( e | “A”) 0.33 0.33 0.33 0.33 0.33 0.33 0.33 0.33 0.33 → 2 → 1 → 1 → 1 → 1 → 1 → 1 → 1 → 1

slide-20
SLIDE 20

CS421: Intro to AI 20 Hal Daumé III (me@hal3.name)

Expectation Maximization

I LIKE A I W: Acc:

e p( e | “I”) p( e | “LIKE”) p( e | “A”) 0.5 0.33 0.33 0.25 0.33 0.33 0.25 0.33 0.33 → 4 → 1 → 1 → 1 → 2 → 2 → 1 → 2 → 2

slide-21
SLIDE 21

CS421: Intro to AI 21 Hal Daumé III (me@hal3.name)

State of the Art DBNs for Speech

slide-22
SLIDE 22

CS421: Intro to AI 22 Hal Daumé III (me@hal3.name)

Summary

➢ HMMs allow us to “separate” two models:

acoustic model (how does what I want to say sound?)

language model (what do I want to say)

➢ Speech recognition is “just” decoding in an

HMM/DBN

Plus a heck of a lot of engineering

➢ Expectation maximization lets us estimate

parameters in models with hidden variables

➢ Most research today focuses on language

modeling

slide-23
SLIDE 23

CS421: Intro to AI 23 Hal Daumé III (me@hal3.name)

Translate Centauri -> Arcturan

  • 1a. ok-voon ororok sprok .
  • 1b. at-voon bichat dat .
  • 7a. lalok farok ororok lalok sprok izok enemok .
  • 7b. wat jjat bichat wat dat vat eneat .
  • 2a. ok-drubel ok-voon anok plok sprok .
  • 2b. at-drubel at-voon pippat rrat dat .
  • 8a. lalok brok anok plok nok .
  • 8b. iat lat pippat rrat nnat .
  • 3a. erok sprok izok hihok ghirok .
  • 3b. totat dat arrat vat hilat .
  • 9a. wiwok nok izok kantok ok-yurp .
  • 9b. totat nnat quat oloat at-yurp .
  • 4a. ok-voon anok drok brok jok .
  • 4b. at-voon krat pippat sat lat .
  • 10a. lalok mok nok yorok ghirok clok .
  • 10b. wat nnat gat mat bat hilat .
  • 5a. wiwok farok izok stok .
  • 5b. totat jjat quat cat .
  • 11a. lalok nok crrrok hihok yorok zanzanok .
  • 11b. wat nnat arrat mat zanzanat .
  • 6a. lalok sprok izok jok stok .
  • 6b. wat dat krat quat cat .
  • 12a. lalok rarok nok izok hihok mok .
  • 12b. wat nnat forat arrat vat gat .

Your assignment, translate this Centauri sentence to Arcturan: farok crrrok hihok yorok clok kantok ok-yurp

slide-24
SLIDE 24

CS421: Intro to AI 24 Hal Daumé III (me@hal3.name)

Topology of the Field

Human Language Technologies

Automatic Speech Recognition Information Retrieval Computational Linguistics Natural Language Processing ICASSP ??? ACL SIGIR

NLP

Machine Translation Summarization Question Answering Information Extraction Parsing “Understanding” Generation

slide-25
SLIDE 25

CS421: Intro to AI 25 Hal Daumé III (me@hal3.name)

A Bit of History

1940s Computations begins, AI hot, Turing test Machine translation = Code-breaking? 1950s Cold war continues 1960s Chomsky and statistics, ALPAC report 1970s Dry spell 1980s Statistics makes significant advances in speech 1990s Web arrives Statistical revolution in machine translation, parsing, IE, etc Serious “corpus” work, increasing focus on evaluation 2000s Focus on optimizing loss functions, reranking How much can we automate? Huge process in machine translation Gigantic corpora become available, scaling New challenges

slide-26
SLIDE 26

CS421: Intro to AI 26 Hal Daumé III (me@hal3.name)

Ready-to-use Data

1994 1996 1998 2000 2002 2004 20 40 60 80 100 120 140 160 180

French-English Chinese-English Arabic-English

M i l l i

  • n

s

  • f

W

  • r

d s ( E n g l i s h s i d e )

slide-27
SLIDE 27

CS421: Intro to AI 27 Hal Daumé III (me@hal3.name)

Classical MT (1970s and 1980s)

Source Text Source Language Analysis Source Lexicon Target Text Target Language Generation Target Lexicon Knowledge Base Transfer/ Interlingua Representation Transfer Rules

slide-28
SLIDE 28

CS421: Intro to AI 28 Hal Daumé III (me@hal3.name)

Layers of complexity

➢ Text: ➢ Morphology: ➢ Syntax: ➢ Semantics:

John saw his brother John see he brother +past +gen

S NNP VBD PRN NN NP NP VP Agent [ Person John ] Event see (+past) Patient Person brother Poss *

slide-29
SLIDE 29

CS421: Intro to AI 29 Hal Daumé III (me@hal3.name)

How Much Analysis?

Source Words Target Words Source Morphology Source Syntax Source Semantics Interlingua Target Morphology Target Syntax Target Semantics A n a l y s i s Generation Direct Classical Not practical for open domain B e c

  • m

i n g P

  • p

u l a r

slide-30
SLIDE 30

CS421: Intro to AI 30 Hal Daumé III (me@hal3.name)

Syntactic Transfer

➢ Now, just get a bunch of linguists to sit down

and write rules and grammars

The student will see the man D N AX V D N NP NP SUB A V OB S Der Student wird den Mann seben D N AX D N V NP NP SUB A OB V S

slide-31
SLIDE 31

CS421: Intro to AI 31 Hal Daumé III (me@hal3.name)

While We're Busy Writing Grammars

Claude Shannon Information Source Noisy Channel Corrupted Output p(s) p(o | s) p(s | o) ∝ p(s) * p(o | s) “Imagined” Words Speech Process Acoustic Signal Need: p(word sequence) and p(signal | word sequence)

slide-32
SLIDE 32

CS421: Intro to AI 32 Hal Daumé III (me@hal3.name)

Acoustic Modeling: p(a | w)

Signal: Transcription: the man ate

Key notion: acoustic-word alignments

Chicken-and-egg problem that we can solve using Expectation Maximization (EM)

slide-33
SLIDE 33

CS421: Intro to AI 33 Hal Daumé III (me@hal3.name)

Speech Rec = Machine Translation?

➢ Peter F. Brown ➢ Stephen A. Della Pietra ➢ Vincent J. Della Pietra ➢ Robert Mercer ➢ The Mathematics of Statistical Machine

Translation: Parameter Estimation

➢ Computational Linguistics 19 (2), June 1993 ➢ Probably the most important paper in NLP in the

last 20 years

“Brown 93”

slide-34
SLIDE 34

CS421: Intro to AI 34 Hal Daumé III (me@hal3.name)

Centauri/Arcturan [Knight 97]

  • 1a. ok-voon ororok sprok .
  • 1b. at-voon bichat dat .
  • 7a. lalok farok ororok lalok sprok izok enemok .
  • 7b. wat jjat bichat wat dat vat eneat .
  • 2a. ok-drubel ok-voon anok plok sprok .
  • 2b. at-drubel at-voon pippat rrat dat .
  • 8a. lalok brok anok plok nok .
  • 8b. iat lat pippat rrat nnat .
  • 3a. erok sprok izok hihok ghirok .
  • 3b. totat dat arrat vat hilat .
  • 9a. wiwok nok izok kantok ok-yurp .
  • 9b. totat nnat quat oloat at-yurp .
  • 4a. ok-voon anok drok brok jok .
  • 4b. at-voon krat pippat sat lat .
  • 10a. lalok mok nok yorok ghirok clok .
  • 10b. wat nnat gat mat bat hilat .
  • 5a. wiwok farok izok stok .
  • 5b. totat jjat quat cat .
  • 11a. lalok nok crrrok hihok yorok zanzanok .
  • 11b. wat nnat arrat mat zanzanat .
  • 6a. lalok sprok izok jok stok .
  • 6b. wat dat krat quat cat .
  • 12a. lalok rarok nok izok hihok mok .
  • 12b. wat nnat forat arrat vat gat .

Your assignment, translate this to Arcturan: farok crrrok hihok yorok clok kantok ok-yurp

slide-35
SLIDE 35

CS421: Intro to AI 35 Hal Daumé III (me@hal3.name)

Centauri/Arcturan [Knight 97]

  • 1a. ok-voon ororok sprok .
  • 1b. at-voon bichat dat .
  • 7a. lalok farok ororok lalok sprok izok enemok .
  • 7b. wat jjat bichat wat dat vat eneat .
  • 2a. ok-drubel ok-voon anok plok sprok .
  • 2b. at-drubel at-voon pippat rrat dat .
  • 8a. lalok brok anok plok nok .
  • 8b. iat lat pippat rrat nnat .
  • 3a. erok sprok izok hihok ghirok .
  • 3b. totat dat arrat vat hilat .
  • 9a. wiwok nok izok kantok ok-yurp .
  • 9b. totat nnat quat oloat at-yurp .
  • 4a. ok-voon anok drok brok jok .
  • 4b. at-voon krat pippat sat lat .
  • 10a. lalok mok nok yorok ghirok clok .
  • 10b. wat nnat gat mat bat hilat .
  • 5a. wiwok farok izok stok .
  • 5b. totat jjat quat cat .
  • 11a. lalok nok crrrok hihok yorok zanzanok .
  • 11b. wat nnat arrat mat zanzanat .
  • 6a. lalok sprok izok jok stok .
  • 6b. wat dat krat quat cat .
  • 12a. lalok rarok nok izok hihok mok .
  • 12b. wat nnat forat arrat vat gat .

Your assignment, translate this to Arcturan: farok crrrok hihok yorok clok kantok ok-yurp

slide-36
SLIDE 36

CS421: Intro to AI 36 Hal Daumé III (me@hal3.name)

Your assignment, translate this to Arcturan: farok crrrok hihok yorok clok kantok ok-yurp

Centauri/Arcturan [Knight 97]

  • 1a. ok-voon ororok sprok .
  • 1b. at-voon bichat dat .
  • 7a. lalok farok ororok lalok sprok izok enemok .
  • 7b. wat jjat bichat wat dat vat eneat .
  • 2a. ok-drubel ok-voon anok plok sprok .
  • 2b. at-drubel at-voon pippat rrat dat .
  • 8a. lalok brok anok plok nok .
  • 8b. iat lat pippat rrat nnat .
  • 3a. erok sprok izok hihok ghirok .
  • 3b. totat dat arrat vat hilat .
  • 9a. wiwok nok izok kantok ok-yurp .
  • 9b. totat nnat quat oloat at-yurp .
  • 4a. ok-voon anok drok brok jok .
  • 4b. at-voon krat pippat sat lat .
  • 10a. lalok mok nok yorok ghirok clok .
  • 10b. wat nnat gat mat bat hilat .
  • 5a. wiwok farok izok stok .
  • 5b. totat jjat quat cat .
  • 11a. lalok nok crrrok hihok yorok zanzanok .
  • 11b. wat nnat arrat mat zanzanat .
  • 6a. lalok sprok izok jok stok .
  • 6b. wat dat krat quat cat .
  • 12a. lalok rarok nok izok hihok mok .
  • 12b. wat nnat forat arrat vat gat .
slide-37
SLIDE 37

CS421: Intro to AI 37 Hal Daumé III (me@hal3.name)

Centauri/Arcturan [Knight 97]

  • 1a. ok-voon ororok sprok .
  • 1b. at-voon bichat dat .
  • 7a. lalok farok ororok lalok sprok izok enemok .
  • 7b. wat jjat bichat wat dat vat eneat .
  • 2a. ok-drubel ok-voon anok plok sprok .
  • 2b. at-drubel at-voon pippat rrat dat .
  • 8a. lalok brok anok plok nok .
  • 8b. iat lat pippat rrat nnat .
  • 3a. erok sprok izok hihok ghirok .
  • 3b. totat dat arrat vat hilat .
  • 9a. wiwok nok izok kantok ok-yurp .
  • 9b. totat nnat quat oloat at-yurp .
  • 4a. ok-voon anok drok brok jok .
  • 4b. at-voon krat pippat sat lat .
  • 10a. lalok mok nok yorok ghirok clok .
  • 10b. wat nnat gat mat bat hilat .
  • 5a. wiwok farok izok stok .
  • 5b. totat jjat quat cat .
  • 11a. lalok nok crrrok hihok yorok zanzanok .
  • 11b. wat nnat arrat mat zanzanat .
  • 6a. lalok sprok izok jok stok .
  • 6b. wat dat krat quat cat .
  • 12a. lalok rarok nok izok hihok mok .
  • 12b. wat nnat forat arrat vat gat .

Your assignment, translate this to Arcturan: farok crrrok hihok yorok clok kantok ok-yurp ???

slide-38
SLIDE 38

CS421: Intro to AI 38 Hal Daumé III (me@hal3.name)

Centauri/Arcturan [Knight 97]

  • 1a. ok-voon ororok sprok .
  • 1b. at-voon bichat dat .
  • 7a. lalok farok ororok lalok sprok izok enemok .
  • 7b. wat jjat bichat wat dat vat eneat .
  • 2a. ok-drubel ok-voon anok plok sprok .
  • 2b. at-drubel at-voon pippat rrat dat .
  • 8a. lalok brok anok plok nok .
  • 8b. iat lat pippat rrat nnat .
  • 3a. erok sprok izok hihok ghirok .
  • 3b. totat dat arrat vat hilat .
  • 9a. wiwok nok izok kantok ok-yurp .
  • 9b. totat nnat quat oloat at-yurp .
  • 4a. ok-voon anok drok brok jok .
  • 4b. at-voon krat pippat sat lat .
  • 10a. lalok mok nok yorok ghirok clok .
  • 10b. wat nnat gat mat bat hilat .
  • 5a. wiwok farok izok stok .
  • 5b. totat jjat quat cat .
  • 11a. lalok nok crrrok hihok yorok zanzanok .
  • 11b. wat nnat arrat mat zanzanat .
  • 6a. lalok sprok izok jok stok .
  • 6b. wat dat krat quat cat .
  • 12a. lalok rarok nok izok hihok mok .
  • 12b. wat nnat forat arrat vat gat .

Your assignment, translate this to Arcturan: farok crrrok hihok yorok clok kantok ok-yurp

slide-39
SLIDE 39

CS421: Intro to AI 39 Hal Daumé III (me@hal3.name)

Centauri/Arcturan [Knight 97]

  • 1a. ok-voon ororok sprok .
  • 1b. at-voon bichat dat .
  • 7a. lalok farok ororok lalok sprok izok enemok .
  • 7b. wat jjat bichat wat dat vat eneat .
  • 2a. ok-drubel ok-voon anok plok sprok .
  • 2b. at-drubel at-voon pippat rrat dat .
  • 8a. lalok brok anok plok nok .
  • 8b. iat lat pippat rrat nnat .
  • 3a. erok sprok izok hihok ghirok .
  • 3b. totat dat arrat vat hilat .
  • 9a. wiwok nok izok kantok ok-yurp .
  • 9b. totat nnat quat oloat at-yurp .
  • 4a. ok-voon anok drok brok jok .
  • 4b. at-voon krat pippat sat lat .
  • 10a. lalok mok nok yorok ghirok clok .
  • 10b. wat nnat gat mat bat hilat .
  • 5a. wiwok farok izok stok .
  • 5b. totat jjat quat cat .
  • 11a. lalok nok crrrok hihok yorok zanzanok .
  • 11b. wat nnat arrat mat zanzanat .
  • 6a. lalok sprok izok jok stok .
  • 6b. wat dat krat quat cat .
  • 12a. lalok rarok nok izok hihok mok .
  • 12b. wat nnat forat arrat vat gat .

Your assignment, translate this to Arcturan: farok crrrok hihok yorok clok kantok ok-yurp

slide-40
SLIDE 40

CS421: Intro to AI 40 Hal Daumé III (me@hal3.name)

Centauri/Arcturan [Knight 97]

  • 1a. ok-voon ororok sprok .
  • 1b. at-voon bichat dat .
  • 7a. lalok farok ororok lalok sprok izok enemok .
  • 7b. wat jjat bichat wat dat vat eneat .
  • 2a. ok-drubel ok-voon anok plok sprok .
  • 2b. at-drubel at-voon pippat rrat dat .
  • 8a. lalok brok anok plok nok .
  • 8b. iat lat pippat rrat nnat .
  • 3a. erok sprok izok hihok ghirok .
  • 3b. totat dat arrat vat hilat .
  • 9a. wiwok nok izok kantok ok-yurp .
  • 9b. totat nnat quat oloat at-yurp .
  • 4a. ok-voon anok drok brok jok .
  • 4b. at-voon krat pippat sat lat .
  • 10a. lalok mok nok yorok ghirok clok .
  • 10b. wat nnat gat mat bat hilat .
  • 5a. wiwok farok izok stok .
  • 5b. totat jjat quat cat .
  • 11a. lalok nok crrrok hihok yorok zanzanok .
  • 11b. wat nnat arrat mat zanzanat .
  • 6a. lalok sprok izok jok stok .
  • 6b. wat dat krat quat cat .
  • 12a. lalok rarok nok izok hihok mok .
  • 12b. wat nnat forat arrat vat gat .

Your assignment, translate this to Arcturan: farok crrrok hihok yorok clok kantok ok-yurp

slide-41
SLIDE 41

CS421: Intro to AI 41 Hal Daumé III (me@hal3.name)

Centauri/Arcturan [Knight 97]

  • 1a. ok-voon ororok sprok .
  • 1b. at-voon bichat dat .
  • 7a. lalok farok ororok lalok sprok izok enemok .
  • 7b. wat jjat bichat wat dat vat eneat .
  • 2a. ok-drubel ok-voon anok plok sprok .
  • 2b. at-drubel at-voon pippat rrat dat .
  • 8a. lalok brok anok plok nok .
  • 8b. iat lat pippat rrat nnat .
  • 3a. erok sprok izok hihok ghirok .
  • 3b. totat dat arrat vat hilat .
  • 9a. wiwok nok izok kantok ok-yurp .
  • 9b. totat nnat quat oloat at-yurp .
  • 4a. ok-voon anok drok brok jok .
  • 4b. at-voon krat pippat sat lat .
  • 10a. lalok mok nok yorok ghirok clok .
  • 10b. wat nnat gat mat bat hilat .
  • 5a. wiwok farok izok stok .
  • 5b. totat jjat quat cat .
  • 11a. lalok nok crrrok hihok yorok zanzanok .
  • 11b. wat nnat arrat mat zanzanat .
  • 6a. lalok sprok izok jok stok .
  • 6b. wat dat krat quat cat .
  • 12a. lalok rarok nok izok hihok mok .
  • 12b. wat nnat forat arrat vat gat .

Your assignment, translate this to Arcturan: farok crrrok hihok yorok clok kantok ok-yurp ???

slide-42
SLIDE 42

CS421: Intro to AI 42 Hal Daumé III (me@hal3.name)

Centauri/Arcturan [Knight 97]

  • 1a. ok-voon ororok sprok .
  • 1b. at-voon bichat dat .
  • 7a. lalok farok ororok lalok sprok izok enemok .
  • 7b. wat jjat bichat wat dat vat eneat .
  • 2a. ok-drubel ok-voon anok plok sprok .
  • 2b. at-drubel at-voon pippat rrat dat .
  • 8a. lalok brok anok plok nok .
  • 8b. iat lat pippat rrat nnat .
  • 3a. erok sprok izok hihok ghirok .
  • 3b. totat dat arrat vat hilat .
  • 9a. wiwok nok izok kantok ok-yurp .
  • 9b. totat nnat quat oloat at-yurp .
  • 4a. ok-voon anok drok brok jok .
  • 4b. at-voon krat pippat sat lat .
  • 10a. lalok mok nok yorok ghirok clok .
  • 10b. wat nnat gat mat bat hilat .
  • 5a. wiwok farok izok stok .
  • 5b. totat jjat quat cat .
  • 11a. lalok nok crrrok hihok yorok zanzanok .
  • 11b. wat nnat arrat mat zanzanat .
  • 6a. lalok sprok izok jok stok .
  • 6b. wat dat krat quat cat .
  • 12a. lalok rarok nok izok hihok mok .
  • 12b. wat nnat forat arrat vat gat .

Your assignment, translate this to Arcturan: farok crrrok hihok yorok clok kantok ok-yurp

slide-43
SLIDE 43

CS421: Intro to AI 43 Hal Daumé III (me@hal3.name)

Centauri/Arcturan [Knight 97]

  • 1a. ok-voon ororok sprok .
  • 1b. at-voon bichat dat .
  • 7a. lalok farok ororok lalok sprok izok enemok .
  • 7b. wat jjat bichat wat dat vat eneat .
  • 2a. ok-drubel ok-voon anok plok sprok .
  • 2b. at-drubel at-voon pippat rrat dat .
  • 8a. lalok brok anok plok nok .
  • 8b. iat lat pippat rrat nnat .
  • 3a. erok sprok izok hihok ghirok .
  • 3b. totat dat arrat vat hilat .
  • 9a. wiwok nok izok kantok ok-yurp .
  • 9b. totat nnat quat oloat at-yurp .
  • 4a. ok-voon anok drok brok jok .
  • 4b. at-voon krat pippat sat lat .
  • 10a. lalok mok nok yorok ghirok clok .
  • 10b. wat nnat gat mat bat hilat .
  • 5a. wiwok farok izok stok .
  • 5b. totat jjat quat cat .
  • 11a. lalok nok crrrok hihok yorok zanzanok .
  • 11b. wat nnat arrat mat zanzanat .
  • 6a. lalok sprok izok jok stok .
  • 6b. wat dat krat quat cat .
  • 12a. lalok rarok nok izok hihok mok .
  • 12b. wat nnat forat arrat vat gat .

Your assignment, translate this to Arcturan: farok crrrok hihok yorok clok kantok ok-yurp process of elimination

slide-44
SLIDE 44

CS421: Intro to AI 44 Hal Daumé III (me@hal3.name)

Centauri/Arcturan [Knight 97]

  • 1a. ok-voon ororok sprok .
  • 1b. at-voon bichat dat .
  • 7a. lalok farok ororok lalok sprok izok enemok .
  • 7b. wat jjat bichat wat dat vat eneat .
  • 2a. ok-drubel ok-voon anok plok sprok .
  • 2b. at-drubel at-voon pippat rrat dat .
  • 8a. lalok brok anok plok nok .
  • 8b. iat lat pippat rrat nnat .
  • 3a. erok sprok izok hihok ghirok .
  • 3b. totat dat arrat vat hilat .
  • 9a. wiwok nok izok kantok ok-yurp .
  • 9b. totat nnat quat oloat at-yurp .
  • 4a. ok-voon anok drok brok jok .
  • 4b. at-voon krat pippat sat lat .
  • 10a. lalok mok nok yorok ghirok clok .
  • 10b. wat nnat gat mat bat hilat .
  • 5a. wiwok farok izok stok .
  • 5b. totat jjat quat cat .
  • 11a. lalok nok crrrok hihok yorok zanzanok .
  • 11b. wat nnat arrat mat zanzanat .
  • 6a. lalok sprok izok jok stok .
  • 6b. wat dat krat quat cat .
  • 12a. lalok rarok nok izok hihok mok .
  • 12b. wat nnat forat arrat vat gat .

Your assignment, translate this to Arcturan: farok crrrok hihok yorok clok kantok ok-yurp cognate?

slide-45
SLIDE 45

CS421: Intro to AI 45 Hal Daumé III (me@hal3.name)

Your assignment, put these words in order: { jjat, arrat, mat, bat, oloat, at-yurp }

Centauri/Arcturan [Knight 97]

  • 1a. ok-voon ororok sprok .
  • 1b. at-voon bichat dat .
  • 7a. lalok farok ororok lalok sprok izok enemok .
  • 7b. wat jjat bichat wat dat vat eneat .
  • 2a. ok-drubel ok-voon anok plok sprok .
  • 2b. at-drubel at-voon pippat rrat dat .
  • 8a. lalok brok anok plok nok .
  • 8b. iat lat pippat rrat nnat .
  • 3a. erok sprok izok hihok ghirok .
  • 3b. totat dat arrat vat hilat .
  • 9a. wiwok nok izok kantok ok-yurp .
  • 9b. totat nnat quat oloat at-yurp .
  • 4a. ok-voon anok drok brok jok .
  • 4b. at-voon krat pippat sat lat .
  • 10a. lalok mok nok yorok ghirok clok .
  • 10b. wat nnat gat mat bat hilat .
  • 5a. wiwok farok izok stok .
  • 5b. totat jjat quat cat .
  • 11a. lalok nok crrrok hihok yorok zanzanok .
  • 11b. wat nnat arrat mat zanzanat .
  • 6a. lalok sprok izok jok stok .
  • 6b. wat dat krat quat cat .
  • 12a. lalok rarok nok izok hihok mok .
  • 12b. wat nnat forat arrat vat gat .

zero fertility

slide-46
SLIDE 46

CS421: Intro to AI 46 Hal Daumé III (me@hal3.name)

Unsupervised EM Training

… la maison ….. la maison bleue …... la fleur … … the house ….. the blue house ...… the flower … All P(french-word | english-word) equally likely

slide-47
SLIDE 47

CS421: Intro to AI 47 Hal Daumé III (me@hal3.name)

Unsupervised EM Training

… la maison ..… la maison bleue ...… la fleur … … the house ..… the blue house ...… the flower … “la” and “the” observed to co-occur frequently, so P(la | the) is increased.

slide-48
SLIDE 48

CS421: Intro to AI 48 Hal Daumé III (me@hal3.name)

Unsupervised EM Training

… la maison ..… la maison bleue …... la fleur … … the house ..… the blue house ...… the flower … “maison” co-occurs with both “the” and “house”, but P(maison | house) can be raised without limit, to 1.0, while P(maison | the) is limited because of “la” (pigeonhole principle)

slide-49
SLIDE 49

CS421: Intro to AI 49 Hal Daumé III (me@hal3.name)

Unsupervised EM Training

… la maison ….. la maison bleue ...… la fleur … … the house ..… the blue house …... the flower … settling down after another iteration

slide-50
SLIDE 50

CS421: Intro to AI 50 Hal Daumé III (me@hal3.name)

Unsupervised EM Training

… la maison ….. la maison bleue …... la fleur … … the house ….. the blue house ...… the flower … Inherent hidden structure revealed by EM training!

  • “A Statistical MT Tutorial Workbook” (Knight, 1999). Promises free beer.
  • “The Mathematics of Statistical Machine Translation” (Brown et al, 1993)
  • Software: GIZA++
slide-51
SLIDE 51

CS421: Intro to AI 51 Hal Daumé III (me@hal3.name)

The IBM Model [Brown et al., 1993]

Mary did not slap the green witch

Mary not slap slap slap the green witch

n(3|slap)

Maria no daba una botefada a la bruja verde

d(j|i)

Mary not slap slap slap NULL the green witch

P NULL

Maria no daba una botefada a la verde bruja

t(la|the)

Use the EM algorithm for training the parameters

slide-52
SLIDE 52

CS421: Intro to AI 52 Hal Daumé III (me@hal3.name)

Decoding for Machine Translation

English Source Translation Model French Output p(e) p(f | e) p(e | f) ∝ p(e) * p(f | e) Decoding:  e = argmax e pe p f ∣e

Problem in NP-hard; use search:

Beam Search Greedy Search Integer Programming A* Search

slide-53
SLIDE 53

CS421: Intro to AI 53 Hal Daumé III (me@hal3.name)

Progress in Statistical MT

insistent Wednesday may recurred her trips to Libya tomorrow for flying Cairo 6-4 ( AFP ) - an official announced today in the Egyptian lines company for flying Tuesday is a company " insistent for flying " may resumed a consideration of a day Wednesday tomorrow her trips to Libya of Security Council decision trace international the imposed ban comment . And said the official " the institution sent a speech to Ministry of Foreign Affairs of lifting on Libya air , a situation her receiving replying are so a trip will pull to Libya a morning Wednesday " . Egyptair Has Tomorrow to Resume Its Flights to Libya Cairo 4-6 (AFP) - said an official at the Egyptian Aviation Company today that the company egyptair may resume as of tomorrow, Wednesday its flights to Libya after the International Security Council resolution to the suspension of the embargo imposed on Libya. " The official said that the company had sent a letter to the Ministry of Foreign Affairs, information on the lifting of the air embargo on Libya, where it had received a response, the first take off a trip to Libya on Wednesday morning ".

2002 2002 2003 2003 2003 2003

slide from C. Wayne, DARPA

slide-54
SLIDE 54

CS421: Intro to AI 54 Hal Daumé III (me@hal3.name)

Reference translation: The U.S. island of Guam is maintaining a high state of alert after the Guam airport and its offices both received an e-mail from someone calling himself the Saudi Arabian Osama bin Laden and threatening a biological/chemical attack against public places such as the airport . Machine translation: The American [?] international airport and its the office all receives

  • ne calls self the sand Arab rich business [?] and so on electronic

mail , which sends out ; The threat will be able after public place and so on the airport to start the biochemistry attack , [?] highly alerts after the maintenance. Reference translation: The U.S. island of Guam is maintaining a high state of alert after the Guam airport and its offices both received an e-mail from someone calling himself the Saudi Arabian Osama bin Laden and threatening a biological/chemical attack against public places such as the airport . Machine translation: The American [?] international airport and its the office all receives

  • ne calls self the sand Arab rich business [?] and so on electronic

mail , which sends out ; The threat will be able after public place and so on the airport to start the biochemistry attack , [?] highly alerts after the maintenance.

Reference translation: The U.S. island of Guam is maintaining a high state

  • f alert after the Guam airport and its offices both

received an e-mail from someone calling himself the Saudi Arabian Osama bin Laden and threatening a biological/chemical attack against public places such as the airport . Machine translation: The American [?] international airport and its the

  • ffice all receives one calls self the sand Arab rich

business [?] and so on electronic mail , which sends

  • ut ; The threat will be able after public place and so
  • n the airport to start the biochemistry attack , [?]

highly alerts after the maintenance.

Tri-gram match Bi-gram matches

Automatic Evaluation of Translation

“Bleu” metric

slide-55
SLIDE 55

CS421: Intro to AI 55 Hal Daumé III (me@hal3.name)

Minimum Error Rate Training for MT

➢ Desire MT system with high BLEU/??? scores ➢ Algorithm:

Build MT system based on generative parameters

Decode development corpus to get n-best lists (~10k best)

Optimize parameters to get high BLEU scores on n- best lists

Repeat until converged [Och, ACL03]

slide-56
SLIDE 56

CS421: Intro to AI 56 Hal Daumé III (me@hal3.name) 56

Phrase-Based Translation

Australia is with North diplomatic relations have that countries few

  • ne of

Kore a

澳洲 是 北 与 韩 有 邦交 的 国家 之一 少数

Australia is with North Korea is diplomatic relations

  • ne of

the few countries Australia is diplomatic relations with North Korea is

  • ne of

the few countries [Koehn, Och and Marcu, NAACL03]

slide-57
SLIDE 57

CS421: Intro to AI 57 Hal Daumé III (me@hal3.name) 57

Training Phrase-Based MT Systems

Maria no daba una botefada a la bruja verde witch green the slap not did Mary

[Koehn, Och and Marcu, NAACL03]

slide-58
SLIDE 58

CS421: Intro to AI 58 Hal Daumé III (me@hal3.name) 58

Decoding Phrase-Based MT

Maria no daba una botefada a la bruja verde Mary did not slap the green witch

➢ Each step induces a cost attributed to:

Language model probability: p(slap | did not)

T-table probability: p(the | a la) and p(a la | the)

Distortion probability: p(skip 1) [for a la --> verde]

Length penalty

... [Koehn, Och and Marcu, NAACL03]

slide-59
SLIDE 59

CS421: Intro to AI 59 Hal Daumé III (me@hal3.name) 59

Hierarchical Phrase-Based MT

澳洲 是 与北 韩 有 邦交 的 国家 之一 少数

few countries North Korea diplomatic relations have with

之一 的

is Australia Australia is North Korea diplomatic relations few countries

有 的 与

[Chiang, ACL05]

slide-60
SLIDE 60

CS421: Intro to AI 60 Hal Daumé III (me@hal3.name) 60

Hierarchical Phrase-Based MT

few countries North Korea diplomatic relations have with

之一 的

is Australia few countries North Korea diplomatic relations have with that is Australia

  • ne
  • f

the is Australia few countries North Korea diplomatic relations have with that

之一

[Chiang, ACL05]

slide-61
SLIDE 61

CS421: Intro to AI 61 Hal Daumé III (me@hal3.name)

Syntax for MT

I ate lunch

PRO VBD NN NP NP VP S

tabeta ate

VBD

hirugohan lunch

NN NP

watashi I

PRO NP VP

x y y wo x

S

x y x wa y I

PRO NP

ate lunch

VBD NN NP VP

wa ate lunch

VBD NN NP VP

watashi wa ate lunch

VBD NN NP

watashi wa wo watashi wa hirugohan wo tabeta

Kevin Knight, Daniel Marcu, Ignacio Thayer, Jonathan Graehl, Jon May, Steve DeNeefe

slide-62
SLIDE 62

CS421: Intro to AI 62 Hal Daumé III (me@hal3.name)

Syntax for MT

Decoding:

Tree-to-tree/string automata

CKY parsing algorithm

Rule learning:

Parsed English corpus

Aligned data (GIZA++)

Extract rules and assign probabilities Time B L E U Phrase-based MT Syntax-based MT

slide-63
SLIDE 63

Conversation as MT

[Ritter, Cherry, Dolan EMNLP 2011]

SMT INPUT: Foreign Text OUTPUT: English Text LEARNING: Parallel Corpora

slide-64
SLIDE 64

Conversation as MT

[Ritter, Cherry, Dolan EMNLP 2011]

SMT Chat INPUT: Foreign Text User Utterance OUTPUT: English Text Response LEARNING: Parallel Corpora Conversations

slide-65
SLIDE 65

Phrase-based MT

[Ritter, Cherry, Dolan EMNLP 2011]

Who wants to come over for dinner tomorrow? Input: Output:

{

want to Yum ! I

{

be there

{

tomorrow !

{

slide-66
SLIDE 66
slide-67
SLIDE 67

CS421: Intro to AI 63 Hal Daumé III (me@hal3.name)

Summary

➢ Old school tranislation = interlingua

Works well for limited domains

Costs a lot of money

➢ New school translation = statistical

Started out naïve

Becoming more linguistically motivated every year

➢ Translation is currently the “hot topic” in NLP

It looks like linguistics really is going to help, after all

(so long as you use it wisely in conjunction with statistics)