Attention Shifting for Parsing Speech
Keith Hall and Mark Johnson
Brown University
ACL 2004
July 22, 2004
Attention Shifting for Parsing Speech Keith Hall and Mark Johnson - - PowerPoint PPT Presentation
Attention Shifting for Parsing Speech Keith Hall and Mark Johnson Brown University ACL 2004 July 22, 2004 Attention Shifting Iterative best-first word-lattice parsing algorithm Posits a complete syntactic analyses for each path of a
ACL 2004
July 22, 2004
(Oracle Word Error Rate)
(Number of parser operations)
(Word Error Rate)
Blaheta & Charniak demeriting (ACL99)
7/22/2004 ACL04: Attention Shifting for Parsing Speech 1
7/22/2004 ACL04: Attention Shifting for Parsing Speech 2
Langauge Source
Noisy Channel
Language Output
W P(W|A) = arg max W P(A, W)
7/22/2004 ACL04: Attention Shifting for Parsing Speech 3
w1, ..., wi, ..., wn1 ... Language Model w1, ..., wi, ..., wn2 w1, ..., wi, ..., wn3 w1, ..., wi, ..., wn4 w1, ..., wi, ..., wnm
8 2 3 5 1 6 4 7 10 9 the/0 man/0 is/0 duh/1.385 man/0 is/0 surely/0 early/0 mans/1.385 man's/1.385 surly/0 surly/0.692 early/0 early/0
n-best list extractor
P (W ) = Qk
1 P (wi|π(wk, . . . , w1))
– Select n-best strings using some model (trigram) – Process each string independently – Select string with highest P (A, W )
7/22/2004 ACL04: Attention Shifting for Parsing Speech 4
1 <s>/0 2 the/0 3 duh/1.223 4 man/0 5 mans/0.510 6 man’s/0.916 9 man/0 is/0 7 early/0 surly/0.694 early/0 10 is/0 8/1.307 </s>/0 surely/0 NN VB VBZ VP VB DT JJ
(Mohri, Pereira, & Riley CS&L02)
7/22/2004 ACL04: Attention Shifting for Parsing Speech 5
BASED ON A STUDY LIKE THIS (160 arcs, 72 nodes)
2 i/0 i./10.71 1 if/81.46 3 would/0 i/0 4 not/0 8 see/97.62 5 suggest/0 9 suggested/92.75 just/0 to/80.41 the/54.78 that/133.4 they/55.21 it/73.10 7 anyone/0 6 any/55.00 10 anyway/93.69 anyone/0 11 make/0 made/132.9 would/110.67/22/2004 ACL04: Attention Shifting for Parsing Speech 6
1 <s>/0 2 the/0 3 duh/1.223 4 man/0 5 mans/0.510 6 man’s/0.916 9 man/0 is/0 7 early/0 surly/0.694 early/0 10 is/0 8/1.307 </s>/0 surely/0 Agenda (Priority Queue)
NP(0,4) 0.567 NN(3,9) 0.550
Compute FOM Grammar
VB VB VP VB DT NN
details in (Hall & Johnson ASRU03)
7/22/2004 ACL04: Attention Shifting for Parsing Speech 7
Compresed lattice Outside HMM Best-first PCFG Parser Syntactic Category Lattice Inside-Outside Prunning Local Trees Lexicalized Syntactic Language Model (Charniak Parser) Word String from Optimal Parse Word-lattice First Stage Second Stage
(Charniak ACL01)
with search for candidate parses
7/22/2004 ACL04: Attention Shifting for Parsing Speech 8
7/22/2004 ACL04: Attention Shifting for Parsing Speech 9
Identify Unused Words Clear Agenda/ Add Edges for Unused Words Is Agenda Empty? no Continue Multi-stage Parsing yes PCFG Word-lattice Parser
with normal best-first parsing)
(unused word has 0 outside probability)
7/22/2004 ACL04: Attention Shifting for Parsing Speech 10
(speech-normalization via Roark’s normalization)
(Charniak ACL01)
7/22/2004 ACL04: Attention Shifting for Parsing Speech 11
7/22/2004 ACL04: Attention Shifting for Parsing Speech 12
(4X over-parsing)
Model # edge pops Oracle WER WER n–best (Charniak) 2.5 million 7.75 11.8 100x LatParse 3.4 million 8.18 12.0 10x AttShift 564,895 7.78 11.9
7/22/2004 ACL04: Attention Shifting for Parsing Speech 13
Model # edge pops Oracle WER WER acoustic lats N/A 3.26 N/A 100x LatParse 3.4 million 5.45 13.1 10x AttShift 1.6 million 4.17 13.1
7/22/2004 ACL04: Attention Shifting for Parsing Speech 14
7/22/2004 ACL04: Attention Shifting for Parsing Speech 15