Lexical Normalization for Neural Network Parsing
Rob van der Goot, Gertjan van Noord University of Groningen r.van.der.goot@rug.nl 26-01-2018
1 / 31
Lexical Normalization for Neural Network Parsing Rob van der Goot, - - PowerPoint PPT Presentation
Lexical Normalization for Neural Network Parsing Rob van der Goot, Gertjan van Noord University of Groningen r.van.der.goot@rug.nl 26-01-2018 1 / 31 Last Year (CLIN27) kheb da gzien ik heb dat gezien orig gzien kheb da tokenize lookup
1 / 31
2 / 31
3 / 31
4 / 31
5 / 31
root nsubj xcomp advmod parataxis punct advmod advmod
case nmod:tmod
6 / 31
7 / 31
the s2 jumped s1
s0 the b0 lazy b1 dog b2 ROOT b3 fox brown
Scoring:
LSTM f xthe concat LSTM f xbrown concat LSTM f xfox concat LSTM f xjumped concat LSTM f xover concat LSTM f xthe concat LSTM f xlazy concat LSTM f xdog concat LSTM f xROOT concat LSTM b s0 LSTM b s1 LSTM b s2 LSTM b s3 LSTM b s4 LSTM b s5 LSTM b s6 LSTM b s7 LSTM b s8 Vthe Vbrown Vfox Vjumped Vover Vthe Vlazy Vdog VROOT MLP (ScoreLeftArc, ScoreRightArc, ScoreShift)
Taken from Kiperwasser and Goldberg (2016)
8 / 31
9 / 31
word1
LSTM f LSTM b word2
LSTM f LSTM b word3
LSTM f LSTM b 10 / 31
root compound compound amod
11 / 31
root
amod
12 / 31
b a s e + c h a r + e x t + c h a r + e x t 48 50 52 54 56 58 60 62 64
raw norm.
13 / 31
b a s e + c h a r + e x t + c h a r + e x t 48 50 52 54 56 58 60 62 64
raw norm.
14 / 31
b a s e + c h a r + e x t + c h a r + e x t 48 50 52 54 56 58 60 62 64
raw norm.
15 / 31
b a s e + c h a r + e x t + c h a r + e x t 48 50 52 54 56 58 60 62 64
raw norm.
16 / 31
b a s e + c h a r + e x t + c h a r + e x t 48 50 52 54 56 58 60 62 64
raw norm. gold
17 / 31
18 / 31
19 / 31
word1
LSTM f LSTM b word2
LSTM f LSTM b word3
LSTM f LSTM b 20 / 31
n
21 / 31
22 / 31
b a s e + c h a r + e x t + c h a r + e x t 48 50 52 54 56 58 60 62 64
raw norm. integr. gold
23 / 31
24 / 31
80 82 84 86 88 90
25 / 31
0.00 0.02 0.04 0.06 0.08 0.10 +8.42e1
26 / 31
27 / 31
28 / 31
29 / 31
30 / 31
31 / 31