Domain Adaptation for Constituency Parsing Using Partial Annotations
Vidur Joshi Matthew Peters Mark Hopkins
Domain Adaptation for Constituency Parsing Using Partial Annotations - - PowerPoint PPT Presentation
Domain Adaptation for Constituency Parsing Using Partial Annotations Vidur Joshi Matthew Peters Mark Hopkins Constituency Parsing is Useful Textual Entailment (Bowman et al., 2016) Semantic Parsing (Hopkins et al., 2017) Sentiment Analysis
Vidur Joshi Matthew Peters Mark Hopkins
Textual Entailment (Bowman et al., 2016) Semantic Parsing (Hopkins et al., 2017) Sentiment Analysis (Socher et al., 2013) Language Modeling (Dyer et al., 2016)
2
3
Geometry Problem: In the rhombus PQRS, PR = 24 and QS = 10. Question: What's the second-most-used vowel in English? Biochemistry: Ethoxycoumarin was metabolized by isolated epidermal cells via dealkylation to 7-hydroxycoumarin ( 7-OHC ) and subsequent conjugation.
4
5
6
7
8
9
Contextualized word representations improve sample
Span-focused models achieve state-of-the-art constituency parsing results. (Stern et al., 2017)
10
11
Review Contextual Word Representations Partial Annotations: Definition Training Parsing as Span Classification The Span Classification Model Experiments and Results: Performance on PTB and new Domains Adapting Using Partial Annotations
12
13
14
15
16
17
18
19
A triangle has [a perimeter of 16] and one side of length 4 . A triangle has [NP a perimeter of 16] and one side of length 4 . A triangle has a perimeter {of 16 and one side of length 4} .
20
(Mielens et al., 2015)
21
22
23
24
Marginalize out components for which no supervision exists.
25
26
*(Mirroshandel and Nasr, 2011; Majidi and Crane, 2013, Nivre et al., 2014; Li et al., 2016)
Assume probability of a parse factors into a product of probabilities.
27
Assume probability of a parse factors into a product of probabilities.
28
Assume probability of a parse factors into a product of probabilities.
29
Assume probability of a parse factors into a product of probabilities. Objective now simplifies to: Easy if model classifies spans!
30
31
32 *(Cross and Huang, 2016; Stern et al., 2017)
33 *(Cross and Huang, 2016; Stern et al., 2017)
34 *(Cross and Huang, 2016; Stern et al., 2017)
35 *(Cross and Huang, 2016; Stern et al., 2017)
36 *(Cross and Huang, 2016; Stern et al., 2017)
37 *(Cross and Huang, 2016; Stern et al., 2017)
38 *(Cross and Huang, 2016; Stern et al., 2017)
39 *(Cross and Huang, 2016; Stern et al., 2017)
40 *(Cross and Huang, 2016; Stern et al., 2017)
41 *(Cross and Huang, 2016; Stern et al., 2017)
42 *(Cross and Huang, 2016; Stern et al., 2017)
▪ A partial annotation is a labeled span. ▪ A full parse labels every span in the sentence. Therefore, training on both is identical under our derived objective.
43
Find maximum using dynamic programming:
44
45
46
47
48
49
She enjoys playing tennis .
50
She enjoys playing tennis .
51
She enjoys playing tennis .
LSTM LSTM LSTM LSTM LSTM LSTM LSTM LSTM LSTM LSTM
52
She enjoys playing tennis .
53
54
MLP
55
Ours Stern et al., 2017 Objective Maximum likelihood on labels Maximum margin
ELMo Yes No POS Tags as Input No Yes
56
Ours Stern et al., 2017 Objective Maximum likelihood on labels Maximum margin
ELMo Yes No POS Tags as Input No Yes
57
Ours Stern et al., 2017 Objective Maximum likelihood on labels Maximum margin
ELMo Yes No POS Tags as Input No Yes
58
Ours Stern et al., 2017 Objective Maximum likelihood on labels Maximum margin
ELMo Yes No POS Tags as Input No Yes
59
60
Stern et al., 2017
+Maximum Likelihood on Labels
61
Effective Inference for Generative Neural Parsing
*New SoTA is 95.1 (Kitaev and Klein, ACL 2018)
62
63
64
Number of parses from Question Bank F1
65
Number of parses from Question Bank F1
66
Number of parses from Question Bank F1
Improvements taper quickly
67
In the diagram at the right, circle O has a radius of 5, and CE = 2. Diameter AC is perpendicular to chord BD at E. What is the length of BD?
68
Ethoxycoumarin was metabolized by isolated epidermal cells via dealkylation to 7-hydroxycoumarin ( 7-OHC ) and subsequent conjugation .
69
70
71
Correct Constituent % Error-Free Sentences %
72
73
Correct Constituent % Error-Free Sentences %
74
75
76
77
78
79
80
81
Demo: http://demo.allennlp.org/constituency-parsing Datasets: https://github.com/vidurj/parser-adaptation/tree/master/data
82