1/58
Deep Neural Ranking Models for Argument Retrieval
Master’s Thesis by Saeed Entezari Referees: Prof. Stein, PD. Dr. Jakoby Supervisor: Michael V¨
- lske
Faculty of Media Bauhaus Universit¨ at Weimar
Deep Neural Ranking Models for Argument Retrieval Masters Thesis by - - PowerPoint PPT Presentation
Deep Neural Ranking Models for Argument Retrieval Masters Thesis by Saeed Entezari Referees: Prof. Stein, PD. Dr. Jakoby Supervisor: Michael V olske Faculty of Media Bauhaus Universit at Weimar September 16, 2020 1/58 Agenda
1/58
Faculty of Media Bauhaus Universit¨ at Weimar
2/58
3/58
ad-hoc retrieval using the collection?
ad-hoc retrieval tasks in the argument retrieval ◮ RQ2.1. Interaction-focused vs. representation-focused? ◮ RQ2.2. Static embedding vs. contextualized embedding? ◮ RQ2.3. Typical Neural ranking model vs. End-to-End?
we require for doing so?
4/58
5/58
6/58
7/58
Figure: The relation between the argument units ([Dumani(2019)])
8/58
9/58
10/58
11/58
12/58
13/58
Figure: Histogram of the unique claims based on the number of tokens
14/58
Table: Normalized claims with the highest number of premises
15/58
16/58
Figure: Histogram of the premises based on their length (number of tokens separated by white space)
17/58
18/58
Figure: Hinge as a pairwise cost function
19/58
claims
20/58
21/58
22/58
Figure: Different datasets and their number of arguments
23/58
Figure: An ideal ranking for a validation query
24/58
25/58
q and d respectively
26/58
Table: Models
Model type embedding re-rank GRU rep static yes DRMM int static yes KNRM int static yes CKNRM int static yes Vanilla BERT int contx yes DRMM BERT int contx yes KNRM BERT int contx yes SNRM rep static no
27/58
Model type embedding re-rank GRU rep static yes DRMM int static yes KNRM int static yes CKNRM int static yes Vanilla BERT int contx yes DRMM BERT int contx yes KNRM BERT int contx yes SNRM rep static no
Figure: Similarity scores using recurrent neural network
28/58
Model type embedding re-rank GRU rep static yes DRMM int static yes KNRM int static yes CKNRM int static yes Vanilla BERT int contx yes DRMM BERT int contx yes KNRM BERT int contx yes SNRM rep static no
29/58
Model type embedding re-rank GRU rep static yes DRMM int static yes KNRM int static yes CKNRM int static yes Vanilla BERT int contx yes DRMM BERT int contx yes KNRM BERT int contx yes SNRM rep static no
30/58
Model type embedding re-rank GRU rep static yes DRMM int static yes KNRM int static yes CKNRM int static yes Vanilla BERT int contx yes DRMM BERT int contx yes KNRM BERT int contx yes SNRM rep static no
31/58
Model type embedding re-rank GRU rep static yes DRMM int static yes KNRM int static yes CKNRM int static yes Vanilla BERT int contx yes DRMM BERT int contx yes KNRM BERT int contx yes SNRM rep static no
the top of BERT network
32/58
Model type embedding re-rank GRU rep static yes DRMM int static yes KNRM int static yes CKNRM int static yes Vanilla BERT int contx yes DRMM BERT int contx yes KNRM BERT int contx yes SNRM rep static no
the inputs
33/58
Figure: Training process of SNRM ([Zamani et al.(2018)Zamani, Dehghani, Croft, Learned-Miller, and Kamps])
34/58
35/58
36/58
(a) DRMM (b) Vanilla BERT
37/58
Table: Models
38/58
39/58
Figure: Candidate documents to be re-ranked in the test phase
40/58
Figure: Document retrieval process ([Zamani et al.(2018)Zamani, Dehghani, Croft, Learned-Miller, and Kamps])
41/58
models!
42/58
43/58
44/58
|A∪B| Figure: The heat map of the Jaccard coefficient for the 50 test queries
45/58
46/58
dimension
47/58
48/58
49/58
Model type embedding re-rank MAP@20 nDCG@5 GRU rep static yes 0.241 x DRMM int static yes 0.528 x KNRM int static yes 0.727 0.684 CKNRM int static yes 0.733 x Vanilla BERT int contx yes 0.88 0.404 DRMM BERT int contx yes 0.881 0.371 KNRM BERT int contx yes 0.902 0.319 SNRM rep static no 0.701 x Aggregation x x x x 0.372
50/58
dimensions)
51/58
52/58
Using distant super vision and assigning unrelated documents with Fuzzy similiarty Creat validation set with higher number of unrelated documents
Representation-focused
Contextualized embedding
Improvement needed for end-to-end approach
Linear regression as an aggregation strategy Analysis of result similarity is required
53/58
54/58
55/58
56/58
Figure: An example of MRR calculation
57/58
Figure: An example of MAP calculation
58/58
p
58/58