MPII at the NTCIR-14 WWW-2 Task
Andrew Yates Max Planck Institute for Informatics
MPII at the NTCIR-14 WWW-2 Task Andrew Yates Max Planck Institute - - PowerPoint PPT Presentation
MPII at the NTCIR-14 WWW-2 Task Andrew Yates Max Planck Institute for Informatics Motivation Opportunity to evaluate NIR model (participatingin pool) Previously evaluated on TREC Web Track 09-14 (WSDM '18, EMNLP '17) With long queries
Andrew Yates Max Planck Institute for Informatics
Opportunity to evaluate NIR model (participatingin pool)
(WSDM '18, EMNLP '17)
Significant improvement with a strong signal from WSDM '18? How does it compare to BM25 with short queries (& pool)?
2
3
beats dortmund bayern
Query-document similarity matrix
4
Query Document
bayern beats dortmund
Match patterns (Convolutional kernels)
5
bayern beats dortmund bayern beats dortmund
Document window Query
PACRR: A Position-Aware Neural IR Model for RelevanceMatching. K Hui, A Yates, K Berberich, G de Melo. In: EMNLP '17.
bayern beats dortmund
Ordered match
6
bayern beats dortmund bayern beats dortmund
Document window Query Partial match Reversed ordered match
PACRR: A Position-Aware Neural IR Model for RelevanceMatching. K Hui, A Yates, K Berberich, G de Melo. In: EMNLP '17.
Matches are local: consider NxN regions of the matrix
7
beats dortmund bayern bayern beats dortmund
PACRR: A Position-Aware Neural IR Model for RelevanceMatching. K Hui, A Yates, K Berberich, G de Melo. In: EMNLP '17.
Patterns are exclusive: each region is best matched by a single pattern
8
beats dortmund bayern
PACRR: A Position-Aware Neural IR Model for RelevanceMatching. K Hui, A Yates, K Berberich, G de Melo. In: EMNLP '17.
(1) CNN kernels capture patterns
9
w: kernel
PACRR: A Position-Aware Neural IR Model for RelevanceMatching. K Hui, A Yates, K Berberich, G de Melo. In: EMNLP '17.
(1) CNN kernels capture patterns
10
Signal for this region: w1,1 x1,6 + w1,2 x1,7 + w1,3 x1,8 + … + w2,1 x2,6 + … w3,3 x3,8 6 7 8 1 2 3 w: kernel
(1) CNN kernels capture patterns
11
(2) Max pool kernels
11
Best-matching pattern Signal: 1.0 Signal: 0.3 Signal: 0
(1) CNN kernels capture patterns
12
(2) Max pool kernels
12
(3) K-max pool query signals from doc regions K=2
(1) CNN kernels capture patterns
13
(2) Max pool kernels
13
(3) K-max pool query signals from doc regions For each query term, we now have:
(1) CNN kernels capture patterns
14
(2) Max pool kernels
14
(3) K-max pool query signals from doc regions (4) Combination function (FC layers) produce a score for each query term (5) Document score is the summation [Steps 4 & 5 differ from original papers]
(1) CNN kernels capture patterns
15
(2) Max pool kernels
15
(3) K-max pool query signals from doc regions (4) Combination function (FC layers) produce a score for each query term (5) Document score is the summation [Steps 4 & 5 differ from original papers]
Related to MatchPyramid, but e.g., different pooling strategies
A Study of MatchPyramid Models on Ad-hoc Retrieval. L. Pang,
An experimental comparison of click position-bias models. Craswell et al. WSDM '08.
16
Document A Document B
17
For each query term, PACRR retains top k match signals
Query term FC receives match signals from different cutoffs
Co-PACRR: A Context-Aware Neural IR Model for Ad-hoc Retrieval. K Hui, A Yates, K Berberich, G de Melo. In: WSDM '18.
18
19
re-rank BM25 run provided by organizers
20
21
Recent work building on PACRR (and other NIR models):
CEDR: Contextualized Embeddings for Document Ranking.