Retrieving Comparative Arguments using Deep Pre-trained Language - PowerPoint PPT Presentation

Retrieving Comparative Arguments using Deep Pre-trained Language Models and NLU Viktoriia Chekalina, Alexander Panchenko Skolkovo Institute of Science and Technology, Philips Innovations Lab RUS 23 sent 2020 1 / 11

Task Methodology ◮ Given a set of topics with comparative query, i.e. question like “What is better X or Y ?” ◮ For each topic: ◮ retrieve documents from ClueWeb12 1 corpus by ChatNoir 2 search engine. ◮ find documents answering the proposed comparative question most fully and reasonably. Methodology ◮ for each topic t i for every retrieved document d j count similarity measures s ij = sim ( t i , d j ) ◮ rank documents according to these similarities 1 https://lemurproject.org/clueweb12 2 https://www.chatnoir.eu/doc 2 / 11

Methodology & Approaches Methodology Candidate documents and their scores ChatNoir: an inverted index + BM25 Similarity scores Topic Topic-document (query) similarity computation Documents sorted by similarity score Scheme of our methods: candidate documents are obtained using a traditional inverted index-based IR system and then re-ranked on the basis of topic-document similarity Approaches ◮ Inverted index relevance estimation from ChatNoir system ◮ Approaches based on SOTA NLP models: LSTM, BERT, Transformers ◮ Comparative feature-based approaches: number of comparative structures in document and density of comparative sentences. 3 / 11

Approaches ChatNoir build-in ranking ◮ ChatNoir has custom BM25 ranking function 3 based on inverted index. ◮ similarity s ij = cn ij , where cn ij - scores provided by ChatNoir LSTM-ULMFit ◮ we use pre-trained bi-directional LSTM language model LSTM-ULMFit ◮ similarity s ij = cos ( h i , h j ) where h i , h j are hidden states of model which processes topic title and document body, respectively. 3 https://www.elastic.co/guide/en/elasticsearch/reference/current/ index-modules-similarity.html 4 / 11

Approaches Bidirectional encoder representations from Transformer (BERT) ◮ BERT model provides word embeddings ◮ similarity s ij = cos ( e topic title , e doc title ), where e topic title , e doc title are averages between word embeddings in the topic’s title and retrieved document’s title, respectively. 5 / 11

Approaches Attention of a Transformer-based language model ◮ We concatenate topic title and document body in one sequence ◮ We use the output of the self-attention layer in pre-trained Transformer model ◮ Base on obtained token-to-token responses we build the attention maps. ◮ In these maps, we select the area which describes responses of tokens from one sentence to another. For 3, 4, 10, 11 attention’s head, we count the sum around this area, and the obtained sum is a similarity. 6 / 11

Approaches The attention map for input “[CLS]” + Which is better, a laptop or a desktop?+ “[SEP]” + Laptop is preferable than desktop because it is more portable. + “[SEP]”. Areas highlighted in red participate in the similarity calculation. 7 / 11

Approaches Comparative features For every pair topic-document we count additive multiplicative term r ij based on comparative features. This term describes the competitiveness of the document. Initialy r ij = 1 ◮ From topic we extract objects for comparison and find the same objects in document. ◮ Term updates as: � r ij ∗ 1 . 2 if we find one object r ij = r ij ∗ 1 . 5 if we find two objects ◮ If we find in document l aspects(features of comparison) and predicates, we increase r ij to l . 8 / 11

Approaches Comparative features ◮ Using classifier of comparative sentences based on InferSent, count number of comparative sentences in the document ◮ r i , j = r i , j + n , where n - number of comparative sentences. Proposed term we add to ChatNoir’s score and ULMFit’s score, so similarity is s ij = cn ij ∗ r ij 9 / 11

Results The evaluation is carried out manually and based on the existence of relevant information in the document. The table describes 5@DCG(discount cumulative gain) scores for approaches mentioned above 10 / 11

Discussion ◮ We apply several approaches to the Argument retrieval shared task. We use baseline from a search engine, information from pre-trained NLP models, the existence of comparative structures in documents. ◮ The best scores gives a model based on inverted indices. ◮ Model with consideration of comparative structures gives the better result than one’s base on only SOTA NLP models. 11 / 11

Retrieving Comparative Arguments using Deep Pre-trained Language - PowerPoint PPT Presentation

Retrieving Comparative Arguments using Deep Pre-trained Language Models and NLU Viktoriia Chekalina, Alexander Panchenko Skolkovo Institute of Science and Technology, Philips Innovations Lab RUS 23 sent 2020 1 / 11 Task Methodology Given a

Command Line Arguments ECE2893 Lecture 20 ECE2893 Command Line Arguments Spring 2011 1 / 5

A MultiAgent System for A MultiAgent System for Retrieving Bioinformatics Retrieving

A Workflow Workflow for for Retrieving Retrieving Orthologous Orthologous A Promoters and I

Classical arguments that for us is a PP complement of easy The following arguments are the

Using Semantics of the Arguments Using Semantics of the Arguments for Predicate Sense Induction

WP3 EX-POST Case studies Comparative Analysis Report Deliverable no.: 3.2 Comparative Analysis

Comparative Genomics: Comparative Genomics: Sequence, Structure, Sequence, Structure, and

Retrieving Data from Multiple Tables Unit Objectives After completing this unit, you should be

Commonsense Knowledge in Pre-trained Language Models Vered Shwartz July 5th, 2020 Commonsense

Janet Kittams-Lalley Helpline Center Available 24/7 Have staff that are trained to assess

I have trained more than 1,000 individuals to become ACII qualified I have trained over

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys:

International Comparative Assessments 1 05/06/2015 1 International Comparative Assessments Key

Comparative Genomics Comparative Genomics Common Themes Gene and functional pathway

Comparative statics Comparative statics is the study of how endogenous variables respond to

Resumex COMPARATIVE OF EQUALITY AS + adjective + AS (to, tanto...quanto, como) COMPARATIVE OF

Deep Learning for NLP Kiran Vodrahalli Feb 11, 2015 Overview What is NLP? Natural

Lecture 2 Bond Valuation Contact: Natt Koowattanatianchai Email: fbusnwk@ku.ac.th

Environmental Protection and Rare Disasters Professor Robert J Barro Paul M Warburg Professor of

Non-Determinis)c Search CS 188: Ar)ficial Intelligence Markov

CSC2523: Deep Learning in Computer Vision Introduction Sanja Fidler January 12, 2016 Sanja

Deep Reinforcement Learning M. Soleymani Sharif University of Technology Fall 2017 Slides are

Deep he(a)p, big feat arXiv:1707.06887 A Distributional Perspective on Reinforcement Learning

Model-Free Control (Reinforcement Learning) and Deep Learning M ARC G. B ELLEMARE Google Brain