Textual Entailment Alina Petrova EMCL TUD, HLT FBK February 22, - - PowerPoint PPT Presentation

textual entailment
SMART_READER_LITE
LIVE PREVIEW

Textual Entailment Alina Petrova EMCL TUD, HLT FBK February 22, - - PowerPoint PPT Presentation

Textual Entailment Alina Petrova EMCL TUD, HLT FBK February 22, 2012 Alina Petrova EMCL TUD, HLT FBK Textual Entailment Introduction Textual Entailment (TE): What is it? a notion from classical logic is applied to natural language


slide-1
SLIDE 1

Textual Entailment

Alina Petrova

EMCL TUD, HLT FBK

February 22, 2012

Alina Petrova EMCL TUD, HLT FBK Textual Entailment

slide-2
SLIDE 2

Introduction

Textual Entailment (TE):

◮ What is it?

a notion from classical logic is applied to natural language using NLP technologies

◮ Which techniques can be applied?

relevant features for detecting TE via machine learning

◮ What is done by the community?

RTE Challenge

Alina Petrova EMCL TUD, HLT FBK Textual Entailment

slide-3
SLIDE 3

Introduction

Textual Entailment (TE):

◮ What is it?

a notion from classical logic is applied to natural language using NLP technologies

◮ Which techniques can be applied?

relevant features for detecting TE via machine learning

◮ What is done by the community?

RTE Challenge Fondazione Bruno Kessler, Human Language Technology group RTE-7 Challenge participation

Alina Petrova EMCL TUD, HLT FBK Textual Entailment

slide-4
SLIDE 4

Natural Language Processing Nowadays

Definition

NLP is an interdisciplinary field which seeks to enable computer to process, understand and generate natural language.

Alina Petrova EMCL TUD, HLT FBK Textual Entailment

slide-5
SLIDE 5

Natural Language Processing Nowadays

Definition

NLP is an interdisciplinary field which seeks to enable computer to process, understand and generate natural language. Modern NLP consists of multiple subareas which can be defined by the tasks they aim to solve.

◮ Machine Translation ◮ Information Retrieval ◮ Question Answering ◮ Word Sense Disambiguation ◮ ... ◮ Recognizing Textual Entailment

Alina Petrova EMCL TUD, HLT FBK Textual Entailment

slide-6
SLIDE 6

Textual Entailment

Intuition: Recognizing Textual Entailment is a generic task that captures major semantic inference between pieces of text.

Definition

Given two text fragments, Text (T) and Hypothesis (H): T entails H iff the meaning of H can be inferred from the meaning

  • f T by human reading.

Alina Petrova EMCL TUD, HLT FBK Textual Entailment

slide-7
SLIDE 7

Textual Entailment

Intuition: Recognizing Textual Entailment is a generic task that captures major semantic inference between pieces of text.

Definition

Given two text fragments, Text (T) and Hypothesis (H): T entails H iff the meaning of H can be inferred from the meaning

  • f T by human reading.

Notes:

◮ why ”human reading”? ◮ what is a ”text fragment”?

Example: T: If you help the needy, God will reward you. H: Giving money to a poor man has good consequences.

Alina Petrova EMCL TUD, HLT FBK Textual Entailment

slide-8
SLIDE 8

TE: How-To

2 opposite approaches: Using formal sematics:

◮ translation of natural language fragments into some logical

systems

◮ classical approach which brings together logic, language and

psychology

◮ successful for narrow domains, but not working on

comprehensive data!

◮ few training data

Using surface structure:

◮ counterintuitive, but proved to be fruitful.

Why? A wide range of entailments follow general patterns that arise from surface (lexical and syntactic) considerations.

Alina Petrova EMCL TUD, HLT FBK Textual Entailment

slide-9
SLIDE 9

TE: How-To cont’d

Alina Petrova EMCL TUD, HLT FBK Textual Entailment

slide-10
SLIDE 10

Surface approach

Main feature is lexical similarity.

◮ naive word overlap ◮ n-grams (= sequences of neighboring words) overlap

Ex: A student Computational Logic workshop took place in

  • Vienna. ⇒ Workshop took place in Vienna.

◮ normalized forms

working = work, brought = bring

◮ paraphrasing (different lexical forms with similar meaning)

Ex: A student workshop was organised in the capital of Austria. ⇒ A student workshop took place in Vienna.

Alina Petrova EMCL TUD, HLT FBK Textual Entailment

slide-11
SLIDE 11

Surface Approach - cont’d

The entailment holds iff the word overlap reaches a certain threshold. It is set via supervised learning.

Alina Petrova EMCL TUD, HLT FBK Textual Entailment

slide-12
SLIDE 12

Surface Approach - cont’d

The entailment holds iff the word overlap reaches a certain threshold. It is set via supervised learning. Statistics on F-measure (2010 data):

◮ best performance - 48.01% ◮ average performance - 33.77% ◮ up to 40% using only lexical matching

But this seems to be a limit for lexical matching.

Alina Petrova EMCL TUD, HLT FBK Textual Entailment

slide-13
SLIDE 13

NLP vs. Textual Entailment

Alina Petrova EMCL TUD, HLT FBK Textual Entailment

slide-14
SLIDE 14

NLP contribution to TE

Using extra features from other areas of NLP improve lexical match results:

◮ Semantic Roles ◮ Named Entity Recognition ◮ lexical knowledge bases (VerbOcean, WordNet) ◮ coreference ◮ syntactic parsing

etc.

Alina Petrova EMCL TUD, HLT FBK Textual Entailment

slide-15
SLIDE 15

Applications

Textual entailment recognition is used in several NLP tasks:

◮ Question Answering ◮ Information Extraction ◮ Information Retrieval ◮ Text Summarization

and many more.

Alina Petrova EMCL TUD, HLT FBK Textual Entailment

slide-16
SLIDE 16

Applications

Textual entailment recognition is used in several NLP tasks:

◮ Question Answering ◮ Information Extraction ◮ Information Retrieval ◮ Text Summarization

and many more. What is it? How TE is used?

Alina Petrova EMCL TUD, HLT FBK Textual Entailment

slide-17
SLIDE 17

Applications

Textual entailment recognition is used in several NLP tasks:

◮ Question Answering ◮ Information Extraction ◮ Information Retrieval ◮ Text Summarization

and many more. What is it? How TE is used? Example: T: The technological triumph known as GPS was incubated in the mind of Ivan Getting. ⇓ entails (1) H: X invented the GPS

Alina Petrova EMCL TUD, HLT FBK Textual Entailment

slide-18
SLIDE 18

Textual Entailment in the Community

Recognizing Textual Entailment challenge. Main Task: given a corpus of T (real data) and a set of H, determine such pairs T-H in which one fragment entails the other.

Alina Petrova EMCL TUD, HLT FBK Textual Entailment

slide-19
SLIDE 19

Textual Entailment in the Community

Recognizing Textual Entailment challenge. Main Task: given a corpus of T (real data) and a set of H, determine such pairs T-H in which one fragment entails the other.

◮ compares the performance of TE systems ◮ launched in 2004 by FBK ◮ supported by Microsoft Research

Mehdad, Negri, de Souza, Petrova. FBK Participation in the RTE-7 Main Task. Text Analysis Conference, 2011

Alina Petrova EMCL TUD, HLT FBK Textual Entailment

slide-20
SLIDE 20

FBK System for RTE-7

Multifeature system with lexical similarity being the key feature. An algorithm to compute n-gram match scores for every level of n:

◮ start from 5-grams ◮ eliminate a string when matched ◮ repeat for (n-1) level

Alina Petrova EMCL TUD, HLT FBK Textual Entailment

slide-21
SLIDE 21

FBK System for RTE-7

Multifeature system with lexical similarity being the key feature. An algorithm to compute n-gram match scores for every level of n:

◮ start from 5-grams ◮ eliminate a string when matched ◮ repeat for (n-1) level

Extra NLP features: Semantic Roles, Named Entities, Wordnet, Syntactic Dependencies

Alina Petrova EMCL TUD, HLT FBK Textual Entailment

slide-22
SLIDE 22

Conclusion

◮ TE is an example of how logical notion can be projected to

natural language.

◮ Area of active research. ◮ Straightforward surface techniques outperform semantic

representation approaches...

◮ ...but clever way of computing lexical similarity should be

found to achieve high performance.

Alina Petrova EMCL TUD, HLT FBK Textual Entailment

slide-23
SLIDE 23

Bibliography

◮ Mehdad, Negri, de Souza, Petrova. FBK Participation in the

RTE-7 Main Task. Text Analysis Conference, 2011

◮ Jia, Huang, Ma, Wan, Xiao. RKUTM Participation at TAC

2010 RTE and Summarization Track. Text Analysis Conference, 2010

◮ Majumdar, Bhattacharyya. Lexical Based Text Entailment

System for Main Task of RTE6. Text Analysis Conference, 2010

Alina Petrova EMCL TUD, HLT FBK Textual Entailment