lti Summary Simple transformational approach for modeling - - PowerPoint PPT Presentation

lti
SMART_READER_LITE
LIVE PREVIEW

lti Summary Simple transformational approach for modeling - - PowerPoint PPT Presentation

Tree Edit Models for Recognizing Textual Entailments, Paraphrases, and Answers to Questions Michael Heilman and Noah A. Smith lti Summary Simple transformational approach for modeling sentence pair relations. Experiments for


slide-1
SLIDE 1

Tree Edit Models for Recognizing Textual Entailments, Paraphrases, and Answers to Questions

lti

Michael Heilman and Noah A. Smith

slide-2
SLIDE 2

Summary

Simple transformational approach for

modeling sentence pair relations.

Experiments for multiple problems:

  • Recognizing textual entailment
  • Paraphrase identification

lti

  • Paraphrase identification
  • Answer selection for question answering

Competitive but not standout performance.

slide-3
SLIDE 3

Intuition

Tree edits are syntactic transformations that can modify semantic properties in various ways.

  • blique

subj. pp-obj.

  • blique

pp-obj.

lti

Mexico Canada

  • We represent sentence pairs as sequences
  • f edits that convert one tree into the other.

L.A. belonged to Before1848 before1848

slide-4
SLIDE 4

Outline

Introduction Connections to Prior Work Finding & Classifying Edit Sequences Experiments

lti

slide-5
SLIDE 5

Prior Work on Sentence Pairs

Numerous approaches for sentence pair

relations, some task-specific.

Considerable work involving tree and

phrase alignments.

lti

Less work on transformational or tree edit

approaches.

  • Harmeling, 07; Bar Haim et al., 07

Das & Smith, 09; MacCartney et al., 08; Zanzotto, 09; Chang et al., NAACL-10; inter alia

slide-6
SLIDE 6

Prior Work on Tree Edit Distance

  • 1. Local edits without reordering.
  • insert, relabel, delete
  • 2. No learning of associations between

labels and features of edit sequences.

lti

  • Chawathe et al., 97; Punyakanok et al., 04;

Wan et al., 06; Bernard et al., 08; inter alia

slide-7
SLIDE 7

Our Method

  • 1. Includes edits for reordering children and

moving subtrees.

  • 2. Learns associations between edit

sequences and features of labeled data.

  • 3. Does not require:

lti

  • 3. Does not require:
  • WordNet
  • Distributional Similarity
  • NER
  • Heavy task-specific tuning
  • Coreference resolution
  • Etc.
  • Possible

future work

slide-8
SLIDE 8

Outline

Introduction Connections to Prior Work Finding & Classifying Edit Sequences Experiments

lti

slide-9
SLIDE 9

Bush shot back: “You're looking pretty young these days.”

DELETE (smile) DELETE (a) DELETE (wry)

Feature Value # edits 8

With a wry smile, Mr. Bush replied, “You're looking pretty young these days.”

θ

lti

  • DELETE (with)

DELETE (smile) DELETE (Mr.) RELABEL (replied, shot) INSERT (back, shot) RELABEL (comma, :)

# edits 8 # unedited nodes 11 # DELETE 5 # INSERT 1 # delete subject …

PARAPHRASE

θ

slide-10
SLIDE 10

Types of Tree Edits

Inserting, Deleting,

Relabeling Nodes

  • INSERT-CHILD
  • INSERT-PARENT
  • DELETE-LEAF
  • DELETE-AND-MERGE
  • RELABEL-NODE

Socrates taught to Plato philosophy. Socrates taught philosophy to Plato.

MOVE-SIBLING

lti

  • RELABEL-EDGE

Reordering Children

  • MOVE-SIBLING

Moving Subtrees

  • MOVE-SUBTREE
  • NEW-ROOT
  • MOVE-SUBTREE

I saw the man with the telescope I saw the man with the telescope

slide-11
SLIDE 11

Complexity

Tree edit distance with insert, relabel,

delete edits:

O(n3logn)

Klein, 98

lti

With reordering and moving subtrees:

Polynominal runtime algorithms not available

slide-12
SLIDE 12

Greedy Best-First Search

We choose the next tree according to the

heuristic function only.

  • We ignore path cost.

Target Tree

Pearl, 84

  • Initial Tree

(e.g., premise) Target Tree (e.g., hypothesis)

slide-13
SLIDE 13

Tree Kernel Search Heuristic

Heuristic compares current tree to target tree ( ). Tree kernel: similarity measure between trees

based on similarities of all their subtrees.

  • Efficient dynamic programming solution.
  • D. Haussler, 99;

Collins & Duffy, 01; Zanzotto & Moschitti, 06; Zelenko et al., 06

slide-14
SLIDE 14

Tree Kernel Search Heuristic

In general, larger trees will have larger kernel

values.

So we “normalize” to [0, 1]:

) , ( 1 ) ( X X K X H − =

  • )

, ( ) , ( ) , ( 1 ) ( X X K X X K X X K X H × − =

heuristic function tree kernel function

slide-15
SLIDE 15

Finding Edit Sequences

Operations are very expressive.

  • Search rarely fails (< 0.5%).

Resulting sequences:

  • Succinct and plausible upon inspection

lti

  • Succinct and plausible upon inspection
  • Internally consistent representation
  • Lead to good performance
slide-16
SLIDE 16

Example Edit Sequence Premise

lti

  • Hypothesis
slide-17
SLIDE 17

Example Edit Sequence

RELABEL-NODE(nearby) MOVE-SUBTREE(Blvd.)

lti

  • MOVE-SUBTREE(Pierce)

Multiple RELABEL-EDGE, DELETE-LEAF, DELETE- AND-MERGE edits

slide-18
SLIDE 18

Classifying by Edit Sequences

Logistic Regression with 33 features.

  • total number of edits
  • number of X edits
  • number of edits removing a subject
  • number of unedited nodes

etc. lti

  • etc.

We learn separate parameters for each

task from labeled sentence pairs.

slide-19
SLIDE 19

Outline

Introduction Connections to Prior Work Finding & Classifying Edit Sequences Experiments

lti

slide-20
SLIDE 20

Recognizing Textual Entailment

Challenge: Decide whether a hypothesis follows from a premise.

Testing: RTE-3 test data. Training: RTE-3 dev. data and data from

Giampiccolo et al., 07

lti

Training: RTE-3 dev. data and data from

previous RTE tasks.

slide-21
SLIDE 21

60 65 70 75 80 Accuracy (%)

RTE-3 Results

de Marneffe et al. 06

Syntactic alignment + classification

Tree edit model MacCartney & Manning 08: Hybrid

  • 50

55

08: Hybrid

de Marneffe et al. 06 + Natural Logic technique

slide-22
SLIDE 22

Paraphrase Identification

Paraphrase ≈ bidirectional entailment. Microsoft Research Paraphrase Corpus

Challenge: Decide whether 2 sentences are paraphrases of each other.

lti

Microsoft Research Paraphrase Corpus

Standard training and testing splits

  • Dolan et al., 04
slide-23
SLIDE 23

60 65 70 75 80 Accuracy (%)

Paraphrase Identification Results

Tree edit model Wan et al. 06

SVM with syntactic dependency overlap, BLEU scores, tree edit distance, etc.

  • 50

55 A

distance, etc.

Das & Smith 09

Quasi-synchronous Grammar to model syntactic alignments + n-gram overlap

slide-24
SLIDE 24

Answer Selection for QA

Challenge: rank sentences by correctness as answers to a given question.

We find edit sequences from answers to

questions.

  • We rank by the estimated probabilities of

correctness.

slide-25
SLIDE 25

Answer Selection Data

Q&A pairs from TREC-8 through TREC-13. Training, Dev., Testing data sets: about 100

questions and 500-1500 answers each

lti

slide-26
SLIDE 26

Answer Selection Results

Punyakanok et al. 04

Tree edit distance

Wang et al. 07

Quasi-synchronous Grammar to model syntactic alignments

0.35 0.45 0.55 0.65 0.75 Ranking Quality an Average Precision)

  • syntactic alignments

Wang et al. 07 + WN

plus lexical semantics from WordNet

Tree edit model

0.25 0.35 R (Mean

slide-27
SLIDE 27

Answer Selection Results

Punyakanok et al. 04

Tree edit distance

Wang et al. 07

Quasi-synchronous Grammar to model syntactic alignments

0.35 0.45 0.55 0.65 0.75 Ranking Quality an Average Precision)

  • syntactic alignments

Wang et al. 07 + WN

plus lexical semantics from WordNet

Tree edit model

0.25 0.35 R (Mean

slide-28
SLIDE 28

Answer Selection Results

Punyakanok et al. 04

Tree edit distance

Wang et al. 07

Quasi-synchronous Grammar to model syntactic alignments

0.35 0.45 0.55 0.65 0.75 Ranking Quality an Average Precision)

  • syntactic alignments

Wang et al. 07 + WN

plus lexical semantics from WordNet

Tree edit model

0.25 0.35 R (Mean

slide-29
SLIDE 29

Conclusions

Syntax-based tree edit algorithm for classifying sentence pairs according to semantic relationships.

Expressive: includes tree edits for

reordering and moving subtrees.

lti

reordering and moving subtrees.

Data Driven: learns parameters from

labeled examples.

Useful for various tasks

slide-30
SLIDE 30

lti